PRACTICAL BASICS OF TRAINING UNDERGRADUATES IN BIG DATA(PROCESSING BIG DATA IN CHUNKS USING THE R PROGRAMMING LANGUAGE)

Authors

  • Nurbekova Gulmira Fazylgalamovna L.N. Gumilyov Eurasian National University
  • Yerlanova Gulmira Zhumagaliyevna Alikhan Bokeikhan university
  • Zulpykhar Zhandos Yensebekuly L.N. Gumilyov Eurasian National University
  • Nariman Saniya Aslbekovna L.N. Gumilyov Eurasian National University

DOI:

https://doi.org/10.52269/RWEP2522187

Keywords:

big data, big data analysis, R programming environment, Nycflights13 package, flights files.

Abstract

The article considers the training of specialists in big data, increasing the knowledge of students in higher educational institutions based on the processing, storage and analysis of big data in the R programming environment. The presented results are part of a research project aimed at a comprehensive study and integration of knowledge related to hardware-software systems and programming languages. Special attention is given to familiarizing students with R language packages and data storage formats. It is demonstrated that data can be stored in two different formats—.rds and .csv—each offering distinct features and advantages for subsequent big data processing. Big data is divided into structured, semi-structured (XML and JSON) and unstructured (texts, images, and videos), which makes their storage, processing, and analysis more complex. Objective: to consider the case when it is impossible to immediately load a complete set of data into R memory. The ability to process data in fragments when it is impossible to immediately load the full data set into R memory when analyzing big data, and in this case the use of the chunk.apply function from the iotools package by Simon Urbanek and Taylor Arnold is mentioned. The analysis of big data related to the training of undergraduates is carried out, data from the results of the practical part of our research is presented.

Author Biographies

  • Nurbekova Gulmira Fazylgalamovna, L.N. Gumilyov Eurasian National University

    PhD, Senior Lecturer ща the Department of computer science

  • Yerlanova Gulmira Zhumagaliyevna, Alikhan Bokeikhan university

    PhD, acting Associate Professor of the Department of information and technical sciences

  • Zulpykhar Zhandos Yensebekuly, L.N. Gumilyov Eurasian National University

    Candidate of Pedagogical Sciences, Associate Professor, Head of the Department of computer science

  • Nariman Saniya Aslbekovna, L.N. Gumilyov Eurasian National University

    Senior Lecturer of the Department of computer science

Additional Files

Published

2025-07-03