ACM - A Systematic Review On Big Data Analytics Frameworks For PDF
ACM - A Systematic Review On Big Data Analytics Frameworks For PDF
1
A Systematic Review on Big Data Analytics Frameworks for
EBIMCS 2019, December, 2019, Kuala Lumpur, Malaysia
Higher Education - Tools and Algorithms
framework for higher education is emerging to be a problem [4]. RQ 1: Do higher educational institutions include any Big Data
Many frameworks proposed for Big Data analytics in higher and related frameworks in addressing concerns and challenges of
education focus on predictive analytics. Building a data-sharing improving teaching and learning?
network and managing data generated from multiple sources is also RQ 2: Which tools and algorithms are used to model academic
a problem [5]. performance in the field of education?
For this purpose, this paper undertakes a systematic literature RQ 3: What are the educational community forecasting?
review (SLR) to establish a general overview of the existing body We planned the rest of the paper as follows. We present the
of knowledge and also outline the gaps that need to be addressed in methodology in section 2. Section 3 discusses the results, and
Big Data analytics in higher education. We intend to answer the finally, we conclude the study in section 4.
subsequent questions to provide direction to researchers about the
unexplored areas.
296 from ISI Web. These studies summed up to 1543. Based on the
title, we rejected 1061 papers. We further rejected 155 studies
RESEARCH METHODOLOGY whose abstract were not thematically related to our central idea.
For this review, we followed Kofod-Petersen [6] methodology, Again, we rejected 92 studies based on the general study of the
which emphasises on getting a snapshot of research piloted in a paper. Based on primary study, we rejected 88 studies. We
particular field of study. We focus on works that combine considered 92 studies not detailed enough, hence were rejected.
Educational Data Mining, Learning Analytics, Big Data These processes narrowed down the studies to 55, which we finally
frameworks in education and tools and algorithms for modelling used for the SLR. Figure 2 shows an overview of the selection and
students’ academic performance. rejection procedure adopted for this study. Table 1 provides the
search criteria that produced the results for the study.
Inclusion and Quality Standards
Table 3: Search Criteria
We selected research works from 2007 to 2018 that deals with Big
Data framework, Educational Data Mining framework for
S/No. Related String Structure of the Search Strings
academic performance prediction and frameworks for Learning
Analytics. Our focus was to get relevant works that address issues Big Data framework AND Education
of Big Data and higher education concerning framework, tools and Big Data OR Big Data AND Higher Education
algorithms from reputable databases. 1 Framework OR Big Data AND E-learning AND
Blended Learning
Search Process Big Data AND (Students OR
We covered six (6) primary data sources, which are ACM, IEEE, 2 Education Students Performance OR Pedagogy
Springer, Elsevier, SAGE and ISI Web. The total number of OR Schools OR Educational Data
research work conducted on Big Data framework for modelling Mining OR Learning Analytics)
students’ academic performance was 286 from ACM, 428 from
IEEE, 193 from Springer, 203 from Elsevier, 137 from SAGE and Quality of Assessment
2
EBIMCS2019, December, 2019, Kuala Lumpur, Malaysia David Otoo-Arthur, Terence Van Zyl.
To ensure the quality valuation of the SLR, we examined the most RESULTS AND ANALYSIS
recent and standard papers from the six selected databases. These
The motivation of this study was to get the idea of Big Data
papers were of two types, journal papers and conference papers as
frameworks, tools and algorithms used broadly in higher education.
shown in Table 2.
The sequence of results are 1) Big Data frameworks, 2) tools and
algorithms, and 3) what the education community is forecasting.
Extraction and Synthesis of Data
Data abstraction and synthesis involve selecting the results from the Table 5: Data Extraction and Synthesis Specimen
pool or studies relevant to the review questions. Table 3 shows the
collections of evidence from selected 55 studies. Narration Aspect
Type of Paper (Conference or Journal),
Title, Author(s), Publication Year,
Meta Data Statistics
Volume, Issue, Pages
Data Extraction
Fundamental objective espoused by the
Abstract
study
Different Case Studies most selected
Case Studies
papers used
Results Selected studies results
Various criteria measures are used to
Assessment
evaluate the results
Data Synthesis
Research Study data sources are
Research
outlined in Table 2
Identified BD frameworks are outlined
Frameworks
in Table 4
Tools and algorithms used are detailed
Tools and Algorithms
in Tables 5 and 6
Identified Areas of Predictions in
Focus
Education are detailed in table 6
Figure 2: An Overview of the Selection Process
3
A Systematic Review on Big Data Analytics Frameworks for
EBIMCS 2019, December, 2019, Kuala Lumpur, Malaysia
Higher Education - Tools and Algorithms
a strong theoretical basis. From Figure 3, we note that the visualisations in the result would have led to a better understanding
integration of Big Data integration into education is a new and of the usage and impact of this framework.
evolving endeavour. Most of the studies selected were between 3.1.2 Framework by Udupi, Malali and Noronha (2016)
2015 and 2018, with 2017 recording the highest number (20) of Udupi, Malali and Noronha [20] presented a Smart Learning
studies that focused on Big Data and Education. The upsurge in framework that integrates E-Learning, Big Data and Smart
2017 is mainly due to the quest of building effective learning Technology. The e-Learning framework (layer 1) focused on
systems based on Big Data to support rapid and timely learning pedagogical activities and educational technologies comprising
analytics. A notable number of higher educational institutions were three sub-layers that enabled data processing. Information from the
utilising Big Data technologies to handle the massive data e-learning systems is synthesized from their sources into three
generated within their settings and to gain a competitive advantage forms (contents, user information and data evaluation) and then
in the education space as highlighted by authors in [29], [34], [61], passed onto the Big Data framework (layer 2) for data extraction,
[62] Our comprehensive checks further revealed ten (10) Big Data exploration and analytics. At the top layer (layer 3) is the smart
frameworks related to educational settings, as shown in Table 4. learning systems used by learners. Despite their framework
integrating Big Data into teaching and learning, detailed
Table 6: Identified Big Data Frameworks for Education explanations on analytics techniques, data storage mechanisms and
report visualization are lacking. Their proposed concept failed to
S/No Big Data Framework Work capture administrative data, demographic data and unstructured
data as well.
A Big Data Framework for Early 3.1.3 Framework by Machova, Komarkova and Lnenicka (2016)
Identification of Dropout Students in [30]
1 Machova, Komarkova and Lnenicka [25] recommended
Massive Open Online Course (MOOC)
Apache Hadoop cluster to examine complex students interaction
Big Data Integration for Transition data from Moodle system on the cloud. Apache Hadoop,
from e-Learning to Smart Learning [20]
2 MapReduce, OpenStack, and Ubuntu Servers anchored their
Framework framework. Although their framework tries to open up the issue of
Processing of Big Educational Data [24] data security, however, it suffers from several limitations; in
3 in the Cloud using Apache Hadoop particular, the authors restricted their work to historical data failing
Five-Sided Educational Data Mining to discuss the various related activities such as analytic techniques,
4 [28] user roles and data security.
Framework (5S-EDMF)
Data Science in Education: Big Data 3.1.4 Framework by Zeng (2017)
5 [4]
and Learning Analytics [28] Zeng [29] proposed a five-sided educational data-mining
Implementation of Learning Analytics framework (5S-EDMF) to analyse college students' diligence and
Framework for MOOCs using state-of- [23] effectiveness of study and to recommend learning resources
6
the-art In-Memory Computing accordingly. Their framework discusses the sources of Big Data in
A Big Data Analytic Framework for education and how these captured data support teaching and
investigating Streaming Educational [15] learning. The framework comprised four main modules:
7
Data 1. Data acquisition module outlining the various sources of
[34] data from the national level to the individual learner.
8 Big Data for Online Learning Systems
A Novel Adaptive e-Learning Model 2. Data processing module where the framework processes
Based on Big Data by using structured, semi-structured, and unstructured historical
[43]
9 competence-based Knowledge and data.
Social Learner Activities
3. Data mining analysis module which evokes the mining
Harnessing the power of Big Data algorithm in Mahout to carry on parallel computation on
Analytics in the Cloud to support Learning [46] data to obtain knowledge hidden pattern(s), and
10 Analytics in Mobile Learning Environment
4. Data application module that deals with the presentation
and visualisation of data analysis results for decision-
Frameworks Review making. Their framework, however, failed to consider
streaming data.
3.1.1 Framework by Tang et al. (2015)
Tang et al. [31] investigated the application of Big Data methods to While their framework highlights significant issues such as
identify students who are likely to drop out of MOOC. To this analytics techniques and visualisation, they are not detailed.
extent, they proposed an automatic framework that used historical 3.1.5 Framework by Klašnja-Milicevic, Ivanovic and Budimac
data to construct a classification model to identify the potential (2017)
dropout student. Their framework presented six (6) modules: data Klašnja-Milicevic, Ivanovic and Budimac [4] also suggested a
collection, data processing, model construction, online framework for analysing and processing multi-structured data sets
classification and result presentation. In spite of their framework for Higher learning institutions. Their work proposed five modules
achieving an average accuracy of 94.9\% on a real dataset, the that anchored the framework; Data Capture and Collection, ETL
inclusion of user roles, storage infrastructure, data types and (Extractions, Transformations, Loading), Hadoop platform,
4
EBIMCS2019, December, 2019, Kuala Lumpur, Malaysia David Otoo-Arthur, Terence Van Zyl.
Analysis Engine and Presentation Layer. The framework aims at analysing, optimisation, and visualisation. Users profile, enrolment
improving teaching and learning by identifying the roles of actors, facts and other relevant contents are synthesised at the third layer
namely learners, teachers/researchers and administrators/data (e-Learning) and presented to the actors of the framework at layer
scientist. Their framework captures various sources of data in four. Despite proposing an extensive and detailed framework, the
education, manage these data within the Hadoop platform and authors failed to discuss the pedagogical design of online learning
stores them in a compatible repository. The analysis engine tasks for the various users of the architecture. Again, their
executes predefined and standard procedures and other complex framework did not look at analytics methods and how learners’
statistical analysis. The presentation layer provides a user-friendly behaviour could be monitored in a real-time environment, as
graphical visualisation interface for the users of the system. Even suggested by the authors for future work.
though their framework is comprehensive and elaborate, the
3.1.9 Framework by Birjali, Beni-Hssane and Erritali (2018)
authors failed to delve deep into the analytics techniques. How their
Birjali, Beni-Hssane and Erritali [44] offered an adaptive e-
framework would secure data and optimise this platform with other
learning model architecture based on Big Data technology and
learning services is also not mentioned. Besides, no real data was
optimisation algorithms. The model presented two levels of
used to test the framework.
adaptation. The first level employed the MapReduce-based GA to
3.1.6 Framework by Laveti et al. (2017)
retrieve the necessary future educational objectives (FEO) based on
Laveti et al. [24] proposed a workflow learning analytic for
the learner prerequisites through learner e-assessment method.
MOOCs that applied the Spark framework. The framework
Using FEO's MapReduce-based Ant Colony Optimisation (ACO)
advanced three main components; Data Store, Processing and
generates adaptive personalised learning path (PLP) that contains
Visualisation, and Reporting and Visualisation. Their framework
the learning content that the learner needs. The second level of their
primarily focused on machine learning algorithms and its
framework determined social indicators from social networks that
application to analytics in education. Contrary to testing their
are in correlation with their learning activities through social
framework with a real dataset, the type of data was not explicit.
network analysis (SNA). Despite producing a framework that
Data captured were historical and excluded streaming data.
blends Big Data technologies to an e-learning platform, the
Moreover, the authors failed to explain how the framework would
framework failed to look into the type of data such as administrative
handle data storage process and its related security issues.
and demographic. Besides, the framework did not include any
3.1.7 Framework by Yang et al. (2017)
mechanisms for securing data and visualisation.
Yang et al. [15] proposed a framework for learners streaming
3.1.10 Framework by Shorfuzzaman et al. (2018)
data using term frequency-inverse document frequency (TF-IDF)
Shorfuzzaman et al. [47] proposed a cloud-based mobile
and fuzzy representation techniques to uncover significant patterns
learning framework that uses Big Data analytics technique to
from usage data. Their work suggested three critical stages;
extract values from a vast volume of mobile learners' data. Their
1. Data collection stage where students' metadata and
behavioural data were collected, framework provided on-demand scalable computing and data
storage resources. However, the framework failed to consider
2. Data persistence stage which comprises Kafka and administrative and demographics data, real-time learning analytics
storage systems for streaming and storing data and and result visualisation, as suggested by authors for future work.
3. Data mining stage where they extracted knowledge from
data. Big Data Frameworks Correlation
Besides, the framework adopted a unified storage mechanism, In general, we consider the framework in this study as a conceptual
which combined traditional RDBMS, HDFS and NoSQL intending or real-time architectural model intended to guide or support Big
to handle small to a large amount of structured and semi-structured Data analytics in higher education.
data, and streaming data. However, their framework could not Our frameworks review revealed several similarities and gaps
handle unstructured data and left out features such as reporting and within the ten (10) identified frameworks. Specifically, the
visualisation. frameworks present four key thematic modules. These modules
3.1.8 Framework by Dahdouh et al. (2018) include data capture and collection, data processing and
Drawing on a broad range of Big Data technologies, Dahdouh et al. visualisation, model construction and data mining, and result
[35] proposed an architecture framework that integrates traditional presentation and visualisation. We realised authors failed to address
e-Learning systems with cloud computing. The Architecture thoroughly issues of data security, privacy, ownership and theories
provided a detailed description of how Hadoop and other Big Data that support the whole data cycle in education.
technologies (Spark and NoSQL) could work together. Four layers Demchenko, de Laat and Membrey [64] suggests that Big Data
characterised their framework; cloud infrastructure, Big Data, e- infrastructure should support the whole data lifecycle, data security
Learning and user. At the infrastructure layer is the hardware and data ownership protection. Similarly, authors in [11], [65], [66]
resources, built with virtualising compute, storage and network argue that the impact of Big Data thrives not only on building
resources. This layer provides abstraction and flexibility when effective models, but informed theories which serve as the basis to
managing various hardware resources. The second layer, Big Data, analysing large-scale data. Therefore, having an educational
provides various Big Data tools for data storage, processing, framework that integrates these pertinent issues is imperative.
5
A Systematic Review on Big Data Analytics Frameworks for
EBIMCS 2019, December, 2019, Kuala Lumpur, Malaysia
Higher Education - Tools and Algorithms
1 *
The Category of Hadoop includes HDFS, YARN and Common 2
Other category includes SPSS, STATA, Vertica, WEKA, KNIME, KEEL,
Tableau, Cassandra, Kibana, Elastic Search, OpenRefine, Scriptella, Pentaho,
KETL. Each tool within this category appeared once.
6
EBIMCS2019, December, 2019, Kuala Lumpur, Malaysia David Otoo-Arthur, Terence Van Zyl.
variety of other tools in recent years. However, Grolinger et al. [69] inability to execute the in-memory job efficiently due to the slow
and Gupta et al. [70] point out that, Hadoop comes with some nature of the Hadoop-MapReduce.
challenges such as difficulty in programming, thus requiring
abstraction, ineffectiveness in processing streaming data and its
Table 8: Comparison of Algorithms used for Modelling Students Performance
Regression6 [17],[37],[10],[23],[42],[27]
Latent Profile
[37]
Analysis (LPA)
TF-IDF [15]
ıve Bayes
Na¨ [23]
KC Model [11]
Gradient Boosting
[23],[46]
(XGBoost)
Student Performance
Table 9: Domains of Forecasting in Education 1 Prediction [15],[26]
Planning and Management [12],[27],[18],
What Education Community is 2 of Stakeholders
S\No Works [33],[55]
Forecasting
7
A Systematic Review on Big Data Analytics Frameworks for
EBIMCS 2019, December, 2019, Kuala Lumpur, Malaysia
Higher Education - Tools and Algorithms
8
EBIMCS2019, December, 2019, Kuala Lumpur, Malaysia David Otoo-Arthur, Terence Van Zyl.
[21] P. K. Udupi, P. Malali, and H. Noronha, “Big data integration for transition from [46] A. De Mauro, M. Greco, M. Grimaldi, and P. Ritala, “Human resources for big
e-learning to smart learning framework,” in 2016 3rd MEC International data professions: A systematic classification of job roles and required skill sets,”
Conference on Big Data and Smart City (ICBDSC). IEEE, 2016, pp. 1–4. Information Processing & Management, vol. 54, no. 5, pp. 807–817, 2018.
[22] B. Liu, X. Li, Y. Wang, H. Wang, and F. Xu, “The system framework of data [47] M. Shorfuzzaman, M. S. Hossain, A. Nazir, G. Muhammad, and A. Alamri,
mining and learning analysis for smart classroom,” in 2018 International Joint “Harnessing the power of big data analytics in the cloud to support learning
Conference on Information, Media and Engineering (ICIME). IEEE, 2018, pp. analytics in mobile learning environment,” Computers in Human Behavior, vol.
331–336. 92, pp. 578–588, 2018.
[23] S. T. Konstantinidis, A. Fecowycz, K. Coolin, H. Wharrad, G. Konstantinidis, [48] B. Cope and M. Kalantzis, “Big data comes to school: Implications for learning,
and P. D. Bamidis, “A proposed learner activity taxonomy and a framework for assessment, and research,” AERA Open, vol. 2, no. 2, p. 2332858416641907,
analysing learner engagement versus performance using big educational data,” 2016.
in 2017 IEEE 30th International Symposium on Computer-Based Medical [49] G. Veletsianos, J. Reich, and L. A. Pasquini, “The life between big data log events:
Systems (CBMS). IEEE, 2017, pp. 429–434. Learners’ strategies to overcome challenges in moocs,” AERA Open, vol. 2, no.
[24] R. N. Laveti, S. Kuppili, J. Ch, S. N. Pal, and N. S. C. Babu, “Implementation of 3, p. 2332858416657002, 2016.
learning analytics framework for moocs using state-of-the-art in-memory [50] M. Clayton and D. Halliday, “Big data and the liberal conception of education,”
computing,” in 2017 5th National Conference on E-Learning & E-Learning Theory and Research in Education, vol. 15, no. 3, pp. 290–305, 2017.
Technologies (ELELTECH). IEEE, 2017, pp. 1–6. [51] G. Dishon, “New data, old tensions: Big data, personalized learning, and the
[25] R. Machova, J. Komarkova, and M. Lnenicka, “Processing of big educational challenges of progressive education,” Theory and Research in Education, vol. 15,
data in the cloud using apache hadoop,” in 2016 International Conference on no. 3, pp. 272–289, 2017.
Information Society (i-Society). IEEE, 2016, pp. 46–49. [52] C. F. Lynch, “Who prophets from big data in education? New insights and new
[26] F. Matsebula and E. Mnkandla, “A big data architecture for learning analytics in challenges,” Theory and Research in Education, vol. 15, no. 3, pp. 249–271, 2017.
higher education,” in 2017 IEEE AFRICON. IEEE, 2017, pp. 951–956. [53] J. Scott and T. P. Nichols, “Learning analytics as assemblage: Criticality and
[27] R. Soltanpoor and A. Yavari, “Coala: contextualization framework for smart contingency in online education,” Research in Education, vol. 98, no. 1, pp. 83–
learning analytics,” in 2017 IEEE 37th International Conference on Distributed 105, 2017.
Computing Systems Workshops (ICDCSW). IEEE, 2017, pp. 226–231. [54] S. Slater, S. Joksimovi´c, V. Kovanovic, R. S. Baker, and D. Gasevic, “Tools for
[28] K. Stefanova and D. Kabakchieva, “Educational data mining perspectives within educational data mining: A review,” Journal of Educational and Behavioral
university big data environment,” in 2017 International Conference on Statistics, vol. 42, no. 1, pp. 85–106, 2017.
Engineering, Technology and Innovation (ICE/ITMC). IEEE, 2017, pp. 264–270. [55] B. Williamson, “Who owns educational theory? Big data, algorithms and the
[29] T. Zeng, “The research and practice of a five-sided educational data mining expert power of education data science,” E-Learning and Digital Media, vol. 14,
framework,” in 2017 IEEE 3rd Information Technology and Mechatronics no. 3, pp. 105–122, 2017.
Engineering Conference (ITOEC). IEEE, 2017, pp. 1050–1053. [56] M. Attaran, J. Stark, and D. Stotler, “Opportunities and challenges for big data
[30] J. Lam, K. K. Ng, S. K. Cheung, T. L. Wong, K. C. Li, and F. L. Wang, analytics in us higher education: A conceptual model for implementation,”
Technology in education. Technology-mediated proactive learning: Second Industry and Higher Education, vol. 32, no. 3, pp. 169–182, 2018.
international conference, ICTE 2015, Hong Kong, China, July 2-4, 2015, revised [57] Y.-F. Zhao, Z.-G. Fu, and F. Chen, “Research on big data preprocessing
selected papers. Springer, 2015, vol. 559. technology of thermal system,” in 2nd Annual International Conference on
[31] J. K. Tang, H. Xie, and T.-L. Wong, “A big data framework for early Energy, Environmental & Sustainable Ecosystem Development (EESED 2016).
identification of dropout students in mooc,” in International Conference on Atlantis Press, 2016.
Technology in Education. Springer, 2015, pp. 127–132. [58] G. Zhang, J. Li, and L. Hao, “Research on cloud computing and its application
[32] S. Velampalli and V. M. Jonnalagedda, “Intelligent computing for skill-set in big data processing of distance higher education.” International Journal of
analytics in a big data framework—a practical approach,” in Proceedings of the Emerging Technologies in Learning, vol. 10, no. 8, 2015.
First International Conference on Intelligent Computing and Communication. [59] S. S. Chaurasia and A. Frieda Rosin, “From big data to big impact: analytics for
Springer, 2017, pp. 267–275. teaching and learning in higher education,” Industrial and Commercial Training,
[33] J. A. Reyes, “The skinny on big data in education: Learning analytics simplified,” vol. 49, no. 7/8, pp. 321–328, 2017.
TechTrends, vol. 59, no. 2, pp. 75–80, 2015. [60] S. S. Chaurasia, D. Kodwani, H. Lachhwani, and M. A. Ketkar, “Big data
[34] C. Laux, N. Li, C. Seliger, and J. Springer, “Impacting big data analytics in academic and learning analytics: Connecting the dots for academic excellence
higher education through six sigma techniques,” International Journal of in higher education,” International Journal of Educational Management, vol. 32,
Productivity and Performance Management, vol. 66, no. 5, pp. 662–679, 2017. no. 6, pp. 1099–1117, 2018.
[35] K. Dahdouh, A. Dakkak, L. Oughdir, and F. Messaoudi, “Big data for online [61] J. McCarthy, “Enhancing feedback in higher education: Students’ attitudes
learning systems,” Education and Information Technologies, vol. 23, no. 6, pp. towards online and in-class formative assessment feedback models,” Active
2783–2800, 2018. Learning in Higher Education, vol. 18, no. 2, pp. 127–141, 2017.
[36] B. Williamson, “The hidden architecture of higher education: building a big data [62] M. Nu´n˜ez-del Prado and R. Gom´ez, “Learning data analytics through a
infrastructure for the ‘smarter university’,” International Journal of Educational problem based learning course,” in 2017 IEEE World Engineering Education
Technology in Higher Education, vol. 15, no. 1, p. 12, 2018. Conference (EDUNINE). IEEE, 2017, pp. 52–56.
[37] T. De Feyter, R. Caers, C. Vigna, and D. Berings, “Unraveling the impact of the [63] A. Klaˇsnja-Mili´cevi´c, M. Ivanovi´c, and Z. Budimac, “Data science in
big five personality traits on academic performance: The moderating and education: Big data and learning analytics,” Computer Applications in
mediating effects of self-efficacy and academic motivation,” Learning and Engineering Education, vol. 25, no. 6, pp. 1066–1078, 2017.
individual Differences, vol. 22, no. 4, pp. 439–448, 2012. [64] Y. Demchenko, C. De Laat, and P. Membrey, “Defining architecture components
[38] W. Smidt, G. Kammermeyer, and S. Roux, “Relations between the big five of the big data ecosystem,” in 2014 International Conference on Collaboration
personality traits of prospective early childhood pedagogues and their beliefs Technologies and Systems (CTS). IEEE, 2014, pp. 104–112.
about the education of preschool children: Evidence from a german study,” [65] P. Nilsen, “Making sense of implementation theories, models and frameworks,”
Learning and Individual Differences, vol. 37, pp. 96–106, 2015. Implementation science, vol. 10, no. 1, p. 53, 2015.
[39] J. T. Wassan, “Discovering big data modelling for educational world,” Procedia- [66] P. V. Coveney, E. R. Dougherty, and R. R. Highfield, “Big data need big theory
Social and Behavioral Sciences, vol. 176, pp. 642–649, 2015. too,” Philosophical Transactions of the Royal Society A: Mathematical, Physical
[40] G. Bello-Orgaz, J. J. Jung, and D. Camacho, “Social big data: Recent and Engineering Sciences, vol. 374, no. 2080, p. 20160153, 2016.
achievements and new challenges,” Information Fusion, vol. 28, pp. 45–59, 2016. [67] L. P. Macfadyen and S. Dawson, “Mining lms data to develop an “early warning
[41] J. Zhang and M. Ziegler, “How do the big five influence scholastic performance? system” for educators: A proof of concept,” Computers & education, vol. 54, no.
A big five-narrow traits model or a double mediation model,” Learning and 2, pp. 588–599, 2010.
Individual Differences, vol. 50, pp. 93–102, 2016. [68] M. Franklin, “Making sense of big data with the berkeley data analytics stack,”
[42] O. Bent, P. Dey, K. Weldemariam, and M. K. Mohania, “Modeling user behavior in Proceedings of the Eighth ACM International Conference on Web Search and
data in systems of engagement,” Future Generation Computer Systems, vol. 68, Data Mining. ACM, 2015, pp. 1–2.
pp. 456–464, 2017. [69] K. Grolinger, M. Hayes, W. A. Higashino, A. L’Heureux, D. S. Allison, and M.
[43] S. Singh, R. Misra, and S. Srivastava, “An empirical investigation of student’s A. Capretz, “Challenges for mapreduce in big data,” in Services (SERVICES),
motivation towards learning quantitative courses,” The International Journal of 2014 IEEE World Congress on. IEEE, 2014, pp. 182–189.
Management Education, vol. 15, no. 2, pp. 47–59, 2017. [70] A. Gupta, H. K. Thakur, R. Shrivastava, P. Kumar, and S. Nag, “A big data
[44] M. Birjali, A. Beni-Hssane, and M. Erritali, “A novel adaptive e-learning model analysis framework using apache spark and deep learning,” in Data Mining
based on big data by using competence-based knowledge and social learner Workshops (ICDMW), 2017 IEEE International Conference on. IEEE, 2017, pp.
activities,” Applied Soft Computing, vol. 69, pp. 14–32, 2018. 9–16.
[45] K. T. Chui, D. C. L. Fung, M. D. Lytras, and T. M. Lam, “Predicting at-risk
university students in a virtual learning environment via a machine learning
algorithm,” Computers in Human Behavior, 2018.