Dynamic Educational Recommender System Based On Improved Recurrent Neural Networks Using Attention Technique
Dynamic Educational Recommender System Based On Improved Recurrent Neural Networks Using Attention Technique
An International Journal
Hadis Ahmadian Yazdi, Seyyed Javad Seyyed Mahdavi Chabok & Maryam
Kheirabadi
To cite this article: Hadis Ahmadian Yazdi, Seyyed Javad Seyyed Mahdavi Chabok & Maryam
Kheirabadi (2022) Dynamic Educational Recommender System Based on Improved Recurrent
Neural Networks Using Attention Technique, Applied Artificial Intelligence, 36:1, 2005298, DOI:
10.1080/08839514.2021.2005298
Introduction
Many years ago, the book was well known as the primary tool for education.
Today’s, however, the development of computers and the World Wide Web
and the increase in heterogeneous information has created a sense of the need
to design systems to generate the most meaningful recommendations, which
simplifies selection and activity processes. Recommender systems have
evolved over the past two decades (Adomavicius and Tuzhilin 2005;
Cremonesi et al. 2011). By becoming the Internet to a comprehensive medium
and the rapid growth of e-learning, users’ expectations of these systems have
risen. In this regard, one of the advantages of these systems is the lack of space
and time constraints on educating. Traditional education systems are time-
consuming, as compared to modern education (Shishehchi et al. 2011).
Nevertheless, web-based systems also have some significant drawbacks, as
compared to traditional classrooms. For instance, there is no interaction
between the learner and the educator. Meanwhile, the presentation of content
and feedback is not personalized (Romero, Ventura, and García 2008). Besides, a
large number of resources in e-learning environments makes it difficult to make
the proper choice. Another point is that the student has individual differences
such as educational background, study method, age, etc. This requires us to get
feedback from students to better guide them in the educating process (Lu 2004).
Concerning such limitations, students in e-learning systems are eager for perso
nalized services to monitor, support automatically, and evaluate student learn
ing. Student loyalty also increases with personalized service (Huimin, Ming, and
Mingming 2010; Muthukumar and Bhalaji 2020).
So far, counseling systems, especially educational counseling, have been
studied with different techniques and methods. Most of these researches,
especially in the issue of educational recommenders, have tried to solve the
problem with linear forms and models and data mining such as ontology to
recommend scientific resources. (Bourkoukou and Achbarou 2018;
Bourkoukou and El Bachari 2018; Gulzar, Leema, and Deepak 2018; Qiao
and Hu 2018; Yago et al. 2018) One of the problems and limitations of these
methods is not accepting a large amount of information. Only a framework for
recommendation is introduced; there has not been much discussion about
a training advisor who automatically offers helpful advice. One of the newest
and most complete ways to solve the problem is to use deep learning networks.
Due to a large amount of problem data, neural network integration, learning
techniques, and structural dynamics informing several hidden layers, these
networks can solve problems with high accuracy and not encounter overfit
errors. (Alashkar et al. 2017; Chen et al. 2017b, 2017a; Hu et al. 2018; Wang
et al. 2018, 2017; Xu et al. 2017; Zheng, Noroozi, and Ph 2017) On the other
hand, in issues such as our problem that we are faced with a variety of areas
and parameters influencing decision-making, Deep learning networks are
more powerful than traditional methods.
Educational Recommender Systems (ERS) is utilized to assist students and
teachers during the learning process. Here, it is worth mentioning that the main
difference between ERS and their business counterparts is the appropriate
educational principles for both the learning and teaching process. This differ
ence in the educational methods used in different academic conditions deter
mines the primary guidelines for the ERS design. Based on this analysis, along
with educating and upgrading of existing algorithms, five specific areas are
introduced, in which the future research and development can be expected:
construction of universal ERS, ERS intended primarily for teachers, ERS that
e2005298-470 H. AHMADIAN YAZDI ET AL.
links student achievements across different courses, ERS which take into account
physical distance between students and use of ERS to motivate students to work
continuously. (Muthukumar and Bhalaji 2020; Zhang et al. 2018) The use of
recommenders when analyzing facts in the decision-making process is one of the
basic elements that many people apply during this process (Prem and Vikas
2010). The development and improvement of existing systems are one of the
current researches in the world. These developments are applied based on the
continuous evolution of statistical methods, machine learning, artificial intelli
gence, data mining, and information retrieval (Prem and Vikas 2010;
Manouselis et al. 2012). This paper aims to design an Educational
Recommender System (ERS) that recommends the relevant resources to users
based on their interests and features in the relevant dataset. Indeed, our recom
mender system contains individual features such as educational background,
age, and so on, including the individual’s interests in pre-clicked, the down
loaded resources, and the user score given to each resource. Hence, such
a system must be educated to be able to recommend new resources to users.
if there is an inherent structure that the model can exploit, deep neural
networks are very efficient for this issue. For instance, both CNN and RNN
have long employed the internal structure of machine vision or natural
language. Because the nature of recommending textbooks depends on the
time and long-term review of student performance, the sequential structure
of sessions or report clicks is very appropriate for inferential errors in con
ventional or recursive models. In many of these methods, the same weight is
considered in learning for all users’ interests, and only the user’s past informa
tion is used in learning. While in the present article, having a network that
looks both backward and forwards can also cover changes in learner behavior
and offer more up-to-date recommendations.
In the following, the sources and descriptions are reviewed on the basic
methods on which ERS operates. Then, some examples of different ERS
classifications are provided according to their specific characteristics and
basic methods. In the next section, the proposed method is presented, which
is a combination of the architecture of MLP deep learning networks. In the
fourth section, the results of the implementation of the proposed algorithm
based on accuracy and efficiency are surveyed. Ultimately, some future sugges
tions are provided in the fifth section.
system usually focuses on a specific type of entity. For example, we can refer to
an article or news item to provide valuable and practical recommendations for
that particular type of entity. In recent years, research on recommender
systems has increased compared to other information systems and methods
(such as datasets with search engines) (Covington, Adams, and Sargin. 2016).
Nowadays, many recommender systems are running, which are based on
different approaches and methods. The methods are shown in Figure 1.
Today, many different ERSs are utilized. Their objective is to facilitate the
modernization of the educational process in both formal and open educational
environments. These systems are usually a combination of design and beha
vior, methods, and strategies to create the recommendations. In this way, ERS
can be divided into systems that recommend educating, learning objects,
teamwork to implement joint activities, different educating methods through
learning cases based on the user’s unique preferences, or helping to create
e2005298-474 H. AHMADIAN YAZDI ET AL.
a personal learning path (PLP) (Zhang et al. 2018). Likewise, the ERS educat
ing method recommends dividing users into methods in formal learning
environments and freely available methods on the World Wide Web.
Regarding the widespread use of web2.0 tools for e-learning, most ERSs
recommend a combination of these methods. Besides, some ERSs help tea
chers to perform a part of student supervision (Tejeda-Lorente et al. 2015) or
find some ways to recommend the learning topics (Huang, Zeng, and Chen
2007; Mubarak, Cao, and Ahmed 2021). Sunita et al. (Aher and Lobo 2012)
recommend ERS courses available to students. Then, they developed their
recommender system based on the best combination of lessons available and
each user’s unique interests. In (Imran et al. 2016) developed their recom
mender system of Personalized Learning Object Recommender System
(PLORS), which is an ERS in the LMS by various learning objects to persona
lize the formal educational process, based on monitoring the activities of
previous students and then comparing them with other students and their
activities. In (El-Bishouty et al. 2014), they proposed an ERS model that helps
teachers proportion e-learning content to their students’ different learning
styles.
Moreover, the E-learning Activities Recommender System (ELARS) (Hoic-
Bozic, HolenkoDlab, and Mornar 2016) exploits visual, auditory, read/write,
mobile (VARK) descriptions (Fleming 1995), and learning styles as an essen
tial element in a user profile. (Marian, Popescu, and Costel 2015) suggest the
use of ERS to help students find the groups who can help them solve
a particular problem in learning the content of a specific course. In several
different systems, the use of ERS to communicate with students has anon
ymously been proposed.
In addition, one of the objectives of ELARS is to be able to recommend
when forming a group to work on a specific problem or on a particular project.
When this feature is added to the ERS, students usually have the freedom to
decide independently whether to accept recommendations and communicate
with appropriate colleagues or to ignore them (Zhang et al. 2018).
Determining a personal learning path is one of the objectives of a number of
ERSs. These systems employ different input parameters to define a unique
path of educational content for each user. China Ming et al. (Chin Ming, Chih
Ming, and Mei Hui 2005) arranged the syllabus in such a way that the system
utilizes the student’s incorrect answers to devise more learning paths so that
the user can obtain sufficient knowledge of the course content.
On the other hand, Lata et al. (Latha and Kirubakaran 2013) developed a
model of ERS that its algorithm employed the graph theory and knowledge of
different learning styles to recommend different PLPs for each user. In (Chin
Ming et al. 2007), each user’s basic knowledge level is compared to the complex
ity of individual learning objects. According to the results of this comparison,
the ERS provides some recommendations for different learning paths. Besides,
APPLIED ARTIFICIAL INTELLIGENCE e2005298-475
in (Onah and Sinclair 2015), the PLP building was designed based on
a comparison of the user profile and the aim of proper learning defined by
the user. The ERS monitors the user’s progress and changes the learning path to
ensure that all the required knowledge is acquired to succeed in further learning.
To obtain the better performance of the algorithms used in ERS, several
methods of artificial intelligence (fuzzy sets, artificial neural networks, evolution
ary strategies) or their interactions are utilized (Zhang et al. 2018) In the papers of
(Tejeda-Lorente et al. 2015; Jamsandekar and Mudholkar 2013), they use fuzzy
inference methods to process data on student success with the aim of better
monitoring students’ progress through course content. Artificial neural networks
are considered to develop algorithms that are capable of self-learning based on
data from a given domain (Negnevitsky 2005), on the ERS of artificial neural
networks for complex modeling relationships between user profiles and their
interests (De Gemmis et al. 2009), as well as for modeling the relationship between
recommended objects and other parameters in the ERS to determine specific
recommendations that are unique for each user (Adomavicius and Tuzhilin 2005;
Jamsandekar and Mudholkar 2013) This method can obtain better overall results
in the same environment, as compared to cases where only one of these methods is
employed (Zhang et al. 2018). Some methods of artificial intelligence based on
evolutionary calculations include genetic algorithms, evolutionary strategies, and
genetic programming (Negnevitsky 2005). It should be noted that genetic algo
rithms and various evolutionary strategies are commonly used in ERS (Zhang
et al. 2018). In (Sengupta, Sahu, and Dasgupta 2011), they used the Ant Colony
Optimization approach (an evolutionary algorithm) to identify system users’
effective and optimal learning paths. This system is exploited to access informa
tion about unfamiliar terms that the user encounters during the learning process.
The paper (Chin Ming et al. 2007) utilized the genetic algorithm to create
a personal learning path for the user, while (Cayzer and Aickelin 2002) used the
biological immune system model to obtain a set of possible recommendations.
Proposed Method
This paper aims first to obtain a dataset of users, including their interest in the
resources under study and the extent to which they use and click on these
resources and related features. After that, the practical items from them are
chosen. In the second phase, using deep neural networks, we educate our
recommender system with acceptable accuracy, and finally, we recommend
resources to users using the educational network. The recommender algo
rithm encompasses data extraction from OULAD data source files, data pre-
processing, construction of a deep learning network of MLP, LSTM, and
BiLSTM networks improved by Attention method, initialization of para
meters, educating, and finally, educating point predicting. In our proposed
architecture, as illustrated in Figure 2, in each layer for each feature in the
e2005298-476 H. AHMADIAN YAZDI ET AL.
dataset, one BiLSTM cell and one LSTM cell are considered so that the cells are
focused on one feature of each record. Each cell considers only the pattern of
one feature. Finally, the total of these patterns leads to obtaining a better result.
determines the probability of belonging to each class, dropout has been used to
use the secret aggregation, ensemble feature; Also, it prevents network over
load and increases the generalization capability of the model.
Implementation
The volume of dataset data used is about 11 million data, which after elim
inating those records via missing fields, finally remains 10543682 records
consisting of 12 features. It should be noted that this set of records is gathered
from the activities of 23326 different students. The implementation steps are
carried out in two parts: the first part contains the hybrid architecture (see
Figure 2) that refers to our idea in the paper. On the other hand, the second
part includes implementing several traditional methods and the deep learning
based on the DBN network related to the idea of the study of (Zhang et al.
2018), which developed scientific resources. In this article, we have used 3
divisions 70% – 30%, 80% – 20%, 90% – 10% to train and test our proposed
model, and for each case 3 values of 0.1, 0.2, 0.3 For validation split.(Table 7)
After the completion of the Epochs, the graphs and the results of the perfor
mances show that the accuracy and loss of our work are far better than the
results of the implementation of the proposed network (Zhang et al. 2018; Lic.
Mar´ıa Emilia Charnelli et al.2019; Hui Chen et al. 2020; Rumei Li et al. 2019;
Lingyao Yan et al. 2021)
Dataset
Students generate a diversity of behavioral data by learning in an online
learning environment. This behavioral data is gathered and stored through
data collection methods (Kuzilek, Hlosta, and Zdrahal 2017). The
resources dataset provides data sources for this platform. This resources
dataset can be exploited to extract content features that reflect students’
interest in resources. The feature vectors of students are generated by
combining student features and lesson features. Afterward, the combina
tion vectors of behavioral feature and user-lesson feature are created. The
dataset consists of 3 students, teachers, and lessons. It includes informa
tion about 22 courses provided, 32,593 students, their assessment results,
and their interaction with the Virtual Learning Environment (VLE), which
is summarized by Students’ daily clicks on various “resources” (10,655,280
inputs) are provided. The dataset is anonymous using the ARX data
encryption tool [PK15]. The data is investigated for loss detection and
then confirmed and published by the Open Data Institute1.
OULAD is a sample subset of the collected student data, which contains
student demographics, student performance in the course assessment, and last
but not least, student behavior in the VLE. This resource provides a unique
dataset of student performance and prepares an opportunity to create new
e2005298-478 H. AHMADIAN YAZDI ET AL.
Dataset Plan
Figure 3 depicts the overall structure of the presented data set. This data set
generally represents students and then the course. The main table of
StudentInfo contains the student files that are linked to the courses (A student
can have more than a one registered course). Each course has several assess
ments related to the student using the student assessment table, including the
history of the student assessment results.
● courses.csv: The file contains a list of all available and presented courses.
● assessments.csv: This file contains information about the assessments of
the presented courses. Usually, each course has a number of assessments
followed by a final exam.
Data Description
● vle.csv: Contains information about the tool’s existence in the VLE. It is
usually HTML, pdf, etc., pages. Students have access to these resources
online, and their interactions with them are then recorded.
● studentInfo.csv: This information includes demographic information
about students and their results. Each student may have several rows.
Each row contains information about one student in the studied course,
including the following columns:
● studentRegistration.csv: This table encompasses information about the
student registration time to participate in the module. Besides, it is
recorded for students who have not registered the registration date.
● StudentAssessment.csv: This file contains student assessment results. If
the student does not submit an assessment, no result is recorded.
Meanwhile, if the assessments are not stored in the system, there will be
no final exams.
● studentVle.csv: it includes information about each student’s interaction
with resources in the VLE.
APPLIED ARTIFICIAL INTELLIGENCE e2005298-479
Figure 3. Dataset structure of the data set (Kuzilek, Hlosta, and Zdrahal 2017).
Data Pre-processing
Figure 4 shows the pre-processing steps. The input consists of four sections:
provided resources, student features, held courses, and student performance
and assessment history in each course. These four sections are combined to
perform further analysis, categorization, feature mapping. Table 1 clearing
blank or mistake data, and feature normalization.
The features are normalized before implementing any work (the data range
is [0, 1]). To do so, Eq. (1) is utilized to perform the normalization as follows:
x xmin
x� ¼ (1)
xmax xmin
Where Xmin denotes the lowest eigenvalue, which is Xmin = min{X1, X2, . . .,
Xn}. Xmax denotes the maximum eigenvalue that is Xmax = max{X1, X2, . . .,
Xn}. X*indicates the normalized value, x means the primary data. Ultimately,
after pre-processing, the data is divided into an education set and a test set.
Records Labeling
The dataset is labeled after the initial stages of pre-processing as follows:
For each student from the set of registered activities for the joint courses, the
resource with the most clicks (which can indicate the student’s taste and interest)
in the course that had the highest grade (which can be the influence of the
e2005298-480 H. AHMADIAN YAZDI ET AL.
resources studied in that course to show the student to be more successful) was
chosen as the label. Therefore, 562 labels were generated, which were mapped
from 0 to 561. The frequency of repetition of labels is illustrated in Figure 5.
In the end, after the pre-processing step, the data is divided into an educa
tion set, a validation set, and a test set. The education set enters our proposed
network as input.
Network Construction
The current research was conducted on a Google Colab server. Here, it is
worthwhile to mention that Google Colab is a cloud service provided by
Google that allows Python programming, which prepares to install and work
with several Python language packages and deep learning frameworks such as
Tensorflow, Keras Pytorch, and more. In terms of this service, it provides
a free GPU to users, which has practically multiplied the efficiency of this
service. The service has been provided with an Nvidia Tesla P100 and 25.51GB
of RAM. Besides, both LSTM and Bidirectional Keras library have been used to
build the recommended network. After entering data into the network, the
data enters the Bidirectional layer with 1536 neurons. At the end of the output
of this layer, they enter the LSTM layer with 512 neurons, and at the end of the
output of this layer, they enter the Attention layer. In this implementation,
SeqWeightedAttention existence in the Keras library has been employed to
implement the attention technique.
Initialization of network parameters:
In the model educating process, we must repeatedly adjust the model
parameters to achieve better results in feature extraction. During the learning
process, the minibatch process method is exploited to solve the problem of
large data volumes. Besides, some parameters such as learning rate, number of
repetitions, and Bach-Size are set as follows:
Bach-size: 2048, Learning rate: 0.0001, Epoch: 120, Activation function:
Softmax
Table 1. Values of mapped features to numbers.
code_module AAA = 0.1 BBB = 0.2 CCC = 0.3 DDD = 0.4 EEE = 0.5 FFF = 0.6 GGG = 0.7
code_presentation 2013B = 540 2013 J = 720 2014B = 180 2014 J = 360 - - -
age_band 0–35 = 0.1 35–55 = 0.2 55≤ = 0.3 - - - -
highest_education A Level or Equivalent = 0.1 HE Qualification = 0.2 Lower Than A Level = 0.3 No Formal quals = 0.4 Post Graduate Qualification = 0.5 - -
final_result Distinction = 0.1 Fail = 0.2 Pass = 0.3 Withdrawn = 0.4 - - -
gender F = 0.1 M = 0.2 - - - - -
APPLIED ARTIFICIAL INTELLIGENCE
e2005298-481
e2005298-482 H. AHMADIAN YAZDI ET AL.
The higher the number of network parameters, the higher its computational
load in the network educating phase. In our proposed network, in the educat
ing phase, the first val accuracy can be observed = 0.95 at epoch = 74 via
a minimum loss of 0.08 at epoch = 115. After completing the educating phase,
we entered the test data as input to the network, and then the obtained final
result was equal to 0.96 with one percentage of generalization.
Figure 6. Results of educating and testing of our proposed network(test-split = 0.3, split-valid
= 0.1).
Table 4. Results of educating and testing of LSTM and BiLSTM networks (n, 12, 1).
accuracy test val_ accuracy accuracy val_loss loss train LSTM
0.79 0.7876 0.7696 0.5544 0.6194
accuracy test val_ accuracy accuracy val_loss loss train BiLSTM
0.70 0.7086 0.6868 1.0078 1.1137
e2005298-484 H. AHMADIAN YAZDI ET AL.
Table 5. Results of educating and testing of LSTM and BiLSTM network (n, 12, 1) and the attention
technique.
accuracy test val_ accuracy accuracy val_loss loss train LSTM & Attention Techniques
0.94 0.9366 0.9423 0.1254 0.1457
accuracy test val_ accuracy accuracy val_loss loss train BiLSTM & Attention Techniques
0.70 0.758 0.7458 0.6675 0.72223
that this part of the architecture performs the general understanding of the
features in the database. Here, to survey the effectiveness of the relationship
between the layers, we implemented and educated two-layer two architectures
LSTM and two-layer Gru. As shown in Table 6, the results of two-layer archi
tectures are more favorable than those of single-layer architectures.
As the BiLSTM deep neural network considers both long-term and short-
term interests of the user, and due to their gradual learning natures, they
support learners’ behavioral changes. Hence, we implemented our proposed
two-layer architecture as a combination of the improved LSTM and BiLSTM
network using the attention technique. As can be seen in Table 7, the results of
our proposed architecture are very acceptable and desirable
According to Figure 6 considering the loss rate, it can be seen that the
number of selected epics is appropriate. Besides, according to the accuracy
diagram, it can be seen that as the val and val accuracy diagrams are almost the
same, so that overfitting does not occur in this experiment.
As shown in Figure 5, we are faced with an unbalanced data set. In these cases,
in addition to the model’s accuracy, it is better to use other evaluation
parameters such as recall and F1 score. To ensure that the model has not
Table 6. Results of educating and testing of two-layer LSTM and GRU networks.
accuracy test val_ accuracy accuracy val_loss loss train LSTM
0.93 0.9265 0.911 0.1622 0.198
accuracy test val_ accuracy accuracy val_loss loss train GRU
0.94 0.9205 0.9066 0.1806 0.2175
Table 7. The result of proposed network in terms of train and test accuracy.
test train
accuracy val_ accuracy accuracy val_loss loss Split-valid Test-split
0.9664 0.9597 0.9511 0.0822 0.1055 0.1 30
0.9674 0.9564 0.9473 0.0909 0.1134 0.2
0.965 0.9581 0.949 0.087 0.1123 0.3
0.9684 0.965 0.9576 0.0704 0.0902 0.1 20
0.9593 0.9516 0.9438 0.102 0.1236 0.2
0.9631 0.9577 0.9484 0.0863 0.1125 0.3
0.964 0.9568 0.9574 0.088 0.1025 0.1 10
0.9611 0.9574 0.9468 0.0878 0.1145 0.2
0.9603 0.9567 0.9462 0.091 0.1175 0.3
APPLIED ARTIFICIAL INTELLIGENCE e2005298-485
intelligently categorized all the data presented into a repetitive class in the
training process to achieve high accuracy. At the end of the test phase, for all
three parameters precision-recall f1-score, the average value is 96%. We
examined the classification accuracy of each group separately and found that
the data of all classes were very well categorized. To evaluate, we have trained
and tested our model with 3 different data sharing modes. In Table 8, as an
example, we have ten cases of the groups that had the lowest frequency of
repetition and ten cases of the groups that had the highest frequency of
repetition from the test results.
As you can see, in all three divisions of 30–70, 20–80 and 10–90 in the
groups with the value support = 1, the result of most recalls and f1-scores is
equal to 1. On the other hand, the number 1 obtained for recalls and f1-score is
very small in the groups with the most members. This shows that our model
has also met the unconventional data challenge in addition to the data volume
challenge.
support f1-score recall precision label support f1-score recall precision label support f1-score recall precision label
1 0.4 1 0.25 530 1 0.4 1 0.25 209 1 1 1 1 326 10 groups with the lowest frequency
1 1 1 1 498 1 1 1 1 228 1 1 1 1 418
1 1 1 1 453 1 1 1 1 547 1 1 1 1 463
2 0.02 0.5 0.01 477 1 0.4 1 0.25 228 1 0.4 1 0.25 391
2 1 1 1 305 1 1 1 1 90 2 1 1 1 446
H. AHMADIAN YAZDI ET AL.
In (Lingyao Yan et al. 2021), the results show that OLS characters can make
the recommendation algorithm more accurate and robust, but as seen in the
first row, the proposed model performed better than both studied methods.
Disclosure Statement
No potential conflict of interest was reported by the author(s).
ORCID
Seyyed Javad Seyyed Mahdavi Chabok http://orcid.org/0000-0003-1856-4579
References
Adomavicius, G., and A. Tuzhilin. 2005. Toward the next generation of recommender systems:
A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge
and Data Engineering 17 (6):734–49. doi:10.1109/TKDE.2005.99.
Aher, S. B., and L. M. R. J. Lobo. 2012. Course recommender system in E-learning.
International Journal of Computer Science and Communication 3 (1):159–64.
Al Mamunur, R., G. Karypis, and J. Riedl. 2008. Learning preferences of new users in
recommender systems: An information theoretic approach. ACM SIGKDD Explorations
Newsletter Journal 10 (2):90–100. doi:10.1145/1540276.1540302.
Alashkar, T., S. Jiang, S. Wang, and Y. Fu (2017, February). Examples-rules guided deep neural
network for makeup recommendation. In Proceedings of the AAAI Conference on Artificial
Intelligence (Vol. 31, No. 1).
Bernhard, T. J. (1997). Challenges and strategies for electrical engineering education. In 27th
Annual Conference Frontiers in Education: Teaching and Learning in an Era of Change (pp.
1459–1462). Pittsburg, Pennsylvania, USA. doi:10.1109/FIE.1997.632708.
Bhojak, H., S. Jain, and V. Muralidharan (2012). Instructor guided personalization of
learningpath-adopting SCORM. In 4th International Congress on Engineering
Education – ImprovingEngineering Education: Towards Sustainable Development.
New York, USA. doi:10.1109/ICEED.2012.6779280.
Bourkoukou, O., and E. El Bachari. 2018. Toward a hybrid recommender system for e-learning
personnalization based on data mining techniques. JOIV: International Journal on
Informatics Visualization 2 (4):271–78. doi:10.30630/joiv.2.4.158.
APPLIED ARTIFICIAL INTELLIGENCE e2005298-489
Bourkoukou, O., and O. Achbarou. 2018. Weighting based approach for learning resources
recommendations. JOIV: International Journal on Informatics Visualization 2 (3):104–09.
doi:10.30630/joiv.2.3.124.
Cayzer, S., and U. Aickelin (2002). A Recommender System based on the Immune Network.In
Congress on Evolutionary Computation CEC2002, (pp. 807–813), Honolulu, USA.
doi:10.1109/CEC.2002.1007029.
Chen, C., P. Zhao, L. Li, J. Zhou, X. Li, and M. Qiu (2017a, April). Locally connected deep
learning framework for industrial-scale recommender systems. In Proceedings of the 26th
international conference on World Wide Web companion (pp. 769–770).
Chen, J., H. Zhang, X. He, L. Nie, W. Liu, and T. S. Chua (2017b, August). Attentive
collaborative filtering: Multimedia recommendation with item-and component-level
attention. In Proceedings of the 40th International ACM SIGIR conference on Research
and Development in Information Retrieval (pp. 335–344).
Chin Ming, H., C. Chih Ming, and C. Mei Hui (2005). Personalized Learning Path Generation
Approach for Web-based Learning. In 4th WSEAS International Conference on E-activities
(pp. 62–68). Miami, Florida, USA.
Chin Ming, H., C. Chih Ming, C. Mei Hui, and C. Shin Chia (2007). Intelligent web-basedtutoring
system with personalized learning path guidance. In 7th IEEE InternationalConference on
Advanced Learning Technologies, ICALT 2007 (pp. 512–516). Nigata, Japan.
Covington, P., J. Adams, and E. Sargin. (2016). Deep neural networks for youtube
recommendations. In Recsys. 191–98.
Cremonesi, P., F. Garzotto, S. Negro, A. V. Papadopoulos, and R. Turrin (2011). Looking for
good Recommendations: A comparative Evaluation of Recommender Systems. In
P. Campos, N. Graham, J. Jorge, N. Nunes, P. Palanque, and M. Winckler (Eds.), 13th
IFIP TC 13 International Conference INTERACT,2011,(pp. 152–168).Lisbon, Portugal.
doi:10.1007/978-3-642-23765-2_11.
De Gemmis, M., Iaquinta, L., Lops, P., Musto, C., Narducci, F., and Semeraro, G. (2009).
Preference learning in recommender systems. Preference Learning, 41, 41-55.
El-Bishouty, M. M., T. W. Chang, S. Graf, and N. S. Chen. 2014. Smart e-course recommender
based on learning styles. Journal of Computers in Education 1 (1):99–111. doi:10.1007/
s40692-014-0003-0.
Fleming, N. D. (1995, July). I’m different; not dumb. Modes of presentation (VARK) in the
tertiary classroom. In Research and development in higher education, Proceedings of the
1995 Annual Conference of the Higher Education and Research Development Society of
Australasia (HERDSA), HERDSA (Vol. 18, pp. 308–313).
Gallego, D. L., E. Barra, A. Gordillo, and G. Huecas (2013).Enhanced recommendationsfor
e-learning authoring tools based on a proactive context-aware recommender. In IEEE
Frontiers in Education Conference (pp. 1393–1395). Oklahoma City, USA.doi:10.1109/
FIE.2013.6685060.
Gong, Y., and Q. Zhang (2016). Hashtag recommendation using attention-based convolutional
neural network. In IJCAI, 2782–88.
Gulzar, Z., A. A. Leema, and G. Deepak. 2018. Pcrs: Personalized course recommender system
based on hybrid approach. Procedia Computer Science 125:518–24. doi:10.1016/j.
procs.2017.12.067.
Hidasi, B., A. Karatzoglou, L. Baltrunas, and D. Tikk (2016). Session-based recommendations
with recurrent neural networks. In Proceedings of the International Conference on Learning
Representations, San Juan, Puerto Rico, 2–4 May 2016, 1–10.
Hoic-Bozic, N., M. HolenkoDlab, and V. Mornar. 2016. Recommender system and web 2.0
Tools to enhance a blended learning model. IEEE Transactions on Education 59 (1):39–44.
doi:10.1109/TE.2015.2427116.
e2005298-490 H. AHMADIAN YAZDI ET AL.
Hu, B., C. Shi, W. X. Zhao, and P. S. Yu (2018, July). Leveraging meta-path based context for
top-n recommendation with a neural co-attention model. In Proceedings of the 24th ACM
SIGKDD International Conference on Knowledge Discovery & Data Mining (pp. 1531–40).
Huang, Z., D. Zeng, and H. Chen. 2007. A comparison of collaborative-filtering recommenda
tion algorithms for ecommerce. IEEE Intelligent Systems 22 (5):5. doi:10.1109/
MIS.2007.4338497.
Huimin, Q., C. Ming, and X. Mingming (2010, May). A personalized resource recommenda
tion system using data mining. In 2010 International Conference on E-Business and
E-Government (pp. 5365–5368). IEEE.
Imran, H., M. Belghis-Zadeh, T. W. Chang, and S. Graf. 2016. PLORS: A personalized learning
object recommender system. Vietnam Journal of Computer Science 3 (1):3–13. doi:10.1007/
s40595-015-0049-6.
Kuzilek, J., M. Hlosta, and Z. Zdrahal, Open university learning analytics dataset, https://www.
kaggle.com/vjcalling/oulad-open-university-learning-analytics-dataset
Latha, C. B. C., and E. Kirubakaran. 2013. Personalized learning path delivery in webbased
educational systems using a graph theory based approach. Journal of American Science
9 (12):981–92.
Lu, J., A personalized e-learning material recommender system, Macquarie Scientific
Publishing, pp. 374–379, 2004.
Manouselis, N., H. Drachsler, K. Verbert, and E. Duval. 2012. Recommender Systems for
Learning. Springer Science & Business Media.
Marian, C. M., P. S. Popescu, and I. Costel (2015). Intelligent tutor recommender system for
on-line educational environments. In 8th International Conference on Educational Data
Mining (pp. 516–519). Madrid, Spain.
Mubarak, A. A., H. Cao, and S. A. Ahmed. 2021. Predictive learning analytics using deep
learning model in MOOCs’ courses videos. Education and Information Technologies
26 (1):371–92. doi:10.1007/s10639-020-10273-6.
Muthukumar, V., and N. Bhalaji. 2020. MOOCVERSITY-deep learning based dropout predic
tion in MOOCs over weeks. Journal of Soft Computing Paradigm (JSCP) 2 (3):140–52.
doi:10.36548/jscp.2020.3.001.
Negnevitsky, M. 2005. Artificial intelligence – A Guide to Intelligent Systems. London: Addison-
Wesley, Pearson Education Ltd.
Onah, D. F., and J. E. Sinclair (2015, March). Massive open online courses: An adaptive
learning framework. In Proceedings of the 9th International Conference on Technology,
Education and Development (pp. 1258–1266).
Qiao, C., and X. Hu (2018, July). Discovering student behavior patterns from event logs:
Preliminary results on a novel probabilistic latent variable model. In 2018 ieee 18th inter
national conference on advanced learning technologies (ICALT) (pp. 207–211). IEEE.
Romero, C., S. Ventura, and E. García. 2008. Data mining in course management systems:
Moodle case study and tutorial. Computers & Education 51 (1):368–84. doi:10.1016/j.
compedu.2007.05.016.
Santos, C. O., and G. J. Boticario. 2011. Requirements for semantic educational recommender
systems in formal e-learning scenarios. Open Access Journal: Algorithms 4 (1):131–54.
doi:10.3390/a4030131.
Sengupta, S., S. Sahu, and R. Dasgupta. 2011. Construction of learning path using ant colony
optimization from a frequent pattern graph. International Journal of ComputerScience
8 (6):314–21.
Shishehchi, S., S. Y. Banihashem, N. A. M. Zin, and S. A. M. Noah (2011, June). Review of
personalized recommendation techniques for learners in e-learning systems. In 2011 inter
national conference on semantic technology and information retrieval (pp. 277–281). IEEE.
APPLIED ARTIFICIAL INTELLIGENCE e2005298-491
Sunil, L., and D. Saini (2013). Design of a Recommender System for Web Based Learning. In
World Congress on Engineering WCE2013 (pp. 3–8). London, England.
Tejeda-Lorente, A., J. Bernabe-Moreno, C. Porcel, P. Galindo-Moreno, and E. Herrera-
Viedma. 2015. A dynamic recommender system as reinforcement for personalized educa
tion by a fuzzy linguistic web system. Elsevier - Procedia ComputerScience 55 (1):1143–50.
doi:10.1016/j.procs.2015.07.084.
Wang, J., L. Yu, W. Zhang, Y. Gong, Y. Xu, B. Wang, . . . D. Zhang (2017, August). Irgan:
A minimax game for unifying generative and discriminative information retrieval models.
In Proceedings of the 40th International ACM SIGIR conference on Research and
Development in Information Retrieval (pp. 515–24).
Wang, Q., H. Yin, Z. Hu, D. Lian, H. Wang, and Z. Huang (2018, July). Neural memory
streaming recommender networks with adversarial training. In Proceedings of the 24th
ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (pp.
2467–75).
Xu, Z., T. Lukasiewicz, C. Chen, Y. Miao, and X. Meng (2017, August). Tag-aware personalized
recommendation using a hybrid deep model. AAAI Press/International Joint Conferences
on Artificial Intelligence.
Yago, H., J. Clemente, D. Rodriguez, and P. Fernandez-de-cordoba. 2018. On-smmile:
Ontology network-based student model for multiple learning environments. Data &
Knowledge Engineering 115:48–67. doi:10.1016/j.datak.2018.02.002.
Zhang, H., T. Huang, L. V. Zhihan, S. Liu, and H. Yang. (2018). MOOCRC: A highly accurate
resource Recommendation model for use in MOOC environments. Springer, Mobile
Networks and Applications.
Zhang, S., L. Yao, and A. Sun (2017). Deep learning based recommender system: A survey and
new perspectives. arXiv, arXiv:1707.07435.
Zheng, L., V. Noroozi, and Y. Ph. (2017). Joint deep modeling of users and items using reviews
for recommendation. In WSDM.