Intelligent Tutoring System Using Expert Knowledge and Kohonen Maps With Automated Training
Intelligent Tutoring System Using Expert Knowledge and Kohonen Maps With Automated Training
https://doi.org/10.1007/s00521-020-04767-0 (0123456789().,-volV)(0123456789().
,- volV)
ORIGINAL ARTICLE
Abstract
This paper presents an intelligent tutoring system (ITS) model that is capable of driving the didactic transposition of
contents. Initially, the tutoring system reactions bases its behavior on rules defined by an expert teacher; after this, a neural
network that learns from the student’s behavior when they are studying adjusts these rules. This way, the neural network
improves the teacher’s rules and, consequently, defines a learning strategy that is more adaptive and reactive to the
student’s profile. Thus, it is possible to offer the student a personalized and individualized education form. The model is
able to guide the student throughout the didactic transposition of contents, aiding the consolidation of desired competencies
established on educational propositions. This work shows the development process of the ITS, including the expert
guidance system and the hybrid system, which improves the expert rules from SOM neural network use. The obtained
results indicate that the application of hybrid technology in ITSs is feasible because, for defining the teaching strategies, it
incorporates the teacher’s knowledge and by neural network use, it assimilates the students’ learning process behavior. The
results show that proposed model has great agreement between the actions of the ‘‘ITS’’ and the students’ actions. The
model showed satisfactory performance when compared to other systems proposed in the literature that use connectionist
approach in its conception.
Keywords Intelligent tutoring systems Hybrid ITS Didactic transposition of contents Artificial neural networks
Self-organizing maps
1 Introduction
123
Neural Computing and Applications
personalized [10]. Intelligent tutoring systems apply sev- assimilate knowledge from the content. The content is
eral AI techniques in order to enable different forms of related to the knowledge that is conveyed, is what the
learning as adaptive, contextual, directed and flexible, student will learn [8, 24, 25].
according to student’s profile [31]. ITS using symbolic Face-to-face interaction is still the oldest and most
paradigms of AI, and a mediation of the teacher, has shown widely used method to transmit knowledge. A key aspect
promising results [11, 27, 33]. Others paradigms as con- of this model, within the physical environment where the
nectionist and nonlinear paradigms have also been used in teaching–learning process takes place, is the simultaneous
the design of intelligent tutoring systems and showing presence of both the receiving (student) and the transmit-
satisfactory results [3, 19]. ting (instructor) agents. In this kind of education, the
To each one of these used techniques in an ITS, there are knowledge transmission process lies in the interaction
several open problems to reach a ‘‘perfect’’ system. When between teacher and student; the teacher guides such pro-
an ITS uses connectionist paradigms, it is necessary to cess according to his or her perception of the student’s
determine correct parameters to obtain a good system reactions [12]. Operational and economic limitations, along
performance. It is hard to set the number of layers, the with the impossibility of individualized contact between
number of neurons by layers and others parameters to the teacher and the several students, have resulted in seri-
achieve an appropriate performance. However, a frequent ous imbalances in the outcome of the teaching–learning
and complex problem is to adapt ITS neural network to process.
students’ skills. This issue comes out because ITS only has With the expansion of technology and the assumption
contact with students after the course starts. Using an ITS that part of the teaching–learning process may not require
with symbolic paradigm, it will behave by a prior knowl- the physical presence of teachers, educators and psychol-
edge inserted in the system. An expert can insert this ogists developed the field of research and educational
knowledge, and in this case, ITS will ignore the particular technology called ‘‘Distance Learning’’ (DL) [7, 13]. This
students’ skills. field of research is an approach without the limitations
Despite all problems, the benefits of using information imposed by a face-to-face teaching environment. DL
technology for aiding teaching activities have increasingly allows a greater dissemination of knowledge; thus, it pro-
motivated the interest for researches in virtual environ- motes the education popularization [15].
ments for education [o]. In this context, the use of com- In a face-to-face teaching environment, the teacher
puter has supported teaching in the classroom and in interacts with students during the presentation of content,
distance learning environments. Currently, virtual univer- identifies their difficulties and offers options to conduct the
sities are an actuality, with proposals of training, under- learning process. In DL environment, the interaction
graduate and postgraduate levels of education [27]. between teacher and student occurs throughout the process
Thus, this work’s goal is to propose and implement a of generating the educational contents [12]. Due to its
hybrid ITS model. This model will allow an expert teacher, asynchronous aspect, DL does not enable immediate
aided by a self-organizing map (SOM) artificial neural interaction between teacher and student when learning
network, create an optimized form to the didactic trans- problems arise. Thus, its delay in the teacher’s intervention
position of contents [6]. The propose is to use the good jeopardizes the student’s learning. The interaction between
characteristics of two stated approaches and, as contribu- the different elements involved in a non-face-to-face edu-
tion of this work, increase a retraining process of neural cational process has established disciplinary ties between
network to adapt ITS with particular students’ skills of a the Exact Sciences and the Humanities [12].
specific course. In a distance learning environment, interactivity is a
complicating factor. During the delivery of the contents,
there may be a time lapse before the teacher administers his
2 Virtual learning environments pedagogical intervention toward the student’s issues. It is
hard to apply general pedagogical interventions in conse-
The process of transmitting knowledge can be represented quence of the plurality of student skills and other typical
by the combined actions of teacher and student, with the situations of a distance learning environment. An learning
purpose of assimilating knowledge. Three elements take analytics about the practice in virtual learning environ-
part in this representation: the teacher, the student and the ments is important to reflect on the challenges of this
delivered content. Students are the interacting part to which environment [30].
all the effort of the process is directed, in order to develop This context requires from distance learning systems the
his or her skills. The teacher is the key agent of the learning capability of perceive students’ skills and changes itself to
process. He is responsible for giving the direction and offer the content in the best way. The use of artificial
organization of the actions that enable students to intelligence techniques can aid these systems to perform
123
Neural Computing and Applications
2.1 Symbolic approach to intelligent tutoring Teuvo Kohonen developed the self-organizing map neural
network, or SOM, in the 1980s [16]. The topological map
The most traditional ICAI models use symbolic AI. These observed in the cerebral cortex inspires its design. In the
systems usually comprise in five parts: the domain basis, SOM network, neurons that are near one each other on the
the student model, teaching strategies, interface control map layout are responsible for similar functions, as occurs
module and the user (the student). Figure 1 shows as these in the brain of more highly evolved animals.
five parts interact. The self-organizing maps are composed by two layers,
The domain basis is the component that acts as the the input layer and a second layer that is the own classi-
expert; it is the origin of all knowledge presented in the fying map. In the input layer, there are several sensing
course. The student model represents the knowledge and neurons to perceive each signal from the external world.
cognitive skills of the user at any given time. The teaching This RNA model uses to training a competitive learning
strategies encompass the pedagogical model in itself. The algorithm, where the neurons on the output layer compete
interface is responsible for the presentation of instructional to define which one has the greatest similarity with the
123
Neural Computing and Applications
presented pattern. However, there are no physical con- 2.3.1 Map topological preservation
nections between them. A neighborhood relationship
determines the map topology. There are many possible This metric describes if after the training, the network
metrics to determine the winning neuron. Yet, the most topology maintains the features of the training data set. If
usual way to determine the winner neuron is the Euclidean the network maintains these characteristics, it indicates that
distance. The one that has the smallest Euclidean distance network learned with success. A simple form to measure
in relation to the presented pattern [17] is the winner the topological quality of a SOM network is using the
neuron. average of the distance between the first and the second
In SOM neural network, weights are associated with winner neurons. Thus, the highest is the obtained result and
each neuron and the number of inputs defines the dimen- the lower is the global ordination of the network. Equa-
sion of the weight vector. For example, if one considers tions (1–3) show how to calculate this measure to a two-
three inputs for the colors red, green and blue, there will be dimensional Kohonen map.
a weight vector with three elements. 0 0
h ¼ maxðLvLv j;jCv Cv Þ ð1Þ
The basic algorithm for training an SOM neural network
does not have a high complexity and two phases comprise h; se h [ 1
s¼ ð2Þ
the training process: global organization and fine-tuning 0; otherwise
(convergence). The topological organization of the SOM
1X N
network occurs in the initial cycles; however, as it is a QT ¼ s ð3Þ
stochastic process, the accuracy of the map depends on the N i¼1
number of cycles during the convergence or fine-tuning.
where QT topological quality, Lv line of the winner neuron;
There is no specific rule for choosing the number of iter- 0
Lv line of the second winner neuron, Cv column of the
ations or cycles, but studies have shown that five hundred 0
cycles per neuron are enough to obtain good statistical winner neuron, Cv column of the second winner neuron,
precision [17]. Therefore, the cycles’ number depends on N number of samples in the data set training.
the number of neurons in the network. The ability to group A map topological preservation quality indicates only
similar patterns together makes the SOM network a good the map ordination state of the network. To measure the
decision-making tool for tasks that involve data classifying precision of the network answers, it is necessary other
and clustering, among others. measure called map precision.
This paper considers that navigation form or the learning
strategy defined by SOM neural network maps the students 2.3.2 Map precision
learning profiles. This is the main hypothesis of this work.
A traditional problem when one uses a neural network is This metric shows the accuracy of the neuron answer to the
to define the best network parameters to execute the net- training data set used in training process. A usual way to
work training, as number of the neuron, the number of measure the map precision in a Kohonen network is using
layers and others characteristics. Normally, the most the average of the Euclidian distance sum retrieved from
common approach is to try several configurations until each used sample in training data set. Thus, the lower is the
reach a good result, or at least, a configuration that can result, better is the map precision. Equation (4) shows how
solve the problem at hand. can be applied, using as base the trained network weights
and the training data set.
2.3 Kohonen quality metrics 1X N
QT ¼ xi wji ð4Þ
N i¼1
As the neural network training is a stochastic process, it is
difficult to measure the quality of learning process. Nev- where xi i-th pattern presented to network; and wji winner
ertheless, after the training process, as tutoring system will neuron weight.
use the neural network, it is important to know the quality These measures allow the verification of the training
of the network learning. This work uses two quality mea- process quality, indicating if the chosen network configu-
sures proposed by Carvalho [4]. These measures verify the ration is enough to correct classification.
quality of network training. They are the map topological The unsupervised learning feature allows the SOM
preservation and the map precision. neural network to perform the classification of data sets not
related to their respective classes. The wide variety of
actions and attributes in a learning environment are pre-
sented as a complicating factor in establishing its classifi-
cation. In this work, the SOM neural network presented
123
Neural Computing and Applications
itself as a viable alternative to the task, enabling the teaching strategy. Figure 2 illustrates the basic format of
structuring of the environment data, relating the diversity this design. The guiding navigator is the part of the inter-
of data in appropriate classes to provide a custom and face that is responsible for providing guidelines for the
favorable learning environment to the student [19]. study activity.
Kohonen’s maps were used in this work because it is one of Each proposed course in that structure should have its
the most applied models for unsupervised learning appli- didactic units organized and distributed in several levels,
cations. It is a classic ANN and used in several studies each one aiming to develop different skills.
where the objective is to classify. Works such as A sequence of texts, arranged in five distinct levels,
[1, 9, 23, 29] deal with the analysis of student performance composes the teaching unit, in addition by two tests, an
using SOM networks. initial and a final. The principal levels among those five are
easy, medium and advanced. The other two levels are
auxiliary: frequently asked questions (FAQs) and exam-
3 Didactic transposition of contents guided ples. Each one of the levels has an exercise related to the
by the hybrid model didactic content and to the level.
The students begin the course by performing an initial
This paper presents a proposal of hybrid ITS model. In our test in order to measure their prior knowledge about the
proposed approach, rules mapped from teacher’s knowl- topic. After the initial test, the tutoring system chooses an
edge and an artificial neural network SOM model guide the appropriate difficulty level to the students according to
didactic transposition of contents. The neural network their knowledge degree. After the student study, the
operates by adjusting the teacher’s rules according to how defined level and answer the proposed exercise, the system
the students navigate through the educational contents. shows a navigation page. This page informs to the student
Two phases compose the process of constructing the the teacher’s suggestion to carry on the study process, or,
proposed model. In the first phase, the tutoring system alternatively, which are the possible steps to be taken by
works using only the teaching strategies based on the rules own student. Figure 3 shows the navigation screen.
defined by expert, the teacher. In the second phase, the The teacher’s suggestion, the first option in Fig. 3,
hybrid decision-making alters the pedagogic strategy cre- shows the tutoring systems decision. This option directs the
ated in first phase. The latter phase uses SOM artificial student toward another level within the same topic or, if
neural networks to refine the expert strategy. deemed appropriate by the system, toward another didactic
The first phase approach allows students to learn in a
guided manner, according to the rules of the teacher who
created the course. However, the students have the freedom
to decide about how to navigate over contents, and which
study strategy they will adopt. Concurrently, this model is
also a source of data for training the neural network used in
the hybrid model.
The hybrid guidance is determined according to the
characteristics of student who used the system in its first
phase, where the expert rules define the decision-making
process. The students’ behavior, while conducting their
study activities, offers information to the tutoring system to
adapt itself, resembling human interaction. Thus, the
hybrid model acts from a domain basis composed by expert
rules and by neural network rules.
123
Neural Computing and Applications
unit. Even after the students visit other topic levels, if they 3.2 The use of hybrid guidance in the didactic
wish to advance to another unit, the ITS will always lead to transposition of contents
the most appropriate level according to each student’s
profile. The ITS with hybrid guidance used for the didactic trans-
At the beginning of the navigation, the tutoring system, position of contents follows the same organization of
based on the score of the initial test and on the rule spec- didactic units shown in Fig. 2, i.e., an initial and final test,
ified by the teacher, defines the most appropriate level to be easy, medium and advanced levels, FAQs and examples.
selected. For navigating through the remaining units, the The greatest difference between this model and the one
system determines the next level to be followed according shown in the previous section is the teaching strategy. In
to the student’s global performance. This performance the hybrid ITS model, the decision-making provided by the
score is defined by the average score of all answered SOM neural network complements the guidance decision-
exercises and the rules defined by the teacher. making that already exists within the expert model. Thus,
In the expert guidance model, the teacher is responsible the purpose of this network is to adjust the teaching strat-
for defining a teaching strategy that will guide the student egy that guides the students throughout the didactic trans-
throughout the didactic transposition of contents. This position that is best suited to their profile and performance.
means that the teacher should define the strategy for nav- Following the same rationale defined for the expert
igating to different units and through the different levels guidance model described in the section before, the hybrid
within a unit. model ITS also presents to the student an initial and final
Aiming to facilitate the teacher’s work in generating of test. However, in this model, the system’s guidance defines
the content, the proposed system suggests the criteria for the guidance process according to the decisions of both
local navigation or expert decision, based on the research system structures, the neural network and the expert
presented [22]. That research considers the same five levels knowledge base. In this case, the neural network operates
used in the structure of our proposed tutoring system, by adjusting the expert’s teaching strategy based on the
without, however, the evaluate process of the contents at learning experience of students that used the expert system
the end of each topic. before. The characteristics of texts and exercises remain
In our proposed approach, the teachers, as the ones in unchanged in the hybrid ITS, i.e., they are the same as in
charge of defining the contents for their courses, have the expert model. Thus, both systems may use the same
complete freedom to modify the expert guidance according interface layouts, colors, texts, etc.
to their own experience or specificity of the topic at hand. A single neural network is responsible for guiding the
Figure 4 shows a simplified map of the expert suggestions student’s navigation in the hybrid ITS. After reading the
depending on the student’s success or failure to a given contents of a didactic unit and answering its respective
local exercise. exercise, the tutoring system submits collected data to the
Therefore, if the student fails in medium-level exercise, neural network for the decision-making process to take
for instance, the tutoring system will perceive in the new place. The neural network guides the transposition between
classifying process and will create a sequence of possible content levels, whereas the expert decision guides the
suggestions to continue the study. transposition between different didactic units.
The proposed ITS, through the expert decisions descri- As occurs in the expert guidance ITS, every step of the
bed in this section, may be used immediately after the student that used the hybrid system is stored in a database.
creation of the course, using only that knowledge basis to The system updates the ITS neural network just after the
help the learning process. At the same time, the tutoring end of data collect process. Thus, the ITS uses the new
system, with its hybrid model, acts gathering data for neural network status to decide about the next step that
training its neural networks. Next section describes how student should be guided.
this process works. These data can also be used later to improve the per-
formance of the neural network if they are used as the data
set for a new training stage. The collected data permit to
123
Neural Computing and Applications
analyze the students’ behavior along their studying neighborhood reducing tax, the neighborhood shape and
activities. the learning tax [17].
The hybrid ITS supports any course or content, but it is The proposed model in this work uses an algorithm to
necessary to use the same distribution structure levels that execute an automatic training. This algorithm minimizes
allow the didactic transposition. After the creation of a new the necessity of a neural network professional to define the
course in this style, one could insert it on the tutoring best configuration to SOM network. To define the ideal
system. Furthermore, the automation implemented for network structure, the algorithm training several bi-di-
training the neural networks enables immediate utilization mensional SOM network, each one with different config-
of the new course. The training of neural network can uration parameters. After this, the algorithm selects the best
happen at any time, without assistance from a neural net- network. This optimized network must have the best
work specialist. topological preservation map and precision map [4].
Aiming guarantee the generalization in the training pro-
3.3 Defining the best network configuration cess, the algorithm also defines the number of neurons
based in the numbers of training patterns. The tutoring
The neural network training process is a complex task, system repeats this process to each new course inserted in
independently from the adopted model. This task requires the system. This model of training and the tutoring system
from the engineer a high knowledge and experience to architecture allows the use of several different SOM net-
define the adequate configuration, which depending on the work in the system, one to each course. After the training
treated problem. In particular, in the SOM networks, some parameters definition and the network training, the system
factors are responsible to its performance after the training builds the neuron labeling process.
as the neuron number, the initial weight values, the
123
Neural Computing and Applications
In its simplest form, the SOM neural network has only one This work used the RubyÒ technology to build both
layer. This layer shows which neuron is the winning one, structures, the hybrid system and the expert tutoring sys-
i.e., the one that most adapted to a given pattern. In this tem, which operate on the server side. This technology
work, each neuron must be able to choose one among five brings dynamism to web pages and allows access to the
different levels of guidance (five predefined classes). For database stored on a PostgreSQL service. The values of
instance, from the medium level of a particular didactic temporal and behavior variables (latency, satisfaction, etc.)
unit, students can visit the easy level of that unit, the are retrieved with JavaScript. This ensures greater accuracy
medium level of the next topic, the advanced level, FAQs and fidelity of the records because the JavaScript works on
or examples. the client side (browser), thus avoiding the delays that are
In order to enable neurons to make various decisions, inherent to the communication.
this work developed a labeling process. This process The choice of technology used for developing the
associates with each neuron a decision probability vector. described systems follows the philosophy of free software,
The algorithm shown in Fig. 5 illustrates the labeling thus minimizing investments with proprietary software.
process used in this work. In the proposed algorithm, all A Xen-based cloud computing solution hosts the systems,
neurons in the network have an updated label that indicates which enables server virtualization with high performance.
the next step. This is possible by insertion of a labeling Figure 6 shows a screenshot of the proposed intelligent
process. tutoring system.
This process associates with each neuron a vector that
contains the five indication levels. The labeling process
works by updating, at each neuron, of the level (label) that 4 Experiments and results
is equivalent to the next visit made by the student, which
generated that pattern. For conducting the experiment, this work developed two
This updating process adds at neuron’s indication point systems with similar software interfaces: an expert navi-
(equivalent to the forthcoming visit of the pattern), the gation system and a hybrid navigation system. The dif-
similarity value (Euclidean distance) between the presented ference between these two systems is the decision
pattern and the weights of the neuron. This process labels mechanism for choosing the more convenient level from
every neuron in the network; however, the neuron must the contents organization. The hybrid navigation system
indicate the level where similar patterns are most repeated applies the modeling techniques proposed and already
(strengthened points). described in this paper.
The labeling process occurs only after the training stage
of the neural networks, which is the moment when every 4.1 Data collection
mapped example is available to the network. The mapping
strengthens the next step that the neuron should indicate In order to compile the experimental results, this work
according to the next step chosen by the (successful) stu- created four data sets to both approaches, the expert system
dent that generated that pattern. and the hybrid guiding system. Fifty-four students from
early years of vocational courses in Mechanics, Computer
Science and Chemistry from the Institute Federal de Goiás,
123
Neural Computing and Applications
in the federal state of Goiás, Brazil, participated in those created for the experiment has as the theme the research
experiments. presented in this paper and information about the system’s
Before the students began navigating the systems to navigation modes.
generate the experiments data, they watched an explanation
about the main characteristics of each system. The main 4.2 Training set
goal of these explanations was to minimize problems the
students could face during their navigation. They also had This work used the data of twenty-two students to compose
information about the software, such as the structure, the the set of examples used for training the neural networks.
development environment and the average time for com- The initial and final tests consisted of fifteen questions each
pletion of the proposed course. Furthermore, the course one and resulted in 660 answer records. Each of these
123
Neural Computing and Applications
123
Neural Computing and Applications
123
Neural Computing and Applications
5 Conclusions
123
Neural Computing and Applications
123