Teaching Evaluation System by Use of Machine Learning and Artificial Intelligence Methods
Teaching Evaluation System by Use of Machine Learning and Artificial Intelligence Methods
Jingjing Hu
Zhejiang Business College, Hangzhou, China
[email protected]
1 Introduction
technology would promote the reform and innovation of college teaching and
establish an education and teaching system suitable for students’ lifelong
development, which helps education to transform to high-level and precise type [3].
The teaching evaluation system can help schools judge teachers’ teaching effect
and students’ learning outcomes, which is the evident for many universities to judge
the teaching process of teachers. However, the current education evaluation system
fails to reflect the teaching situation of teachers under the new technology, including
the following aspects [4]. I. The evaluation method is out dated and inefficient, and
the data credibility is low. II. The evaluation index is not perfect, and there is no
evaluation content for multimedia teaching mode. III. The weight distribution of
evaluation indexes is unreasonable and lacks objectivity and fairness. IV. There are
deficiencies in the analysis and processing of the data at the later stage, and the
practical information can’t be extracted. Moreover, the implementation of teaching
evaluation process is very complex, and massive data calculation is required.
Therefore, it is urgent to establish an objective, efficient, and feasible teaching
evaluation system and evaluation optimization process.
In this work, a teaching evaluation system based on machine learning is designed.
Firstly, correlation analysis is performed on the acquired evaluation data. Secondly,
association rules are followed to determine the relationship between indexes in
teaching evaluation. Finally, the machine learning algorithm is adopted to optimize
the data processing and build the teaching evaluation model, so as to realize the
automation of teaching evaluation.
2 Methods
88 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
User Login
Algorithm Training
Data Evaluation Extract Attribute
Collection Model Features
Submit To
Leave
The index is a stipulation on one aspect of a specific target and reflects the
characteristics of a certain aspect. Therefore, for the overall characteristics, it
shouldn’t start with a certain index, but deal with the relationship between each index,
and a series of related index combinations should be set up to completely reflect the
characteristics of things. Therefore, the evaluation of each teaching content in Fig. 2 is
described as 1, 2, 3, 4, and 5, corresponding to unqualified, qualified, medium, good,
and excellent, respectively. The Letters A, B, C, ••, Q, and R correspond to the 18
evaluation indexes in Fig. 2, and the letter T represents the comprehensive evaluation
score. Each column of Table 1 contains a complete teaching evaluation record, and
each letter corresponds to the corresponding value and the evaluation value of the
evaluation index.
90 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
𝑎11 … 𝑎1𝑛
𝑅=( ⋮ ⋱ ⋮ ) (2)
𝑎𝑚1 ⋯ 𝑎𝑚𝑛
The correlation analysis adopted relies on association rules to obtain the actual
relationship between strong rule analysis and evaluation indexes. Association rules
are adopted to express some correlation between the two sets of numbers 𝑋 and 𝑌,
which is determined by the confidence 𝑐 and support 𝑠 [9].
The confidence level 𝑐 measures the credibility of the rule, and is defined as the
ratio of the number of events containing both 𝑋 and 𝑌 to the number of events
containing 𝑋 in database A, that is, at least 𝑐% of the events contained in the event
database also contains 𝑌, the expression is shown in equation (3).
The support degree s refers to the statistical importance of the rule in the entire data
set, which represents the ratio of the number of events containing 𝑋 in the event
database 𝐴 to the total number of events, indicating that at least s% of the events in
the event database 𝐴 contain 𝑋, as shown in equation (4).
The processed data set is calculated via the equation of the association rules to
obtain strong rules between the evaluation indexes. The correlation analysis process
of the association process is shown in Fig. 3.
Yes
Strong Association Rule Judge s>=min_s, c>=min_c Publish And Calculate s And c
No
Through the data acquisition, processing, and correlation analysis of the teaching
evaluation model, a scientific and effective evaluation index is designed. Moreover,
the correlation between the indexes is analyzed to eliminate the large deviation of the
data, and the teaching evaluation system is optimized.
92 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
In equation (6), 𝑝(𝐶𝑗 ) is the class prior probability, which is obtained by training a
large number of data sets, and the calculation is as follows.
𝑠𝑗
𝑝(𝐶𝑗 ) = (7)
𝑠
In equation (10), count represents statistics, the value of the attribute 𝐴𝑖 is 𝑎𝑘 and
belongs to the 𝐶𝑗 category, and the equation for calculating the attribute weight is as
follows.
94 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
𝑝(𝐴𝑖 |𝑟𝑒𝑙)
𝑤(𝐴𝑖 , 𝑎𝑘 , 𝑗) = (12)
𝑝(𝐴𝑖 |𝑛𝑜𝑟𝑒𝑙)
In dataset 𝐴, if there are 𝑚 class labels, 𝑛 attributes, and 𝑘 possible values for each
attribute, the total weight of all attributes is 𝑚 × 𝑛 × 𝑘. The weights of the same
attribute are different under different circumstances. According to the specific value
of each attribute, the weight of the probability associated with the current class label
is selected for calculation, and the result value of each category is compared. The
category corresponding to the maximum value is the classification result.
Based on Python and pycharm platform, the correlation analysis of 440 pre-
processed teaching evaluation data sets is implemented. The correlation coefficient
between evaluation indexes is calculated based on the experimental data. Table 2
shows the maximum correlation coefficient between each evaluation index.
From the data in Table 2, there is a strong correlation between the two evaluation
indexes with large correlation coefficients. Therefore, an association rule experiment
is conducted on this data set, and the minimum confidence level is set to 0.500 and the
minimum support level is 0.300. After calculation, strong rules for some data in Table
3 are obtained.
From the analysis data in Table 2, the correlation coefficients between indexes B
and M, E and P, and C and O are relatively high, indicating a strong influence
relationship between them. Combined with Table 3, B, C, P, and other indexes have
high confidence, and these indexes are strongly correlated with other indexes. Other
indexes can replace the existence of these indexes, so there is no need to retain these
indexes in teaching evaluation and they were removed, and A, D, E, F, G, H, I, J, K,
L, M, N, O, Q, and R is retained. Then the original 18 evaluation indexes are
optimized into 15 independent evaluation index systems.
The accuracy and feasibility of each algorithm are judged by comparing the
existing evaluation data with the four commonly used machine learning classification
methods in Section 2.4. 440 experimental training sets and 140 test sets are adopted,
and 20 cross-validation tests are conducted and two parallel tests are conducted to
calculate the accuracy of each classification algorithm. Fig. 4 shows the actual
comparison results of average classification accuracy of these algorithms.
96 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
0.78
Average Classification Accuracy
0.76
0.74
0.72
0.70
0.68
The First
The Second
0.66
SVM DT BP NB
Fig. 4 presents that among the four common machine algorithms, the average
classification accuracy of DT algorithm is the lowest, which is about 0.67; while NB
algorithm has the highest classification accuracy, which is about 0.76. Therefore, it is
proved that NB classification algorithm has good accuracy in the construction of
teaching evaluation model. The average running duration of four classification
algorithms is tested on the same data set. Fig. 5 shows the average running duration of
various machine learning algorithms in processing the same number of data sets.
35
The First
The Second
Average Running Duration/s
28
21
14
SVM DT BP NB
Fig. 5 discloses that the running duration of the DT classification algorithm and
NB classification algorithm in processing the average operation of the same data set
are smaller than those of SVM and BP algorithms, while NB algorithm takes less time
than DT algorithm. Combined with the experimental results, compared with other
classification algorithms, the NB algorithm has a higher classification accuracy in the
data set of the teaching evaluation system and has the shortest running duration.
Therefore, the NB algorithm is chosen to construct the teaching evaluation model.
440 data records are extracted from the evaluation and teaching database as the
training set, and 140 data as the test set. 10 cross-validation experiments are
implemented to test the classification accuracy of NB algorithm and WNB algorithm.
Fig. 6 shows the comparison results of the accuracy of NB algorithm and WNB
algorithm.
0.90
NB
WNB
0.85
Classification Accuracy
0.80
0.75
0.70
1 2 3 4 5 6 7 8 9 10
98 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
In the field of teaching evaluation, the most commonly used classification method
is the BP neural network [12]. The weighted NB classification method is adopted in
this work to construct the teaching evaluation model, and the following content is the
comparison of data classification accuracy of WNB and BP algorithm.
BP classification algorithm and WNB algorithm are trained with 440 data, and 140
data are tested. When BP algorithm is trained, the number of input layer nodes is set
as 15, hidden layer node as 6, output layer node as 1, activation function is “tanh”, the
learning rate is 0.01, and the number of cycles is 20,000. Then, the BP algorithm and
the WNB algorithm are compared in 10 cross experiments. Fig. 7 shows the
comparison results of the classification accuracy between the BP algorithm and the
WNB algorithm.
0.90
BP
WNB
0.85
Classification Accuracy
0.80
0.75
0.70
0.65
1 2 3 4 5 6 7 8 9 10
4 Conclusion
5 References
[1] Zou S. (2017). Designing and practice of a college English teaching platform based on
artificial intelligence. Journal of Computational and Theoretical Nanoscience,14(1): 104-
108. https://doi.org/10.1166/jctn.2017.6133
[2] Eaton E., Koenig S., Schulz C., et al. (2018). Blue sky ideas in artificial intelligence
education from the EAAI 2017 new and future AI educator program. AI Matters,3(4): 23-
31. https://doi.org/10.1145/3175502.3175509
[3] Kim K., Park Y. (2017). A Development and Application of the Teaching and Learning
Model of Artificial Intelligence Education for Elementary Students. Journal of The Korean
Association of Information Education,21(1): 139-149
[4] Zhao X., Guo H. T., Huang C. L., et al. (2017). Teaching evaluation system research based
on structure entropy weight method. Journal of Discrete Mathematical Sciences and
Cryptography,20(1): 179-191. https://doi.org/10.1080/09720529.2016.1178915
[5] Wei Z., Baocun L. (2018). The undergraduate teaching evaluation system in China:
Progress, problems and suggestions. Chinese Education & Society,51(4): 248-259. https://
doi.org/10.1080/10611932.2018.1493865
[6] Wang Z., Liu J. (2019). A teaching quality evaluation system of massive open online
courses based on big data analysis. International Journal of Emerging Technologies in
Learning (iJET),14(14): 81-91. https://doi.org/10.3991/ijet.v14i14.10818
[7] Rizzuto M. (2017). Design recommendations for self-paced online faculty development
courses. TechTrends,61(1): 77-86. https://doi.org/10.1007/s11528-016-0130-8
[8] Ho J., Plewa C., Lu V. N. (2016). Examining strategic orientation complementarity using
multiple regression analysis and fuzzy set QCA. Journal of Business Research,69(6):
2199-2205. https://doi.org/10.1016/j.jbusres.2015.12.030
[9] Osadchiy T., Poliakov I., Olivier P., et al. (2019). Recommender system based on pairwise
association rules[J]. Expert Systems with Applications,115: 535-542. https://doi.org/10.
1016/j.eswa.2018.07.077
100 http://www.i-jet.org
Paper—Teaching Evaluation System by use of Machine Learning and Artificial Intelligence Methods
[10] Brewster L. R., Dale J. J., Guttridge T. L., et al. (2018). Development and application of a
machine learning algorithm for classification of elasmobranch behaviour from
accelerometry data. Marine Biology,165(4): 62. https://doi.org/10.1007/s00227-018-3318-
y
[11] Jiang L., Zhang L., Li C., et al. (2018). A correlation-based feature weighting filter for
naive Bayes. IEEE Transactions on Knowledge and Data Engineering,31(2): 201-213.
https://doi.org/10.1109/tkde.2018.2836440
[12] Zhou Y., Zheng S., Zhang G. (2020). Machine-learning based study on the on-site
renewable electrical performance of an optimal hybrid PCMs integrated renewable system
with high-level parameters’ uncertainties. Renewable Energy,151: 403-418. https://doi.org/
10.1016/j.renene.2019.11.037
6 Author
Article submitted 2020-12-05. Resubmitted 2021-01-13. Final acceptance 2021-01-16. Final version
published as submitted by the authors.