Enhancing convolutional neural network based model for cheating at online examinations detection
Enhancing convolutional neural network based model for cheating at online examinations detection
Corresponding Author:
Sara Ouahabi
Laboratory of Modeling and Information Processing, Faculty of Sciences Ben M'sik, Hassan II University
Casablanca, Morocco
Email: [email protected]
1. INTRODUCTION
E-learning is playing a vital role in the existing educational setting, as it changes the entire
education system and becomes one of the greatest preferred topics for academics [1]. This shift has been
driven by the need for a safe and efficient alternative to in-person learning. In fact, e-learning allow
providing effective teaching methods, catering to diverse learning styles and offers accessibility to a vast
array of educational resources and interactive opportunities [2], [3], promoting active engagement and critical
thinking. However, e-learning faces also major challenges such as cheating. Exam fraud is widespread
globally [4]–[6], whatever the level of development. As a result, traditional cheating detection methods may
no longer be totally effective in preventing examination fraud. Online exams are an integral part of e-learning
solutions for authentic and fair assessment of student performance [7]. The design and execution of online
exams are the most challenging aspects of e-learning. In particular, online exams are usually conducted on
e-learning platforms without the physical presence of students and instructors in the same place. This creates
several deficiencies in terms of the integrity and security of online exams. For example, candidate
authenticity verification is extremely problematic in an online environment, particularly in the absence of
continuous monitoring. What's more, online exams are highly conducive to cheating, as thousands of
information resources are accessible to students without any controls. In this context, where preserving the
integrity of online exams is crucial, artificial intelligence (AI) offers advanced analysis and detection skills,
making it an invaluable asset for guaranteeing the reliability of online assessments. Cheating on online exams
can be detected and classified in multitude forms, from collusion between students to the use of mobile
devices, such as phones, to more subtle indicators such as eye movements [8], [9], mouth movements head
movements [10], and many other unauthorized behaviors.
In the face of the diverse methods of cheating, this study concentrates on analyzing candidates' head
movements during exams since it is considered as key element to detect several other behaviors and serve as
potentially significant indicators of cheating. The primary goal is to employ the convolutional neural network
(CNN) approach on the FEI dataset, specifically designed to detect head movements. In addition, a
comparison of the approach's performance against other models including VGG16, ResNet50, DenseNet21,
EfficientNetB0, and MobileNetV2 is performed to evaluate its efficacy in cheating detection is done. Our
approach outperforms the other models and has the potential to significantly contribute to the prevention of
cheating in online exams by offering a precise and dependable solution to this intricate issue.
The remainder of this article is structured as follow. Section 2 provides an in-depth review of related
work in the field of online cheating detection. Section 3 details the methodology developed, which is
centered on deep learning (DL) for analyzing candidates' head movements. In section 4, more details about
the experiment and the obtained results are presented, offering essential insights into the approach's
effectiveness. Finally, section 5 serves as the conclusion of the study.
2. RELATED WORKS
In the field of online assessment, which is evolving rapidly, researchers face several challenges and
explore numerous research possibilities. Many studies have been conducted to improve the integrity of online
exams and address the issue of cheating. Bawarith et al. [11] proposes a methodology based on continuous
authentication, eye tracking, and fingerprint scanning, which was applied to a private dataset. The results
demonstrated a sensitivity of 100%, a specificity of 95.56%, an accuracy of 95.74%, an overall accuracy of
97.78%, and an F-measure of 97.83. Furthermore, Jalali and Noorbehbahani [12] presents two distinct
methods for detecting cheating in online exams, both of them were developed and tested using a private
dataset. The first method is based on image processing using MATLAB, calculating the difference between
pixels in the images. When the threshold is set at 9, this method boasts high accuracy, particularly for
detecting empty seats, with an accuracy of 100%, however, it is sensitive to variations in the color of walls,
students' clothing and objects such as sheets of paper. The second method is based on clustering reference
images using the k-medoids algorithm. This method reduces complexity compared to the first method, but its
accuracy is slightly lower, with an average accuracy of 68%. It excels in detecting empty seats, also
achieving 100% accuracy. Geetha et al. [13] used the Eigenface method for extracting facial features from
facial vectors and the support vector machine (SVM) model to improve detection accuracy. This approach
was applied to a private dataset, and the obtained matching accuracy was approximately 61% with 50 real-
time images in the dataset. This accuracy can be improved by increasing the number of images in the dataset.
To addresses the issue of cheating in exams, whether in paper or electronic exam copies,
Rhein et al. [14] introduce FLEXauth, an application that utilizes AI techniques for author verification in
electronic programming exams using a private dataset. The idea is based on the principle that each student
develops an individual style for answering certain types of exercises, which can be extracted using AI tools
and then compared to reference material with verified authors. The concept is applied to Java programming
exams, but the goal is to extend support to other programming languages and types of assignments in the
future. The results show that the random forest (RF) method achieved the best performance with an accuracy
of up to 67.15% for classifying the top three options. Moreover, the GoogleCodeJam2017 dataset was used
for testing, where an accuracy of 93.80% was achieved.
In addition to machine learning (ML) and authentification based cheating detection techniques, others
papers used DL techniques and transfer learning. In fact, Ozdamli et al. [15] employed computer vision
algorithms from the OpenCV library for image acquisition, preprocessing, feature extraction,
detection/segmentation, high-level processing, and decision making. CNNs, particularly the mini Xception
model, were used to predict facial emotions. Various datasets were used for training and testing, including
images for facial verification, emotion recognition, gaze tracking, and head movements. The system achieved
an accuracy of approximately 99.38% on the LFW dataset. For in-class emotion tracking, the average accuracy
was approximately 66% on the FER dataset. Finally, for monitoring behaviors during online exams, the
system displayed an eye-tracking accuracy of approximately 96.95% on the gi4E dataset and a head movement
tracking accuracy of approximately 96.24% on the FEI dataset. Yulita et al. [16] use MobileNetV2
architecture for recognizing activities during online exams, which was applied to the OEP dataset. Optimal
hyperparameters were found, resulting in an F1 score of 84.52%. A web application was developed to assist
teachers in detecting academic fraud. This represents a significant advancement in educational technology.
In various studies [17]–[20], researchers have explored emotion recognition from facial expressions
from different perspectives. El Hammoumi et al. [17] focused on developing a facial expression recognition
system based on CNNs, with a clear goal of integration into an e-learning system. This methodology
involved image preprocessing, feature extraction, and image classification, resulting in an impressive test
accuracy of 97.53% on CK+ and 97.18% on Japanese female facial expression (JAFFE). Ozdemir et al. [18]
adopted a similar approach using DL and CNNs to create an emotional classification model, merging various
datasets. Their model based on the LeNet architecture achieved a training accuracy of 96.43% and a
validation accuracy of 91.81%. Furthermore, Pranav et al. [19] focused on developing a deep convolutional
neural network (DCNN) model to classify five different human facial emotions, achieving an accuracy of
78.04%. Finally, Zbaida et al. [20] examined the importance of emotion recognition in human-machine
interaction, especially in e-learning platforms, by evaluating various approaches such as VGG16, VGG19,
ResNet50V2, EfficientNetB0, and EfficientNetB7 to develop a specialized neural network for accurate
emotion identification.
The analysis of the current state of the art, reveals that CNNs have become important for their
excellent performance in various online assessment areas. Many different types of CNNs have been studied in
different situations and with different data, showing flexibility and potential. The related works studies have
utilized CNNs for various tasks such as recognizing emotions, analyzing facial expressions, and monitoring
behavior during online exams. However, when exploring research, we can conclude that great efforts are still
needed to enhance the performance of CNNs, especially in detecting cheating. The focus of this research is on
improving CNNs using the FEI dataset designed specifically for detecting head movements.
The choice to focus on the analysis of head movements was made due to its significance in
comprehending a student's behavior during an examination. Detecting head movements is a pivotal approach
for recognizing potential student cheating, and it constitutes a fundamental aspect of the research. An
overview of the various studies covered in the state of the art is provided. This summary aims to provide a
concise perspective on the objectives, datasets, methodologies used and results obtained by each article
included in this literature review. Furthermore, in the results column, the abbreviation "Acc" stands for
"Accuracy", representing the accuracy value obtained by each method adopted, as detailed in Table 1.
3. METHOD
The main objective of this paper is to implement a CNN model architecture for head movement
detection, with the aim of identifying cheating behavior in online exams using the FEI dataset. In this section,
Enhancing convolutional neural network based model for cheating at online … (Sara Ouahabi)
846 ISSN: 2252-8938
the methodology used is detailed, and the crucial steps followed to ensure the rigor and reliability of the
results are highlighted. The approach is structured in several fundamental steps. First, a data collection was
carrifed out. This initial phase was followed by a critical data cleaning step aimed at eliminating any
inconsistencies or anomalies. Next, the data was labeled into two distinct categories: cheating and
non-cheating, to facilitate the identification of behavioral patterns. Afterwards, a data pre-processing phase
was carried out. This included data augmentation to diversify and enrich the dataset, as well as images
resizing to ensure consistency of model inputs. To maintain a balance between classes, a balancing strategy
was implemented. Finally, DL models were implemented and their performance was evaluated in the context
of the study. Figure 1 presents an illustrative diagram of the methodology adopted.
3.7. Modeling
In this study, the cheating detection model is based on CNN architecture. However, in order to
evaluate the effectiveness of the approach, several other reference models were also included in the analysis.
These reference models are identified from the literature review and derived from different deep neural
network and serve as points of comparison to assess the capability of the applied CNN model. In the
following section, a closer look is taken at these reference models.
Enhancing convolutional neural network based model for cheating at online … (Sara Ouahabi)
848 ISSN: 2252-8938
3.7.5. MobileNetV2
Is a compact, swift, and precise DCNN tailored for classification and detection assignments.
Engineered to excel in terms of both speed and size efficiency. This network ensures noteworthy accuracy in
various computer vision tasks such as object classification and detection [26].
3.7.6. EfficientNet
Stands out as a CNN architecture acclaimed for its efficiency and outstanding performance across
diverse computer vision tasks, including the discernment of emotions based on facial expressions. Employing
a compound scaling approach, the EfficientNetB0 architecture uniformly adjusts the network's depth, width,
and resolution. This scaling strategy enables the model to strike a commendable balance between capacity
and computational efficiency [27].
to optimize its predictions. These findings collectively demonstrate the positive progression and effectiveness
of the CNN model throughout the training process. Furthermore, the reduction in loss corroborates the
model's capability to minimize errors, thereby enhancing its overall performance.
Table 3. Hyperparameters
Category Hyperparameters Value or configuration
Layer hyperparameters Dropout 50%
Kernel size (3x3)
Activation function of the final layer Sigmoid
Activation function of hidden layers ReLU
Compilation hyperparameters Optimization function Adam
Error function binary_crossentropy
Learning rate 0.001
Execution hyperparameters Batch size 32
Number of epochs 20
Figure 3. Evolution of training and validation accuracy Figure 4. Evolution of of training and validation loss
Additionally, Figure 5 presents the confusion matrix, allowing for a comprehensive assessment of
the model's classification capability. This matrix provides valuable insights into the model's strengths and
Enhancing convolutional neural network based model for cheating at online … (Sara Ouahabi)
850 ISSN: 2252-8938
weaknesses, highlighting areas for potential improvement. The confusion matrix obtained after training the
model to classify cheating and non-cheating cases reveals impressive performance, meaning that the model
correctly identified cheating and non-cheating cases in the majority of situations.
After examining the obtained metrics, it is clear that the CNN model has achieved outstanding
performance. With an accuracy of 97.28%, the model has demonstrated a strong ability to correctly classify
input data. Furthermore, a precision score of 98.00% attests to its capability in minimizing false positives.
These results reveal the efficiency and reliability of the model, positioning it as a promising solution for the
given task. They also confirm that the model is well-tailored to the data and provides a suitable balance
between identifying true positives and minimizing classification errors. These results are encouraging for the
future of the project and suggest that the model is ready for deployment in real-world scenarios.
On the other hand, the 97.28% accuracy rate achieved illustrates the remarkable effectiveness of the
method in classifying head movements in the FEI dataset, which is divided into two distinct classes: cheating
and non-cheating. Comparing these results with those of previous studies documented in the literature, also
conducted on the same FEI dataset, it is interesting to note that these earlier works achieved an accuracy rate
of 96.24% [15]. Therefore, these results significantly surpass previously reported performance, demonstrating
that the CNN-based approach represents a substantial advance in solving this particular problem.
5. CONCLUSION
In conclusion, this research explored the use of DL to detect cheating in online exams, focusing
specifically on the detection of head movements. The proposed CNN model demonstrated exceptional
performance, outperforming other evaluated models such as ResNet, VGG16, DenseNet21, MobileNetV2,
and EfficientNetB0. The proposed approach achieved 97.28% accuracy, with 98.00% precision and 96.60%
recall. These results are significantly better than previous work on the same FEI dataset, which achieved a
precision of 96.24%. The effectiveness of the CNN model in detecting cheating in online exams suggests that
it could be successfully deployed in real-world scenarios, helping to ensure the integrity of online
assessments. However, it is important to note that this model is not infallible and may not be as effective in
detecting more subtle forms of cheating, such as verbal cheating or discreet collaboration between students.
These forms of cheating may not involve visible head movements, making their detection more difficult for
our current model.
6. PERSPECTIVES
In our future work, we plan to explore other perspectives and study various forms of online cheating,
including those that are more complex and difficult to detect. For example, we plan to study eye movements,
facial expressions and voice recognition as potential means of detecting online cheating. By expanding our
cheating detection toolkit, we hope to strengthen our model and improve its ability to detect a wider range of
online cheating behaviors.
REFERENCES
[1] A. M. Maatuk, E. K. Elberkawi, S. Aljawarneh, H. Rashaideh, and H. Alharbi, “The covid-19 pandemic and e-learning:
challenges and opportunities from the perspective of students and instructors,” Journal of Computing in Higher Education, vol.
34, no. 1, pp. 21–38, 2022, doi: 10.1007/s12528-021-09274-2.
[2] S. Ouahabi, A. Eddaoui, E. H. Labriji, E. Benlahmar, and K. El Guemmat, “Implementation of a novel educational modeling
approach for cloud computing,” International Journal of Emerging Technologies in Learning, vol. 9, no. 6, pp. 49–53, 2014, doi:
10.3991/ijet.v9i6.4153.
[3] S. Ouahabi, K. El Guemmat, M. Azouazi, and S. El Filali, “A survey of distance learning in morocco during covid-19,” Indonesian
Journal of Electrical Engineering and Computer Science, vol. 22, no. 2, pp. 1087–1095, 2021, doi: 10.11591/ijeecs.v22.i2.pp1087-1095.
[4] A. A. Bamanikar, A. Patil, D. Singh, M. Kumbhar, and V. Badhe, “Online exam portal using machine learning and face
detection,” International Journal for Research in Applied Science and Engineering Technology, vol. 11, no. 5, pp. 4624–4630,
2023, doi: 10.22214/ijraset.2023.52692.
[5] A. M. Sendur, “Academic malpractice in tests and exams from an international perspective,” Przegląd Badań Edukacyjnych, vol.
36, no. 1, 2022, doi: 10.12775/pbe.2022.007.
[6] L. C. O. Tiong and H. J. Lee, “E-cheating prevention measures: detection of cheating at online examinations using deep learning
approach -- a case study,” arXiv-Computer Science, pp. 1–9, 2021.
[7] A. W. Muzaffar, M. Tahir, M. W. Anwar, Q. Chaudry, S. R. Mir, and Y. Rasheed, “A systematic review of online exams
solutions in e-learning: techniques, tools, and global adoption,” IEEE Access, vol. 9, pp. 32689–32712, 2021, doi:
10.1109/ACCESS.2021.3060192.
[8] N. Dilini, A. Senaratne, T. Yasarathna, N. Warnajith, and L. Seneviratne, “Cheating detection in browser-based online exams
through eye gaze tracking,” in 2021 6th International Conference on Information Technology Research (ICITR), IEEE, 2021, pp.
1–8, doi: 10.1109/ICITR54349.2021.9657277.
[9] A. Javed and Z. Aslam, “An intelligent alarm based visual eye tracking algorithm for cheating free examination system,”
International Journal of Intelligent Systems and Applications, vol. 5, no. 10, pp. 86–92, 2013, doi: 10.5815/ijisa.2013.10.11.
[10] M. M. Masud, K. Hayawi, S. S. Mathew, T. Michael, and M. El Barachi, “Smart online exam proctoring assist for cheating
detection,” in Advanced Data Mining and Applications, 2022, pp. 118–132, doi: 10.1007/978-3-030-95405-5_9.
[11] R. Bawarith, A. Basuhail, A. Fattouh, and S. Gamalel-Din, “E-exam cheating detection system,” International Journal of
Advanced Computer Science and Applications, vol. 8, no. 4, 2017, doi: 10.14569/ijacsa.2017.080425.
[12] K. Jalali and F. Noorbehbahani, “An automatic method for cheating detection in online exams by processing the students webcam
images,” in 3rd Conference on Electrical and Computer Engineering Technology (E-Tech 2017), Tehran, Iran, 2017, pp. 1–6.
[13] M. Geetha, R. S. Latha, S. K. Nivetha, S. Hariprasath, S. Gowtham, and C. S. Deepak, “Design of face detection and recognition
system to monitor students during online examinations using machine learning algorithms,” in 2021 International Conference on
Computer Communication and Informatics (ICCCI), IEEE, 2021, pp. 1–4, doi: 10.1109/ICCCI50826.2021.9402553.
[14] J. O.-Rhein, B. Küppers, and U. Schroeder, “An application to discover cheating in digital exams,” in Proceedings of the 18th
Koli Calling International Conference on Computing Education Research, 2018, pp. 1–5, doi: 10.1145/3279720.3279740.
[15] F. Ozdamli, A. Aljarrah, D. Karagozlu, and M. Ababneh, “Facial recognition system to detect student emotions and cheating in
distance learning,” Sustainability, vol. 14, no. 20, 2022, doi: 10.3390/su142013230.
[16] I. N. Yulita, F. A. Hariz, I. Suryana, and A. S. Prabuwono, “Educational innovation faced with covid-19: deep learning for online
exam cheating detection,” Education Sciences, vol. 13, no. 2, 2023, doi: 10.3390/educsci13020194.
[17] O. El Hammoumi, F. Benmarrakchi, N. Ouherrou, J. El Kafi, and A. El Hore, “Emotion recognition in e-learning systems,” in 2018
6th International Conference on Multimedia Computing and Systems (ICMCS), 2018, pp. 1–6, doi: 10.1109/ICMCS.2018.8525872.
[18] M. A. Ozdemir, B. Elagoz, A. Alaybeyoglu, R. Sadighzadeh, and A. Akan, “Real time emotion recognition from facial
expressions using cnn architecture,” in 2019 Medical Technologies Congress (TIPTEKNO), IEEE, 2019, pp. 1–4, doi:
10.1109/TIPTEKNO.2019.8895215.
[19] E. Pranav, S. Kamal, C. S. Chandran, and M. H. Supriya, “Facial emotion recognition using deep convolutional neural network,”
in 2020 6th International Conference on Advanced Computing and Communication Systems, ICACCS 2020, 2020, pp. 317–320,
doi: 10.1109/ICACCS48705.2020.9074302.
Enhancing convolutional neural network based model for cheating at online … (Sara Ouahabi)
852 ISSN: 2252-8938
[20] A. Zbaida, M. Kodad, Y. Mohamed, and N. Benmoussa, “Deep learning and emotion recognition for an e-learning platform,”
Research Gate, pp. 1–11, 2023.
[21] W. Lu, J. Li, J. Wang, and L. Qin, “A CNN-biLSTM-AM method for stock price prediction,” Neural Computing and
Applications, vol. 33, no. 10, pp. 4741–4753, 2021, doi: 10.1007/s00521-020-05532-z.
[22] M. Coskun, A. Ucar, O. Yildirim, and Y. Demir, “Face recognition based on convolutional neural network,” in 2017 International
Conference on Modern Electrical and Energy Systems (MEES), Nov. 2017, pp. 376–379, doi: 10.1109/MEES.2017.8248937.
[23] V. Atliha and D. Sesok, “Comparison of VGG and resnet used as encoders for image captioning,” in 2020 IEEE Open Conference
of Electrical, Electronic and Information Sciences (eStream), IEEE, 2020, pp. 1–4, doi: 10.1109/eStream50540.2020.9108880.
[24] A. K. Dubey and V. Jain, “Automatic facial recognition using VGG16 based transfer learning model,” Journal of Information and
Optimization Sciences, vol. 41, no. 7, pp. 1589–1596, 2020, doi: 10.1080/02522667.2020.1809126.
[25] N. Hasan, Y. Bao, A. Shawon, and Y. Huang, “DenseNet convolutional neural networks application for predicting covid-19 using
ct image,” SN Computer Science, vol. 2, no. 5, 2021, doi: 10.1007/s42979-021-00782-7.
[26] H. Abdo, K. M. Amin, and A. M. Hamad, “Fall detection based on retinanet and mobilenet convolutional neural networks,” in 2020 15th
International Conference on Computer Engineering and Systems (ICCES), 2020, pp. 1–7, doi: 10.1109/ICCES51560.2020.9334570.
[27] Ü. Atila, M. Uçar, K. Akyol, and E. Uçar, “Plant leaf disease classification using efficientnet deep learning model,” Ecological
Informatics, vol. 61, 2021, doi: 10.1016/j.ecoinf.2020.101182.
BIOGRAPHIES OF AUTHORS