Automatic Grading of Answer Sheets Using Machine L
Automatic Grading of Answer Sheets Using Machine L
Learning Techniques
[email protected], [email protected]
1 Introduction
there's a chance to totally change this procedure. This research aims to tackle this prob-
lem by automating the grading of hard copy question-answer sheets using advanced
NLP and ML algorithms.
The most objective of this research work is to reduce the time and expenses related
to manual grading, which takes two to three days for teachers to do on average. By
employing methods such as XGBoost, Ridge Regression, and Naive Bayes, the system
seeks to deliver precise evaluations of student responses, relieving educators of the la-
borious hand grading duty.
The implementation of NLP and ML algorithms is made possible by the use of op-
tical character recognition technology, which transforms hard copy question-answer
sheets into machine-readable format. The system attempts to understand patterns and
subtleties present in responses by utilizing historical datasets of student question-an-
swers, with a major focus on scoring short-answer questions.
Beyond just assigning grades, the system also seeks to give students insightful feed-
back that will expedite their learning and support their academic growth. Strict criteria
like Precision, Recall, and F-measure are utilized to assess the system's performance,
guaranteeing the precision and dependability of the automated grading procedure.
All things considered, this paper is a big step toward improving efficacy and effi-
ciency in the field of education, demonstrating how cutting-edge technologies may sim-
plify routine tasks and give teachers and students more control.
2 Literature Survey
Liu et al. [4] present an innovative method for extracting and understanding global
features from short text. Through a fusion of Convolutional Neural Networks (CNN)
and Latent Dirichlet Allocation (LDA), the authors devised a technique capable of cap-
turing both local and global features effectively. This approach holds substantial prom-
ise for various natural language processing tasks, especially in the realm of short text
comprehension.
Neslihan et al. [5] conducted research focusing on the UK's GSEC exam, offering
feedback to students. Initially, the authors employed standard data mining techniques
to analyze student answers alongside model responses. They subsequently devised sim-
ilarity measures based on common word occurrences, utilizing a clustering algorithm
for this purpose.
3 Proposed Methodology
This paper uses three different algorithms: XGBoost, Ridge Regression, and Naive
Bayes to evaluate question-answer sheets autonomously. Each algorithm is unique and
provides a fresh approach to solving problems associated with assessing students' re-
sponses.
Feature Extraction:
The next step involved extracting relevant features from the pre-processed text data.
Features such as word frequency, sentence structure, and semantic meaning were
identified and extracted using techniques from Natural Language Processing.
Additionally, domain-specific features relevant to the subject matter were considered
to enhance the accuracy of the grading process.
Automatic Grading of Answer Sheets using Machine Learning 279
XGBoost:
Ridge Regression:
Ridge Regression is a linear regression technique that addresses the issue of
multicollinearity and over-fitting by introducing a regularization term to the cost
function. By penalizing large parameter values, Ridge Regression produces more stable
and interpretable models, particularly in high-dimensional datasets. In our
implementation, Ridge Regression serves as a complementary approach to Naive Bayes
and XGBoost, providing a linear model that captures the underlying relationships
between features and scores in a concise manner.
scores using the classifier. For XGBoost and Ridge Regression, we directly predict
scores based on the learned models.
Evaluation: At last, we assess the performance of each show utilizing suitable
measurements such as accuracy, precision, recall, and F1 score. By comparing the
comes about, we point to distinguish the foremost compelling calculation for reviewing
question-answer sheets.
4 Results
The research introduces a solution for automating the assessment of question answer
sheets specifically focusing on copies to reduce the time and expenses linked with grad-
ing. By applying Natural Language Processing and Machine Learning methods like
XGBoost, Ridge Regression and Naive Bayes the system uses an OCR dataset gener-
ated from hard copy papers. It is tailored to grade short answer questions and offer
feedback to students by referencing a database of student responses. Performance eval-
uation metrics such, as Accuracy, Precision, Recall and F measure are used to gauge
the effectiveness of the system. The experimental results were analyzed to identify
trends, strengths, and limitations of the proposed system. Comparative investigation
was performed to compare the execution of each calculation and distinguish the fore-
most viable approach for computerizing the evaluating prepare. The implications of the
findings were discussed, highlighting the potential impact of automated grading sys-
tems on educational practices and student outcomes. The below are the mathematical
equations of Accuracy, Precision, Recall, F-measure i.e., Eqn - 1 to Eqn - 4 respectively.
(𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 )
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠+𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 -------- (2)
(𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 )
𝑅𝑒𝑐𝑎𝑙𝑙 = -------- (3)
𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 +𝐹𝑎𝑙𝑠𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠
(𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛∗𝑅𝑒𝑐𝑎𝑙𝑙)
𝐹 − 𝑚𝑒𝑎𝑠𝑢𝑟𝑒 = 2𝑋 -------- (4)
(𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑅𝑒𝑐𝑎𝑙𝑙 )
These metrics indicate the robustness and effectiveness of the XGBoost algorithm in
accurately assessing student responses. With high precision and recall rates, XGBoost
ensures both the accuracy and comprehensiveness of the grading process, highlighting
its potential as a reliable tool for automating educational evaluations.
282 K. Ramani et al.
In conclusion, this paper speaks to a critical walk in automating the grading process of
question-answer sheets through the application of advanced Natural Language Pro-
cessing and Machine Learning techniques. By employing algorithms such as Naive
Bayes classification and Ridge Regression and XGBooster, we have developed a sys-
tem capable of efficiently assessing student responses and providing valuable feedback,
thereby alleviating the burden on educators and enhancing the overall efficiency of the
educational evaluation process and experimental results reveal that XGBoost achieves
the highest efficiency. Through rigorous evaluation metrics and comprehensive imple-
mentation, this project underscores the transformative potential of technology in
streamlining traditional educational practices, paving the way for more effective learn-
ing outcomes and resource optimization in educational institutions.
In the future,Public datasets were available not too long ago, and research in the field
is open to new techniques, However, it's still not widely used in automatic grading.
Using technology like ICR to convert handwritten text into digital form can help. Then,
applying deep learning techniques for grading can improve accuracy. Additionally, us-
ing LLMs can offer students personalized feedback, making learning better. This ap-
proach could greatly enhance automatic grading systems in education
References
pp.726-743.
6. Neethu George, Sijimol PJ, Surekha Mariam Varghese, "GradingDescriptive Answer
Scripts Using Deep Learning", InternationalJournal of Innovative Technology and Ex-
ploring Engineering (IJITEE), Volume-8 Issue-5 March, 2019.
7. Alomran, M., & Chia, D. (2018). Automated Scoring System forMultiple Choice Test
with Quick Feedback. International Journal of Information and Education Technology,
8(8).
8. Mohler, M., Bunescu, R., Mihalcea, R.: Learning to grade short answer questions using
semantic similarity measures and dependency graph alignments. In: Proceedings of the
49th Annual Meeting of the Association for Computational Linguistics: Human Lan-
guage Technologies, pp. 752–762 , 2011
9. A. Vinaya Babu, "Detection of concept-drift for clustering time-changing categorical
data: An optimal method for large datasets." In Data Engineering and Communication
Technology: Proceedings of 3rd ICDECT-2K19, pp. 861-871. Springer Singapore,
2020.
10. Kuraparthi, Swaraja, Madhavi K. Reddy, C. N. Sujatha, Himabindu Valiveti, Chaitanya
Duggineni, Meenakshi Kollati, and Padmavathi Kora. "Brain Tumor Classification of
MRI Images Using Deep Convolutional Neural Network." Traitement du Signal 38, no.
4 (2021).
11. Magooda, A., Zahran, M.A., Rashwan, M., Raafat, H., Fayek, M.B.: Vector based tech-
niques for short answer grading. In: International Florida Artificial Intelligence Re-
search Society Conference Ahmed, pp. 238–243 (2016)
12. Passero, G., Haendchen Filho, A., Dazzi, R.: Avaliação do uso de métodos baseados em
se wordnet para corre¸c˜ao de quest˜oes discursivas. In: Brazilian Symposium onCom-
puters in Education (Simp´osio Brasileiro de Informática na Educa¸c˜ao-SBIE), vol. 27,
p. 1136 (2016)
13. Ziai, R., Ott, N., Meurers, D.: Short answer assessment: establishing links between re-
search strands. In: Proceedings of the Seventh Workshop on Building Educational Ap-
plications Using NLP, pp. 190–200, 2012.
14. Valenti, S., Neri, F., Cucchiarelli, A.: An overview of current research on automated
essay grading. J. Inf. Technol. Educ. 2, 319–330, 2003
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/),
which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any
medium or format, as long as you give appropriate credit to the original author(s) and the
source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's
Creative Commons license, unless indicated otherwise in a credit line to the material. If material
is not included in the chapter's Creative Commons license and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain
permission directly from the copyright holder.