Data Entry Through OCR - A Case Study of Digitizing Examination Marks from Paper Marksheets
Data Entry Through OCR - A Case Study of Digitizing Examination Marks from Paper Marksheets
Background
● To compare different methods for drawing bounding boxes around each digit
from multi-digit numbers.
Word count: 69
Expected Outcome
Word count: 67
Methodology
● Getting image slices that contain the numbers we want to extract
● Validating existing OCR systems on these cropped images with numbers.
● Preliminary results have shown.
Preliminary results are positive, with the model showing significant improvement
over generic OCR solutions when tested on examination papers with variable
writing styles and potentially touching digits.