Handwritten Text Recognition
Handwritten Text Recognition
Recognition
Presented by: Akankshi Mody | Mohini Agarwal | Logan Liu | Shane Kok
Introduction
Creative Signatures
Handwriting recognition is a challenging problem due to huge variation in individual writing styles.
657 writers
contributed samples of their handwriting
13'353 text lines
isolated and labeled
115'320 words
isolated and labeled
Approaches
Simple Approach-1
KNN
[1] [‘o’]
[‘d’] [3]
Challenges
[‘M’]
[‘E’]
57% accuracy
Challenges-1
10 == ‘A’
Challenges-2
https://github.com/githubharald/SimpleHTR
Convolutional Neural Network
https://towardsdatascience.com/build-a-handwritten-text-recognition-system-using-tensorflow-2326a3487cd5
Recurrent Neural Network
https://towardsdatascience.com/build-a-handwritten-text-recognition-system-using-tensorflow-2326a3487cd5
Connectionist Temporal Classification Loss
https://towardsdatascience.com/intuitively-understanding-connectionist-temporal-classification-3797e43a86c
How CTC Decoding Works
Repetition Characters in
CTC Blank (-) Label
the input image
Proxy character used to indicate
the separation between characters Handles input images with
in input image characters occurring multiple
times such as ‘too’
Handling Duplicates
in the Output
18
Connection Temporal Classification Loss
https://towardsdatascience.com/intuitively-understanding-connectionist-temporal-classification-3797e43a86c
Bringing it Altogether
Shi - An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
Running the Models
RMSProp Optimizer
Gradient Descent Optimizer
Conclusion
Lessons Learned
PRE-PROCESSING
Preprocessing Images
MODELS
■ Grey-scaled
■ Resized Dimensions
■ Fixing Kernel Striding Window Size RNN+CNN Models
■ CNN performs well on images
■ Transposed
■ RNN helps to learn better for sequential models
Future Steps
Data Augmentation
Match real world variances in handwriting by applying
random transformations Increase Input Size
Improve the model to identify and read sentences and
full passages
Text Correction
If a word is not learned in training, refer to corpus of
words
Future Business Applications
Teach Kids to
Recognize and Interpret Dyslexic
Improve Handwriting Handwriting
write another line here
entrepreneurial activities differ substantially