Research Paper On OCR
Research Paper On OCR
2, December 2016
Page | 1 ISSN-2409-6520
Journal of Information & Communication Technology-JICT Vol. 10 Issue. 2, December 2016
simpler problem because characters are usually of uniform in the image. In these situations, advance character
dimensions, and the positions of characters on the page can segmentation techniques are used.
be predicted [3]. Feature extraction: The segmented characters are then
Handwriting character recognition is a very tough job processes to extract different features. Based on these
due to different writing style of user as well as different features, the characters are recognized. Different types of
pen movements by the user for the same character. These features that can be used extracted from images are
systems can be divided into two sub-categories i.e. on-line moments etc. The extracted features should be efficiently
and off-line systems. The former is performed in real-time computable, minimize intra-class variations and maximizes
while the users are writing the character. They are less inter-class variations.
complex as they can capture the temporal or time based Character classification: This step maps the features of
information i.e. speed, velocity, number of strokes made, segmented image to different categories or classes. There
direction of writing of strokes etc. In addition, there no are different types of character classification techniques.
need for thinning techniques as the trace of the pen is few Structural classification techniques are based on features
pixels wide. The offline recognition systems operate on extracted from the structure of image and uses different
static data i.e. the input is a bitmap. Hence, it is very decision rules to classify characters. Statistical pattern
difficult to perform recognition. classification methods are based on probabilistic models
There have been many online systems available because and other statistical methods to classify the characters.
they are easier to develop, have good accuracy and can be Post processing: After classification, the results are not
incorporated for inputs in tablets and PDAs [4]. 100% correct, especially for complex languages. Post
processing techniques can be performed to improve the
Character accuracy of OCR systems. These techniques utilizes
Recognition natural language processing, geometric and linguistic
context to correct errors in OCR results. For example, post
Handwritte processor can employ a spell checker and dictionary,
Printed probabilistic models like Markov chains and n-grams to
n
improve the accuracy. The time and space complexity of a
post processor should not be very high and the application
Online of a post-processor should not engender new errors.
OCR
a. Image Acquisition
Offline
Image acquisition is the initial step of OCR that
Figure.1: Types of character recognition system
comprises obtaining a digital image and converting it into
suitable form that can be easily processed by computer.
IV.APPLICATIONS OF OCR
This can involve quantization as well as compression of
image [8]. A special case of quantization is binarization
OCR enables a large number of useful applications.
that involves only two levels of image. In most of the
During the early days, OCR has been used for mail sorting,
cases, the binary image suffices to characterize the image.
bank cheque reading and signature verification [5].
The compression itself can be lossy or loss-less. An
Besides, OCR can be used by organizations for automated
overview of various image compression techniques have
form processing in places where a huge number of data is
been provided in [9].
available in printed form. Other uses of OCR include
processing utility bills, passport validation, pen computing
b. Pre-processing
and automated number plate recognition etc [6]. Another
useful application of OCR is helping blind and visually
impaired people to read text [7]. Next to image acquisition is pre-processing that aims to
enhance the quality of image. One of the pre-processing
IV. MAJOR PHASES OF OCR techniques is thresholding that aims to binaries the image
based on some threshold value [9]. The threshold value can
The process of OCR is a composite activity comprises be set at local or global level.
different phases. These phases are as follows: Different types of filters such as averaging, min
Image acquisition: To capture the image from an external and max filters can be applied. Alternatively, different
source like scanner or a camera etc. morphological operations such as erosion, dilation,
Preprocessing: Once the image has been acquired, opening and closing can be performed.
different preprocessing steps can be performed to improve
the quality of image. Among the different preprocessing
techniques are noise removal, thresholding and extraction
image base line etc.
Character segmentation: In this step, the characters in the
image are separated such that they can be passed to
recognition engine. Among the simplest techniques are
connected component analysis and projection profiles can
be used. However in complex situations, where the
characters are overlapping /broken or some noise is present
Page | 2 ISSN-2409-6520
Journal of Information & Communication Technology-JICT Vol. 10 Issue. 2, December 2016
Table.1: Major Phases of OCR system e. Classification
Page | 3 ISSN-2409-6520
Journal of Information & Communication Technology-JICT Vol. 10 Issue. 2, December 2016
REFERENCES
[1] Satti, D.A., 2013, Offline Urdu Nastaliq OCR for
Printed Text using Analytical Approach. MS thesis
report Quaid-i-Azam University: Islamabad,
Pakistan. p. 141.
[2] Mahmoud, S.A., & Al-Badr, B., 1995, Survey and
bibliography of Arabic optical text recognition.
Signal processing, 41(1), 49-77.
[3] Bhansali, M., & Kumar, P, 2013, An Alternative
Method for Facilitating Cheque Clearance Using
Smart Phones Application. International Journal of
Application or Innovation in Engineering &
Management (IJAIEM), 2(1), 211-217.
[4] Qadri, M.T., & Asif, M, 2009, Automatic Number
Plate Recognition System for Vehicle Identification
Using Optical Character Recognition presented at
International Conference on Education Technology
and Computer, Singapore, 2009. Singapore: IEEE.
[5] Shen, H., & Coughlan, J.M, 2012, Towards A Real
Time System for Finding and Reading Signs for
Visually Impaired Users. Computers Helping People
with Special Needs. Linz, Austria: Springer
International Publishing.
[6] Bhavani, S., & Thanushkodi, K, 2010, A Survey On
Coding Algorithms In Medical Image Compression.
International Journal on Computer Science and
Engineering, 2(5), 1429-1434.
[7] Bhammar, M.B., & Mehta, K.A, 2012, Survey of
various image compression techniques. International
Journal on Darshan Institute of Engineering
Research & Emerging Technologies, 1(1), 85-90.
[8] Lazaro, J., Martín, J.L, Arias, J., Astarloa, A., &
Cuadrado, C, 2010, Neuro semantic thresholding
using OCR software for high precision OCR
applications. Image and Vision Computing, 28(4),
571-578.
[9] Lund, W.B., Kennard, D.J., & Ringger, E.K. (2013).
Combining Multiple Thresholding Binarization
Values to Improve OCR Output presented in
Document Recognition and Retrieval XX
Conference 2013, California, USA, 2013. USA:
SPIE
[10] Shaikh, N.A., & Shaikh, Z.A, 2005, A generalized
thinning algorithm for cursive and non-cursive
language scripts presented in 9th International
Multitopic Conference IEEE INMIC, Pakistan, 2005.
Pakistan: IEEE
[11] Shaikh, N.A., Shaikh, Z.A., & Ali, G, 2008,
Segmentation of Arabic text into characters for
recognition presented in International Multi Topic
Conference, IMTIC, Jamshoro, Pakistan, 2008.
Pakistan: Springer.
[12] Ciresan, D.C., Meier, U., Gambardella, L.M., &
Schmidhuber, J, 2011, Convolutional neural network
committees for handwritten character classification
presented in International Conference on Document
Analysis and Recognition, Beijing, China, 2011.
USA: IEEE.
Page | 4 ISSN-2409-6520