OCR Technical Documentation and Software Manual
OCR Technical Documentation and Software Manual
Prepared for
ICT Division
Government of the People’s Republic of Bangladesh
Table of Contents
1. INTRODUCTION.......................................................................................................................................................
1.1 PURPOSE.................................................................................................................................................................
1.2 SYSTEM REQUIREMENTS.........................................................................................................................................
1.3 INSTALLATION........................................................................................................................................................
2. USER INTERFACE AND FUNCTIONS(USER MANUAL).................................................................................
2.1 GUI.........................................................................................................................................................................
2.2 GUI FUNCTIONS.....................................................................................................................................................
3.Technical Documentation
1. Introduction
“Bangla Optical Character Recognition” (Bangla OCR) is a desktop software developed for
automatic conversion of images of printed Bangla text into machine-encoded Unicode Bangla
text.
1.1 Purpose
The purpose of this documentation is to familiarize the users with the interface, functionality and
usage guide of Bangla OCR software.
1.3 Installation
Step 1: vcredist_x86.exe install this Visual C++ redistributable (). If a newer
version of Microsoft Visual C++ 2010 Redistributable has already been installed
detected on the machine you don’t need to install the Microsoft Visual C++ 2010
Redistributable, just close the installer window.
Step 2: Run BongoOCRSetUp_1.0.exe file.
Step 3: Windows will ask your permission to run the installer. Click ‘Yes’
After the installation is finished a new window will pop-up and Click ‘Finish’
Now OCR is ready to use.
Note: If the application fails to run after installation please make sure that the Visual Studio
Redistributable 2010 is present at the system
1. Menu Bar :
OCR User Interface has a standard Menu Bar which contains all the functions of the
OCR. Below is the list of Menus, sub-menus and their corresponding usage.
File Menu
Open – open an image file
Do OCR – after opening an image this command does the ocr of the image, use it
when image contains only text and only one column
Do Batch OCR- when you open more than one file on image thumb preview and
want to do ocr all page one by one, just click open and all file you want to do ocr then all
image file will be open on image thumb preview. After that when you click Do Batch
OCR button it will start convert one by one image into editable text format and the output
will be shown on text editor. If you save the output by click save as button, all output text
will be save in one file.
Do OCR with Document Analysis – use this command when document image
contains more than one column and image. It’s try to give output as like
as given input image page format.
3. Image Menu
Analyze Page – use this command to detect image and text blocks. Image block is
masked as red color and text block as green color. If you want to
detect tables then check ‘detect tables’ from options bar
You can double click on any text block to ocr that part only.
Deskew Image – use this command if your document image text is skewed at some angle,
this operation is necessary for good accuracy of OCR. Below is the example of skewed
document.
4. Format Menu – this menu contains standard text formatting commands. Command
names are self-descriptive.
Tool Bar – OCR Interface contains a toolbar, these commands are actually frequently
used commands clustered together in a visual form, all of these commands exists in the
menu bar.
Hover over any tool icon and a tool-tip will appear which describes it’s corresponding
function.
On top-right side there’s a zoom bar, you can use this to zoom in or zoom out text in the
text editor.
Tip – After opening an Image you can hold down your left mouse button and select a text
region in the document, after selecting you can press ‘Do OCR’ to OCR that region only.
Tips: For table detection select detect table checkbox and then click “Do OCR with
Document Analysis”.
Conditions.
Each table has to contain at least 2500 pixels.
Each row of a table has to be at least 1/100th of the total document height
Each column has to be at least 1/50 of total document width.
The source image has to be straight if not you can use the deskew tool.
Tips for batch processing: Click File->open. You can select multiple files at once. Then
click File -> Do batch OCR. For batch processing the document has to be plain text with
single column.
Note: Currently due to some system restriction this application does not support Unicode
characters (Bengali or any other language) in the file names. So please make sure the
names of the files upon which OCR is being performed is named with English ASCII
characters.