0% found this document useful (0 votes)

15 views

A (6)

The document presents an AI-based reading system designed for visually impaired individuals, utilizing Optical Character Recognition (OCR) and Text-to-Speech (TTS) technology to convert printed text into audible speech. The system allows users to capture images of text using their smartphone camera, which are then processed to recognize and vocalize the content. The application aims to enhance accessibility to printed information for the visually impaired by providing a user-friendly Android application.

Uploaded by

nuri.omid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

A (6)

Uploaded by

nuri.omid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Proceedings of the Third International Conference on Electronics Communication and Aerospace Technology [ICECA 2019]

IEEE Conference Record # 45616; IEEE Xplore ISBN: 978-1-7281-0167-5

AI based Reading System for Blind using OCR

Abhishek Mathur Prerna Sharma
Information Technology Akshada Pathare Information Technology
Ramrao Adik Institute of Technology Information Technnology Ramrao Adik Institute of Technology
Nerul,India Ramrao Adik Institute of Technology Nerul,India
[email protected] Nerul,India [email protected]
[email protected]

Sujata Oak
Information Technology
Ramrao Adik Institute of Technology
Nerul,India
[email protected]

Abstract—The new generation of mobile phones has great done, this all features will be provided by Optical Character
hardware ability and faster processing which is powerful Recognition (OCR) module. Once the image is being pre-
enough to develop applications which help the user to connect processed Optical Character Recognition will read and
and interact with the world at their own comfort zone. This recognize the text visible in the image. However the final
system is an OCR reading system which uses camera
outcome of the system will be a audible speech which will
application present in your smart phones combined with OCR
(Optical Character Recognition). OCR is a mechanism which read out the text recognized by the OCR module, For
coverts images of typed, handwritten, or printed text into delivering this voice output Text-To-Speech (TTS) module
machine encoded text. This system will help you to take a is being used. This application will read anything that is
picture or scan the document present with user using the present in English language also any numerical integer
phone’s camera, the image will be scanned and the application value. Every special character like comma, exclamation, full
will read the text written in English language and convert the stop, question mark will be taken into consideration and
output in speech format. The speech output is generated using right pause will be taken whenever any of these is
Text To Speech Module. The purpose of delivering the output encountered. Thus using OCR the system will recognize and
in form of voice/speech is to serve the information that is
convert the image into text format and further Text-To-
present on the document to the visually impaired.
Speech will convert the recognized text into voice which
Keywords— Mobile Phone, Optical Character Recognition, will help the visually impaired to read the document and
Text to Speech, Visually Impaired. keep them updated.

I. INTRODUCTION
II. LITERATURE REVIEW
As in today’s world though all the information is
represented electronically, but the information represented In Text reading applications there are many different
on the paper, has its own relevance. However, this techniques available such as label reading, voice stick, brick
information is not available for the visually impaired people. pi reader and pen aiding but these methods can perform text
To help them in getting this vital information we propose a to speech by creating datasets. In order to address this
system in which the document is read electronically and will problem, finger reading technique has been developed, it
be converted into speech. This system will help the visually eliminates the datasets created and stored previously and
impaired to be aware of the information presented on the provide a previous response of reading any text given as
document through the help of speech. Our application will input captured image.
be able to recognize the text captured by a mobile phone
camera, display the translation result back onto the screen of 1. In [1], authors suggest that MATLAB, LabVIEW is
the mobile phone, and produce the speech of the translated used to preprocess an image which is then given as
text. input further this image is segmented and then the OCR
Mobile devices are becoming very popular, especially the module starts it’s process of text recognition. This
Smartphone. Researchers are developing various system not only converted image to speech but also
applications for the users, which can be used on Smart tried to take input in text format from the user and
phones. This system proposed will be available to the end converted the same into speech, this thereby can be
user in form of Android application that can be downloaded used by totally dumb people or people with speech loss.
from Application stores. Thus developing an android based Initially the system generates a bitmap in ARGB-8888,
system will help to serve greater number of people. This and then passes it to tesseract engine for recognition.
system will help the visually impaired to read the essential
document within few clicks. The application will require an 2. In [2] ,authors proposed system will allow the user to
inbuilt camera application system in the phone. Since a view the virtual object in the real world using a marker
visually impaired won’t be able to click a perfect picture of based Augment reality. The user has to provide any one
the document pre-processing of the captured image will be side of the image that can be the left, right, top or the
bottom view of the image. Later to get the virtual fully

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 39

Proceedings of the Third International Conference on Electronics Communication and Aerospace Technology [ICECA 2019]
IEEE Conference Record # 45616; IEEE Xplore ISBN: 978-1-7281-0167-5

appearing object the image will be placed on a 3D cube. III. PROPOSED METHODOLOGY
A live video is feed as an input, then it generates binary System Design:
images i.e. a digital image that has only two possible
values for each pixel. These binary images are
processed using an image processing technique to
detect the AR Marker. Once the AR Marker is detected,
its location is provided. Later it calculates the relative
pose of the camera in real time. The term pose means
the six degrees of freedom (DOF) position, i.e. the 3D
location and 3D orientation of an object. Finally it
display the augmented image on the display screen.

3. In [4], authors explains the system proposed is an

automatic book reading system. In this research the text
recognition process is done using Raspberry PI devices.
The characters are recognized using algorithm like
tesseract and python programming is used. The final
output is given as voice. The image is captured using a
web camera which is connected to ARM
microcontroller through USB. In this system
TESSERACT library is being used for OCR module
and Flite library for data conversion to audio.

4. In [5], authors propose that major image processing Fig.1: Proposed Block Diagram.
techniques such as image acquisition, processing is
covered. Here text is identified from an image using This application will be available to the users in the form of
ideas such as LabVIEW and the NI Vision toolkit. an Android application that can be downloaded from the
Application Stores. Modules involved in this Reading
5. In [6], presents an algorithm for extracting information System are as follows:
from a business card using an android mobile phone. In
this research it used open source OCR software called •Camera:
tesseract. Using tesseract helped to overcome The inbuilt camera application in any smart phone is used to
environmental conditions reflection, blurring, variable capture the image of the document which is to be read. The
lighting, scaling that appeared during clicking a picture. image will contain textual regions from which the text will
In this system a Gray scale or color image is provided be recognized.
as input. The program takes .tiff and .bmp files, then it
converts the gray scale images into binary images. It •OCR module:
then calculates the optimal threshold that separates the OCR module that is Optical Character Recognition module
background and foreground pixel classes such that the is used to preprocess the image and detects and identify the
variance between the two is minimal. Then it finds words that in English language. After capturing of the image
locations having a pixel count less than a 2 specific in standard resolution, the textual regions within the image
threshold. After each line of text is found, Tesseract are localized. The system is only concerned with the textual
examines the lines of text to find approximate text. regions and complex backgrounds are not within the scope
of the project. The captured image is processed and the first
6. In [7], authors propose an algorithm which helps to step in the processing is localization of the textual regions in
detect localize and extract text that are horizontally the image. The identified characters are converted into text
aligned in an image irrespective of the background that is machine encoded and displayed on the screen with
present. It uses projection profile analyses and the help of this module. The core functionality of the system
geometric properties to segregate the text region and is to recognize the text from image. The localized textual
detect the text. After this processing it is then send to regions are used as input for this system feature and the text
the OCR engine for character recognition. (Alphanumeric characters: A-Z, a-z and 0-9) in these
regions is recognized.
7. In [8], authors explain the system which extracts text
characters from natural scene using smart mobile •Text To Speech Module:
devices. The algorithm used for the same designs a This module is used to give speech output of the converted
discriminative character descriptor then by designing text from the image. This module thus helps to hear
stroke character maps it models each specific character whatever is printed or written on the scanned document.
class by modeling character structure.
•User Interface module:
User Interface module will provide features such as a login
page, a registration page and also will also display the result

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 40

Proceedings of the Third International Conference on Electronics Communication and Aerospace Technology [ICECA 2019]
IEEE Conference Record # 45616; IEEE Xplore ISBN: 978-1-7281-0167-5

of detected text on the screen and give a voice output which

is convenient for the end user.

Proposed Approach:

 We are using Android Studio for developing the

application.
 Android has maximum amount of users and
android is more user friendly.
 The user will use mobile camera to capture the
image which they wish to scan.
 This captured image will act as input to our OCR
module which will scan the image and recognize
the text in it.
 The recognized text from the OCR module will be
used as input for our text-to-speech module which
will give speech output to users.

CONCLUSION
AI based reading system using OCR is an artificial
intelligence reading system developed using a smart phones Hand written Text Recognition:
camera combined with OCR (Optical Character
Recognition). This application detects the text using the
camera and scans the text and then converts it into digital
text which is recognized by the system and displays the
translated text and gives speech output. To understand the
dynamics of the project, a basic idea about what is AI and
OCR is required. This report explains the entire working of
Language Translator, along with minimum requirements
needed to implement it. Hence, visually impaired person can
easily use this AI based Reading system as a friendly simple
application in all around the globe.

RESULT
Text Recognition:
AI based reading system for blind converts the image file
into text and display o the screen and give a voice output.
For conversion and recognition of text it uses OCR.OCR
uses in built library for recognition of text. It successfully
recognizes English alphabets from A-Z or a-z.

Number Recognition:
This system not just recognizes the alphabetic words and
letters but it also identifies the integer or numerical value
from 0-9 and it is also delivered in form of speech.

Proceedings of the Third International Conference on Electronics Communication and Aerospace Technology [ICECA 2019]
IEEE Conference Record # 45616; IEEE Xplore ISBN: 978-1-7281-0167-5

REFERENCES

[1] Jisha Gopinath, Aravind S, Pooja Chandran, Saranya S S, "Text

to Speech Conversion System using OCR", International Journal
of Emerging Technology and Advanced Engineering , Volume
5, Issue 1, January 2015.
[2] Raviraj S Patkar , S. Pratap Singh , Ms. Swati V. Birje, "Marker
Based Augmented Reality Using Android OS", International
Journal of Advanced Research in Computer Science and
Software Engineering ,Volume 3, Issue 5, May 2013.
[3] Ashwani Kumar, Ankush Chourasia, "Blind Navigation System
Using Artificial Intelligence", March 2018
[4] Aaron James S, Sanjana S, Monisha M, "OCR based automatic
book reader for the visually impaired using Raspberry PI", Vol.
4, Issue 7, January 2016.
[5] Christopher G Relf, "Image Acquisition and Processing with
LabVIEW", CRC Press, 2004.
[6] Sonia Bhaskar, Nicholas Lavassar, Scott Green, "Implementing
Optical Character Recognition on the Android Operating
System for Business Cards".
[7] Julinda Gllavata, Ralph Ewerth and Bernd Freisleben, "A
Robust Algorithm for Text Detection in Images".
[8] Chucai Yi, Yingli Tian, “Scene Text Recognition in Mobile
Applications by Character Descriptor and Structure
Configuration”, IEEE Transactions on Image Processing, Vol.
23 No. 7, July 2014.
[9] Sujata Atul Oak, Dr. Amarsinh Vidhate, “Improved Duplicate
Address Detection For Fast Handover Mobile IPv6”,
International Conference on Computing Communication,
Control and Automation, IEEE section, (ICCUBEA- 2016),
August 2016.
[10] Sujata Atul Oak, Dr. Amarsinh Vidhate, “Mobility Management
in Vehicular Networks”, International Journal of Advanced
Research in Computer Science and Softwre
Engineering(IJARCSSE), Vol. 5, Issue 12, December 2015.
[11] Sujata Oak, “Video Piracy Detection using Invisible
watermarking”, International Journal of Research in Science and
Engineering (IJRISE), Vol. 3, Issue 3, June 2017.
[12] Sujata Oak, “Emotion Based Music Player”, International
Journal of AdvancedResearch in Computer and Communication
on Engineering (IJARCCE), Vol. 6, Issue 4, April 2017.

Six Easy Pieces Essentials of Physics Ex PDF
6% (16)
Six Easy Pieces Essentials of Physics Ex PDF
7 pages
Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
From Everand
Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
Bernardo Ronquillo Japón
No ratings yet
Leslie Mashonga T2082163F
No ratings yet
Leslie Mashonga T2082163F
9 pages
Image To Speech Conversion PDF
No ratings yet
Image To Speech Conversion PDF
7 pages
Text To Speech Conversion
No ratings yet
Text To Speech Conversion
4 pages
Math El
No ratings yet
Math El
17 pages
Text To Speech Conversion Using Raspberry - PI
No ratings yet
Text To Speech Conversion Using Raspberry - PI
3 pages
Visually Disabled
No ratings yet
Visually Disabled
7 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Blind Reader: Project Guide:Dr. Jayanand Gawande
No ratings yet
Blind Reader: Project Guide:Dr. Jayanand Gawande
8 pages
Image To Speech Conversion in Multi Languages
No ratings yet
Image To Speech Conversion in Multi Languages
31 pages
Smart Reader For Blind People
No ratings yet
Smart Reader For Blind People
3 pages
Open Source Computer Vision
No ratings yet
Open Source Computer Vision
79 pages
Devel Projevct
No ratings yet
Devel Projevct
59 pages
Text Reader For Visually Impaired Person Using Image Processing Open-CV
No ratings yet
Text Reader For Visually Impaired Person Using Image Processing Open-CV
8 pages
6.python Text To Speech
No ratings yet
6.python Text To Speech
2 pages
2021 Textreader
No ratings yet
2021 Textreader
9 pages
A Smart Reader For Visually Impaired People Using Raspberry PI
No ratings yet
A Smart Reader For Visually Impaired People Using Raspberry PI
5 pages
Voice Assisted Text Reading System For Visually Impaired Persons
No ratings yet
Voice Assisted Text Reading System For Visually Impaired Persons
6 pages
Android TTS OCR Converter System For People With Visual Disability
No ratings yet
Android TTS OCR Converter System For People With Visual Disability
7 pages
MP 2A PPT
No ratings yet
MP 2A PPT
26 pages
Blind Helper Documentation NEW
No ratings yet
Blind Helper Documentation NEW
70 pages
Intelligent Character Recognition: Advancing Machine Perception in Computer Vision
From Everand
Intelligent Character Recognition: Advancing Machine Perception in Computer Vision
Fouad Sabry
No ratings yet
Optical Character Recognition Based Speech Synthesis: Project Report
0% (1)
Optical Character Recognition Based Speech Synthesis: Project Report
17 pages
4
No ratings yet
4
7 pages
Last Edited
No ratings yet
Last Edited
8 pages
Camera Based Text Reading For Blind Person
No ratings yet
Camera Based Text Reading For Blind Person
17 pages
Sign Board Reader
No ratings yet
Sign Board Reader
22 pages
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
First_review_1MS21LVS06
No ratings yet
First_review_1MS21LVS06
12 pages
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
No ratings yet
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
3 pages
Raspberry Pi Based Smart Reader For Visually Impaired People
50% (2)
Raspberry Pi Based Smart Reader For Visually Impaired People
12 pages
dip_pdf
No ratings yet
dip_pdf
30 pages
Adapting A Tts System To A Reading Machine For The Blind
No ratings yet
Adapting A Tts System To A Reading Machine For The Blind
4 pages
Journals Uja I Ej
No ratings yet
Journals Uja I Ej
13 pages
Smart Wireless Braille: Chippy
No ratings yet
Smart Wireless Braille: Chippy
4 pages
Presentation 4
No ratings yet
Presentation 4
17 pages
KH
No ratings yet
KH
7 pages
Dynamic Project
No ratings yet
Dynamic Project
4 pages
Introduction To Augmented Reality Hardware: Augmented Reality Will Change The Way We Live Now: 1, #1
From Everand
Introduction To Augmented Reality Hardware: Augmented Reality Will Change The Way We Live Now: 1, #1
Kaviyaraj R
No ratings yet
Fyp
No ratings yet
Fyp
18 pages
Hindi
No ratings yet
Hindi
6 pages
Implementing Image-To-Speech Recognition by Capturing Image Frames For Visually Impaired
No ratings yet
Implementing Image-To-Speech Recognition by Capturing Image Frames For Visually Impaired
6 pages
Image To Text and Speech Conversion
No ratings yet
Image To Text and Speech Conversion
3 pages
Iarjset 2022 9420
No ratings yet
Iarjset 2022 9420
5 pages
Research Paper Format
No ratings yet
Research Paper Format
14 pages
A Survey Paper On Speech Captioning of Document For Visually Impaired People
No ratings yet
A Survey Paper On Speech Captioning of Document For Visually Impaired People
8 pages
IJARCCE.2024.131140
No ratings yet
IJARCCE.2024.131140
4 pages
RRL
No ratings yet
RRL
2 pages
An Efficient Approach For Text-to-Speech Conversio
No ratings yet
An Efficient Approach For Text-to-Speech Conversio
6 pages
Recognizing of Text and Product Label From Hand Held Entity Intended For Visionless Persons
No ratings yet
Recognizing of Text and Product Label From Hand Held Entity Intended For Visionless Persons
3 pages
PRE Synopsis
No ratings yet
PRE Synopsis
3 pages
V6i2 Pices0003
No ratings yet
V6i2 Pices0003
3 pages
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
No ratings yet
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
7 pages
Text Reader For Blind
No ratings yet
Text Reader For Blind
6 pages
Report Draft
No ratings yet
Report Draft
18 pages
Tamil Textual Image Reader
No ratings yet
Tamil Textual Image Reader
4 pages
inbound4683096324164272159
No ratings yet
inbound4683096324164272159
6 pages
B.E Ece Batchno 88
No ratings yet
B.E Ece Batchno 88
77 pages
LITERATURE SURVEY - Visual
No ratings yet
LITERATURE SURVEY - Visual
5 pages
A (24)
No ratings yet
A (24)
9 pages
A (22)
No ratings yet
A (22)
11 pages
A (25)
No ratings yet
A (25)
2 pages
A (11)
No ratings yet
A (11)
21 pages
A (23)
No ratings yet
A (23)
6 pages
Awn Ap 54MR
No ratings yet
Awn Ap 54MR
34 pages
Project Management - CPM-PERT
No ratings yet
Project Management - CPM-PERT
125 pages
Ayush Experiment 23
No ratings yet
Ayush Experiment 23
5 pages
Makalah Strategi Lokasi
No ratings yet
Makalah Strategi Lokasi
15 pages
Mia - Padua - How To Create Time-Lapse Video
No ratings yet
Mia - Padua - How To Create Time-Lapse Video
51 pages
Limits Ap Classroom PDF
100% (1)
Limits Ap Classroom PDF
23 pages
Mr.Patel 482 BNS Mahadevpura
No ratings yet
Mr.Patel 482 BNS Mahadevpura
18 pages
MB21
No ratings yet
MB21
11 pages
DenA2542X100 Monitor AMI Phosphate-II
No ratings yet
DenA2542X100 Monitor AMI Phosphate-II
2 pages
AxesstelPst Manual
No ratings yet
AxesstelPst Manual
133 pages
(Ebook) Professional Architectural Photography, Third Edition by Michael Harris ISBN 9780240516721, 0240516729 download
100% (2)
(Ebook) Professional Architectural Photography, Third Edition by Michael Harris ISBN 9780240516721, 0240516729 download
47 pages
Laporan Penjualan Dagang V1.4 - Contoh Isian Toko Handphone
No ratings yet
Laporan Penjualan Dagang V1.4 - Contoh Isian Toko Handphone
191 pages
LA Irish Standard
No ratings yet
LA Irish Standard
18 pages
Functions in Economics
No ratings yet
Functions in Economics
60 pages
Smart Manufacturing For The Oil Refining and Petrochemical Industry
No ratings yet
Smart Manufacturing For The Oil Refining and Petrochemical Industry
4 pages
1.testing Throughout The Software Life Cycle
No ratings yet
1.testing Throughout The Software Life Cycle
12 pages
Arpan Jain: Linkedin Angellist Github
No ratings yet
Arpan Jain: Linkedin Angellist Github
1 page
Linux Boot Processes
No ratings yet
Linux Boot Processes
7 pages
Leonarddorschnerresume 010622
No ratings yet
Leonarddorschnerresume 010622
1 page
Dayna Kriger - Graphic Design Portfolio
No ratings yet
Dayna Kriger - Graphic Design Portfolio
31 pages
ADA lab manual (1)
No ratings yet
ADA lab manual (1)
47 pages
Timer Manual
No ratings yet
Timer Manual
2 pages
Shadows of Evil: Step 1 - Complete The Rituals
No ratings yet
Shadows of Evil: Step 1 - Complete The Rituals
9 pages
Unit-5(Part-3)File Handling in C
No ratings yet
Unit-5(Part-3)File Handling in C
18 pages
Reseach Paper On Customer Satisfaction About DTH
No ratings yet
Reseach Paper On Customer Satisfaction About DTH
18 pages
VXLAN Deployment - Use Cases and Best Practices
No ratings yet
VXLAN Deployment - Use Cases and Best Practices
76 pages
Core 6 Succinctly
No ratings yet
Core 6 Succinctly
102 pages
2.5.2 HZ-2000H Transformer Tan Delta-User Manual
No ratings yet
2.5.2 HZ-2000H Transformer Tan Delta-User Manual
27 pages

A (6)

Uploaded by

A (6)

Uploaded by

Proceedings of the Third International Conference on Electronics Communication and Aerospace Technology [ICECA 2019]

IEEE Conference Record # 45616; IEEE Xplore ISBN: 978-1-7281-0167-5

AI based Reading System for Blind using OCR

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 39

3. In [4], authors explains the system proposed is an

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 40

of detected text on the screen and give a voice output which

 We are using Android Studio for developing the

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 41

[1] Jisha Gopinath, Aravind S, Pooja Chandran, Saranya S S, "Text

978-1-7281-0167-5/19/$31.00 ©2019 IEEE 42

You might also like