0% found this document useful (0 votes)

15 views

Multilingual text recognition system

Multilingual text recognition system by using OCR

Uploaded by

Alfiya Sayyed

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Multilingual text recognition system

Multilingual text recognition system by using OCR

Uploaded by

Alfiya Sayyed

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

MULTILINGUAL

TEXT
RECOGNITION
SYSTEM BY
USING OCR
AN ADVANCED OCR SOLUTION FOR MULTIPLE
LANGUAGES
• Presented by: Ms. Alfiya Sayyed & Ms. Rutuja
Shivale
• Guided by: Prof. Amruta Navale
• Institution: Dr. D. Y. Patil Arts, Commerce and
Science College, Pimpri
• Academic Year: 2023-24
Table of Contents
01 Abstract 08 Methodology

02
04 Introduction 09 Software Requirement Analysis

0301 Objective 1006 System workflow diagram

0402 Literature Survey 07

11 Result

0503 Comparison of Existing & Proposed System 12 Conclusion

06 What is OCR? 13 Future Enhancement

07 System Architecture 14 Reference

2
ABSTRACT
• Overview:
The MLTR system aims to provide accurate
text extraction from images containing multiple
languages. This system leverages OCR techniques to
enhance accessibility and usability across different
linguistic contexts.
• Key Features:
Multilingual support: Handles text in
multiple languages.
Robust image preprocessing: Ensures
high- quality text extraction.
User- friendly interface: Simplifies the
user experience.

3
INTRODUCTION
Background:
 In today’s digital age, there is a growing need to convert
printed documents into electronic formats.
 Electronic documents enhance data security,
accessibility and ease of sharing and editing
Importance of OCR:
 OCR technology allows for the conversion of different
types of documents such as PDFs, or images into
editable and searchable data
 Traditional OCR system are often limited to single
languages, reducing their effectiveness in a multilingual
world.
Multilingual Text Recognition System:
 Our project aim to address the limitations of traditional
OCR systems by supporting text recognition in multiple
languages.
 This system enhances usability and accessibility for
users who work with documents in various languages.

4
OBJECTIVES
Developing multilingual text recognition system
Enhance Data Security
Improves Accessibility and Usability
Implement Efficient Preprocessing Techniques
Accurate Language Detection
User- Friendly Interface

5
LITERATURE SURVEY
 Background: Traditional OCR systems includes Tesseract,
Adobe Acrobat OCR, and ABBYY FineReader but this system
struggles with multilingual texts, varied fonts poor quality
images and handwritten text.

 Current Trends: Modern OCR systems use machine learning

and deep learning techniques to improve accuracy and handle
a variety of languages and fonts.

 Limitations of Existing System: Issues with multilingual

support, accuracy in noisy images, and computational
requirements.

6
COMPARISON OF EXISTING SYSTEM & PROPOSED SYSTEM

EXISTING SYSTEM

Proposed system supports multiple

Existing systems only have the functionalities such as extracting text in
capability to convert and recognize multiple languages. It also adds benefit
only the documents of English or a by providing heterogeneous characters
specific language only. That is, the recognition.
older OCR system is uni-lingual or bi-
lingual.

PROPOSED SYSTEM

7
WHAT IS OCR?
 Definition: OCR (Optical Character Recognition) technology converts different types of documents, such
as scanned papers documents, PDF files, or images captured by a digital camera, into editable and
searchable data.

 Types of OCR:
1.OCR: General character recognition
2.Optical Word Recognition: Recognizes entire words rather than individual characters.
3.Intelligent Character Recognition: Recognizes hand-printed characters.
4.Intelligent Word Recognition: Recognizes hand- printed words.

8
SYSTEM ARCHITECTURE
 Overview:
Our OCR system architecture is designed to support
multilingual text recognition, ensuring high accuracy
and efficient processing of various languages scripts.
 Components:
1. User interface: Allows users to upload images
and view extracted text and detected language.
2. Image Preprocessing Modules: Converts
uploaded images to grayscale and enhances them
for better OCR performance.
3. OCR Engine: Utilizes Tessereact OCR to extract
text from preprocessed images.
4. Languages Detection Module: Detects the
languages of the extracted text using langdetect
library
5. Output Module: Displays the extracted text and
detected languages to the user in user friendly
format.
9
METHODOLOGY
The methodology of our OCR system involves several key
steps: image preprocessing, text extraction, language detection, and text
post-processing.
Each step plays a critical role in ensuring the accuracy and efficiency of
text recognition from images.

 Image Preprocessing
• Image preprocessing is the first step, where the uploaded image
is prepared for text extraction.
• This includes converting the image to grayscale to reduce
complexity and using noise reduction techniques to enhance the
quality of the text in the image.
• These steps help in improving the accuracy of the OCR process.
 Text Extraction
• The core of the OCR system is the text extraction phase, where
Image preprocessing the preprocessed image is processed to identify and extract text.
• We use Tesseract OCR, a powerful open-source library,
configured to recognize multiple languages.
• Tesseract analyzes the text areas in the image and converts them
into machine-encoded text.
9
 Language Detection
• After extracting the text, the system detects the language using the Langdetect library.
• This step is crucial for processing documents that contain text in multiple languages.
• The detected language information helps in subsequent text processing and formatting steps.
 Text Post-Processing
• In the post-processing phase, the extracted text is cleaned and formatted.
• This involves removing unwanted characters and symbols, and structuring the text into readable format.
• The final output is a clean, editable document that retains the original text's integrity.

11
SOFTWARE REQUIREMENTS ANALYSIS

 Problem Statement:
Extracting accurate text from images containing multiple languages.

 System Requirements:
• Hardware: High- resolution camera or scanner, high performance
processor.
• Software: Python, OpenCV, Tesseract, langdetect.

 Libraries used:
• OpenCV: For image processing.
• Pytesseract: Python wrapper for tesseract OCR.
• Langdetect: Language detection library

9
System Workflow Diagram

9
RESULTS
The Multilingual Text Recognition System project successfully implemented a web application
that allows users to upload images containing text and extracts the text from them. The system supports
multiple languages, enabling users to extract text in various languages accurately.

 Key Performance Metrics:

1. Accuracy of OCR:
• OCR accuracy rate of 95% across various languages
2. Language detection accuracy:
• Language detection accuracy is 90%
3. Processing time:
• Average processing time of 3 seconds per image

9
GUI INTERFACE DESIGN

9
Examples of Successful Text Extraction and Language Detection
Example 1:

Input Image:
Extracted text:

Detected Language: Hindi

9
Example 2:

9
CONCLUSION
 Through our project, we successfully addressed the challenges associated with processing multilingual
documents, providing users with a reliable and efficient tool for extracting text from images in various
languages.

 The system ability to accurately detect and extract text in languages ranging from English and Hindi to
Arabic and beyond has far- reaching implication for data accessibility.

 By overcoming language barriers, our system empowers users to efficiently process multilingual
documents, improving productivity and reducing the risk of errors associated with manual transcription.

18
FUTURE ENHANCEMENT

 Enhanced language support: Continuously expand the language support to encompass additional
languages and dialects, catering to the diverse linguistic needs of users worldwide.

 Mobile Application Development: Develop a mobile application version of the system to enable users
to perform text extraction tasks on the go, leveraging the capabilities of smartphones and tablets.

 Integration with AI: Explore the integration of artificial intelligence (AI) techniques, such as machine
learning and deep learning, to improve language detection accuracy and optimize text extraction
algorithms.

 Improved Accuracy: Invest in research and development efforts to further enhance the accuracy of text
recognition, especially for complex scripts and low-quality images.

9
REFERENCE

For the complete reference and understanding of OCR refer jeff heaton's chapter 7 from
www.jeffheaton.com
 The IEEE standard reference paper from which we collected our problem statement is authorized by
Dana Petcu, Silviu Panica, Viorel Negru and Andrei Eckstein of Computer Science Department who
are from West University of Timisoara, Romania.

 The reference paper is also authorized by Doina Banciu from National Institute for Research and
Development in Informatics, Romania.

 A. Revathi and N. A. Modi, "Comparative Analysis of Text Extraction from Color Images using
Tesseract and OpenCV," 2021 8th International Conference on Computing for Sustainable Global
Development (INDIACom), 2021, pp. 931-936, DOI: 10.1109/INDIACom51348.2021.00167.

Nissan Titan Power Control System
100% (3)
Nissan Titan Power Control System
98 pages
ISO27k ISMS A5.9 Information Asset Checklist 2022
No ratings yet
ISO27k ISMS A5.9 Information Asset Checklist 2022
3 pages
ANN Miniproject Report
No ratings yet
ANN Miniproject Report
11 pages
analysis phase.pptx_20250108_101518_0000
No ratings yet
analysis phase.pptx_20250108_101518_0000
19 pages
OCR Assignment
No ratings yet
OCR Assignment
5 pages
Presentation 4
No ratings yet
Presentation 4
17 pages
Image_to_Audio_Content_Reader_Project
No ratings yet
Image_to_Audio_Content_Reader_Project
8 pages
Digital Library Software
No ratings yet
Digital Library Software
21 pages
Bilingual_OCR_Report
No ratings yet
Bilingual_OCR_Report
10 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Optical_character_recognition_system_using_artific
No ratings yet
Optical_character_recognition_system_using_artific
7 pages
Speech_Image_Translator_Presentation (1)
No ratings yet
Speech_Image_Translator_Presentation (1)
16 pages
FFGB
No ratings yet
FFGB
12 pages
PRE Synopsis
No ratings yet
PRE Synopsis
3 pages
OCR PPT GRP 12
No ratings yet
OCR PPT GRP 12
10 pages
1822-b.e-cse-batchno-4 (1)
No ratings yet
1822-b.e-cse-batchno-4 (1)
64 pages
Praveen2014Towards
No ratings yet
Praveen2014Towards
6 pages
10 1109@icirca48905 2020 9183326
No ratings yet
10 1109@icirca48905 2020 9183326
6 pages
Arabic Optical Character Recognition Software A Review
No ratings yet
Arabic Optical Character Recognition Software A Review
15 pages
Ocr Gtts
No ratings yet
Ocr Gtts
49 pages
Raj Synopsis12
No ratings yet
Raj Synopsis12
5 pages
Speech Recognition
No ratings yet
Speech Recognition
9 pages
5.0 Best Practices For OCR
No ratings yet
5.0 Best Practices For OCR
4 pages
APP2
No ratings yet
APP2
16 pages
Voice Recognition
No ratings yet
Voice Recognition
16 pages
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
No ratings yet
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
3 pages
Image To Speech Conversion in Multi Languages
No ratings yet
Image To Speech Conversion in Multi Languages
31 pages
THANK_YOU
No ratings yet
THANK_YOU
23 pages
Speech to Text
No ratings yet
Speech to Text
17 pages
SYSTEM. This Process Is Also Called DOCUMENT IMAGE ANALYSIS (DIA)
No ratings yet
SYSTEM. This Process Is Also Called DOCUMENT IMAGE ANALYSIS (DIA)
88 pages
Presentation ML
No ratings yet
Presentation ML
9 pages
Ocr Nanonets Tesseract
No ratings yet
Ocr Nanonets Tesseract
39 pages
Document Scanner App Synopsiss
No ratings yet
Document Scanner App Synopsiss
6 pages
Synopsis[1]
No ratings yet
Synopsis[1]
17 pages
Design Phase
No ratings yet
Design Phase
10 pages
OCR For Hindi and Sanskrit
No ratings yet
OCR For Hindi and Sanskrit
1 page
An Analysis of The Performance of Named Entity Recognition Over Ocred Documents
No ratings yet
An Analysis of The Performance of Named Entity Recognition Over Ocred Documents
2 pages
Sign Language RECOGNITION USING DEEP LEARNING
No ratings yet
Sign Language RECOGNITION USING DEEP LEARNING
28 pages
Optical Character Recognition Based Speech Synthesis: Project Report
0% (1)
Optical Character Recognition Based Speech Synthesis: Project Report
17 pages
MATHS Report
No ratings yet
MATHS Report
15 pages
Chat With Multiple PDF and Sign Letter Detection
No ratings yet
Chat With Multiple PDF and Sign Letter Detection
10 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
Unlocking Text from Images: The Future of OCR Technology
No ratings yet
Unlocking Text from Images: The Future of OCR Technology
4 pages
Last Edited
No ratings yet
Last Edited
8 pages
9589-First Manuscript-57755-2-10-20220620 - X
No ratings yet
9589-First Manuscript-57755-2-10-20220620 - X
12 pages
Text Detector (OCR)
No ratings yet
Text Detector (OCR)
12 pages
final slide
No ratings yet
final slide
18 pages
Sign Language Converter Using Image Recognition
No ratings yet
Sign Language Converter Using Image Recognition
9 pages
ML Report
No ratings yet
ML Report
5 pages
Text To Speech Conversion Using Raspberry - PI
No ratings yet
Text To Speech Conversion Using Raspberry - PI
3 pages
Script Identification of Telugu, English and Hindi Document Image
No ratings yet
Script Identification of Telugu, English and Hindi Document Image
11 pages
11 SDD Final Chapter Summaries
No ratings yet
11 SDD Final Chapter Summaries
8 pages
Got - Towards Ocr-2
No ratings yet
Got - Towards Ocr-2
19 pages
Bofinal
No ratings yet
Bofinal
10 pages
Switching To Ocr Gcse 9 1 Computer Science From Edexcel
No ratings yet
Switching To Ocr Gcse 9 1 Computer Science From Edexcel
15 pages
Voice_Translation_App_Detailed_Presentation
No ratings yet
Voice_Translation_App_Detailed_Presentation
17 pages
doc
No ratings yet
doc
5 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Arkwright CGP-AI Presentation
No ratings yet
Arkwright CGP-AI Presentation
8 pages
Review of Text To Speech Conversion Methods: Poonam.S.Shetake, S.A.Patil, P. M Jadhav
No ratings yet
Review of Text To Speech Conversion Methods: Poonam.S.Shetake, S.A.Patil, P. M Jadhav
7 pages
Script Identification of Telugu, English and Hindi Document Image
No ratings yet
Script Identification of Telugu, English and Hindi Document Image
11 pages
Python The Complete Reference: Comprehensive Guide to Mastering Python Programming from Fundamentals to Advanced Techniques
From Everand
Python The Complete Reference: Comprehensive Guide to Mastering Python Programming from Fundamentals to Advanced Techniques
Aarav Joshi
No ratings yet
BCNP 5504 H
No ratings yet
BCNP 5504 H
2 pages
Emi Papaer
No ratings yet
Emi Papaer
12 pages
01-BABS Brochure 2023 Updated
No ratings yet
01-BABS Brochure 2023 Updated
4 pages
Level 3 Repair: 8-2. Block Diagram
No ratings yet
Level 3 Repair: 8-2. Block Diagram
40 pages
ISO-1207-2011 Slot Cheese Head Screws
No ratings yet
ISO-1207-2011 Slot Cheese Head Screws
14 pages
CDI 4 (Semi-Final Examination) : Last Name First Name M.I
No ratings yet
CDI 4 (Semi-Final Examination) : Last Name First Name M.I
4 pages
Munters High Temp Psych Chart
No ratings yet
Munters High Temp Psych Chart
2 pages
Delay Alignment Modulation Enabling Equalization-Free Single-Carrier Communication PDF
No ratings yet
Delay Alignment Modulation Enabling Equalization-Free Single-Carrier Communication PDF
5 pages
Combipac Boiler Operation Management
No ratings yet
Combipac Boiler Operation Management
15 pages
MK 90adptr010 23 PDF
No ratings yet
MK 90adptr010 23 PDF
154 pages
Manual Abit AB-PX5
No ratings yet
Manual Abit AB-PX5
90 pages
02 Cadfil - Price - Option - Matrix - V13
No ratings yet
02 Cadfil - Price - Option - Matrix - V13
1 page
Telecom Networks Lab I Laboratory Report Writing Format: 1 Title Page
No ratings yet
Telecom Networks Lab I Laboratory Report Writing Format: 1 Title Page
2 pages
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
No ratings yet
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
76 pages
Iso 18289 2014
No ratings yet
Iso 18289 2014
9 pages
Installing Usb Drivers
No ratings yet
Installing Usb Drivers
4 pages
Validus Presentation en 04-07
No ratings yet
Validus Presentation en 04-07
22 pages
Terms and Conditions of Use Halodoc
No ratings yet
Terms and Conditions of Use Halodoc
9 pages
ASS Project Report of Design of Mechatronics
No ratings yet
ASS Project Report of Design of Mechatronics
40 pages
Panasonic TH-32A300DX
100% (1)
Panasonic TH-32A300DX
3 pages
Bike Parking Project (Coding)
No ratings yet
Bike Parking Project (Coding)
6 pages
T2K4 Character Sheet English Color (Fillable)
100% (4)
T2K4 Character Sheet English Color (Fillable)
1 page
Map of The University of Twente: Witbreuksweg
No ratings yet
Map of The University of Twente: Witbreuksweg
2 pages
Areas of Discussion Current Action Recommendation: FOR: Kendi Lovellen E. Villarin, RSW
No ratings yet
Areas of Discussion Current Action Recommendation: FOR: Kendi Lovellen E. Villarin, RSW
2 pages
8_Material Inspection Request (MIR)
No ratings yet
8_Material Inspection Request (MIR)
12 pages
OsirisM SensorSpecification v1.1.0
No ratings yet
OsirisM SensorSpecification v1.1.0
46 pages
WindowsCE Hacking
No ratings yet
WindowsCE Hacking
24 pages
GSP5 Installation License Agreement
No ratings yet
GSP5 Installation License Agreement
7 pages

Multilingual text recognition system

Uploaded by

Multilingual text recognition system

Uploaded by

MULTILINGUAL

0301 Objective 1006 System workflow diagram

0402 Literature Survey 07

0503 Comparison of Existing & Proposed System 12 Conclusion

06 What is OCR? 13 Future Enhancement

07 System Architecture 14 Reference

 Current Trends: Modern OCR systems use machine learning

 Limitations of Existing System: Issues with multilingual

Proposed system supports multiple

 Key Performance Metrics:

Detected Language: Hindi

You might also like