Automatic Speech Recognition For Indian Languages: Comprehension and Analysis

The document discusses the technical challenges of automatic speech recognition for Indian languages including their diversity in phonetics and pronunciation. It covers the key components of ASR systems, research and development efforts, and issues faced including a lack of standardized speech and text corpora. The conclusion emphasizes that ASR models need adjustment to understand cultural and linguistic differences to improve accuracy and access to technology.

Uploaded by

sayalibarhate2717

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views11 pages

Automatic Speech Recognition For Indian Languages: Comprehension and Analysis

Uploaded by

sayalibarhate2717

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Automatic Speech Recognition for Indian

Languages: Comprehension and Analysis

M.Sc. Computer Science

Presented By :
Rugveda Kushare
Shreya Ghadage
Introduction

◆ Technical intricacies of ASR

◆ Specific challenges in transliterating Indian languages
◆ Ongoing efforts to overcome these challenges
◆ Future directions for advancing ASR technology
◆ Better serving the linguistic diversity of India
OVERVIEW OF AUTOMATIC
SPEECH RECOGNITION

● Definition and Functionality of ASR

● Importance of ASR for Indian Languages
● Key Components of ASR Systems
DIVERSITY IN INDIAN LANGUAGES
• Variety of Languages and Dialects
• Unique Phonetics and Pronunciation
• Impact on ASR Accuracy
Word Error Rate
Word Error Rate (WER) is a metric used to evaluate the accuracy of an Automatic Speech
Recognition (ASR) system's transcription output.

Mathematically, the formula for WER is,

WER=((S+D+I)/N)*100

• 1. S is the number of substitutions (words in the reference that are replaced by incorrect words
in the recognized text),
• 2. D is the number of deletions (words missing in the recognized text compared to the
reference text),
• 3. I is the number of insertions (extra words present in the recognized text compared to the
reference text), and
• 4. N is the total number of words in the reference text
RESEARCH AND DEVELOPMENT IN
ASR
• Current State of ASR for Indian Languages
• Key Research Initiatives and Projects
• Challenges in Data Collection and Annotation
DATA PROCESSING

1. Audio Processing
2. Preprocessing and Text Cleaning
3. Reduction and Transliteration of Text
PROCESS OF TRANSLITERATION
• Transliteration of Indic Words
• Word transliteration from the English
Dictionary
• Reduction
ISSUES FACED IN ASR FOR INDIAN
LANGUAGES
• Linguistic, speaker, and channel variability pose challenges for ASR
engines.
• ASR systems must adapt to unpredictable speech signals for accurate
transcription.
• The lack of standard speech and text corpus presents challenges for Indian
languages.
• Gathering a corpus of speech requires careful attention and data extraction.
• Speech variables impact the effectiveness of ASR systems in various
applications
CONCLUSION
The research shows that making ASR work well for languages like Hindi,
Tamil, and others is tricky because they have different ways of speaking
and cultural meanings. We need to adjust ASR models to understand these
differences better. By doing this, we can improve how accurately ASR
understands and writes down what people say. ASR is important because it
helps more people access technology and keeps languages alive. We need to
keep learning and improving ASR to make it work well for all languages.
THANK YOU

My Experience With The New Goethe-Zertifikat B2: Lesen 87/100
94% (18)
My Experience With The New Goethe-Zertifikat B2: Lesen 87/100
5 pages
IT Report-1
No ratings yet
IT Report-1
14 pages
ASR Survey Presentation
No ratings yet
ASR Survey Presentation
14 pages
Comparative Analysis of Automatic Speech Recognition Techniques
No ratings yet
Comparative Analysis of Automatic Speech Recognition Techniques
8 pages
ASRoIL - A - Comprehensive - Survey - For - Automa Kannada
No ratings yet
ASRoIL - A - Comprehensive - Survey - For - Automa Kannada
32 pages
14-Speech Recognition
No ratings yet
14-Speech Recognition
11 pages
2208.12666v1 Feature Extraction
No ratings yet
2208.12666v1 Feature Extraction
13 pages
A Brief Introduction To Automatic Speech Recognition
No ratings yet
A Brief Introduction To Automatic Speech Recognition
22 pages
2. Speech Recognition
No ratings yet
2. Speech Recognition
7 pages
Punjabi A
No ratings yet
Punjabi A
7 pages
CHAPTER ONE
No ratings yet
CHAPTER ONE
13 pages
ASR in NLP
No ratings yet
ASR in NLP
7 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
34 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
No ratings yet
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
6 pages
Racial Disparities in Automated Speech Recognition
No ratings yet
Racial Disparities in Automated Speech Recognition
6 pages
Development and Suitability of Indian Languages Speech Database For Building Watson Based ASR System
No ratings yet
Development and Suitability of Indian Languages Speech Database For Building Watson Based ASR System
7 pages
Booklet 2 Unit 4 English PPT
No ratings yet
Booklet 2 Unit 4 English PPT
37 pages
Amharic ASR Project Proposal
No ratings yet
Amharic ASR Project Proposal
7 pages
Artificial Intelligence-An Introduction: Department of Computer Science & Engineering
No ratings yet
Artificial Intelligence-An Introduction: Department of Computer Science & Engineering
17 pages
Automatic Speech Recognition Thesis
100% (3)
Automatic Speech Recognition Thesis
7 pages
Research On Regional Languages
No ratings yet
Research On Regional Languages
6 pages
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
No ratings yet
HG3052 SpeechSynthesisAndRecognition Lecture 10 Update2019-20
49 pages
Arabic Speech Recognition Challenges and State of The Art
No ratings yet
Arabic Speech Recognition Challenges and State of The Art
27 pages
asr01-intro
No ratings yet
asr01-intro
43 pages
Research Papers on Speech Recognition System
No ratings yet
Research Papers on Speech Recognition System
6 pages
s10772-024-10082-z
No ratings yet
s10772-024-10082-z
13 pages
9 Speech Recognition
No ratings yet
9 Speech Recognition
26 pages
Speech Recognition1
100% (1)
Speech Recognition1
39 pages
A Review On Speech Recognition Challenge
No ratings yet
A Review On Speech Recognition Challenge
7 pages
A Review On Different Approaches For Speech - Recognition System
No ratings yet
A Review On Different Approaches For Speech - Recognition System
6 pages
ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development
No ratings yet
ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development
9 pages
Seminar Ppt1
No ratings yet
Seminar Ppt1
18 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
28 pages
Feature Extraction Using PCA
No ratings yet
Feature Extraction Using PCA
36 pages
2205.08014v1
No ratings yet
2205.08014v1
5 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
35 pages
2023-Automatic Speech Recognition in L2 Learning A Review Based On PRISMA Methodology
No ratings yet
2023-Automatic Speech Recognition in L2 Learning A Review Based On PRISMA Methodology
13 pages
Electrical Engineering (2017-2021) Punjab Engineering College, Chandigarh - 160012
No ratings yet
Electrical Engineering (2017-2021) Punjab Engineering College, Chandigarh - 160012
23 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
23 pages
Tutorial On Speech Recognition: Alex Acero Microsoft Research
No ratings yet
Tutorial On Speech Recognition: Alex Acero Microsoft Research
38 pages
Thesis On Automatic Speech Recognition
100% (2)
Thesis On Automatic Speech Recognition
6 pages
Lectures 1 Rabiner Speech Processing
No ratings yet
Lectures 1 Rabiner Speech Processing
77 pages
Design and Implementation
No ratings yet
Design and Implementation
74 pages
Speech Recognition Project
No ratings yet
Speech Recognition Project
33 pages
Speech Recognition Application
No ratings yet
Speech Recognition Application
13 pages
A Study On Automatic Speech Recognition
100% (1)
A Study On Automatic Speech Recognition
2 pages
Cmu Sphinx Audio To Text
No ratings yet
Cmu Sphinx Audio To Text
9 pages
ASR
No ratings yet
ASR
13 pages
Automatic Speech Recognition Documentation
No ratings yet
Automatic Speech Recognition Documentation
24 pages
Evaluating Google Speech-to-Text API's Performance For Romanian E-Learning Resources
No ratings yet
Evaluating Google Speech-to-Text API's Performance For Romanian E-Learning Resources
9 pages
Ai For Speech Recognition
No ratings yet
Ai For Speech Recognition
19 pages
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
No ratings yet
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
4 pages
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
100% (1)
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
65 pages
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
From Everand
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Lex Analysis and Implementation: Definitive Reference for Developers and Engineers
From Everand
Lex Analysis and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Disambiguation of Particles: Hindi-To-English
From Everand
Disambiguation of Particles: Hindi-To-English
Anil Thakur
No ratings yet
Dart Language Reference Guide: Definitive Reference for Developers and Engineers
From Everand
Dart Language Reference Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Arabic in a Flash Kit Ebook Volume 2
From Everand
Arabic in a Flash Kit Ebook Volume 2
Fethi Mansouri, Dr.
5/5 (2)
Lesson 3 - Verb Tense and Modals
No ratings yet
Lesson 3 - Verb Tense and Modals
24 pages
Interlanguage
No ratings yet
Interlanguage
28 pages
There Is Are
No ratings yet
There Is Are
15 pages
Bulletin 1091932 Smit
No ratings yet
Bulletin 1091932 Smit
428 pages
A. Literal and Faithful Translation
No ratings yet
A. Literal and Faithful Translation
5 pages
Tenses Form
No ratings yet
Tenses Form
1 page
CGT - English 2: Week 4
No ratings yet
CGT - English 2: Week 4
24 pages
English Language Lesson Notes Basic 8 Week 10
No ratings yet
English Language Lesson Notes Basic 8 Week 10
11 pages
English-Master-in-Didactics-of-Foreign-Languages
No ratings yet
English-Master-in-Didactics-of-Foreign-Languages
44 pages
Lec1_Intro-2
No ratings yet
Lec1_Intro-2
25 pages
Findings and Discussion: No Language Styles Script The Movie 1 The Colloquial Style
No ratings yet
Findings and Discussion: No Language Styles Script The Movie 1 The Colloquial Style
13 pages
Exercise 1. Identify The Number of The Morphemes in Each of The Given Words. No. of Morphemes
No ratings yet
Exercise 1. Identify The Number of The Morphemes in Each of The Given Words. No. of Morphemes
3 pages
Great Gatsby Lesson 3-3
No ratings yet
Great Gatsby Lesson 3-3
10 pages
The Present Perfect or The Past Simple Exercise at Auto-English
No ratings yet
The Present Perfect or The Past Simple Exercise at Auto-English
2 pages
Quirk - A University Grammar of English
100% (1)
Quirk - A University Grammar of English
249 pages
Technical Report Format
No ratings yet
Technical Report Format
20 pages
Week Task To Mam Mica
No ratings yet
Week Task To Mam Mica
4 pages
CH8 Contrastive Analysis, Interlanguage, and Error Analysis - Brown, 2000
No ratings yet
CH8 Contrastive Analysis, Interlanguage, and Error Analysis - Brown, 2000
34 pages
Öz, H. (2014) - Morphology and Implications For English Language Teaching. in A. Saricoban (Ed.), Linguistics For English Language
No ratings yet
Öz, H. (2014) - Morphology and Implications For English Language Teaching. in A. Saricoban (Ed.), Linguistics For English Language
42 pages
Pinyin 1
100% (1)
Pinyin 1
27 pages
06-Future Continuous Tense
No ratings yet
06-Future Continuous Tense
5 pages
Eif Lessonplan Explanation
No ratings yet
Eif Lessonplan Explanation
4 pages
Fluent Reading - Reading Rockets
No ratings yet
Fluent Reading - Reading Rockets
12 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
44 pages
Grammar Booklet Full Version 2022 BH
No ratings yet
Grammar Booklet Full Version 2022 BH
29 pages
Download full Gradability in Natural Language Logical and Grammatical Foundations Heather Burnett ebook all chapters
100% (5)
Download full Gradability in Natural Language Logical and Grammatical Foundations Heather Burnett ebook all chapters
62 pages
Monosylables and Polisylables
No ratings yet
Monosylables and Polisylables
4 pages
S.M.I.L.E Poetry Student Analysis Update
No ratings yet
S.M.I.L.E Poetry Student Analysis Update
5 pages

Automatic Speech Recognition For Indian Languages: Comprehension and Analysis

Uploaded by

Automatic Speech Recognition For Indian Languages: Comprehension and Analysis

Uploaded by

Automatic Speech Recognition for Indian

Languages: Comprehension and Analysis

◆ Technical intricacies of ASR

● Definition and Functionality of ASR

Mathematically, the formula for WER is,

You might also like