ASR Brief History: Trends Followed at Different Point of Time

Speech recognition research began in the 1970s but significant work did not start until the 1980s. There are now multiple approaches to speech recognition, but more research is still needed to make systems more robust, effective, and reliable. Early systems used either a segment-based approach that extracted features from specific temporal landmarks or a frame-based approach using statistical analysis of short time spectral features. While segment-based approaches were initially used more, frame-based approaches using hidden Markov models have emerged as the dominant method and produced highly accurate large vocabulary speech recognizers. Current research continues to refine these methods and apply them to tasks like conversational speech recognition systems.

Uploaded by

Ram Nepali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views2 pages

ASR Brief History: Trends Followed at Different Point of Time

Uploaded by

Ram Nepali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 2

ASR Brief History

Works relating to Speech Recognition Systems started before mid 70's. However work started in a slow pace and significant research and works were done only in the 80's. Now we have number of different paths to follow for speech recognition. However much research is still to be done inorder to make the system more robust, effective and relaiable. Researchs are going on all round the globe. Places like CMU ( Carnegie Mellon University ), MIT, Stansford, etc are significant in this matter. Dedicated people are working in developing this science.

Trends followed at different point of time:

Two models have been applied very frequently for ASR, Automatic Speech Recognition. They are frame-based and segment-based approaches. Originally the segment-based approach was followed. In the past, there have been many segment-based ASR approaches which extracted feature vectors at speci.c temporal landmarks (Cole et al., 1983), including work during the early ARPASUR project in the 1970s (Weinstein et al., 1975). Most of these e.orts were hampered however, by attempting to explicitly incorporate speech knowledge by heuristic means through intense knowledge engineering, and by lack of a stochastic framework to deal with the present state of ignorance in our understanding of the human communication process and its inherent variabilities. In this approach the voice is segmented into small units like the phonemes or sylable i.e. boundaries or the landmarks are detected and a chain of smaller units is formed. Then different methods can be followed to recognize it. A probabilistic graph network may be used to determine the correct words. Acoustic or probabilistic landmarks form the basis for a phonetic segment network, or graph. Feature vectors are extracted both over hypothesized

phonetic segments and at their boundaries for phonetic analysis. The resulting observation space(the set of all feature vectors) takes the form of an acoustic-phonetic network, or graph, whereby different paths through the graph are associated with different sets of feature vectors. This graph based observation space is quite di.erent from prevailing approaches which employ a temporal sequence of observations, which typically contain short-time spectral information (e.g., MFCCs). The segmental and feature-extraction characteristics of this recognizer provide us with a framework within which we try to incorporate knowledge of the speech signal. Recent research has proved that a frame-based approach could give a better result. In this approach recognizer do not need to decode each phonetic units. It uses a stastical analysis. It uses grammar to select the next possible words. It uses the dictionary to collect the pronunciation. Several different models like HMM ( Hidden Marcov Model ) is used by a scorer to calculate the acoustic probability for a particular unit of speech. It selects next set of likely states ,scores incoming features against these states and selects the state with highest probability and prunes low scoring states. Over the past two decades, .rst-order hidden Markov models (HMMs) have emerged as the dominant stochastic model for automatic speech recognition (ASR) (Rabiner, 1989). With a wellformed mathematical foundation, and e.cient, automated training procedures which can process the ever increasing amounts of speech data, impressive HMM-based recognizers have been created for a wide-variety of increasingly di.cult ASR tasks. Several projects are being developed using these approaches. The SUMMIT speech recognizer developed in MIT has always used a segment-based framework for its acousticphonetic representation of the speech signal (Zue et al., 1989;Glass et al., 1996). Similary another project which is being developed in the Carnegie Mellon University is the 'sphnix'. It is based on the frame-based approach. Its knowlwdge base consists of dictionary, language model and the acoustic model. It uses the HMM to evaluate the acoustic probabilities for each part of speech and generates the most likely state as the result. It is being done completely in JAVA platform. Over the past year and a half, a telephonebased, weather information system called JUPITER [14 is being developed in MIT, which is available via a toll-free number for users to query a relational database of current weather conditions using natural, conversational speech. Using information obtained from several different internet sites, JUPITER can provide weather forecasts for approximately 500 cities around the world for three to five days in advance, and can answer questions about a wide range of weather properties such temperature, wind speed, humidity, precipitation, sunrise etc., as well as weather advisory information. History of ASR is quite short and most of the works are still research based. Still much work is to be done to improve errors involved the the process.

A. Hillerborg-Strip Method Design Handbook-CRC Press (1996)
100% (5)
A. Hillerborg-Strip Method Design Handbook-CRC Press (1996)
333 pages
(Nigel R Hewson) Prestressed Concrete Bridges de
100% (2)
(Nigel R Hewson) Prestressed Concrete Bridges de
390 pages
Lecture 9 - Speech Recognition
No ratings yet
Lecture 9 - Speech Recognition
65 pages
Fundamentals of Speech Recognitiony - Lawrence Rabiner - Biing-Hwang Juang PDF
No ratings yet
Fundamentals of Speech Recognitiony - Lawrence Rabiner - Biing-Hwang Juang PDF
546 pages
A Probabilistic Framework For Segment-Based Speech Recognition
No ratings yet
A Probabilistic Framework For Segment-Based Speech Recognition
16 pages
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
100% (1)
Automatic Speech Recognition (ASR) : Omar Khalil Gómez - Università Di Pisa
65 pages
ASR2018
No ratings yet
ASR2018
40 pages
ASR Proof
No ratings yet
ASR Proof
19 pages
ASRcourseMOSIG2024
No ratings yet
ASRcourseMOSIG2024
97 pages
Electrical Engineering (2017-2021) Punjab Engineering College, Chandigarh - 160012
No ratings yet
Electrical Engineering (2017-2021) Punjab Engineering College, Chandigarh - 160012
23 pages
s10772-024-10082-z
No ratings yet
s10772-024-10082-z
13 pages
Editor in Chief,+recurrent Neural Networks in Automatic Speech Recognition
No ratings yet
Editor in Chief,+recurrent Neural Networks in Automatic Speech Recognition
8 pages
A Review On Different Approaches For Speech - Recognition System
No ratings yet
A Review On Different Approaches For Speech - Recognition System
6 pages
ASRcourseDSBA
No ratings yet
ASRcourseDSBA
100 pages
A Study On Automatic Speech Recognition
100% (1)
A Study On Automatic Speech Recognition
2 pages
A Speaker Independent Continuous Speech Recognizer For Amharic
No ratings yet
A Speaker Independent Continuous Speech Recognizer For Amharic
5 pages
Easychair Preprint: Adnene Noughreche, Sabri Boulouma and Mohammed Benbaghdad
No ratings yet
Easychair Preprint: Adnene Noughreche, Sabri Boulouma and Mohammed Benbaghdad
8 pages
Lectures 1 Rabiner Speech Processing
No ratings yet
Lectures 1 Rabiner Speech Processing
77 pages
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
No ratings yet
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
7 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
35 pages
Speechrecognitionfinalpresentation 141124072610 Conversion Gate01
No ratings yet
Speechrecognitionfinalpresentation 141124072610 Conversion Gate01
30 pages
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
No ratings yet
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
14 pages
d 0332836
No ratings yet
d 0332836
9 pages
Comparative Analysis of Automatic Speech Recognition Techniques
No ratings yet
Comparative Analysis of Automatic Speech Recognition Techniques
8 pages
Hidden Markov Model and Persian Speech Recognition
No ratings yet
Hidden Markov Model and Persian Speech Recognition
9 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
9 pages
Speech Recognition Application
No ratings yet
Speech Recognition Application
13 pages
Voice Recognition
60% (5)
Voice Recognition
31 pages
Applsci 12 01091
No ratings yet
Applsci 12 01091
18 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
Research paper
No ratings yet
Research paper
9 pages
Arabic Speech Recognition Challenges and State of The Art
No ratings yet
Arabic Speech Recognition Challenges and State of The Art
27 pages
Speech Recognition As Emerging Revolutionary Technology
No ratings yet
Speech Recognition As Emerging Revolutionary Technology
4 pages
Assignment Submission Speech Recognition System Architectural Design
No ratings yet
Assignment Submission Speech Recognition System Architectural Design
5 pages
Voice Recognition System Speech To Text
No ratings yet
Voice Recognition System Speech To Text
5 pages
IRJET Speech Scribd
No ratings yet
IRJET Speech Scribd
3 pages
asr01-intro
No ratings yet
asr01-intro
43 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
FARSDAT
No ratings yet
FARSDAT
12 pages
Speech Recognition With Hidden Markov Model: A Review
100% (1)
Speech Recognition With Hidden Markov Model: A Review
4 pages
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
No ratings yet
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
6 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Xiao Guest Lecture ASR
No ratings yet
Xiao Guest Lecture ASR
39 pages
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
No ratings yet
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
6 pages
End-to-End Speech Recognition: A Survey
No ratings yet
End-to-End Speech Recognition: A Survey
27 pages
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
No ratings yet
Automatic Speech Recognition: A Review: Anchal Katyal, Amanpreet Kaur, Jasmeen Gill
4 pages
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
A Review On Automatic Speech Recognition Architect
No ratings yet
A Review On Automatic Speech Recognition Architect
13 pages
A Review On Speech Recognition Challenge
No ratings yet
A Review On Speech Recognition Challenge
7 pages
A Review Malay Speech Recognition and Audio Visual Speech Recognition
No ratings yet
A Review Malay Speech Recognition and Audio Visual Speech Recognition
6 pages
Comp Sci - Recognition Isolated - Shanthi Teressa1
No ratings yet
Comp Sci - Recognition Isolated - Shanthi Teressa1
6 pages
msp.1982.28454
No ratings yet
msp.1982.28454
6 pages
Ann LA2 Project
No ratings yet
Ann LA2 Project
23 pages
Speech Segmentation
No ratings yet
Speech Segmentation
8 pages
Redaction HTK Amazigh Speech
No ratings yet
Redaction HTK Amazigh Speech
15 pages
Synopsis
No ratings yet
Synopsis
5 pages
Automatic Speech Segmentation in Syllable Centric Speech Recognition System
No ratings yet
Automatic Speech Segmentation in Syllable Centric Speech Recognition System
10 pages
Automatic Speech Recognition 2
No ratings yet
Automatic Speech Recognition 2
22 pages
Data-Driven Neural Network Based Feature - Phd-Thesis
No ratings yet
Data-Driven Neural Network Based Feature - Phd-Thesis
155 pages
Viva Speech
100% (1)
Viva Speech
4 pages
Swarm Intelligence: Fundamentals and Applications
From Everand
Swarm Intelligence: Fundamentals and Applications
Fouad Sabry
No ratings yet
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
From Everand
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
3D Box Culvert
100% (1)
3D Box Culvert
41 pages
Cable Stayed Bridge (Content)
100% (3)
Cable Stayed Bridge (Content)
70 pages
ARMP Manual 2017
No ratings yet
ARMP Manual 2017
42 pages
Bridge Hand Book
No ratings yet
Bridge Hand Book
51 pages
जिल्ला दररेट २०७५ ०७६ - final 4
No ratings yet
जिल्ला दररेट २०७५ ०७६ - final 4
149 pages
2D Portal Frame
No ratings yet
2D Portal Frame
16 pages
30.0m Simply Supported Span, Cast-In-situ, 2-Webbed Prestressed Concrete Slab-Deck
No ratings yet
30.0m Simply Supported Span, Cast-In-situ, 2-Webbed Prestressed Concrete Slab-Deck
12 pages
STD Drwing For Highway
100% (1)
STD Drwing For Highway
56 pages
Technical Viability of Using Reclaimed Asphalt Pavement in Ahmedabad Brts Corridor For Base Course
No ratings yet
Technical Viability of Using Reclaimed Asphalt Pavement in Ahmedabad Brts Corridor For Base Course
6 pages
Abbreviations: IFB . Invitation For Bids
No ratings yet
Abbreviations: IFB . Invitation For Bids
1 page
8s L8lehg X) 6f) +8f: X) El D) L G Dfu KMF/FD
No ratings yet
8s L8lehg X) 6f) +8f: X) El D) L G Dfu KMF/FD
3 pages
Well Foundation
No ratings yet
Well Foundation
1 page
New Microsoft PowerPoint Presentation
No ratings yet
New Microsoft PowerPoint Presentation
41 pages
1.basic Design Data: Design Review of BELSOT KHOLA BRIDGE Superstructure
No ratings yet
1.basic Design Data: Design Review of BELSOT KHOLA BRIDGE Superstructure
1 page
Bridge Engineering: Maintenance: Bridge Maintenance Techniques
100% (1)
Bridge Engineering: Maintenance: Bridge Maintenance Techniques
36 pages
Schedule of Test
No ratings yet
Schedule of Test
13 pages
Abbreviations: IFB . Invitation For Bids
No ratings yet
Abbreviations: IFB . Invitation For Bids
2 pages
Tender Notice 06
No ratings yet
Tender Notice 06
1 page
River Training Works - Case Studies of Ganga Bridge No. 52: Project Report ON
No ratings yet
River Training Works - Case Studies of Ganga Bridge No. 52: Project Report ON
72 pages
Rate Analysis
100% (1)
Rate Analysis
568 pages
Estimate: Description of Works: Purchase of Laptop
No ratings yet
Estimate: Description of Works: Purchase of Laptop
4 pages
Bridge Projects: Request For Expression of Interest (Reoi) (Consulting Services-Individual)
No ratings yet
Bridge Projects: Request For Expression of Interest (Reoi) (Consulting Services-Individual)
6 pages
Dsa Q6 PDF
No ratings yet
Dsa Q6 PDF
6 pages
Effect of Acute Imagery Towards Performance of Penalty Kick in Football Players
No ratings yet
Effect of Acute Imagery Towards Performance of Penalty Kick in Football Players
9 pages
7 Effective Love and Logic Strategies For The Classroom - The Art of Education University
No ratings yet
7 Effective Love and Logic Strategies For The Classroom - The Art of Education University
8 pages
The Roles of Accounting Information Systems in An Organization Experiencing Financial Crisis
No ratings yet
The Roles of Accounting Information Systems in An Organization Experiencing Financial Crisis
26 pages
Is The Lefkoe Belief Process A Fraud
No ratings yet
Is The Lefkoe Belief Process A Fraud
5 pages
Explanation - Grammar Unit 6 - Reported Speech
No ratings yet
Explanation - Grammar Unit 6 - Reported Speech
6 pages
WilliamForsythe ChoreographicObjects
No ratings yet
WilliamForsythe ChoreographicObjects
2 pages
Grade 5 Activity: Introduction To Patterns Goals/Key Questions
No ratings yet
Grade 5 Activity: Introduction To Patterns Goals/Key Questions
5 pages
COL 106: Data-Structures: Course Coordinator: Amit Kumar
No ratings yet
COL 106: Data-Structures: Course Coordinator: Amit Kumar
37 pages
Modulo 1 - KEY
No ratings yet
Modulo 1 - KEY
9 pages
Discourse Community Ethnography
100% (1)
Discourse Community Ethnography
10 pages
A Student's Guide To Jean-Paul Sartre's Existentialism and Humanism Nigel Warburton Gives A Brief Introduction To This Classic Text
No ratings yet
A Student's Guide To Jean-Paul Sartre's Existentialism and Humanism Nigel Warburton Gives A Brief Introduction To This Classic Text
7 pages
Study of Elements in Gamified Application Software-Examples From Starbucks
No ratings yet
Study of Elements in Gamified Application Software-Examples From Starbucks
5 pages
Ransdell and Levy 1999
No ratings yet
Ransdell and Levy 1999
14 pages
Stephen Law - Philosophy For AS and A2-Routledge (2004) 2
100% (1)
Stephen Law - Philosophy For AS and A2-Routledge (2004) 2
281 pages
Unit-2 AI
No ratings yet
Unit-2 AI
12 pages
No More Letter of The Week
No ratings yet
No More Letter of The Week
11 pages
Final-Format-JRU-Thesis (1) JJJJJ
100% (1)
Final-Format-JRU-Thesis (1) JJJJJ
5 pages
Module 8 Lemwell Bilo
No ratings yet
Module 8 Lemwell Bilo
3 pages
g3 A1 Smile
No ratings yet
g3 A1 Smile
2 pages
Teaching Approaches and Methods
No ratings yet
Teaching Approaches and Methods
3 pages
The IELTS Band Scores
No ratings yet
The IELTS Band Scores
3 pages
Practice Test 2 in English
No ratings yet
Practice Test 2 in English
5 pages
Food and Beverage Services: Lesson Plan in TVL
No ratings yet
Food and Beverage Services: Lesson Plan in TVL
5 pages
Assignment # 1 (Management)
50% (2)
Assignment # 1 (Management)
3 pages
Eavesdropping
No ratings yet
Eavesdropping
22 pages
Theories and Hypothesis Comparative Chart
No ratings yet
Theories and Hypothesis Comparative Chart
2 pages
20 Basic Verbs
No ratings yet
20 Basic Verbs
2 pages
Media Literacy Perspective - Brown
No ratings yet
Media Literacy Perspective - Brown
14 pages
IXARTIFICIAL INTELLIGENCEAIS-050122-2252023-07-13093229Unit Test 2 - Artificial Intelliegence - Class IX
No ratings yet
IXARTIFICIAL INTELLIGENCEAIS-050122-2252023-07-13093229Unit Test 2 - Artificial Intelliegence - Class IX
2 pages