Speech Recognition: - Shetul Chothani

The document discusses speech recognition technology. It begins with an introduction to speech recognition, which involves a computer interpreting audible input and converting it to usable data. The document then covers the main stages of speech recognition: preprocessing, recognition, and communication. It discusses applications like dictation, voice commands, information services, education, and security systems. It concludes by discussing how speech recognition is becoming more common and will continue to grow as more applications are discovered.

Uploaded by

Sunil Pillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views19 pages

Speech Recognition: - Shetul Chothani

Uploaded by

Sunil Pillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

SPEECH

RECOGNITION
-SHETUL CHOTHANI

- PARAS KANERIYA
1. INTRODUCTION
 Speech is the most natural and common
way of communication between people.
 It would seem only natural that
computer development would eventually
progress to the point where people
would want to extend the human-
computer interface to include speech.
1.1 SPEECH
RECOGNITION(SR)
 Speech or voice recognition is the ability
of a machine or program to recognize and
carry out voice commands or take
dictation.
 It is a process by which a computer
interprets audible input from a user and
converts this data into a usable form.
 Thus, SR is a technology that enables a
computer to understand the spoken word.
2. THEORY OF SPEECH
RECOGNITION
 There are mainly three stages involved in
speech recognition:
 Preprocessing: - Preprocessing involves
taking the speech input and converting it
into something the computer can use.
 Recognition: - During the recognition
stage, the computer must identify what
has been said.
 Communication: - Finally, in the
communication stage, the computer acts
upon the translated input.
2.1 PREPROCESSING
 The first stage of the speech recognition
process is preprocessing.
 In this stage, the data is sent as input to
the system.
 The amount of this data must be kept to a
minimum.
 The inherent challenge in this is to remove
the "bad" data, such as noise, without
losing or distorting the critical data
needed to identify what has been said.
HEADPHONE

General Input Device For Speech Recognition

2.2 RECOGNITION
 Once preprocessing is completed, the
input data moves to the recognition
stage, where the primary work involved
in speech recognition is accomplished.
 There are two main approaches to
attacking the speech recognition
problems in the recognition stage : a
Knowledge-based approach and a
Data-based approach.
2.3 COMMUNICATION
 The final stage in the speech
recognition process is the
communication stage.
 In this stage, the software system acts
upon the voice input it has received
and translated.
 Thus the whole process of SR is
completed.
Mark Lucente, an IBM researche
moving one of the virtual objects
displayed on the screen.

“Put that…..” “Over there…..”

3. APPLICATIONS OF
SPEECH RECOGNITION
SYSTEMS
 There are several examples of
applications of speech recognition
because speech recognition itself has
applications in many fields.
 Most well known are the applications
like Dictation /text entry, Voice -
command systems, Information
services, Education, Security systems
etc.
3.1 DICTATION /TEXT ENTRY
 First tune your audio setup, then train the
software to recognize your voice and after
that you can work with it anywhere you
would use a mouse or keyboard for input.
 Freespeech 2000 from Phillips, Voice
Xpress from the Belgian company Lernout
& Hauspie, ViaVoice from IBM and Dragon
NaturallySpeaking from Dragon Systems
are the various s/w available in the market.
3.2 VOICE - COMMAND
SYSTEMS
 With Voice-Command systems a user can
give commands to the computer by
talking to it and then some actions are
performed.
 In the future voice-command systems will
allow information and communication
anywhere, anytime and anyway the user
wants it - in the office, at home, in the car,
kitchen, design studio, college and so on.
3.3 SECURITY SYSTEMS
 Every day terabytes of data containing
personal information are send over the
World Wide Web.
 Orders are placed and money is transferred.
 For the security of the above transactions,
the company VeriVoice, based in Princeton,
New Jersey developed a system that uses a
Netscape Plug-in and the microphone that's
installed on most multimedia computers to
perform speaker verification.
3.4 INFORMATION SERVICES
 Speech Recognition can also be applied
for information services like
communication.
 Presently, research is done for doing
speech recognition over the telephone.
 Instead of dialing numbers the caller can
choose the direction of the dialog by
talking to an automated service system.
3.5 EDUCATION
 The use of speech technology in
education offers great new possibilities.
 Learning to read and speak foreign
languages, transcription of lectures,
assistance for people with learning
disabilities are applications already in use.
 Software programs developed at IBM's
Thomas J. Watson Research Center helps
children to learn to read.
4.1 CONCLUSION
 In the recent years, voice and speech
recognition systems are becoming more and
more common, as evidenced by Sprint's
voice-activated telephone dialing system,
voice-controlled Windows applications, and
speech recognition devices in many
commercial industries.
 The market for products continues to grow,
as people discover more and more
applications where they could be useful.
4.2 CONCLUSION
 For many people in the past few years,
speech recognition has moved from just
being a novelty to becoming an important
tool used in their everyday lives.
 Speech recognition technology will enter
our daily lives within a few years, for
some it already has.
 The potential of SR technology will
enable humans to interact naturally with
machines.
4.3 CONCLUSION
 This emerging technology is the next phase
in a series of steps to convert the computer
into an interactive, humanlike mind.
 Indeed, as this technology improves, people
will ask their refrigerator for milk and tell
their cars where to go.
 Gradual integration will take place and
someday you'll notice yourself wondering
"Gee, was that a computer I was talking to
or was it a human operator?"
THANK
YOU
?

Services Marketing 2
No ratings yet
Services Marketing 2
32 pages
CT3 2010 Syllabus
No ratings yet
CT3 2010 Syllabus
8 pages
Analysis of GD Topics
No ratings yet
Analysis of GD Topics
43 pages
Current Challenges and Application of Speech Recog
No ratings yet
Current Challenges and Application of Speech Recog
4 pages
Speech Recognition PPT F
100% (2)
Speech Recognition PPT F
16 pages
NLP 1.3.1_Speed Recogmnition
No ratings yet
NLP 1.3.1_Speed Recogmnition
20 pages
Unit-3 Attribute Data Input and Data Display (E-next.in)
No ratings yet
Unit-3 Attribute Data Input and Data Display (E-next.in)
30 pages
Speech Recognition Report
100% (1)
Speech Recognition Report
20 pages
Widcollogo1 FINAL
No ratings yet
Widcollogo1 FINAL
83 pages
Features: Digital Assistant
No ratings yet
Features: Digital Assistant
8 pages
Speech Recognition System Using Ic Hm2007
100% (1)
Speech Recognition System Using Ic Hm2007
21 pages
DADM NOTES and Cheat Sheet
No ratings yet
DADM NOTES and Cheat Sheet
11 pages
2A02284 Rev B Bypass Panel HRG
No ratings yet
2A02284 Rev B Bypass Panel HRG
28 pages
Speech Recognition
0% (1)
Speech Recognition
27 pages
Tejaswini Group Report
No ratings yet
Tejaswini Group Report
18 pages
SPEECH RECOGNITION SYSTEM
No ratings yet
SPEECH RECOGNITION SYSTEM
5 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
Key Application: Automatic Speech Recognition or ASR, As It's
No ratings yet
Key Application: Automatic Speech Recognition or ASR, As It's
8 pages
EST I - Math
100% (6)
EST I - Math
17 pages
Voice Recognition System: Third Year Electronics, Third Year Electronics
No ratings yet
Voice Recognition System: Third Year Electronics, Third Year Electronics
14 pages
Module2 Notes (Srinivasulu M)
No ratings yet
Module2 Notes (Srinivasulu M)
33 pages
Piyu Sem Report.5
No ratings yet
Piyu Sem Report.5
30 pages
Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem
No ratings yet
Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem
23 pages
Wanderlust Merged
No ratings yet
Wanderlust Merged
40 pages
Speech Recognition: White Paper
No ratings yet
Speech Recognition: White Paper
24 pages
An Introduction To Speech and Speaker Recognition
No ratings yet
An Introduction To Speech and Speaker Recognition
8 pages
Jasmeet Seminar Report
No ratings yet
Jasmeet Seminar Report
24 pages
Features: Digital Assistant
No ratings yet
Features: Digital Assistant
7 pages
ABSTRACT Seminar
No ratings yet
ABSTRACT Seminar
5 pages
Speech Recognition Project
No ratings yet
Speech Recognition Project
33 pages
Speech Recognition Using Ic HM2007
100% (4)
Speech Recognition Using Ic HM2007
31 pages
Speech Technology
No ratings yet
Speech Technology
5 pages
A Report On
No ratings yet
A Report On
35 pages
Speech Recognition System - A Review: April 2016
No ratings yet
Speech Recognition System - A Review: April 2016
10 pages
S&P Capital IQ Pro: A Single Platform For Essential Intelligence
No ratings yet
S&P Capital IQ Pro: A Single Platform For Essential Intelligence
8 pages
Internship Report - Mariana Ngugi
No ratings yet
Internship Report - Mariana Ngugi
12 pages
CASE STUDY - Speech Recognition
No ratings yet
CASE STUDY - Speech Recognition
25 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
No ratings yet
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
32 pages
Ai For Speech Recognition
No ratings yet
Ai For Speech Recognition
27 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
Artificial Intelligence in Voice Recognition
No ratings yet
Artificial Intelligence in Voice Recognition
14 pages
03 VAX Architecture
100% (1)
03 VAX Architecture
31 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
23 pages
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
No ratings yet
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
8 pages
SPEECH
100% (1)
SPEECH
17 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
22 pages
Valliammai Engineering College: Department of Information Technology
No ratings yet
Valliammai Engineering College: Department of Information Technology
10 pages
Speech Recognition: SK - Rahil 1602-11-735-046
No ratings yet
Speech Recognition: SK - Rahil 1602-11-735-046
1 page
SPEECH
No ratings yet
SPEECH
8 pages
Speech Recognition System - A Review
No ratings yet
Speech Recognition System - A Review
10 pages
Approved by AICTE, New Delhi Affiliated To Aryabhatta Knowledge University, Patna, BIHAR
No ratings yet
Approved by AICTE, New Delhi Affiliated To Aryabhatta Knowledge University, Patna, BIHAR
5 pages
Minor Project123
No ratings yet
Minor Project123
40 pages
Load & Battery Calculation Till APC 40KVA
No ratings yet
Load & Battery Calculation Till APC 40KVA
15 pages
Project Report
No ratings yet
Project Report
106 pages
Unit IV: Graphs (Refer T-1 and R-6)
No ratings yet
Unit IV: Graphs (Refer T-1 and R-6)
27 pages
Chapter 1. INTRODUCTION
No ratings yet
Chapter 1. INTRODUCTION
2 pages
Project New
No ratings yet
Project New
2 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
24 pages
Speech Recognition: Prof. Ram Meghe Institute of Technology and Research, Badnera-Amravati
No ratings yet
Speech Recognition: Prof. Ram Meghe Institute of Technology and Research, Badnera-Amravati
13 pages
Speech Recognition - Specific Task of Speech Recognition: Abstract
No ratings yet
Speech Recognition - Specific Task of Speech Recognition: Abstract
7 pages
Speech Recognition
No ratings yet
Speech Recognition
17 pages
Speech Recognition Technology: Applications & Future: Pankaj Pathak
No ratings yet
Speech Recognition Technology: Applications & Future: Pankaj Pathak
3 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
14 pages
Under The Guidance Of: S K Biswal
No ratings yet
Under The Guidance Of: S K Biswal
19 pages
Speech Recognition: BY Charu Joshi
100% (2)
Speech Recognition: BY Charu Joshi
26 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
Speech Recognition Seminar Report
87% (97)
Speech Recognition Seminar Report
32 pages
Lecture 7: Least-Squares Problem: Convex Optimization
No ratings yet
Lecture 7: Least-Squares Problem: Convex Optimization
7 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
9 pages
Building Scalable Serverless Apps in The Cloud: AWS or Azure ?
No ratings yet
Building Scalable Serverless Apps in The Cloud: AWS or Azure ?
37 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Assessment 3 - HIT172 - 2023
No ratings yet
Assessment 3 - HIT172 - 2023
5 pages
The Analog To Digital Conversion Process
No ratings yet
The Analog To Digital Conversion Process
14 pages
Data Analytics Notes
No ratings yet
Data Analytics Notes
3 pages
AWS Cloud Architect: Nanodegree Program Syllabus
No ratings yet
AWS Cloud Architect: Nanodegree Program Syllabus
14 pages
Elt 273 8
No ratings yet
Elt 273 8
3 pages
SPEECH RECOGNITION SYSTEM Final
No ratings yet
SPEECH RECOGNITION SYSTEM Final
16 pages
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
No ratings yet
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
10 pages
Subjects Taught in English
No ratings yet
Subjects Taught in English
9 pages
Subject CT7 Business Economics Core Technical Syllabus: For The 2011 Examinations
No ratings yet
Subject CT7 Business Economics Core Technical Syllabus: For The 2011 Examinations
8 pages
Nicolas Sarkozy - Interview
No ratings yet
Nicolas Sarkozy - Interview
7 pages
Alcatel-4400 User Manual
No ratings yet
Alcatel-4400 User Manual
1 page
Read Excel Sheet Data Into DataTable - CodeProject
No ratings yet
Read Excel Sheet Data Into DataTable - CodeProject
4 pages
A Survey On Speech Recognition
No ratings yet
A Survey On Speech Recognition
2 pages
Post Assessment Moderation Report
100% (1)
Post Assessment Moderation Report
12 pages
Oracle® Fusion Middleware System Requirements and Specifications
No ratings yet
Oracle® Fusion Middleware System Requirements and Specifications
30 pages
Alibaba Group
100% (1)
Alibaba Group
21 pages
BSCCS2003: Week-2 HTML Lab Assignment
No ratings yet
BSCCS2003: Week-2 HTML Lab Assignment
4 pages
Pooja Thakur
No ratings yet
Pooja Thakur
14 pages
Introduction To The Title: Consumer Preference-For The Launch of New Product "Kheer"
No ratings yet
Introduction To The Title: Consumer Preference-For The Launch of New Product "Kheer"
15 pages
Deepak Savadiya Roll No
No ratings yet
Deepak Savadiya Roll No
15 pages
CT2 Finance and Financial Reporting
0% (1)
CT2 Finance and Financial Reporting
7 pages
A Presentation ON Brand Building and Market Penetration' BY Yuvraj Shreemal
No ratings yet
A Presentation ON Brand Building and Market Penetration' BY Yuvraj Shreemal
14 pages
Recruitment & Selection Policy At: Sandesh Company
No ratings yet
Recruitment & Selection Policy At: Sandesh Company
13 pages
Business Plan SP Electronics
No ratings yet
Business Plan SP Electronics
23 pages
Fill Your Answers Below
No ratings yet
Fill Your Answers Below
8 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Speech Generating Device: Fundamentals and Applications
From Everand
Speech Generating Device: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit 1: Introduction To Service Marketing
No ratings yet
Unit 1: Introduction To Service Marketing
13 pages
CT5 - Syllabus For 2011
No ratings yet
CT5 - Syllabus For 2011
8 pages
Speech Recognition: Fundamentals and Applications
From Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
E11f02p62 PDF
No ratings yet
E11f02p62 PDF
5 pages
Service Intro
No ratings yet
Service Intro
9 pages
Viral - Marketing Mind Comet
No ratings yet
Viral - Marketing Mind Comet
11 pages
Sales & Marketing IN Pharmaceutical Industry: Introdution
No ratings yet
Sales & Marketing IN Pharmaceutical Industry: Introdution
16 pages
Case Study Teaching Pedagogy
No ratings yet
Case Study Teaching Pedagogy
5 pages
Frog Leap Quiz
No ratings yet
Frog Leap Quiz
2 pages
NewsNuggets NovWeek3
No ratings yet
NewsNuggets NovWeek3
3 pages
NewsNuggets NovWeek2
No ratings yet
NewsNuggets NovWeek2
3 pages
Job Predictor
No ratings yet
Job Predictor
1 page
Practical File IT
50% (2)
Practical File IT
44 pages
Program To Create IDOC in Sap
100% (1)
Program To Create IDOC in Sap
3 pages

Speech Recognition: - Shetul Chothani

Uploaded by

Speech Recognition: - Shetul Chothani

Uploaded by

SPEECH

General Input Device For Speech Recognition

“Put that…..” “Over there…..”

You might also like