0% found this document useful (0 votes)

78 views23 pages

Speaker Recognation System Srs

The document discusses speaker recognition systems. It begins by defining speaker recognition as the process of automatically identifying or verifying a speaker based on information in their speech waves. It then discusses the objectives of extracting, characterizing, and recognizing speaker identity from speech signals. The document outlines the basic steps of a speaker recognition system as voice recording, feature extraction, pattern matching, and decision making. It focuses on Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Gaussian Mixture Models (GMM) for pattern matching. The document also provides details about its own experimental methodology using the TIMIT database, MFCC features, and GMM modeling in Matlab.

Uploaded by

Levko Dovgan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views23 pages

Speaker Recognation System Srs

Uploaded by

Levko Dovgan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

asadthomas@gmail.

com
[email protected]

SPEAKER RECOGNATION
SYSTEM (SRS)
MD. ASAD
[email protected]
RIYA BHADRA
[email protected]
IASNLP-2015, IIIT Hyderabad
Introduction

• Speaker Recognition: It is the process of automatically recognizing

(identify & verify) who is speaking on the basis of individual information
that exist in speech waves.
Objectives and aims
• To extract, characterize, and recognize the information about a
speaker identity.

• To building a robust system to identify and verify a speaker

accurately.
Automatically extract information transmitted
in speech signal
Application of speaker recognition

• SR uses are voice dialling, banking by telephone, telephone

shopping, database access services, information services,
voice mail, security control for confidential information
areas, and remote access to computers.

• Some systems use "anti-speaker" techniques such as

cohort models.
Development of Speaker Recognition Systems

• The first type of speaker recognition machine using

spectrograms of voices was invented in the 1960’s. It was
called voiceprint analysis or visible speech.

• Since the mid-1980s, this field has been steadily getting

matured that commercial applications of SR have been
increasing, and many companies currently offer this
technology.
Speech processing taxonomy
Principles of Speaker Recognition

Two applications:

• Speaker Identification and

• Speaker Verification
There exist two types of speaker recognition:

• Text dependent (restrained)
• Text independent (unrestrained)

Text dependent recognition has better performance for

subjects that cooperate. But text independent voice
recognition is more flexible that it can be used for non-
cooperating individuals.
• Close Set
• Open Set
Speaker Recognition

• Basically identification or authentication using speaker

recognition consists of four steps:

1. Voice Recording
2. Feature Extraction
3. Pattern Matching
4. Decision (accept / reject)
Feature Extraction

• Feature extraction is to convert speech waveform to

some type of parametric representation. This sub-
process is the key part in front-end processing, and
always be viewed as a ‘replacer’ of front-end
processing
• Models used for feature extraction are LPCCs, MFCCs
etc…
Pattern Matching

• Pattern matching is the actual comparisson of the extracted

frames with known speaker models (or templates), this results
in a matching score which quantifies the similarity in between
the voice recording and a known speaker model. Pattern
matching is often based on Hidden Markov Models (HMMs),
a statistical model which takes into account the underlying
variations and temporal changes of the accoustic pattern.
• Models used for Pattern Matching are VQ, NN,
HMM,GMM etc…
Speaker Recognition

• Data Base using = TIMIT

• Feature extraction = MFCCs
• Pattern matching = GMM
• Tool used = Mat-Lab
WHY MFCCs?

Mel-frequency Cepstrum Coefficients:

• Until now, Mel-frequency cepstral coefficients (MFCC) are the best
known and most commonly used features for not only speech
recognition, but speaker recognition as well. The computation of
MFCC is based on the short-term analysis and it is similar to the
computation of Cepstral Coefficients. The significant difference lays
on the usage of critical bank filters to realize mel-frequency
warping. The critical bandwidths with frequency are based on the
human ears perception.
• A mel is a unit of measure based on the human ear’s perceived
frequency.
Intoduction to GMM

• Gaussian • Mixture Model

“Gaussian is a characteristic symmetric “mixture model is a probabilistic model
“bell carve” shape that quickly falls off which assumes the underlying data to
towards 0 (practically)” belong to a mixture distribution”
Why GMM?

• Classification paradigms used in SRS during the past 20

years VQ, NN, HMM and GMM represent Vector
Quantization, Neutral Network, Hidden Markov Model and
Gaussian Mixture Model respectively. A continuous ergodic
HMM method is superior to a discrete ergodic HMM
method and that a continuous ergodic HMM method is as
robust as a VQ-based method when enough training data is
available. However, when little data is available, the VQ-
based method is more robust than a continuous HMM
method.
EXPERIMENTAL METHODOLOGY

Dataset Description
• TIMIT Database.
• Total Number of speakers= 98
• Female speakers= 48
• Male Speakers= 50
• Total sentences= 10
• Trained Data= 8 sentences for each speaker
• Testing Data= 2 sentences for each speaker
Analysis Tool
• Matlab
Result
References:

1. Reynolds, D. A and Rose, R. C. 1995. “Robust Text- Independent

Speaker Identification Using Gaussian Mixture Speaker Models”,
IEEE Trans. on Speech and Audio Processing, vol.3, No.1, pp.72-
83,
2. Panda, A. K & Sahoo, A. K. 2011. Study of Speaker Recognition
System. Thesis NIT, Rourkela.
3. Ling Feng, “Speaker Recognition”, Kgs. Lyngby 2004
Question?

Microsoft 365 - ISO Report (2025)
No ratings yet
Microsoft 365 - ISO Report (2025)
50 pages
Peer-Graded Assignment – Final Assignment
No ratings yet
Peer-Graded Assignment – Final Assignment
2 pages
Final Report Complete PDF
No ratings yet
Final Report Complete PDF
26 pages
Speaker Recognition: SRT Project of Signal Processing
No ratings yet
Speaker Recognition: SRT Project of Signal Processing
27 pages
The Data Streaming Revolution
No ratings yet
The Data Streaming Revolution
24 pages
SHA 256 Algorithm
No ratings yet
SHA 256 Algorithm
33 pages
ADTIMA Credential
No ratings yet
ADTIMA Credential
32 pages
RBAC Guidebook
100% (1)
RBAC Guidebook
43 pages
Exotel Manual Get Started
No ratings yet
Exotel Manual Get Started
15 pages
Support Terms and Service Level Agreements (SLA) of The OutSystems Software - OutSystems
No ratings yet
Support Terms and Service Level Agreements (SLA) of The OutSystems Software - OutSystems
8 pages
NAPASMerchantIntegrationSpecification 2.2
No ratings yet
NAPASMerchantIntegrationSpecification 2.2
32 pages
Nozomi Networks CMC Data Sheet
No ratings yet
Nozomi Networks CMC Data Sheet
10 pages
Introduction To AIOps - Simplilearn
No ratings yet
Introduction To AIOps - Simplilearn
14 pages
IG1242 ODA Component Inventory v14.0.0
No ratings yet
IG1242 ODA Component Inventory v14.0.0
65 pages
Battle Card - HARMONY Endpoint - May20
No ratings yet
Battle Card - HARMONY Endpoint - May20
5 pages
Administrator Authentication and RBAC
No ratings yet
Administrator Authentication and RBAC
37 pages
Campaign Management Use Cases
100% (1)
Campaign Management Use Cases
3 pages
HITS Profile PDF
No ratings yet
HITS Profile PDF
17 pages
Lab Guide - Using BPMN Process Diagrams From IBM Blueworks Live in IBM Process Mining
No ratings yet
Lab Guide - Using BPMN Process Diagrams From IBM Blueworks Live in IBM Process Mining
26 pages
ABB REL 316 4 Numerical Line Protection
100% (2)
ABB REL 316 4 Numerical Line Protection
44 pages
30 Leading ICT Vietnam 2014
No ratings yet
30 Leading ICT Vietnam 2014
96 pages
BA Foundation - Lecture 3
No ratings yet
BA Foundation - Lecture 3
32 pages
Simple Network Management Protocol (SNMP) : Feature Overview and Configuration Guide
No ratings yet
Simple Network Management Protocol (SNMP) : Feature Overview and Configuration Guide
24 pages
Mẫu tài liệu SRS
No ratings yet
Mẫu tài liệu SRS
46 pages
Cloud Security Review (GCP-Template)
No ratings yet
Cloud Security Review (GCP-Template)
17 pages
Never Miss A Lead: Manage Your Calls 24 X 7 With IVRS
No ratings yet
Never Miss A Lead: Manage Your Calls 24 X 7 With IVRS
8 pages
Exotel - Inbound Call Centre
No ratings yet
Exotel - Inbound Call Centre
17 pages
CCS336 Cloud Services Management Lecture Notes 2
No ratings yet
CCS336 Cloud Services Management Lecture Notes 2
120 pages
Sangfor ACloud 5.8.6 Associate 2019 01 Introduction of Sangfor ACloud
No ratings yet
Sangfor ACloud 5.8.6 Associate 2019 01 Introduction of Sangfor ACloud
28 pages
Yeastar Product Slides PDF
No ratings yet
Yeastar Product Slides PDF
91 pages
Consulting Services Agreement Template
No ratings yet
Consulting Services Agreement Template
22 pages
SaaS Revenue Primer - Flux Analysis
100% (2)
SaaS Revenue Primer - Flux Analysis
11 pages
HYDRAULICS
No ratings yet
HYDRAULICS
197 pages
Managed Services Opportunity For Telcos
No ratings yet
Managed Services Opportunity For Telcos
14 pages
Microsoft C# Coding Conventions
No ratings yet
Microsoft C# Coding Conventions
8 pages
RFP Call Center 21 December 2020
No ratings yet
RFP Call Center 21 December 2020
97 pages
Estun ProNet Series Users Manual V2.02
No ratings yet
Estun ProNet Series Users Manual V2.02
182 pages
CH-1 and 2
No ratings yet
CH-1 and 2
129 pages
C++ Uptodate Lecture Notes
No ratings yet
C++ Uptodate Lecture Notes
69 pages
Whitepaper VDI Smackdown
No ratings yet
Whitepaper VDI Smackdown
107 pages
Srs Document For University Management System
No ratings yet
Srs Document For University Management System
59 pages
Process Questions
No ratings yet
Process Questions
6 pages
VoIP Vs PBX - Dialpad
No ratings yet
VoIP Vs PBX - Dialpad
4 pages
Citrix Managed Desktops - Feature Comparison Matrix
No ratings yet
Citrix Managed Desktops - Feature Comparison Matrix
2 pages
Ứng Dụng SSL Trên Router DrayTek
No ratings yet
Ứng Dụng SSL Trên Router DrayTek
11 pages
Ebook What Is A CDP - BlueConic
No ratings yet
Ebook What Is A CDP - BlueConic
16 pages
Template - Requirement Management Sheet
No ratings yet
Template - Requirement Management Sheet
23 pages
Avaya Contact Center Fundamentals - Assisted Care
No ratings yet
Avaya Contact Center Fundamentals - Assisted Care
113 pages
MATH10-Q4-MOD33
No ratings yet
MATH10-Q4-MOD33
30 pages
FPT Software (Fsoft HCM)
No ratings yet
FPT Software (Fsoft HCM)
5 pages
1.2 Auxillium 2022
No ratings yet
1.2 Auxillium 2022
68 pages
The Multivariate Normal Distribution: Exactly Central Limit
No ratings yet
The Multivariate Normal Distribution: Exactly Central Limit
59 pages
CV AnhThao1
No ratings yet
CV AnhThao1
1 page
Entity Catalog Management API REST Specification
No ratings yet
Entity Catalog Management API REST Specification
105 pages
tool4cool_operating_instructions_12-2019
No ratings yet
tool4cool_operating_instructions_12-2019
76 pages
HCCC Overview and Specification
No ratings yet
HCCC Overview and Specification
68 pages
IT Solutions: Products Offered
No ratings yet
IT Solutions: Products Offered
35 pages
Catalogue Fans Drives
No ratings yet
Catalogue Fans Drives
60 pages
Spring Professional Certification Study Guide
No ratings yet
Spring Professional Certification Study Guide
12 pages
Model Mania 2009
No ratings yet
Model Mania 2009
1 page
Enghouse Interactive - Contact Center Service Provider
No ratings yet
Enghouse Interactive - Contact Center Service Provider
32 pages
Resource Ordering Management API REST Specification
No ratings yet
Resource Ordering Management API REST Specification
34 pages
Global Delivery Model: We Are Where Our Customers Are
No ratings yet
Global Delivery Model: We Are Where Our Customers Are
2 pages
User Case
No ratings yet
User Case
33 pages
Gam If Ication
No ratings yet
Gam If Ication
31 pages
Foldeddipoleantenna 160809040534
No ratings yet
Foldeddipoleantenna 160809040534
18 pages
PYQ DEC 2019 - VedPrep
No ratings yet
PYQ DEC 2019 - VedPrep
42 pages
SOAP Vs REST Headtohead Comparison
No ratings yet
SOAP Vs REST Headtohead Comparison
7 pages
Srs Student Managementdocx
No ratings yet
Srs Student Managementdocx
16 pages
Srs Student Managementdocx
No ratings yet
Srs Student Managementdocx
16 pages
Student Management System Srs PDF Free
No ratings yet
Student Management System Srs PDF Free
30 pages
Modern Work User Subscription Plan Comparison For Enterprise - 27 Jan 2022
No ratings yet
Modern Work User Subscription Plan Comparison For Enterprise - 27 Jan 2022
9 pages
MCCB & Acb: Building A New Electric World
No ratings yet
MCCB & Acb: Building A New Electric World
42 pages
Method Devlopment and Validation of Metformn and Glimepiride in Tablet Dosage Form by RP-HPLC Method
No ratings yet
Method Devlopment and Validation of Metformn and Glimepiride in Tablet Dosage Form by RP-HPLC Method
11 pages
Six Sigma Tools Dpu Dpmo PPM and Rty
100% (1)
Six Sigma Tools Dpu Dpmo PPM and Rty
3 pages
L43 L44
No ratings yet
L43 L44
6 pages
Causative Verb Paper
No ratings yet
Causative Verb Paper
7 pages
BMC Remedy Installation
No ratings yet
BMC Remedy Installation
2 pages
Panasonic CQ C1101u
No ratings yet
Panasonic CQ C1101u
30 pages
Resume Eb
No ratings yet
Resume Eb
2 pages
IQ4/IO/..: I/O Expansion Modules
No ratings yet
IQ4/IO/..: I/O Expansion Modules
12 pages
Adopt An Element Project
No ratings yet
Adopt An Element Project
6 pages
MSP Sales Process Guide and Checklists
No ratings yet
MSP Sales Process Guide and Checklists
17 pages
Zynq Adc
No ratings yet
Zynq Adc
21 pages
9th Science QSN T1 - Chapter-2 - OTQ
No ratings yet
9th Science QSN T1 - Chapter-2 - OTQ
14 pages
Angular Measurement Metallurgy
No ratings yet
Angular Measurement Metallurgy
19 pages
Learn Autodesk Inventor 2018 Basics
No ratings yet
Learn Autodesk Inventor 2018 Basics
1 page
FPT Corporation: A Growing Presence in Diverse Global Markets
No ratings yet
FPT Corporation: A Growing Presence in Diverse Global Markets
2 pages
Man Pro Lab Lab Exp No 6 - Introduction To Lathe Operation
No ratings yet
Man Pro Lab Lab Exp No 6 - Introduction To Lathe Operation
8 pages
Case Study 4 Defra UnITy
No ratings yet
Case Study 4 Defra UnITy
46 pages
pp1 Math
No ratings yet
pp1 Math
16 pages
Sieve Analysis Lab
No ratings yet
Sieve Analysis Lab
7 pages
Integrated Approach To BCM System Design: by Rama Lingeswara Satyanarayana Tammineedi, Mbci, CBCP, Cissp, Cisa, PMP, Itil
No ratings yet
Integrated Approach To BCM System Design: by Rama Lingeswara Satyanarayana Tammineedi, Mbci, CBCP, Cissp, Cisa, PMP, Itil
3 pages
NetCracker US Government Case Study For TMF-libre
No ratings yet
NetCracker US Government Case Study For TMF-libre
11 pages
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
From Everand
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Speaker Recognation System Srs

Uploaded by

Speaker Recognation System Srs

Uploaded by

asadthomas@gmail.

• Speaker Recognition: It is the process of automatically recognizing

• To building a robust system to identify and verify a speaker

• SR uses are voice dialling, banking by telephone, telephone

• Some systems use "anti-speaker" techniques such as

• The first type of speaker recognition machine using

• Since the mid-1980s, this field has been steadily getting

• Speaker Identification and

Text dependent recognition has better performance for

• Basically identification or authentication using speaker

• Feature extraction is to convert speech waveform to

• Pattern matching is the actual comparisson of the extracted

• Data Base using = TIMIT

Mel-frequency Cepstrum Coefficients:

• Gaussian • Mixture Model

• Classification paradigms used in SRS during the past 20

1. Reynolds, D. A and Rose, R. C. 1995. “Robust Text- Independent

You might also like