Lab 9a. Linear Predictive Coding For Speech Processing: Vocal Tract Parameters Pitch Period Voiced/Unvoiced Speech Switch

This document describes linear predictive coding (LPC) for speech processing. LPC models speech production as a time-varying digital filter excited by an impulse train or random noise. It estimates the filter coefficients using linear prediction to minimize prediction error, resulting in analysis and synthesis filters. The MATLAB LPC demo analyzes speech by windowing frames, applying an analysis filter to obtain residuals and coefficients, then synthesizes the original signal by passing residuals through the inverse synthesis filter.

Uploaded by

Hamouda Azzouz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views5 pages

Lab 9a. Linear Predictive Coding For Speech Processing: Vocal Tract Parameters Pitch Period Voiced/Unvoiced Speech Switch

Uploaded by

Hamouda Azzouz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

E E 2 7 5 Lab October 27, 2007

Lab 9a. Linear Predictive Coding for Speech Processing

Impulse Train
Generator
Random
Noise
Generator
Vocal Tract
Parameters
Time-Varying
Digital Filter
Block Diagram of simplified model of speech production
Pitch Period
Voiced/Unvoiced
Speech
Switch
H(z)
Figure 1:
Sections 0.4 and 0.5 contain the Lab Experiment and Lab Report needed.
0.1 Basic Principles of Linear Predictive Analysis
The basic discrete-time model of speech production is shown above. The composite spectral eects of
radiation, vocal tract and glottal excitation are represented by a time-varying digital lter. For short
periods when parameters are considered stationary, we have a time-invariant system. The steady-state
transfer function H(z) of the lter part of the model is modeled as,
H(z) =
S(z)
U(z)
=
G
1 a
1
z
1)
a
2
z
2
a
3
z
3
. . . a
p
z
p
(1)
The vocal tract system is excited by signal u[n], which will be an impulse train for voiced speech or random
noise for unvoiced speech. Thus, the parameters of this speech model are: voiced-unvoiced classication,
pitch period for voiced speech, gain parameter G and the coecients {a
k
} of the lter. These are the
parameters that are transmitted in coded speech.
There are many methods for estimation of pitch period and voiced/unvoiced classication. They are not
discussed here and actually are not implemented in this Demo. What is implemented is a method for
determining lter coecients (lattice lter coecients, referred to as reection coecients). It is these
lter coecients that are transmitted along with a residual signal instead of the parameters referred to above.
We consider the simplied all-pole model of Figure 1 , equation (1) as the natural representation of non-
nasal voiced sounds. (For nasals and fricatives, the acoustic theory calls for both poles and zeros in the
vocal tract transfer function H(z)). Actually, if the lter order p is high enough, the all-pole model provides
a fairly good representation for almost all the sounds of the speech. The major advantage of the all-pole
model is that the gain parameter G and the lter coecients a
k
can be estimated in a straightforward and
computationally ecient way using the method of linear predictive analysis.
0.2 Linear Predicton Analysis & Synthesis Filters
We assume that speech is modeled as shown in Figure 1. The speech s(n) is related to excitation u(n) by
s[n] =
p

k=1
a
k
s[n k] + Gu[n] (2)
To obtain model coecients, we resort to the following: Assume that you are trying to predict signal s[n]
at time n from previous values at times n 1, n 2, . . . etc.. A linear predictor with prediction coecients

k
is dened as a system whose output is
s[n] =
p

k=1

k
s[n k] (3)
The transfer function of the p
th
order linear predictor of equation (3) is the polynomial
P(z) =
p

k=1

k
z
k
The prediction error e(n) is dened as
e[n] = s[n] s[n] = s[n]
p

k=1

k
s[n k] (4)
Equivalently,
E(z) = A(z)S(z)
where
A(z) = 1
p

k=1

k
z
k
Comparing equations (2) and (4) it is seen that when the speech signal obeys the model of (2) exactly, then

k
= a
k
exactly. Then e[n] = Gu[n] and E(z) = GU(z). Thus the prediction error lter A(z) will be the
inverse lter of the system H(z) of (1). That is,
E(z) = GU(z) = A(z)S(z)
Hence,
H(z) =
S(z)
U(z)
=
G
A(z)
So we have A(z), the analysis lter and H(z), the synthesis lter.
The basic problem of linear prediction analysis is to determine the set of predictor coecients coecients
k
directly from the speech signal. Because of the non-stationary nature of speech, coecients are determined
for short segments of the speech where the signal is considered approximately stationary. These are found
through a minimization of the mean-square prediction error. The resulting parameters are then assumed to
be the parameters of the system function H(z) which is then used for the synthesis of that speech segment.
The method of determining these coecients is outlined below.
0.3 Minimum Mean-Square Error and the Orthogonality Principle
We consider the linear prediction problem of equation (3) as predicting a random variable from a set of other
random variables. Given RVs (x
1
, x
2
, . . . , x
n
) we wish to nd n constants
a
1
, a
2
, a
3
, . . . , a
n
such that we form a linear estimate of a random variable s by the sum of RVs
s = a
1
x
1
+ a
2
x
2
+ . . . , +a
n
x
n
. (5)
This is typically done by assuring that the the mean-square value
P = E{|s (a
1
x
1
+ a
2
x
2
+ . . . , +x
n
)|
2
}
of the resulting error
= s s = s (a
1
x
1
+ a
2
x
2
+ . . . , +x
n
)
is minimum. We do this by setting
P
a
i
= E{2[s (a
1
x
1
+ a
2
x
2
+ . . . , +a
n
x
n
)](x
i
)} = 0 (6)
which yields the so-called Yule Walker equations:
Setting i = 1, 2, . . . , n in equation (6) we get
R
11
a
1
+ R
12
a
2
+ .... + R
1n
a
n
= R
01
R
21
a
1
+ R
22
a
2
+ .... + R
2n
a
n
= R
02
R
31
a
1
+ R
32
a
2
+ .... + R
3n
a
n
= R
03
............................................................
R
n1
a
1
+ R
n2
a
2
+ .... + R
nn
a
n
= R
0n
(7)
where
R
ji
= E{x
i
x

j
} R
0j
= E{sx

j
}
If the data x
i
are linearly independent then the determinant of the coecients R
ij
is positive. Equation
(7) is solved for the unknown coecients a
k
, k = 1, 2, . . . n (
k
on the previous page) by using the so-called
Levinson-Durbin algorithm. Accordingly, the problem essentially consists of determining, for a short segment
of speech, the matrix of correlation coecients R
i,j
and then inverting the matrix to obtain the prediction
coecients which are then transmitted. All this often has to be done in real-time.
0.4 MATLAB LPC DEMO
Run the Demo as per instructions in Lab 9.
Demo Decsription
The demo consists of two parts; analysis and synthesis. The analysis portion is found in the transmitter
section of the system.
Analysis Section:
In this simulation, the speech signal is divided into frames of size 20 ms (160 samples), with an overlap of 10
ms (80 samples). Each frame is windowed using a Hamming window. The original speech signal is passed
through an analysis lter, which is an all-zero lter. It is a so-called lattice lter with coecients referred
to as reection coecients obtained in the previous step. The output of the lter is called the residual
signal. This is what is transmitted here along with the lter coecients. Here, the analysis section output
is simply connected to the synthesis portion.
Synthesis Section:
This residual signal is passed through a synthesis lter which is the inverse of the analysis lter. The output
of the synthesis lter is the original signal.
0.5 LAB REPORT
Give a brief description of what exactly is happening in the analysis and synthesis portion of the MATLAB
LPC speech analysis and synthesis Demo. Observe the residual signal and lter coecients generated in the
Analysis section that are then transmitted to the synthesis section.
Figure 2:
Ref: MATLAB Help, Linear Predicting & Coding of Speech.
Class notes:mirchand/ee276-2003

(Thomas F. Quatieri) Discrete Time Speech Signal P (BookFi - Org) 2 PDF
100% (3)
(Thomas F. Quatieri) Discrete Time Speech Signal P (BookFi - Org) 2 PDF
800 pages
Digital Processing of Speech Signals (Rabiner & Schafer 1978) PDF
100% (2)
Digital Processing of Speech Signals (Rabiner & Schafer 1978) PDF
265 pages
Lawrence R. Rabiner, Digital Processing of Speech Signals
100% (1)
Lawrence R. Rabiner, Digital Processing of Speech Signals
527 pages
Linear Prediction of Speech: D. Markel A. H. Gray, JR
No ratings yet
Linear Prediction of Speech: D. Markel A. H. Gray, JR
299 pages
Dokumen - Pub Discrete Time Speech Signal Processing Principles and Practice Low Price Ed Lpe 013242942x 9780132429429 9788177587463 8177587463
No ratings yet
Dokumen - Pub Discrete Time Speech Signal Processing Principles and Practice Low Price Ed Lpe 013242942x 9780132429429 9788177587463 8177587463
802 pages
Codificadores de Voz
No ratings yet
Codificadores de Voz
26 pages
SP MODULE 5 PPT L4C
No ratings yet
SP MODULE 5 PPT L4C
145 pages
Test2 SP
No ratings yet
Test2 SP
43 pages
342383676
No ratings yet
342383676
94 pages
Anais Aesbr2007
No ratings yet
Anais Aesbr2007
160 pages
Linear Predict
No ratings yet
Linear Predict
14 pages
Linear Prediction: The Technique, Its Solution and Application To Speech
No ratings yet
Linear Prediction: The Technique, Its Solution and Application To Speech
20 pages
Report
No ratings yet
Report
9 pages
Banglai Namaz Shikkha
No ratings yet
Banglai Namaz Shikkha
12 pages
Gram - Linear Prediction and Optimum Linear Fil-33-52
No ratings yet
Gram - Linear Prediction and Optimum Linear Fil-33-52
20 pages
A F A E: Daptive Iltering Pplications Xplained
No ratings yet
A F A E: Daptive Iltering Pplications Xplained
15 pages
3
No ratings yet
3
12 pages
Linear Prediction: The Problem, Its Solution and Application To Speech
No ratings yet
Linear Prediction: The Problem, Its Solution and Application To Speech
22 pages
Biomodelling - Linier Prediction
No ratings yet
Biomodelling - Linier Prediction
23 pages
Pub - Digital Processing of Speech Signals PDF
No ratings yet
Pub - Digital Processing of Speech Signals PDF
265 pages
AudioProcessing[1]
No ratings yet
AudioProcessing[1]
17 pages
LPC Modeling of Vocal Tract: H (Z) G A (Z) G A Z
No ratings yet
LPC Modeling of Vocal Tract: H (Z) G A (Z) G A Z
11 pages
Module2 SSP
No ratings yet
Module2 SSP
70 pages
dịch bt
No ratings yet
dịch bt
13 pages
CELP
No ratings yet
CELP
23 pages
Why linear prediction analysis is important in speech
No ratings yet
Why linear prediction analysis is important in speech
10 pages
PDF
No ratings yet
PDF
485 pages
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
No ratings yet
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
5 pages
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
6 pages
Linear Predictor: Nature of Linear Prediction
No ratings yet
Linear Predictor: Nature of Linear Prediction
9 pages
Linear Prediction Analysis: Vignans Institute of Engineering For Women
No ratings yet
Linear Prediction Analysis: Vignans Institute of Engineering For Women
20 pages
Atal 2006 LPC PDF
No ratings yet
Atal 2006 LPC PDF
5 pages
Speech Compression
No ratings yet
Speech Compression
14 pages
Implementation of Linear Predictive Coding (LPC) of Speech: Outline
No ratings yet
Implementation of Linear Predictive Coding (LPC) of Speech: Outline
15 pages
Prentice Hall - Digital Processing of Speech Signals - 1978 PDF
No ratings yet
Prentice Hall - Digital Processing of Speech Signals - 1978 PDF
265 pages
Review On ELEC333: Spring 2011 Nico & Wilber
No ratings yet
Review On ELEC333: Spring 2011 Nico & Wilber
63 pages
LPC
No ratings yet
LPC
5 pages
Prepared By: Mamatha.K.S M.Tech (S.P) 1 Sem Guided By: Mr. Satish.M.N
No ratings yet
Prepared By: Mamatha.K.S M.Tech (S.P) 1 Sem Guided By: Mr. Satish.M.N
21 pages
Linear Prediction
No ratings yet
Linear Prediction
18 pages
Speech Generation
No ratings yet
Speech Generation
11 pages
2020 11 26.09.47.56 Digital Communication Unit II 13marks
No ratings yet
2020 11 26.09.47.56 Digital Communication Unit II 13marks
23 pages
Speech Coders For Wireless Communication
No ratings yet
Speech Coders For Wireless Communication
53 pages
Estimation of Formant Frequency of Speech Signal by Linear Prediction Method and Wavelet Transform IJERTV2IS3371
No ratings yet
Estimation of Formant Frequency of Speech Signal by Linear Prediction Method and Wavelet Transform IJERTV2IS3371
6 pages
David S Undermann, Harald H Oge, Antonio Bonafonte, Helenca Duxans
No ratings yet
David S Undermann, Harald H Oge, Antonio Bonafonte, Helenca Duxans
5 pages
Musero V CAA Demurrer
No ratings yet
Musero V CAA Demurrer
20 pages
International Financial Institutions
No ratings yet
International Financial Institutions
29 pages
A Tutorial On Speech Synthesis Models
No ratings yet
A Tutorial On Speech Synthesis Models
8 pages
(eBook PDF) Organizational Behavior in Education: Leadership and School Reform 11th Editionpdf download
100% (5)
(eBook PDF) Organizational Behavior in Education: Leadership and School Reform 11th Editionpdf download
48 pages
Linear, Time-Varying System e (N), Excitation X (N), Speech Output
No ratings yet
Linear, Time-Varying System e (N), Excitation X (N), Speech Output
4 pages
Artificial Bandwidth Extension of Speech: COURSE SGN-1650 AND SGN-1656, 2010-2011
No ratings yet
Artificial Bandwidth Extension of Speech: COURSE SGN-1650 AND SGN-1656, 2010-2011
7 pages
LPC Vocoder Project
No ratings yet
LPC Vocoder Project
4 pages
EC18501 - Unit II_DM_ADM_DPCM_LPC
No ratings yet
EC18501 - Unit II_DM_ADM_DPCM_LPC
43 pages
Speech Processing Project
No ratings yet
Speech Processing Project
16 pages
APA Handbook of Clinical Psychology - Applications and Methods
100% (1)
APA Handbook of Clinical Psychology - Applications and Methods
15 pages
GITAM Integrated BTech-MTech Syllabus
No ratings yet
GITAM Integrated BTech-MTech Syllabus
186 pages
You Drink It Just Like
No ratings yet
You Drink It Just Like
22 pages
18.7 Real-World Example - Speech Synthesis: 0 and So The Interfering Sinusoid Is Filtered Out. The PSD at
No ratings yet
18.7 Real-World Example - Speech Synthesis: 0 and So The Interfering Sinusoid Is Filtered Out. The PSD at
5 pages
A Prayer For My Daughter Full
No ratings yet
A Prayer For My Daughter Full
3 pages
EE6425 Class Project: LPC 10 Speech Analysis and Synthesis Model
No ratings yet
EE6425 Class Project: LPC 10 Speech Analysis and Synthesis Model
23 pages
Linear Predictive Coding
No ratings yet
Linear Predictive Coding
22 pages
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
No ratings yet
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
31 pages
ps7 Fall09
No ratings yet
ps7 Fall09
2 pages
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
No ratings yet
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
4 pages
DSP Unit5 Applications of Multirate Signal Processing
No ratings yet
DSP Unit5 Applications of Multirate Signal Processing
19 pages
Schabanel PhDThesis
No ratings yet
Schabanel PhDThesis
244 pages
Grand Realty - Def Sandlin Answer to Complaint(165496190.1)
No ratings yet
Grand Realty - Def Sandlin Answer to Complaint(165496190.1)
14 pages
Discrete Time Processing of Speech Signa
No ratings yet
Discrete Time Processing of Speech Signa
12 pages
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
No ratings yet
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
4 pages
Investment Banking Interview Questions
No ratings yet
Investment Banking Interview Questions
9 pages
FINM7008 Lecture 4
No ratings yet
FINM7008 Lecture 4
33 pages
Normative Structure of Science
No ratings yet
Normative Structure of Science
8 pages
Discrete Time Systems
No ratings yet
Discrete Time Systems
11 pages
Seminar 1
No ratings yet
Seminar 1
12 pages
Variational Inference Ref Paper
No ratings yet
Variational Inference Ref Paper
13 pages
Final Test Literature 5 Types of Figurative Language
No ratings yet
Final Test Literature 5 Types of Figurative Language
15 pages
Engineering Workshop Report
No ratings yet
Engineering Workshop Report
16 pages
Iemh108 PDF
No ratings yet
Iemh108 PDF
17 pages
Stream Analysis
No ratings yet
Stream Analysis
35 pages
Ob Unit 2 Leadership 2022 Bba Sem 2
No ratings yet
Ob Unit 2 Leadership 2022 Bba Sem 2
18 pages
Poem
50% (2)
Poem
5 pages
CRC Agricultural Trading v. NLRC
No ratings yet
CRC Agricultural Trading v. NLRC
6 pages
The Enemy Board Ques. Ans. 24-25
No ratings yet
The Enemy Board Ques. Ans. 24-25
4 pages
Chemistry - On Techniques - 06.04
No ratings yet
Chemistry - On Techniques - 06.04
2 pages
Architecture Du Réseau LTE
No ratings yet
Architecture Du Réseau LTE
2 pages
Earned Value Analysis Basic Concepts: Ricardo Viana Vargas, MSC, Ipma-B, PMP
No ratings yet
Earned Value Analysis Basic Concepts: Ricardo Viana Vargas, MSC, Ipma-B, PMP
25 pages
Signal Processing - Exercices: 1 Exercice 1
No ratings yet
Signal Processing - Exercices: 1 Exercice 1
2 pages
C. A. Childress, Psy.D. Licensed Clinical Psychologist, Psy 18857
No ratings yet
C. A. Childress, Psy.D. Licensed Clinical Psychologist, Psy 18857
10 pages
Suraksha Independent Ethics Committee
No ratings yet
Suraksha Independent Ethics Committee
8 pages
RF Propagation - 07-Okumura and Hata Macroscopic Propagation Models
No ratings yet
RF Propagation - 07-Okumura and Hata Macroscopic Propagation Models
8 pages
To Kill A Mockingbird - Chapters 1-4 Quiz
No ratings yet
To Kill A Mockingbird - Chapters 1-4 Quiz
3 pages
Nanosensors
No ratings yet
Nanosensors
2 pages
RM SAMPLE EXAM For Revision 2021
100% (1)
RM SAMPLE EXAM For Revision 2021
2 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)

Lab 9a. Linear Predictive Coding For Speech Processing: Vocal Tract Parameters Pitch Period Voiced/Unvoiced Speech Switch

Uploaded by

Lab 9a. Linear Predictive Coding For Speech Processing: Vocal Tract Parameters Pitch Period Voiced/Unvoiced Speech Switch

Uploaded by

E E 2 7 5 Lab October 27, 2007

Lab 9a. Linear Predictive Coding for Speech Processing

You might also like