Speech Compression Using GSM
Speech Compression Using GSM
cc
Speech Compression
using
GSM RPE-LTP
Faiza Nawaz
Bisma Hashmi
Mehrin Kiani
Introduction to GSM
The Global System for Mobile Communications is the most
popular standard for mobile phones in the world.
www.final-yearprojects.co.cc 2
Architecture Of GSM
www.final-yearprojects.co.cc 3
What is Speech?
Speech Generation:
www.final-yearprojects.co.cc 4
GSM 6.10 Vocoder
www.final-yearprojects.co.cc 5
GSM 6.10 Vocoder
Coding scheme used by GSM 6.10 Vocoder is the Regular Pulse
Excitation - Long Term prediction - Linear Predictive Coder
(RPE-LTP)
www.final-yearprojects.co.cc 6
Linear Predictive Coder (LPC)
LPC algorithm assumes that each speech sample is a linear
combination of previous samples.
www.final-yearprojects.co.cc 7
Residual Pulse Excited (RPE) Coder
Determines if the signal is voiced or unvoiced
www.final-yearprojects.co.cc 8
GSM Compression Technologies
Full Rate
Enhanced Full Rate (EFR)
Adaptive Multi-Rate (AMR)
Half Rate
www.final-yearprojects.co.cc 9
GSM Full Rate Vocoder Using RPE-LTP
www.final-yearprojects.co.cc 10
GSM Full Rate Vocoder Using RPE-LTP
(…contd)
The input data to the RPE-LTP coder is 20ms of speech
composed of 160 samples, each with 13bit resolution.
www.final-yearprojects.co.cc 11
LPC Speech Generation
The model of speech generation can be thought of as air passing
through a set of different size cylinders.
www.final-yearprojects.co.cc 12
Short Term Analysis Stage
Uses autocorrelation to calculate a set of eight reflection
coefficients.
www.final-yearprojects.co.cc 13
Short Term Analysis Stage (…contd)
The coded LARs is then decoded back to coefficients
and used to filter the input samples.
www.final-yearprojects.co.cc 14
Long Term Prediction Stage
www.final-yearprojects.co.cc 15
Long Term Prediction Stage (…
contd)
The long-term predictor produces two parameters for
each sub window: the lag and the gain.
www.final-yearprojects.co.cc 16
Calculating Lag and Gain
LAG:
Compute resemblance by correlation.
correlation of x[n] and y[n] =
Sum of products x[n]*y[n-lag]
GAIN:
Maximum correlation divided by the energy of the
reconstructed short-term residual signal.
www.final-yearprojects.co.cc 17
Residual Pulse Encoding
www.final-yearprojects.co.cc 18
Residual Signal(…contd)
The algorithm down-samples by a factor of three,
discarding two out of three sample values.
www.final-yearprojects.co.cc 19
Speech Decoder
Decoder consists of three parts
RPE Decoding
www.final-yearprojects.co.cc 20
Speech Decoder(…contd)
www.final-yearprojects.co.cc 21
Speech Decoder (…contd)
Algorithm multiplies the 13 3-bit samples by the scaling factor and
expands them back into 40 samples, zero-padding the gaps
Noise from the excited long-term synthesis filter passes through the
tubes of the simulated vocal tract--and emerges as speech
www.final-yearprojects.co.cc 22
QUESTIONS ???
www.final-yearprojects.co.cc 23