0% found this document useful (0 votes)

72 views

Speech Compression Using GSM

This document discusses speech compression using GSM RPE-LTP. It begins by introducing GSM as the most popular standard for mobile phones, using a digital 2G system. It then describes the GSM architecture and speech generation process. The document focuses on the GSM 6.10 vocoder, which uses linear predictive coding (LPC), residual pulse excitation (RPE), and long-term prediction (LTP) to compress speech. It explains each part of the encoding and decoding process in detail.

Uploaded by

Thuy Tran Vinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Speech Compression Using GSM

Uploaded by

Thuy Tran Vinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

www.final-yearprojects.co.

Speech Compression
using
GSM RPE-LTP
Faiza Nawaz
Bisma Hashmi
Mehrin Kiani
Introduction to GSM
 The Global System for Mobile Communications is the most
popular standard for mobile phones in the world.

 GSM service is used by over 2 billion people across more than

212 countries and territories.

 The ubiquity of the GSM standard makes international roaming

very common between mobile phone operators.

 GSM differs significantly from its predecessors in that both

signaling and speech channels are Digital call quality.
(so it is considered a second generation (2G) mobile phone
system.)

www.final-yearprojects.co.cc 2
Architecture Of GSM

www.final-yearprojects.co.cc 3
What is Speech?

 Speech Generation:

www.final-yearprojects.co.cc 4
GSM 6.10 Vocoder

 Key principle: mathematical modeling of the human vocal tract,

leading to an efficient compression method for transmitting
speech.

 A vocoder (combination of voice and coder) is used to describe

GSM systems tailored for the compression of speech.

 The sampling rate is 8000 sample/s leading to an average bit

rate for the encoded bit stream of 13 K bit/s

www.final-yearprojects.co.cc 5
GSM 6.10 Vocoder
 Coding scheme used by GSM 6.10 Vocoder is the Regular Pulse
Excitation - Long Term prediction - Linear Predictive Coder
(RPE-LTP)

 Vocoder sends three kinds of information to the receiver:

 Voiced or unvoiced signal
 (If it is voiced) The period of the excitation signal
 The parameters of the prediction filter.

www.final-yearprojects.co.cc 6
Linear Predictive Coder (LPC)
 LPC algorithm assumes that each speech sample is a linear
combination of previous samples.

 Speech is sampled, stored and analyzed.

 Coefficients calculated from the sample are transmitted and

processed in the receiver.

 Receiver accurately processes and categorizes voiced and

unvoiced sounds.

www.final-yearprojects.co.cc 7
Residual Pulse Excited (RPE) Coder
 Determines if the signal is voiced or unvoiced

 Determines the period for voiced sounds, encodes periodicity and

transmits the coefficient

 When the signal changes from voiced to unvoiced, RPE transmits a

code that stops the receiver from generating periodic pulses

 Starts generating random pulses to correspond to the noise like

nature of unvoiced

www.final-yearprojects.co.cc 8
GSM Compression Technologies

 Four compression technologies are:

 Full Rate
 Enhanced Full Rate (EFR)
 Adaptive Multi-Rate (AMR)
 Half Rate

www.final-yearprojects.co.cc 9
GSM Full Rate Vocoder Using RPE-LTP

 Described as an RPE-LTP linear predictive coder.

 Models the human vocal tract as a series of cylinders of

different widths.

 By forcing air through these cylinders, speech sounds

can be generated— the LPC coder models this with a
set of simultaneous equations.

www.final-yearprojects.co.cc 10
GSM Full Rate Vocoder Using RPE-LTP
(…contd)
 The input data to the RPE-LTP coder is 20ms of speech
composed of 160 samples, each with 13bit resolution.

 The data is first passed through a pre-emphasis filter:

 Enhances high-frequency components of the signal. (better
transmission efficiency.)
 Also removes any offset on the signal. (Simplifies computation.)

www.final-yearprojects.co.cc 11
LPC Speech Generation
 The model of speech generation can be thought of as air passing
through a set of different size cylinders.

www.final-yearprojects.co.cc 12
Short Term Analysis Stage
 Uses autocorrelation to calculate a set of eight reflection
coefficients.

 Schur recursion is used to efficiently solve the set of

equations resulting from it.

 The parameters are then converted into log-area ratios

(LARs) -- that allow better quantizing in a smaller
number of bits — the first eight parameters of the
transmission stream.

www.final-yearprojects.co.cc 13
Short Term Analysis Stage (…contd)
 The coded LARs is then decoded back to coefficients
and used to filter the input samples.

 The reason for decoding the LARs is to ensure that the

encoder uses the same information available at the
decoder to perform the filtering.

 An array of weights lpc[P] is computed such that

s[n] ~ lpc[0]*s[n--1]+lpc[1]*s[n--2]+_+lpc[P--1]*s[n--P]
(P is usually between 8 and 14, GSM uses 8.)

www.final-yearprojects.co.cc 14
Long Term Prediction Stage

 The 160 samples are split into 4 sub-windows of 40

samples each.

www.final-yearprojects.co.cc 15
Long Term Prediction Stage (…
contd)
 The long-term predictor produces two parameters for
each sub window: the lag and the gain.

 The LTP lag describes the source of the copy in time.

 The LTP gain describes the scaling factor.

www.final-yearprojects.co.cc 16
Calculating Lag and Gain

 LAG:
Compute resemblance by correlation.
correlation of x[n] and y[n] =
Sum of products x[n]*y[n-lag]
 GAIN:
Maximum correlation divided by the energy of the
reconstructed short-term residual signal.

www.final-yearprojects.co.cc 17
Residual Pulse Encoding

 To remove the long-term predictable signal from

its input, the algorithm then subtracts the scaled
40 samples.

 The residual signal is either weak or random and

consequently cheaper to encode and transmit.

www.final-yearprojects.co.cc 18
Residual Signal(…contd)
 The algorithm down-samples by a factor of three,
discarding two out of three sample values.

 Results in four evenly spaced 13-value subsequences to

choose from, starting with samples 1, 2, 3, and 4.

 The algorithm picks the sequence with the most energy.

 That leaves us with 13 3-bit sample values and a 6-bit

scaling factor that turns the PCM encoding into an
APCM

www.final-yearprojects.co.cc 19
Speech Decoder
 Decoder consists of three parts

 RPE Decoding

 LTP synthesis filter

 LPC short term synthesis filter

www.final-yearprojects.co.cc 20
Speech Decoder(…contd)

www.final-yearprojects.co.cc 21
Speech Decoder (…contd)
 Algorithm multiplies the 13 3-bit samples by the scaling factor and
expands them back into 40 samples, zero-padding the gaps

 Resulting residual pulse is fed to the long-term synthesis filter

 40-sample segment is cut from the old estimated short-term residual

signal, scaled by the LTP gain and added to the incoming pulse

 Estimated short-term residual signal passes through the short-term

synthesis filter whose reflection coefficients are calculated by the
LPC module

 Noise from the excited long-term synthesis filter passes through the
tubes of the simulated vocal tract--and emerges as speech

www.final-yearprojects.co.cc 22
QUESTIONS ???

www.final-yearprojects.co.cc 23

GSM Phy Part-2
No ratings yet
GSM Phy Part-2
13 pages
4: Speech Compression: Data Rates
No ratings yet
4: Speech Compression: Data Rates
14 pages
Speech Coders For Wireless Communication
No ratings yet
Speech Coders For Wireless Communication
53 pages
CELP
No ratings yet
CELP
23 pages
Human Speech Producing Organs: 2.4 Kbps
No ratings yet
Human Speech Producing Organs: 2.4 Kbps
108 pages
Unit2 1
No ratings yet
Unit2 1
23 pages
Speech Generation
No ratings yet
Speech Generation
11 pages
New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece
No ratings yet
New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece
24 pages
Audio Compression
No ratings yet
Audio Compression
81 pages
MMC Unit III-1
No ratings yet
MMC Unit III-1
122 pages
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
No ratings yet
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
87 pages
Wireless Networks Slides8
No ratings yet
Wireless Networks Slides8
23 pages
GSM Codecs
No ratings yet
GSM Codecs
6 pages
LPC Modeling: Unit 5 1.speech Compression
No ratings yet
LPC Modeling: Unit 5 1.speech Compression
13 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
38 pages
AN2197 - Implementing The Levinson-Durbin Algorithm On The StarCore SC140 - SC1400 Cores
No ratings yet
AN2197 - Implementing The Levinson-Durbin Algorithm On The StarCore SC140 - SC1400 Cores
24 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
18 pages
2720_Slides7
No ratings yet
2720_Slides7
18 pages
Data Transmission Over Speech Coded Voice Channels
No ratings yet
Data Transmission Over Speech Coded Voice Channels
81 pages
Speech and Audio Coding
No ratings yet
Speech and Audio Coding
16 pages
Speech Coder
No ratings yet
Speech Coder
20 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Speech Coding Journal
No ratings yet
Speech Coding Journal
20 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Unit 2 Wireless
No ratings yet
Unit 2 Wireless
159 pages
Adaptive Multi Rate Coder Using ACLP
No ratings yet
Adaptive Multi Rate Coder Using ACLP
45 pages
Vocoder
No ratings yet
Vocoder
72 pages
Speech and Audio Processing: Lecture-3
No ratings yet
Speech and Audio Processing: Lecture-3
20 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
Speech Compression
No ratings yet
Speech Compression
15 pages
Wireless Chp7
No ratings yet
Wireless Chp7
13 pages
Low Bit Rate Speech Coding
No ratings yet
Low Bit Rate Speech Coding
165 pages
Nice
No ratings yet
Nice
15 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Code Excited Liner Predictive Coding
No ratings yet
Code Excited Liner Predictive Coding
9 pages
Lecture LPC
No ratings yet
Lecture LPC
7 pages
b18592958 PDF
No ratings yet
b18592958 PDF
104 pages
123
No ratings yet
123
23 pages
Speech Compression Techniques - Formant and CELP Vocoders
No ratings yet
Speech Compression Techniques - Formant and CELP Vocoders
41 pages
MELP Low Bit Rate Speech Coding Algorithm
No ratings yet
MELP Low Bit Rate Speech Coding Algorithm
5 pages
RELP
No ratings yet
RELP
13 pages
Speech Coding
100% (3)
Speech Coding
36 pages
Speech Processing Project
No ratings yet
Speech Processing Project
16 pages
Dolby Audio Coders
100% (3)
Dolby Audio Coders
17 pages
Multimedia Communications: Speech Compression
No ratings yet
Multimedia Communications: Speech Compression
26 pages
ch5.3 (Vocoders)
No ratings yet
ch5.3 (Vocoders)
23 pages
Audio and Video Compresssion
100% (1)
Audio and Video Compresssion
61 pages
Codificadores de Voz
No ratings yet
Codificadores de Voz
26 pages
5. Speech Coding Techniques
No ratings yet
5. Speech Coding Techniques
17 pages
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
No ratings yet
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
5 pages
2 - PCM & Delta Modulation
No ratings yet
2 - PCM & Delta Modulation
33 pages
Source and Channel Encoder and Decoder Modeling: S-72.333 Postgraduate Course in Radiocommunications Fall 2000
No ratings yet
Source and Channel Encoder and Decoder Modeling: S-72.333 Postgraduate Course in Radiocommunications Fall 2000
17 pages
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
No ratings yet
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
140 pages
dịch bt
No ratings yet
dịch bt
13 pages
Anais Aesbr2007
No ratings yet
Anais Aesbr2007
160 pages
dịch bt
No ratings yet
dịch bt
11 pages
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Pic® Micro Principles on Your Mobile
From Everand
Pic® Micro Principles on Your Mobile
Clive W. Humphris
No ratings yet
Learn the Pic® Micro on Your Smartphone
From Everand
Learn the Pic® Micro on Your Smartphone
Clive W. Humphris
No ratings yet
Pic® Micro Principles V11
From Everand
Pic® Micro Principles V11
Clive W. Humphris
No ratings yet
Emergence of SDN With IOT: Kareem Sharif PHDCSF18M501
No ratings yet
Emergence of SDN With IOT: Kareem Sharif PHDCSF18M501
31 pages
Megalith Jean Hiraga
No ratings yet
Megalith Jean Hiraga
5 pages
Basics of Uart Communication
No ratings yet
Basics of Uart Communication
7 pages
ANT-ADU4521R3v06 Datasheet (High Gain ANT)
No ratings yet
ANT-ADU4521R3v06 Datasheet (High Gain ANT)
2 pages
BJT Gummel Poon Model
No ratings yet
BJT Gummel Poon Model
11 pages
Multi Technology On Board Equipment: All-In-One
No ratings yet
Multi Technology On Board Equipment: All-In-One
2 pages
HOMEWORK TUẦN 30.3-5.4
No ratings yet
HOMEWORK TUẦN 30.3-5.4
3 pages
Training For FS SST VST VVT ATs Incharge
No ratings yet
Training For FS SST VST VVT ATs Incharge
86 pages
DH7508_Receiving_Card_Specifications-V1.0.3 (1)
No ratings yet
DH7508_Receiving_Card_Specifications-V1.0.3 (1)
7 pages
Avaya CDR Feature Description and Implementation
No ratings yet
Avaya CDR Feature Description and Implementation
20 pages
Huawei LTE RNP Introduction1
100% (1)
Huawei LTE RNP Introduction1
31 pages
Profile GF
No ratings yet
Profile GF
12 pages
Odom Echotrac MkII
No ratings yet
Odom Echotrac MkII
2 pages
Cmslab Manual - 1
No ratings yet
Cmslab Manual - 1
61 pages
Multi Pro CB Modifications
No ratings yet
Multi Pro CB Modifications
75 pages
FBT Pinouts
No ratings yet
FBT Pinouts
1 page
Enterprise VoIP Solutions With Alpine Linux - Slashroots 2011
No ratings yet
Enterprise VoIP Solutions With Alpine Linux - Slashroots 2011
47 pages
Circular No - 40 - 20-21 PDF
100% (1)
Circular No - 40 - 20-21 PDF
2 pages
Cisco 700-505 Exam Questions & Answers: Number: 700-505 Passing Score: 800 Time Limit: 120 Min File Version: 26.4
No ratings yet
Cisco 700-505 Exam Questions & Answers: Number: 700-505 Passing Score: 800 Time Limit: 120 Min File Version: 26.4
12 pages
4G Technology - : Magic Communication
No ratings yet
4G Technology - : Magic Communication
10 pages
ApolloMapping DEM Price List Product Spec
No ratings yet
ApolloMapping DEM Price List Product Spec
4 pages
Debug 2
No ratings yet
Debug 2
2 pages
MT6799 LTE-A Smartphone Application Processor Technical Brief V1.1
No ratings yet
MT6799 LTE-A Smartphone Application Processor Technical Brief V1.1
79 pages
Tugas 2 PDF
No ratings yet
Tugas 2 PDF
3 pages
Full Microwave Engineering 3rd Edition Annapurna Das Ebook All Chapters
100% (11)
Full Microwave Engineering 3rd Edition Annapurna Das Ebook All Chapters
70 pages
Maplin Electronics 1982-12
No ratings yet
Maplin Electronics 1982-12
68 pages
VeEX Product Catalog F00 FINAL PDF
No ratings yet
VeEX Product Catalog F00 FINAL PDF
12 pages
BR-4 GVF Idirect Installer Certification
No ratings yet
BR-4 GVF Idirect Installer Certification
2 pages
03 Dhi Asc2202c D
No ratings yet
03 Dhi Asc2202c D
2 pages
Zigbee Controlled Relay Long Range
No ratings yet
Zigbee Controlled Relay Long Range
2 pages

Speech Compression Using GSM

Uploaded by

Speech Compression Using GSM

Uploaded by

www.final-yearprojects.co.

 GSM service is used by over 2 billion people across more than

 The ubiquity of the GSM standard makes international roaming

 GSM differs significantly from its predecessors in that both

 Key principle: mathematical modeling of the human vocal tract,

 A vocoder (combination of voice and coder) is used to describe

 The sampling rate is 8000 sample/s leading to an average bit

 Vocoder sends three kinds of information to the receiver:

 Speech is sampled, stored and analyzed.

 Coefficients calculated from the sample are transmitted and

 Receiver accurately processes and categorizes voiced and

 Determines the period for voiced sounds, encodes periodicity and

 When the signal changes from voiced to unvoiced, RPE transmits a

 Starts generating random pulses to correspond to the noise like

 Four compression technologies are:

 Described as an RPE-LTP linear predictive coder.

 Models the human vocal tract as a series of cylinders of

 By forcing air through these cylinders, speech sounds

 The data is first passed through a pre-emphasis filter:

 Schur recursion is used to efficiently solve the set of

 The parameters are then converted into log-area ratios

 The reason for decoding the LARs is to ensure that the

 An array of weights lpc[P] is computed such that

 The 160 samples are split into 4 sub-windows of 40

 The LTP lag describes the source of the copy in time.

 The LTP gain describes the scaling factor.

 To remove the long-term predictable signal from

 The residual signal is either weak or random and

 Results in four evenly spaced 13-value subsequences to

 The algorithm picks the sequence with the most energy.

 That leaves us with 13 3-bit sample values and a 6-bit

 LTP synthesis filter

 LPC short term synthesis filter

 Resulting residual pulse is fed to the long-term synthesis filter

 40-sample segment is cut from the old estimated short-term residual

 Estimated short-term residual signal passes through the short-term

You might also like