Speech compression technique.docx

Speech compression is a technique that encodes speech signals to reduce redundancy and bandwidth requirements for transmission. The digitization of speech involves converting analog signals to digital through sampling, quantization, and coding. Various speech compression techniques include waveform coders and vocoders, which analyze and synthesize speech for efficient audio data handling.

Uploaded by

sanjaylogesh14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Speech compression technique.docx

Uploaded by

sanjaylogesh14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Speech compression technique:

Speech compression is the technique of encoding the speech signal in some

way that allows the same speech parameters to represent the whole signal. In other
words, it is to eliminate redundant features of speech and keep only the important
ones for the next stage of speech reproduction.

The aim of speech compression is to reduce the number of bits required to

represent speech signals by removing the redundant bits so-that the less bandwidth
is required for transmission.

SPEECH SIGNAL DIGITIZATION

Speech signal digitization is the process to convert speech from analog
signal to digital signal in order for digital processing and transmission. The main
phases in speech signal digitization are shown in fig 1 a) sampling, and in fig 1b)
quantization and coding.

HUMAN SPEECH PRODUCTION

The production of speech is a natural phenomenon of human being by
inhaling the air through mouth. In fig 2, a conceptual diagram of human speech
production physical model. When we speak, the air from lungs push through the
vocal tract and out of the mouth to produce a sound. Speech compression,
especially at low bit rate speech compression, explores the nature of human
speech production mechanism. In this section, we briefly explain how human
speech is produced.

Fig 2: Conceptual diagram of human speech production

A schematic diagram of the human speech production mechanism

Speech production is the process by which thoughts are translated into speech. This
includes the selection of words, the organization of relevant grammatical forms,
and then the articulation of the resulting sounds by the motor system using the
vocal apparatus.

Block diagram of human speech production

a) Voiced Sound: For some sounds for example, a voiced sound, or vowel
sounds of ‘a’, ‘i’ and ‘μ’, as, the vocal cords vibrate (open and close) at a rate
(fundamental frequency or pitch frequency) and the produced speech
samples show a quasi-periodic pattern.
b)Unvoiced Sound :For other sounds (e.g., certain fricatives as ‘s’ and ‘f’, and
plosives as ‘p’, ‘t’ and ‘k’ , named as unvoiced sound, the vocal cords do not
vibrate and remain open during the sound production.

Note: The waveform of unvoiced sound is more like noise.

SPEECH COMPRESSION TECHNIQUES
There are two types of speech compression techniques as follows:
1) Waveform Coders:
Time Domain
i) Pulse-code modulation (PCM)
ii) Adaptive differential pulse-code modulation (ADPCM)
Frequency Domain
a) Sub-band Coding: SBC
b) Adaptive Transform Coding:
2) Vocoders - A vocoder is a category of voice codec that analyzes and synthesizes
the human voice signal for audio data compression, multiplexing, voice
encryption, voice transformation, etc. Basically vocoder was designed to reduce
the channel bandwidth in telecommunication.
i) Linear Predictive Coders: LPC
ii) Formant synthesis:

Vocoder:
An electronic mechanism that reduces speech signals to slowly varying
signals transmittable over communication systems of limited frequency bandwidth
Vocoders are used in television production, filmmaking and games, usually for
robots or talking computers.

Vocoder derived certain parameters from a speech wave, and the parameters
were then used to control a synthesizer that reproduced the speech. To paraphrase
Dudley, vocoders could lead to advantages of more secure communications, and a
greater number of telephone channels in the same frequency space.

Speech Signals Processing
No ratings yet
Speech Signals Processing
7 pages
Speech Compression Techniques: An Overview
No ratings yet
Speech Compression Techniques: An Overview
4 pages
Linear Prediction Coding Vocoders: Institute of Space Technology Islamabad
No ratings yet
Linear Prediction Coding Vocoders: Institute of Space Technology Islamabad
15 pages
Procedia: Speech Coding Techniques
No ratings yet
Procedia: Speech Coding Techniques
11 pages
Speech Coding
100% (3)
Speech Coding
36 pages
Adaptive Multi Rate Coder Using ACLP
No ratings yet
Adaptive Multi Rate Coder Using ACLP
45 pages
Speech
No ratings yet
Speech
39 pages
Low Bit Rate Speech Coding
No ratings yet
Low Bit Rate Speech Coding
3 pages
Unit I Content Beyond Syllabus Introduction To Information Theory What Is "Information Theory" ?
No ratings yet
Unit I Content Beyond Syllabus Introduction To Information Theory What Is "Information Theory" ?
16 pages
Unit 2 Wireless
No ratings yet
Unit 2 Wireless
159 pages
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
No ratings yet
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
1 page
Linear Predictive Coding
No ratings yet
Linear Predictive Coding
4 pages
Linear Predictive Coding: Jeremy Bradbury December 5, 2000
No ratings yet
Linear Predictive Coding: Jeremy Bradbury December 5, 2000
23 pages
unit 2 sound or audio system
No ratings yet
unit 2 sound or audio system
29 pages
Nice
No ratings yet
Nice
15 pages
Ijetae 0612 54 PDF
No ratings yet
Ijetae 0612 54 PDF
4 pages
Introduction To Digital Speech Processing
No ratings yet
Introduction To Digital Speech Processing
42 pages
TEST-1
No ratings yet
TEST-1
77 pages
Speech Compression (2)
No ratings yet
Speech Compression (2)
37 pages
Human Speech Producing Organs: 2.4 Kbps
No ratings yet
Human Speech Producing Organs: 2.4 Kbps
108 pages
Unit2 1
No ratings yet
Unit2 1
23 pages
Vocoders: Phase Insensitivity
No ratings yet
Vocoders: Phase Insensitivity
3 pages
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
No ratings yet
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
5 pages
Research Paper
No ratings yet
Research Paper
5 pages
LPC Modeling: Unit 5 1.speech Compression
No ratings yet
LPC Modeling: Unit 5 1.speech Compression
13 pages
Lab9: Speech Synthesis
No ratings yet
Lab9: Speech Synthesis
13 pages
Design of Two Blocks of A Speech Coding
No ratings yet
Design of Two Blocks of A Speech Coding
10 pages
Module1 SSP
No ratings yet
Module1 SSP
95 pages
lab9a
No ratings yet
lab9a
12 pages
Major Project - I Final Submission Report: DSP Tools in Wireless Communication
No ratings yet
Major Project - I Final Submission Report: DSP Tools in Wireless Communication
36 pages
Effect of SVD Based Processing On The Perception of Voiced and Unvoiced Consonants
No ratings yet
Effect of SVD Based Processing On The Perception of Voiced and Unvoiced Consonants
5 pages
1.1 Motivation: Subband Coding Using Filter Banks OCTOBER 2011
No ratings yet
1.1 Motivation: Subband Coding Using Filter Banks OCTOBER 2011
30 pages
SP - 3301PPT
No ratings yet
SP - 3301PPT
152 pages
Speech Compression
No ratings yet
Speech Compression
22 pages
Final PPT On Speech Processing
0% (1)
Final PPT On Speech Processing
20 pages
Digital Signal Processing LEC 1
No ratings yet
Digital Signal Processing LEC 1
12 pages
Voice Digitization and On
No ratings yet
Voice Digitization and On
2 pages
A Simple LPC Vocoder Bob Beauchaine EE586, Spring 2004: Vocal Tract Modeling
No ratings yet
A Simple LPC Vocoder Bob Beauchaine EE586, Spring 2004: Vocal Tract Modeling
12 pages
Lesson 4 - Coding of Text, Voice, Image, and Video
No ratings yet
Lesson 4 - Coding of Text, Voice, Image, and Video
11 pages
Study of Different Types Coders For GSM: Abhinav Kumar
No ratings yet
Study of Different Types Coders For GSM: Abhinav Kumar
7 pages
Speech Compression Techniques - Formant and CELP Vocoders
No ratings yet
Speech Compression Techniques - Formant and CELP Vocoders
41 pages
Digital Audio Formats
From Everand
Digital Audio Formats
Ambrose Delaney
No ratings yet
Digital Speech Processing
No ratings yet
Digital Speech Processing
7 pages
Synthesis: Models of Speech
No ratings yet
Synthesis: Models of Speech
6 pages
Wireless and Mobile Communication_unit2
No ratings yet
Wireless and Mobile Communication_unit2
20 pages
Audio Compression: Ashish Sharma
No ratings yet
Audio Compression: Ashish Sharma
7 pages
Assignment On Speech
No ratings yet
Assignment On Speech
9 pages
A Project Report On A Time-Varying Convergence Parameter For The LMS Algorithm in The Presence of White Gaussian Noise
No ratings yet
A Project Report On A Time-Varying Convergence Parameter For The LMS Algorithm in The Presence of White Gaussian Noise
63 pages
Audio Processing (Musical Sound Processing) : Music
No ratings yet
Audio Processing (Musical Sound Processing) : Music
4 pages
Effect of Singular Value Decomposition Based Processing On Speech Perception
No ratings yet
Effect of Singular Value Decomposition Based Processing On Speech Perception
8 pages
Effect of Singular Value Decomposition Based Processing On Speech Perception
No ratings yet
Effect of Singular Value Decomposition Based Processing On Speech Perception
8 pages
Modeling The Speech Signal: Don Johnson
No ratings yet
Modeling The Speech Signal: Don Johnson
10 pages
Speech Signal Processing
No ratings yet
Speech Signal Processing
41 pages
Real-Time Voice Changer
No ratings yet
Real-Time Voice Changer
4 pages
Unit 2 A
No ratings yet
Unit 2 A
48 pages
Sound Design and Mixing in Reason
From Everand
Sound Design and Mixing in Reason
Andrew Eisele
3/5 (2)
Nature of Speech Signal: Basanta Joshi, PHD
No ratings yet
Nature of Speech Signal: Basanta Joshi, PHD
67 pages
THE Increasing Relevance of Multimedia Applications Is Placing A Great Demand On Content
No ratings yet
THE Increasing Relevance of Multimedia Applications Is Placing A Great Demand On Content
17 pages
Articles: Speech Synthesis 1 Prosody (Linguistics) 11 Tone (Linguistics) 13
No ratings yet
Articles: Speech Synthesis 1 Prosody (Linguistics) 11 Tone (Linguistics) 13
26 pages
Speech Signal Processing and Cross Language Information Retrieval
No ratings yet
Speech Signal Processing and Cross Language Information Retrieval
45 pages
Project Report
No ratings yet
Project Report
19 pages
01-intro-fork
No ratings yet
01-intro-fork
34 pages
CAD
No ratings yet
CAD
130 pages
Basic Simulation Lab Manual
No ratings yet
Basic Simulation Lab Manual
90 pages
Online Agriculture System
No ratings yet
Online Agriculture System
37 pages
Andon
No ratings yet
Andon
9 pages
Assignment2-NIT CALICUT DSA
No ratings yet
Assignment2-NIT CALICUT DSA
10 pages
Data Visualization in The Age of Big Data
No ratings yet
Data Visualization in The Age of Big Data
7 pages
Hspice Mosfet
No ratings yet
Hspice Mosfet
630 pages
Subscriber Data Usage-2018!04!13
No ratings yet
Subscriber Data Usage-2018!04!13
10 pages
Telecommunication in Pakistan
No ratings yet
Telecommunication in Pakistan
43 pages
RB Selenium 2
No ratings yet
RB Selenium 2
6 pages
Scrum Guide
No ratings yet
Scrum Guide
80 pages
Sense: Fire Alarm Control Panel (EN54. 2 & 4) Installation and Commissioning MAN 1553-8
No ratings yet
Sense: Fire Alarm Control Panel (EN54. 2 & 4) Installation and Commissioning MAN 1553-8
54 pages
Search Engine Comparison
No ratings yet
Search Engine Comparison
7 pages
SAP - Community - SAP S - 4HANA Release and Maintenance Strategy UpdateSAP - S4HANA - Release - and - Maintenance - Strategy - Update
No ratings yet
SAP - Community - SAP S - 4HANA Release and Maintenance Strategy UpdateSAP - S4HANA - Release - and - Maintenance - Strategy - Update
16 pages
INST-00094-Power-Supply-Replacement
No ratings yet
INST-00094-Power-Supply-Replacement
3 pages
Open Data Exposed Bastiaan Van Loenen all chapter instant download
100% (1)
Open Data Exposed Bastiaan Van Loenen all chapter instant download
65 pages
Mutex Vs Semaphore
No ratings yet
Mutex Vs Semaphore
3 pages
Phoenix Case Study
No ratings yet
Phoenix Case Study
6 pages
Taurus Userguide
No ratings yet
Taurus Userguide
174 pages
Compiler-Lexical Analysis
100% (1)
Compiler-Lexical Analysis
59 pages
2 Characteristics Process and Ethics of Research
No ratings yet
2 Characteristics Process and Ethics of Research
16 pages
Census PDF
No ratings yet
Census PDF
5 pages
Crime Type and Occurrence Prediction Using Machine Learning
No ratings yet
Crime Type and Occurrence Prediction Using Machine Learning
28 pages
Lom Log
No ratings yet
Lom Log
44 pages
Rexroth
No ratings yet
Rexroth
132 pages
Animal Photo Manipulation Tutorial
No ratings yet
Animal Photo Manipulation Tutorial
14 pages
D. Utilization Is The Action Phase
No ratings yet
D. Utilization Is The Action Phase
18 pages
2014 Chief Information Security Officer (CISO) Leadership Forum
No ratings yet
2014 Chief Information Security Officer (CISO) Leadership Forum
15 pages

Speech compression technique.docx

Uploaded by

Speech compression technique.docx

Uploaded by

Speech compression technique:

Speech compression is the technique of encoding the speech signal in some

The aim of speech compression is to reduce the number of bits required to

SPEECH SIGNAL DIGITIZATION

HUMAN SPEECH PRODUCTION

Fig 2: Conceptual diagram of human speech production

Block diagram of human speech production

Note: The waveform of unvoiced sound is more like noise.

You might also like