0% found this document useful (0 votes)
33 views

Speech compression technique.docx

Speech compression is a technique that encodes speech signals to reduce redundancy and bandwidth requirements for transmission. The digitization of speech involves converting analog signals to digital through sampling, quantization, and coding. Various speech compression techniques include waveform coders and vocoders, which analyze and synthesize speech for efficient audio data handling.

Uploaded by

sanjaylogesh14
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

Speech compression technique.docx

Speech compression is a technique that encodes speech signals to reduce redundancy and bandwidth requirements for transmission. The digitization of speech involves converting analog signals to digital through sampling, quantization, and coding. Various speech compression techniques include waveform coders and vocoders, which analyze and synthesize speech for efficient audio data handling.

Uploaded by

sanjaylogesh14
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Speech compression technique:

Speech compression is the technique of encoding the speech signal in some


way that allows the same speech parameters to represent the whole signal. In other
words, it is to eliminate redundant features of speech and keep only the important
ones for the next stage of speech reproduction.

The aim of speech compression is to reduce the number of bits required to


represent speech signals by removing the redundant bits so-that the less bandwidth
is required for transmission.

SPEECH SIGNAL DIGITIZATION


Speech signal digitization is the process to convert speech from analog
signal to digital signal in order for digital processing and transmission. The main
phases in speech signal digitization are shown in fig 1 a) sampling, and in fig 1b)
quantization and coding.

HUMAN SPEECH PRODUCTION


The production of speech is a natural phenomenon of human being by
inhaling the air through mouth. In fig 2, a conceptual diagram of human speech
production physical model. When we speak, the air from lungs push through the
vocal tract and out of the mouth to produce a sound. Speech compression,
especially at low bit rate speech compression, explores the nature of human
speech production mechanism. In this section, we briefly explain how human
speech is produced.

Fig 2: Conceptual diagram of human speech production


A schematic diagram of the human speech production mechanism

Speech production is the process by which thoughts are translated into speech. This
includes the selection of words, the organization of relevant grammatical forms,
and then the articulation of the resulting sounds by the motor system using the
vocal apparatus.

Block diagram of human speech production

a)​ Voiced Sound: For some sounds for example, a voiced sound, or vowel
sounds of ‘a’, ‘i’ and ‘μ’, as, the vocal cords vibrate (open and close) at a rate
(fundamental frequency or pitch frequency) and the produced speech
samples show a quasi-periodic pattern.
b)​Unvoiced Sound :For other sounds (e.g., certain fricatives as ‘s’ and ‘f’, and
plosives as ‘p’, ‘t’ and ‘k’ , named as unvoiced sound, the vocal cords do not
vibrate and remain open during the sound production.

Note: The waveform of unvoiced sound is more like noise.


SPEECH COMPRESSION TECHNIQUES
There are two types of speech compression techniques as follows:
1) Waveform Coders:
Time Domain
i)​ Pulse-code modulation (PCM)
ii)​ Adaptive differential pulse-code modulation (ADPCM)
Frequency Domain
a) Sub-band Coding: SBC
b) Adaptive Transform Coding:
2) Vocoders - A vocoder is a category of voice codec that analyzes and synthesizes
the human voice signal for audio data compression, multiplexing, voice
encryption, voice transformation, etc. Basically vocoder was designed to reduce
the channel bandwidth in telecommunication.
i) Linear Predictive Coders: LPC
ii) Formant synthesis:

Vocoder:
​ An electronic mechanism that reduces speech signals to slowly varying
signals transmittable over communication systems of limited frequency bandwidth
Vocoders are used in television production, filmmaking and games, usually for
robots or talking computers.

Vocoder derived certain parameters from a speech wave, and the parameters
were then used to control a synthesizer that reproduced the speech. To paraphrase
Dudley, vocoders could lead to advantages of more secure communications, and a
greater number of telephone channels in the same frequency space.

You might also like