0% found this document useful (0 votes)

50 views8 pages

Multimedia Systems Chapter 6

The document discusses the basics of digital audio, including the digitization of sound through sampling and quantization. It describes key audio concepts like the Nyquist rate, signal-to-noise ratio, pulse code modulation, differential coding, and lossless predictive coding techniques like DPCM. The goal is to transform, quantize, and encode audio signals for efficient transmission and storage.

Uploaded by

David Peter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views8 pages

Multimedia Systems Chapter 6

Uploaded by

David Peter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Ambo University

Chapter Six
Basics of Digital Audio
Audio information is crucial for multimedia presentations and, in a sense, is the
simplest type of multimedia data. However, some important differences
between audio and image information cannot be ignored. For example, while it
is customary and useful to occasionally drop a video frame from a video stream,
to facilitate viewing speed, we simply cannot do the same with sound
information or all sense will be lost from that dimension.
6.1 Digitization of Sound
What Is Sound?
Sound is a wave phenomenon like light, but it is macroscopic and involves
molecules of air being compressed and expanded under the action of some
physical device..For example,
a speaker in an audio system vibrates back and forth and produces a
longitudinal pressure wave that we perceive as sound.
Without air there is no sound - for example, in space. Since sound is a pressure
wave, it takes on continuous values, as opposed to digitized ones with a finite
range. Nevertheless, if we wish to use a digital version of sound waves, we
must form digitized representations of audio information.

FIGURE : An analog signal: continuous measurement of pressure wave.

Multimedia System Compiled by Shemsu S 1

Ambo University

Figure shows the one-dimensional nature of sound. Values change over time
in amplitude: the pressure increases or decreases with time. The amplitude
value is a continuous quantity. Since we are interested in working with such
data in computer storage, we must digitize the analog signals (i.e.,
continuous-valued voltages) produced by microphones.
For image data, we must likewise digitize the time-dependent analog signals
produced by typical video-cameras. Digitization means conversion to a stream
of numbers - preferably integers for efficiency
Since the graph in Figure is two-dimensional, to fully digitize the signal shown
we have to sample in each dimension - in time and in amplitude. Sampling
means measuring the quantity we are interested in, usually at evenly spaced
intervals. The first kind of sampling - using measurements only at evenly
spaced time intervals - is simply called sampling (surprisingly), and the rate at
which it is performed is called the sampling frequency.
For audio, typical sampling rates are from 8 kHz (8,000 samples per second) to
48 kHz. The human ear can hear from about 20 Hz (a very deep rumble) to as
much as 20 kHz; above this level, we enter the range of ultrasound. The human
voice can reach approximately 4 kHz and we need to bound our sampling rate
from below by at least double this frequency (see the discussion of the Nyquist
sampling rate, below). Thus we arrive at the useful range about 8 to 40 or so
kHz
Nyquist Theorem
Signals can be decomposed into a sum of sinusoids, if we are willing to use
enough sinusoids. shows how weighted sinusoids can build up quite a complex
signal. Whereas frequency is an absolute measure, pitch is a perceptual,
subjective quality of sound - generally, pitch is relative.
Note that the true frequency and its alias are located symmetrically on the
frequency axis with respect to the Nyquist frequency pertaining to the sampling
rate used. For this reason, the Nyquist frequency associated with the sampling
frequency is often called the "folding" frequency. That is to say, if the sampling
frequency is less than twice the true frequency, and is greater than the true

Multimedia System Compiled by Shemsu S 2

Ambo University

frequency, then the alias frequency equals the sampling frequency

Signal-to-Noise Ratio (SNR)
In any analog system, random fluctuations produce noise added to the signal,
and the measured voltage is thus incorrect. The ratio of the power of the correct
signal to the noise is called the signal-fo-noise ratio (SNR). Therefore, the SNR
is a measure of the quality of the signal.
The SNR is usually measured in decibels (dB), where 1 dB is a tenth of a bel.
The SNR value, in units of dB, is defined in tenns of base-l0 logarithms of
squared voltages:

Signal-to-Quantization-Noise Ratio (SQNR)

For digital signals, we must take into account the fact that only quantized
values are stored. For a digital audio signal, the precision of each sample is
determined by the number of bits per sample, typically 8 or 16.
Aside from any noise that may have been present in the original analog signal,
additional error results from quantization. That is, if voltages are in the range of
0 to 1 but we have only 8 bits in which to store values, we effectively force all
continuous values of voltage into only 256 different values. Inevitably, this
introduces a roundoff error. Although it is not really "noise," it is called
quantization noise (or quantization error). The association with the concept of
noise is that such errors will essentially occur randomly from sample to sample.
The quality of the quantization is characterized by the
signal-to-quantization-noise ratio (SQNR). Quantization noise is defined as the
difference between the value of the analog signal, for the particular sampling
time, and the nearest quantization interval value. At most, this error can be as
much as half of the interval

Multimedia System Compiled by Shemsu S 3

Ambo University

6.2 Quantization and Transmission of Audio

To be transmitted, sampled audio information must be digitized, and here we

look at some of the details of this process. Once the information has been
quantized, it can then be transmitted or stored.

Coding of Audio
Quantization and transformation of data are collectively known as coding of
the data. For audio, the ч-law technique for companding audio signals is
usually combined with a simple algorithm that exploits the temporal
redundancy present in audio signals.

Differences in signals between the present and a previous time can effectively
reduce the size of signal values and, most important, concentrate the histogram
of pixel values (differences, now) into a much smaller range.

Pulse Code Modulation

PCM in General. Audio is analog - the waves we hear travel through the air to
reach our eardrums. We know that the basic techniques for creating digital
signals from analog ones consist of sampling and quantization. Sampling is
invariably done uniformly - we select a sampling rate and produce one value
for each sampling time.
In the magnitude direction, we digitize by quantization, selecting breakpoints in
magnitude and remapping any value within an interval to one representative
output level. The set of interval boundaries is sometimes called decision
boundaries, and the representative values are called reconstruction levels

Every compression scheme has three stages:

 Transformation. The input data is transformed to a new representation

that is easier or more efficient to compress. For example, in Predictive
Coding, (discussed later in the chapter) we predict the next signal from

Multimedia System Compiled by Shemsu S 4

Ambo University

previous ones and transmit the prediction error.

 Loss. we may introduce loss of information. Quantization is the main lossy

step. Here we use a limited number of reconstruction levels, fewer than in
the original signal. Therefore, quantization necessitates some loss of
information.

 Coding. Here, we assign a codeword (thus forming a binary bitstream) to

each output level or symbol. This could be a fixed-length code or a
variable-length code, such as Huffman coding (discussed in Chapter 7).

Differential Coding of Audio

Audio is often stored not in simple PCM but in a form that exploits differences.
For a start, differences will generally be smaller numbers and hence offer the
possibility of using fewer bits to store.
An advantage of forming differences is that the histogram of a difference signal
is usually considerably more peaked than the histogram for the original signal.
For example, as an extreme case, the histogram for a linear ramp signal that has
constant slope is uniform, whereas the histogram for the derivative of the signal
(i.e., the differences, from sampling point to sampling point) consists of a spike
at the slope value.
Generally, if a time-dependent signal has some consistency over time
(temporal redundancy), the difference signal - subtracting the current sample
from the previous one - will have a more peaked histogram, with a maximum
around zero. Consequently, if we then go on to assign bitstring codewords to
differences, we can assign short codes to prevalent values and long codewords
to rarely occurring ones.

Multimedia System Compiled by Shemsu S 5

Ambo University

Lossless Predictive Coding

Predictive coding simply means transmitting differences - we predict the next
sample as being equal to the current sample and send not the sample itself but
the error involved in making this assumption. That is, if we predict that the next
sample equals the previous one, then the error is just the difference between
previous and next. Our prediction scheme could also be more complex.
However, we do note one problem. Suppose our integer sample values are in
the range 0 .. 255. Then differences could be as much as -255 .. 255. So we
have unfortunately increased our dynamic range (ratio of maximum to
minimum) by a factor of two: we may well need more bits than we needed
before to transmit some differences.

DPCM
Differential Pulse Code Modulation is exactly the same as Predictive Coding,
except that it incorporates a quantizer step. Quantization is as in PCM and can
be uniform or nonuniform. One scheme for analytically determining the best
set of nonuniform quantizer steps is the Lloyd-Max quantizer, named for Stuart
Lloyd and Joel Max, which is based on a least squares minimization of the
error term.

DM
DM stands for Delta Modulation, a much-simplified version of DPCM often
used as a quick analog-to-digital converter.

ADPCM
Adaptive DPCM takes the idea of adapting the coder to suit the input much
further. Basically, two pieces make up a DPCM coder: the quantizer and the
predictor. Above, in Adaptive DM,we adapted the quantizer step size to suit the
input. In DPCM, we can adaptively modify the quantizer, by changing the step
size as well as decision boundaries in a nonuniform quantizer.

Multimedia System Compiled by Shemsu S 6

Ambo University

We can carry this out in two ways: using the properties of the input signal
(called forward adaptive quantization), or the properties of the quantized
output. For if quantized errors become too large, we should change the
nonuniform Lloyd-Max quantizer (this is called backward adaptive
quantization).

MIDI: MUSICAL INSTRUMENT DIGITAL INTERFACE

Wave-table files provide an accurate rendering of real instrument sounds but
are quite large. For simple music, we might be satisfied with PM synthesis
versions of audio signals that could easily be generated by a sound card. A
sound card is added to a PC expansion board and is capable of manipulating
and outputting sounds through speakers connected to the board, recording
sound input from a microphone connected to the computer, and manipulating
sound stored on a disk. If we are willing to be satisfied with the sound card's
defaults for many of the sounds we wish to include in a multimedia project, we
can use a simple scripting language and hardware setup called MIDI.

MIDI Overview
MIDI, which dates from the early 1980s, is an acronym that stands for Musical
Instrument Digital Interface. It forms a protocol adopted by the electronic
music industry that enables computers, synthesizers, keyboards, and other
musical devices to communicate with each other. A synthesizer produces
synthetic music and is included on sound cards, using one of the two methods
discussed above. The MIDI standard is supported by most synthesizers, so
sounds created on one can be played and manipulated on another and sound
reasonably close. Computers must have a special MIDI interface, but this is
incorporated into most sound cards. The sound card must also have both DA
and AD converters. MIDI is a scripting language - it codes "events" that stand
for the production of certain sounds. Therefore, MIDI files are generally very
small. For example, a MIDI event might include values for the pitch of a single
note, its duration, and its volume.

Multimedia System Compiled by Shemsu S 7

Ambo University

Terminology. A synthesizer was, and still may be, a stand-alone sound

generator that can vary pitch, loudness, and tone color. (The pitch is the
musical note the instrument plays - a C, as opposed to a G, say.) It can also
change additional music characteristics, such as attack and delay time. A good
(musician's) synthesizer often has a microprocessor, keyboard, control panels,
memory, and so on. However, inexpensive synthesizers are now included on
PC sound cards. Units that generate sound are referred to as tone modules or
sound modules. . A sequencer started off as a special hardware device for
storing and editing a sequence of musical events, in the form of MIDI data.
Now it is more often a software music editor on the computer.
A MIDI keyboard produces no sound, instead generating sequences of MIDI
instructions called MIDI messages.

Hardware Aspects of MIDI

The MIDI hardware setup consists of a 31.25 kbps (kilobits per second) serial
connection, with the 10-bit bytes including a 0 start and stop bit. Usually,
MIDI-capable units are either input devices or output devices, not both.
Figure shows a traditional synthesizer. The modulation wheel adds vibrato.
Pitch bend alters the frequency, much like pulling a guitar string over slightly.
There are often other controls, such as foots pedals, sliders, and so on.
The physical MIDI ports consist of 5-pin connectors labeled IN and OUT and a
third connector, THRU. This last data channel simply copies data entering the
IN channel. MIDI communication is half-duplex. MIDI IN is the connector via
which the device receives all MIDI data. MIDI OUT is the connector through
which the device transmits all the MIDI data it generates itself. MIDI THRU is
the connector by which the device echoes the data it receives from MIDI IN
(and only that - all the data generated by the device itself is sent via MIDI
OUT). These ports are on the sound card or interlace externally, either on a
separate card on a PC expansion card slot or using a special interlace to a serial
or parallel port

Multimedia System Compiled by Shemsu S 8

Geog.2 Fourth Edition Part 4
100% (1)
Geog.2 Fourth Edition Part 4
27 pages
383 942-27 EnDat 2-2 en
100% (1)
383 942-27 EnDat 2-2 en
20 pages
2015 Chapter 6 MMS IT - 1
No ratings yet
2015 Chapter 6 MMS IT - 1
18 pages
dmslecture3
No ratings yet
dmslecture3
11 pages
Chapter 6
No ratings yet
Chapter 6
9 pages
chapter 6
No ratings yet
chapter 6
8 pages
5 Digital Baseband
No ratings yet
5 Digital Baseband
20 pages
Chapter 06 - Basics of Digital Audio
No ratings yet
Chapter 06 - Basics of Digital Audio
97 pages
Lecture 4 - Audio Basics
No ratings yet
Lecture 4 - Audio Basics
36 pages
3.quantization and Transmission of Audio
No ratings yet
3.quantization and Transmission of Audio
10 pages
1. Introduction (UCS749).pptx
No ratings yet
1. Introduction (UCS749).pptx
72 pages
6- Digital Audio Technology(1)
No ratings yet
6- Digital Audio Technology(1)
24 pages
PCM, Differential Coding, DPCM, DM, ADPCM - Ze-Nian Li and Mark S
No ratings yet
PCM, Differential Coding, DPCM, DM, ADPCM - Ze-Nian Li and Mark S
13 pages
Week-3 Representation of Speech Waveforms - EEE 2415
No ratings yet
Week-3 Representation of Speech Waveforms - EEE 2415
10 pages
Unit Iii Audio Fundamental and Representaion
No ratings yet
Unit Iii Audio Fundamental and Representaion
24 pages
DC 17
No ratings yet
DC 17
4 pages
1-2_Intro_Analog and Digital Signals
No ratings yet
1-2_Intro_Analog and Digital Signals
10 pages
Multimedia Systems-L6
No ratings yet
Multimedia Systems-L6
22 pages
Digital Baseband Handout
No ratings yet
Digital Baseband Handout
20 pages
Introduction (UCS749)
No ratings yet
Introduction (UCS749)
59 pages
Chap 6
No ratings yet
Chap 6
46 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Audio Theory
No ratings yet
Audio Theory
33 pages
Audio Theory
100% (1)
Audio Theory
33 pages
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
No ratings yet
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
30 pages
Chapter 3
No ratings yet
Chapter 3
27 pages
Chap 2 - Sound Recording
No ratings yet
Chap 2 - Sound Recording
41 pages
Multimedia
No ratings yet
Multimedia
2 pages
Digital Audio Processing Revisited: Juan P Bello
No ratings yet
Digital Audio Processing Revisited: Juan P Bello
29 pages
Lecture 2
No ratings yet
Lecture 2
49 pages
Analog-to-Digital-Convesion v2 - Student
100% (2)
Analog-to-Digital-Convesion v2 - Student
43 pages
American International University-Bangladesh: Principles of Communication Title: Abstract
No ratings yet
American International University-Bangladesh: Principles of Communication Title: Abstract
4 pages
Ch1 Introduction Part2
No ratings yet
Ch1 Introduction Part2
29 pages
Audio Compression Notes (Data Compression)
No ratings yet
Audio Compression Notes (Data Compression)
35 pages
Information Sources & Signals: CECS 474 Computer Network Interoperability
No ratings yet
Information Sources & Signals: CECS 474 Computer Network Interoperability
20 pages
Audio and Audio Compression
No ratings yet
Audio and Audio Compression
27 pages
Unit-2 Multimedia Information Representation
No ratings yet
Unit-2 Multimedia Information Representation
72 pages
Lect 4
No ratings yet
Lect 4
14 pages
Unit 2
No ratings yet
Unit 2
26 pages
Principles of Primary MUX
No ratings yet
Principles of Primary MUX
164 pages
CPEDATCOM Chapter 3 PDF
No ratings yet
CPEDATCOM Chapter 3 PDF
7 pages
Pcs m0dule - 4 Ppt
No ratings yet
Pcs m0dule - 4 Ppt
47 pages
Digital Audio
No ratings yet
Digital Audio
9 pages
02-Digital Representation of Information
No ratings yet
02-Digital Representation of Information
39 pages
Chapt 06
No ratings yet
Chapt 06
30 pages
PCM Tta Batch
No ratings yet
PCM Tta Batch
84 pages
Quantization
100% (1)
Quantization
89 pages
ART2017951
No ratings yet
ART2017951
5 pages
Telecomm - Lecture 3 - New PDF
No ratings yet
Telecomm - Lecture 3 - New PDF
49 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Bec613a Mmc Mod2
No ratings yet
Bec613a Mmc Mod2
60 pages
CS3570 Chapter4
No ratings yet
CS3570 Chapter4
71 pages
RT Lecture 5 Slides
No ratings yet
RT Lecture 5 Slides
26 pages
Chapter 6 - Basics of Digital Audio
No ratings yet
Chapter 6 - Basics of Digital Audio
24 pages
CS 550 Multimedia&WS 2 SOUND v1
No ratings yet
CS 550 Multimedia&WS 2 SOUND v1
41 pages
GDPHM 505
No ratings yet
GDPHM 505
19 pages
Multimedia Technology CH 5
No ratings yet
Multimedia Technology CH 5
9 pages
Analog vs Digital
From Everand
Analog vs Digital
Marcus Tesla
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Anti Aliasing: Enhancing Visual Clarity in Computer Vision
From Everand
Anti Aliasing: Enhancing Visual Clarity in Computer Vision
Fouad Sabry
No ratings yet
The Magic of Transforms
From Everand
The Magic of Transforms
Pasquale De Marco
No ratings yet
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
From Everand
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
Fouad Sabry
No ratings yet
Unit 2
No ratings yet
Unit 2
17 pages
Types of Speakers
No ratings yet
Types of Speakers
5 pages
p31220 Ohms Law
No ratings yet
p31220 Ohms Law
12 pages
Complete Chemistry For Cambridge Secondary 1 Hulme Annas Archive
100% (3)
Complete Chemistry For Cambridge Secondary 1 Hulme Annas Archive
122 pages
Amazon PPC Management
100% (1)
Amazon PPC Management
14 pages
Geog.2 Fourth Edition Part 5
No ratings yet
Geog.2 Fourth Edition Part 5
32 pages
Geog.2 Fourth Edition Part 3
100% (2)
Geog.2 Fourth Edition Part 3
28 pages
Lecture Chapter 6
No ratings yet
Lecture Chapter 6
42 pages
Panasonic - WJ-MS424 - CCTV Quad
No ratings yet
Panasonic - WJ-MS424 - CCTV Quad
2 pages
ShockDisplay Curve Data Sheet
No ratings yet
ShockDisplay Curve Data Sheet
4 pages
Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice Over IP
No ratings yet
Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice Over IP
22 pages
Using MATLAB For Vibration Measurements
No ratings yet
Using MATLAB For Vibration Measurements
12 pages
Anritsu MF2412B
No ratings yet
Anritsu MF2412B
8 pages
User Manual: Lectrical Ultifunction Nalyzer
No ratings yet
User Manual: Lectrical Ultifunction Nalyzer
11 pages
23 Sampling
No ratings yet
23 Sampling
13 pages
Advances in Information and Communication Networks: Proceedings of the 2018 Future of Information and Communication Conference (FICC), Vol. 1 Kohei Arai download
100% (1)
Advances in Information and Communication Networks: Proceedings of the 2018 Future of Information and Communication Conference (FICC), Vol. 1 Kohei Arai download
57 pages
Direct RF Convertion
100% (1)
Direct RF Convertion
9 pages
Analog To Digital Converters
100% (1)
Analog To Digital Converters
21 pages
Chapter 2 - Graphics Output Primitives
No ratings yet
Chapter 2 - Graphics Output Primitives
87 pages
Gw-Instek C - 01-20
No ratings yet
Gw-Instek C - 01-20
10 pages
Stilson-Smith - Alias-Free Digital Synthesis of Classic Analog Waveforms (BLIT)
No ratings yet
Stilson-Smith - Alias-Free Digital Synthesis of Classic Analog Waveforms (BLIT)
12 pages
307RF Rudra38 PDF
No ratings yet
307RF Rudra38 PDF
6 pages
Controls Shimadzu Lc10 20systems
No ratings yet
Controls Shimadzu Lc10 20systems
78 pages
9.7 Data Converters-An Introduction: Operational-Amplifier and Data-Converter Circuits
No ratings yet
9.7 Data Converters-An Introduction: Operational-Amplifier and Data-Converter Circuits
13 pages
Digital Image Processing Assignment-Week 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
No ratings yet
Digital Image Processing Assignment-Week 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
10 pages
1 Binary & Hexadecimal Systems J24
No ratings yet
1 Binary & Hexadecimal Systems J24
19 pages
Digsilent DSM
No ratings yet
Digsilent DSM
4 pages
Introduction To LabVIEW and Computer-Based Measurements
No ratings yet
Introduction To LabVIEW and Computer-Based Measurements
68 pages
Altera Crest Factor Reudction AN475
No ratings yet
Altera Crest Factor Reudction AN475
29 pages
La Xlimit Ii: Manual
No ratings yet
La Xlimit Ii: Manual
7 pages
CMVA CAT II Performance Objectives 2019 EN
No ratings yet
CMVA CAT II Performance Objectives 2019 EN
11 pages
As Computer Science - Complete Notes
No ratings yet
As Computer Science - Complete Notes
247 pages
Mas Osx - Digital Performer 4
No ratings yet
Mas Osx - Digital Performer 4
2 pages
HF SDR Receiver CW-SSB-DRM Yu1lm
No ratings yet
HF SDR Receiver CW-SSB-DRM Yu1lm
12 pages
Sheet4 Sol
No ratings yet
Sheet4 Sol
4 pages

Multimedia Systems Chapter 6

Uploaded by

Multimedia Systems Chapter 6

Uploaded by

Ambo University

FIGURE : An analog signal: continuous measurement of pressure wave.

Multimedia System Compiled by Shemsu S 1

Multimedia System Compiled by Shemsu S 2

frequency, then the alias frequency equals the sampling frequency

Signal-to-Quantization-Noise Ratio (SQNR)

Multimedia System Compiled by Shemsu S 3

6.2 Quantization and Transmission of Audio

To be transmitted, sampled audio information must be digitized, and here we

Pulse Code Modulation

Every compression scheme has three stages:

 Transformation. The input data is transformed to a new representation

Multimedia System Compiled by Shemsu S 4

previous ones and transmit the prediction error.

 Loss. we may introduce loss of information. Quantization is the main lossy

 Coding. Here, we assign a codeword (thus forming a binary bitstream) to

Differential Coding of Audio

Multimedia System Compiled by Shemsu S 5

Lossless Predictive Coding

Multimedia System Compiled by Shemsu S 6

MIDI: MUSICAL INSTRUMENT DIGITAL INTERFACE

Multimedia System Compiled by Shemsu S 7

Terminology. A synthesizer was, and still may be, a stand-alone sound

Hardware Aspects of MIDI

Multimedia System Compiled by Shemsu S 8

You might also like