0% found this document useful (0 votes)

112 views29 pages

Digital Audio Processing Revisited: Juan P Bello

The document discusses key concepts in digital audio processing including: - How microphones convert sound waves to electrical signals through transduction. - How analog to digital converters (ADCs) sample and quantize analog audio signals into discrete digital values by taking regular samples defined by the sampling rate. - The Nyquist sampling theorem which states the minimum required sampling rate is twice the highest frequency contained in the signal to avoid aliasing. - Quantization noise that results from rounding analog amplitudes and its relationship to bit depth and signal-to-noise ratio. - Oversampling and dithering techniques used to reduce quantization noise. - How digital to analog converters (DACs) reconstruct

Uploaded by

koustubhthorat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views29 pages

Digital Audio Processing Revisited: Juan P Bello

Uploaded by

koustubhthorat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Digital audio processing revisited

Juan P Bello
Digital audio processing
Microphones
• Sound is an energy disturbance that propagates through a
medium as a wave
• Commonly, the medium is air, thus the sound wave produces
variations of air pressure
• A microphone is a transducer (i.e. a device that converts
energy or information from one form to another).
• Specifically, the microphone converts air pressure into voltage
levels, thus generating an electrical signal analogous to the
mechanical one.
• The following expression notates the relationship between
voltage and pressure in a microphone, where the symbol µ
means “is proportional to”: v(t) µ p(t)
ADC
• The conversion of an analog (continuous) signal x(t) into a
discrete sequence of numbers x(n) is performed by an Analog-to-
digital Converter (ADC)
• The ADC samples the amplitude of the analog signal at regular
intervals in time, and encodes (quantizes) those values as binary
numbers.
• The regular time intervals are known as the sampling period (Ts)
and are determined by the ADC clock.
• This period defines the frequency at which the sampling will be
done, such that the sampling frequency (in Hertz) is:
1
fs =
Ts
• The accuracy of the quantization depends on the number of bits
used to encode each amplitude value from the analog signal.

!
ADC

ADC

• The outgoing sequence x(n) is a discrete-time signal with

quantized amplitude
• Each element of the sequence is referred to as a sample.

...,x[n " 1],x[n],x[n +1],...

Discrete signals
• An example discrete signal is a real sinusoid, which can be
described as:
x[n] = a cos("n + # )
• where a is the amplitude, ω the angular frequency, and φ the
initial phase. At sample number n, the phase is equal to φ + ωn.
• A sinusoid is an example of simple harmonic motion.
• Because each cycle is completed in a constant amount of time,
!
the motion of the wave is periodic, i.e. there is a T > 0 that
satisfies the equation:

f (n) = f (n + "),0 < " < #

• The number of cycles completed per second is the frequency of
the wave, and the inverse of the frequency is its period.
Discrete signals

• A sine and a cosine are differentiated only by a phase difference

of a quarter cycle (π/2)
The sampling theorem
• Sampling is the process of converting a continuous signal into a
discrete sequence
• Our intuition tells us that we will loose information in the process
• However this is not necessarily the case and the sampling
theorem simply formalizes this fact
• It states that “in order to be able to reconstruct a bandlimited
signal, the sampling frequency must be at least twice the highest
frequency of the signal being sampled” (Nyquist, 1928)

The Nyquist frequency

Aliasing
• What happens when fs < 2B
• There is another, lower-frequency, signal that share samples with
the original signal (an alias).

• Related to the wagon-wheel effect:

http://www.michaelbach.de/ot/mot_strob/index.html

LPF
Anti-aliasing
Hearing frequency range

Human hearing is widely

accepted to lie in the
20-20kHz range

Thus main reason for

standard sampling
frequencies to be of
44.1kHz and 48kHz

In digital synthesis we
then have to be careful
not to exceed the
Nyquist frequency
Loudness

• dB = 10 * log10(level/reference level) - Levels of intensity or power

• Reference level = 0dB = 10-12 watts per square meter (threshold of
hearing)
Dynamic Range
• Threshold of hearing is ~0dB and threshold of pain is ~125dB
• Dynamic range of a system: difference between the loudest and
softest sound that a system can produce (measured in dB)
• On a linearly encoded PCM streams it is roughly: # of bits * 6

Dynamic Range
Quantization noise
• Is the distortion produced by the rounding-up of real signal amplitude
values during the ADC process to the values “allowed” by the bit-
resolution of each sample.
• The difference in level between the intended signal and the noise arising
from quantization is the signal-to-quantization-noise ratio (SQNR)
• This depends on the quantization accuracy (# of bits) and the signal itself.

• Example: a sound with progressively worsening quantization noise:

Low-level quantization noise
• Sounds just above silence are degraded most severely by the
quantization noise, because all of the variation is captured by the
least significant bit.

• This is known as low-level QN, i.e. a square wave produced by 1-

bit variations triggered when the signal has a very low amplitude.
• This noise can be critical as square waves are rich in odd
harmonics, that can even extend beyond the Nyquist frequency
producing aliasing.

• Solutions to this problem include:

1. Increasing the bit resolution (the level of noise is “inversely
proportional” to the number of bits per sample)
2. Adding dither, i.e. low-energy analog noise added prior to the AD
conversion, hence randomizing the quantization noise. Low-level
uncorrelated wide-band noise (amplitude typically LSB/2) is less
intrusive than square wave noise.
Dithering

Original

8-colors no dither

8-colors + dither
Oversampling
• If the desired sampling rate is X, oversampling will perform the
analog-to-digital conversion at some faster rate, such as 2X.

• The technique can be used to: minimize aliasing, noise reduction and
increase accuracy beyond that provided by the wordlength.
• It widens the range of the frequency spectrum thus reducing the
(uniformly distributed) noise below the Nyquist frequency.
• When the final filtering is performed, the residual quantization noise in
the audible signal will be less: 4X oversampling yields a 6 dB
reduction (12 dB for 8X oversampling)
Storage Requirements

Type Wordlength SamplingRate SQNR Bytes/minute/chan

nel

CD 16 bits 44100 96 dB 5,292,000

CD 16 bits 48000 96 dB 5,760,000

DVD 24 bits 88200 144 dB 10,584,000

DVD 24 bits 96000 144 dB 11,520,000

DVD 24 bits 192000 144 dB 23,040,000

Storage requirement = fs * wordlength * duration * channels

DAC and Imaging
• Just as we used an ADC to go from x(t) to x(n), we can turn a
discrete sequence into a continuous voltage-level signal using a
Digital-to-analog converter (DAC).
• However, the quantized nature of the digital signal produces a
“Zero-Order Hold” effect that distorts the converted signal,
introducing some step (fast) changes.
• This distortion is know as imaging.
• To avoid this, we use a low-pass filter after the DAC, such that it
smoothes out those fast changes.
• The filter, known as an anti-imaging filter (AKA smoothing or
reconstruction filter), discards signal components above the
Nyquist frequency, thus performing a simple interpolation
between the sampled values.
Digital Recording and Playback

This is not only storage, this is

our digital system!

That system is supposed to

process the signal somehow

Still we do not know anything

about our system
Digital systems
• The digital system can be seen as an algorithm that operates on
the discrete input sequence x(n)
• The output of such a system is the sequence y(n)
• The simplest of such systems are known as Linear Time-invariant
(LTI) systems
• As the name indicates they must be time-invariant: i.e. their
behavior does not change over time; and linear: they fulfill the
following condition:
if x(n) = A " x1 (n) + B " x 2 (n)
then y(n) = A " y1 (n) + B " y 2 (n)
• For any constant A and B, and for a system where yi(n) is the
output of xi(n), thus satisfying the superposition and scaling
properties
!
Impulse response
• The input/output relations on a LTI system can be characterized
using a test signal
• A commonly-used test signal is the unit impulse, defined as:

#1 n=0
" (n) = $
%0 elsewhere
• If we apply a unit impulse to a digital system we obtain y(n) = h(n),
the impulse response of the system.
• A digital system can be completely characterized by its impulse
response
!
Discrete convolution
• Since we know the impulse response h(n) of a given system, we
can calculate its response to ANY input signal x(n) by convolving
the input with its impulse response:
m=%
y(n) = x(n) " h(n) = & x(n) # h(n $ m)
m=$%
• A convolution represents the amount of overlap between x(n) and
a reversed and temporally-shifted version of h(n)

http://mathworld.wolfram.com/Convolution.html
Basic systems
• A 2-sample delay can be described by the relation: y(n) = x(n-2)

• A gain of a is represented as: y(n) = ax(n)

• The addition (mixing) of two inputs is: y(n) = a1x2(n)+a2x2(n)

Basic systems
• By combining the previous systems we can obtain a typical digital
system:
1 1 1
y(n) = x(n) + x(n "1) + x(n " 2)
3 3 3

!
Transfer function
• However, the temporal relations between input and output are not
all we can use to describe the system
• The frequency-domain behavior of a digital system specifies which
input frequencies will be passed, rejected or emphasized.
• This behavior can be described using the transfer function H(z)
and the frequency response H(f) (that will be discussed later)
• The transfer function is obtained by calculating the Z-transform:
$
X(z) = % x(n) " z #n

n=#$

• Of the impulse response h(n) as:

$
! H(z) = % h(n) " z #n

n=#$

!
Causality and stability
• Some common Z-transforms:

x(n) X(z)
x(n " M) z"M # X(z)
$ (n) 1
$ (n " M) z"M

• Finally, to be realizable, digital systems must be:

1. Causal: the!system cannot react to an input before it is received
2. Stable: the sum of the absolute values of h(n) has to be less than
infinite
Basic Systems in MSP
• MSP is a set of extensions to Max that provide for audio analysis,
processing and synthesis
• All MSP objects end with a tilde ‘~’ to indicate audio-rate
processing. This because the tilde vaguely resembles a sine wave.

startwindow

adc~
cycle~ 440
cycle~ 440
stop
+~ turn audio *~ 0.4 multiply by number
*~ 0.2 on/off < 1.0 to attenuate
*~ 0.5 dac~
dac~

Send any discrete sequence Mix Change gain

to the DAC
Basic systems in MSP
signal in adc~

store in tapin~ 1000

delay line

read out with tapout~ 100 tapout~ 200 read out with
100 ms delay 200 ms delay
dac~

• A tapin~ object saves some amount of its input signal in a

buffer whose size is specified by the object’s argument (here
1000 milliseconds).
• Any tapout~ objects connected to the outlet of a tapin~ share
that same buffer, reading samples out after a delay.
Useful References
• Zölzer, U. (Ed). “DAFX: Digital Audio Effects”. John Wiley and Sons (2002)
• Chapter 1: Zölzer, U. “Introduction”.

• Pohlmann, K. “Principles of Digital Audio”. McGraw-Hill, Inc. (1995)

• Roads, C. “The Computer Music Tutorial”. MIT Press (1996)

Antonov SABR Spreads Its Wings
No ratings yet
Antonov SABR Spreads Its Wings
7 pages
Lecture 2
No ratings yet
Lecture 2
75 pages
CH - 1
No ratings yet
CH - 1
58 pages
CS3570 Chapter4
No ratings yet
CS3570 Chapter4
71 pages
Lecture 5 PCM
No ratings yet
Lecture 5 PCM
38 pages
Analog-to-Digitial Conversion
No ratings yet
Analog-to-Digitial Conversion
21 pages
Module 2 Note
No ratings yet
Module 2 Note
29 pages
1 Basics DSP AV Intro
No ratings yet
1 Basics DSP AV Intro
36 pages
Electrotècnia: Tema 6 Tecnologia de Conversió D'àudio Digital
No ratings yet
Electrotècnia: Tema 6 Tecnologia de Conversió D'àudio Digital
39 pages
Classifications of Signals: ECE 593: Signal, Spectra and Signal Processing Course Outline
No ratings yet
Classifications of Signals: ECE 593: Signal, Spectra and Signal Processing Course Outline
6 pages
Topic 3 - Source Coding
No ratings yet
Topic 3 - Source Coding
65 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
Chapter 1
No ratings yet
Chapter 1
49 pages
DSP Slide I
No ratings yet
DSP Slide I
27 pages
B3-Signal Processing For Robotics
No ratings yet
B3-Signal Processing For Robotics
16 pages
Chapt 9
No ratings yet
Chapt 9
53 pages
MEH-Nakai Lab-1
No ratings yet
MEH-Nakai Lab-1
93 pages
Sampling Impr
No ratings yet
Sampling Impr
57 pages
Comm. Sys. Lect. 8
No ratings yet
Comm. Sys. Lect. 8
36 pages
Class14 Data Conversion
No ratings yet
Class14 Data Conversion
59 pages
Lecture 2
No ratings yet
Lecture 2
49 pages
6- Digital Audio Technology(1)
No ratings yet
6- Digital Audio Technology(1)
24 pages
Lez
No ratings yet
Lez
13 pages
DSP Unit-I Part 2 Updated 20.7.2020
No ratings yet
DSP Unit-I Part 2 Updated 20.7.2020
57 pages
DSP PartA
100% (1)
DSP PartA
106 pages
04 CM0340 DSP
No ratings yet
04 CM0340 DSP
44 pages
Sampled Data Systems: Sampling Effects of Sampling
No ratings yet
Sampled Data Systems: Sampling Effects of Sampling
25 pages
Chapter 4 Instrument
No ratings yet
Chapter 4 Instrument
24 pages
DSP Chapter One
No ratings yet
DSP Chapter One
100 pages
The Work Presented Below Is Solely Based Upon Our Research and We Ought Responsibility For It' Authenticity
No ratings yet
The Work Presented Below Is Solely Based Upon Our Research and We Ought Responsibility For It' Authenticity
22 pages
DSP System
No ratings yet
DSP System
56 pages
Adc Student: Andrew Brown Jonathan Warner Laura Strickland
No ratings yet
Adc Student: Andrew Brown Jonathan Warner Laura Strickland
51 pages
Analog & Digital Signals1
No ratings yet
Analog & Digital Signals1
37 pages
Lec#02 Introduction To Digital Control Systems
No ratings yet
Lec#02 Introduction To Digital Control Systems
18 pages
MSP - Lecture - 2 and 3 PDF
No ratings yet
MSP - Lecture - 2 and 3 PDF
51 pages
Emi Data Acqusition
No ratings yet
Emi Data Acqusition
48 pages
5 Digital Baseband
No ratings yet
5 Digital Baseband
20 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
37 pages
DS Lecture1 PDF
No ratings yet
DS Lecture1 PDF
41 pages
Lecture-2
No ratings yet
Lecture-2
81 pages
Lecture 5 PDF
No ratings yet
Lecture 5 PDF
59 pages
DSPweek-1,2
No ratings yet
DSPweek-1,2
44 pages
TC-515 Advanced Digital Signal Processing: Dr. Muhammad Imran Aslam Spring 2015
No ratings yet
TC-515 Advanced Digital Signal Processing: Dr. Muhammad Imran Aslam Spring 2015
58 pages
Procesamiento Digital de Señales: Jhon James Granada Torres
No ratings yet
Procesamiento Digital de Señales: Jhon James Granada Torres
39 pages
EEE351 PCSlect 06
No ratings yet
EEE351 PCSlect 06
82 pages
Signal Digitization in DSP
No ratings yet
Signal Digitization in DSP
5 pages
2 - Signal Sampling and Quantization
No ratings yet
2 - Signal Sampling and Quantization
96 pages
Intro To Digital Communication
No ratings yet
Intro To Digital Communication
29 pages
Lecture 1.pdf
No ratings yet
Lecture 1.pdf
34 pages
Unit-2 Multimedia Information Representation
No ratings yet
Unit-2 Multimedia Information Representation
72 pages
Introduction To Digital Signal Processing: Dr. Hugh Blanton ENTC 4347
No ratings yet
Introduction To Digital Signal Processing: Dr. Hugh Blanton ENTC 4347
16 pages
Chuong 1
No ratings yet
Chuong 1
76 pages
Introduction To ADC
No ratings yet
Introduction To ADC
82 pages
Signals - Systems-04 Discrete Time Signals and Systems
No ratings yet
Signals - Systems-04 Discrete Time Signals and Systems
89 pages
Sampling
No ratings yet
Sampling
63 pages
BDSP Lecture 2
No ratings yet
BDSP Lecture 2
53 pages
ECE 465 Digital Signals Processing: Assoc - Prof. Pham Van Tuan
No ratings yet
ECE 465 Digital Signals Processing: Assoc - Prof. Pham Van Tuan
45 pages
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Analog vs Digital
From Everand
Analog vs Digital
Marcus Tesla
No ratings yet
Analog Dialogue, Volume 47, Number 2
From Everand
Analog Dialogue, Volume 47, Number 2
Analog Dialogue
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Digital Sampling
No ratings yet
Digital Sampling
14 pages
What Is DTS Sound
100% (1)
What Is DTS Sound
66 pages
Open VXSobel 3 X3
No ratings yet
Open VXSobel 3 X3
3 pages
OpenVX Specification 1 1
No ratings yet
OpenVX Specification 1 1
302 pages
Application For Morphological Image Processing: Author: Dan Campbell UW-Madison Computer Engineering/Computer Science
No ratings yet
Application For Morphological Image Processing: Author: Dan Campbell UW-Madison Computer Engineering/Computer Science
9 pages
Synthesis and Characterization of Oligosalicylaldehyde-Based Epoxy Resins
No ratings yet
Synthesis and Characterization of Oligosalicylaldehyde-Based Epoxy Resins
5 pages
2 - Intro to PySpark RDD
No ratings yet
2 - Intro to PySpark RDD
35 pages
Short Circuit Testing of Cables: HPL Experience: V. Sreeram, M. Rajkumar, S. Sudhakara Reddy, T. Gurudev and Maroti
No ratings yet
Short Circuit Testing of Cables: HPL Experience: V. Sreeram, M. Rajkumar, S. Sudhakara Reddy, T. Gurudev and Maroti
4 pages
003 Kinetic Theory of Gases and Thermodynamics DPP 03of Lec 0
No ratings yet
003 Kinetic Theory of Gases and Thermodynamics DPP 03of Lec 0
3 pages
wumpus notes.docx
No ratings yet
wumpus notes.docx
14 pages
Operating System 2nd Edition - Topik 11
No ratings yet
Operating System 2nd Edition - Topik 11
57 pages
ProperDorkingLeakedSKiDS 1
100% (1)
ProperDorkingLeakedSKiDS 1
58 pages
SSScvs
No ratings yet
SSScvs
2 pages
Preparation of Drosophila Polytene Chromosome Squashes Practical Paper
No ratings yet
Preparation of Drosophila Polytene Chromosome Squashes Practical Paper
11 pages
SMA Study Guide
No ratings yet
SMA Study Guide
7 pages
Practicas LECCION 6 Oracle
No ratings yet
Practicas LECCION 6 Oracle
3 pages
HILL, Lester S. - Cryptography in An Algebraic Alphabet
No ratings yet
HILL, Lester S. - Cryptography in An Algebraic Alphabet
8 pages
F5-TTS: A Fairytaler That Fakes Fluent and Faithful Speech With Flow Matching
No ratings yet
F5-TTS: A Fairytaler That Fakes Fluent and Faithful Speech With Flow Matching
18 pages
Dry Machining1
No ratings yet
Dry Machining1
21 pages
Trace Master
No ratings yet
Trace Master
9 pages
Gretl Tutorial March 2021
No ratings yet
Gretl Tutorial March 2021
18 pages
A Model for Performance Enhancement of Steganography through Dynamic Key Cryptography
No ratings yet
A Model for Performance Enhancement of Steganography through Dynamic Key Cryptography
7 pages
Heat Transfer in Finned Tubes Lab Report
0% (1)
Heat Transfer in Finned Tubes Lab Report
6 pages
IMS-semi Detailed 3
No ratings yet
IMS-semi Detailed 3
32 pages
2019 DSE Phy
No ratings yet
2019 DSE Phy
20 pages
Koolprog User Guide
No ratings yet
Koolprog User Guide
18 pages
Evaluation of Feeds by Digestion Experiments
No ratings yet
Evaluation of Feeds by Digestion Experiments
35 pages
WP Qlik Sense Architectural Overview
No ratings yet
WP Qlik Sense Architectural Overview
6 pages
Phase Transformations and Heat Treatments of Steels: June 2020
No ratings yet
Phase Transformations and Heat Treatments of Steels: June 2020
34 pages
DC Machine Construction
No ratings yet
DC Machine Construction
10 pages
Immediate download Clean Architecture with NET 1st Edition Esposito ebooks 2024
100% (1)
Immediate download Clean Architecture with NET 1st Edition Esposito ebooks 2024
65 pages
69NT40-541-001 To 199 OS
No ratings yet
69NT40-541-001 To 199 OS
112 pages

Digital Audio Processing Revisited: Juan P Bello

Uploaded by

Digital Audio Processing Revisited: Juan P Bello

Uploaded by

Digital audio processing revisited

• The outgoing sequence x(n) is a discrete-time signal with

...,x[n " 1],x[n],x[n +1],...

f (n) = f (n + "),0 < " < #

• A sine and a cosine are differentiated only by a phase difference

The Nyquist frequency

• Related to the wagon-wheel effect:

Human hearing is widely

Thus main reason for

• dB = 10 * log10(level/reference level) - Levels of intensity or power

• Example: a sound with progressively worsening quantization noise:

• This is known as low-level QN, i.e. a square wave produced by 1-

• Solutions to this problem include:

Type Wordlength SamplingRate SQNR Bytes/minute/chan

CD 16 bits 44100 96 dB 5,292,000

CD 16 bits 48000 96 dB 5,760,000

DVD 24 bits 88200 144 dB 10,584,000

DVD 24 bits 96000 144 dB 11,520,000

DVD 24 bits 192000 144 dB 23,040,000

Storage requirement = fs * wordlength * duration * channels

This is not only storage, this is

That system is supposed to

Still we do not know anything

• A gain of a is represented as: y(n) = ax(n)

• The addition (mixing) of two inputs is: y(n) = a1x2(n)+a2x2(n)

• Of the impulse response h(n) as:

• Finally, to be realizable, digital systems must be:

Send any discrete sequence Mix Change gain

store in tapin~ 1000

• A tapin~ object saves some amount of its input signal in a

• Pohlmann, K. “Principles of Digital Audio”. McGraw-Hill, Inc. (1995)

• Roads, C. “The Computer Music Tutorial”. MIT Press (1996)

You might also like