0% found this document useful (0 votes)

66 views

Aist2010 03 Analysis

The document discusses Fourier analysis and its applications in audio signal processing. It introduces the Fourier transform which decomposes a signal into its constituent sinusoids. The discrete Fourier transform (DFT) is used to analyze digital audio signals by summing a finite number of sinusoids. The short-time Fourier transform (STFT) further breaks the analysis into frames to show how frequencies change over time in a spectrogram. Window functions are used to avoid spectral leakage when applying the Fourier transform to short segments.

Uploaded by

wingkitcwk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views

Aist2010 03 Analysis

Uploaded by

wingkitcwk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

AUDIO ANALYSIS AND

VISUALIZATION
AIST2010 Lecture 3
Fourier Analysis Spectral Visualization MATLAB Programming

OUTLINE
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 2
SUMMATION OF WAVES
Any continuous function, e.g. audio signal, can be expressed as a sum
of (infinite many) sinusoidal waves
Proved by French scientist and mathematician Jean Baptiste Fourier (1768–
1830)
Each sinusoidal wave has their
own amplitude and frequency

Image from: Fund. of Music Processing, p.70

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 3

FUNDAMENTAL FREQUENCY AND HARMONICS Image from: https://en.wikipedia.org/wiki/Harmonic

In particular, some waveforms sound ”better”

Only/mainly frequency components in relationship
of integer multiples
The GCD is often called the fundamental frequency
𝑓" , and the others are harmonics 𝑓#
𝑓# = 𝑘𝑓"
Harmonics are sometimes called partials and
overtones too, but may be numbered differently!
Read: https://en.wikipedia.org/wiki/Harmonic#Partials,_overtones,_and_harmonics Harmonics of multiple relationship

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 4

Image from: Comp. Music Instruments, p.63

OTHER WAVEFORMS
Sawtooth wave
Sum of all harmonics, with each
decreasing in amplitude
Square wave
Sum of odd harmonics
Triangle wave Decomposing a square wave
Sum of odd harmonics, with a negative sign for alternating odd harmonics, and
each decreasing in amplitude
Some more animations of the square wave decomposition here: http://bilimneguzellan.net/fuyye-serisi/

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 5

MATHEMATICAL REPRESENTATION
Sometimes you may see this as well:
𝑔 𝑡 ≔ 𝐴 cos(2𝜋𝑓𝑡 + 𝜑)
A sinusoidal wave can be represented as…
𝑔 𝑡 ≔ 𝐴 sin(2𝜋𝑓𝑡 + 𝜑)
Sometimes you may
where see this: 𝜔 = 2𝜋𝑓
A = amplitude, i.e. loudness of the sound
f = frequency (in Hz), i.e. pitch of the sound
Note: period T = 1/ f, in seconds
ϕ = phase (in radians, where 2π rad=360°), i.e. relative position of an
oscillation within its cycle
Note: A phase shift by ϕ+2π has the same effect as a phase shift by ϕ

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 6

FOURIER ANALYSIS
Transformation from time domain (amplitude vs. time) into frequency
domain (magnitude vs. frequency) 𝑒 @9 = cos 𝑡 + 𝑖 sin 𝑡

ℱ 𝑔 𝑡 = 𝑔7 𝑓 = ∫9∈ℝ 𝑔 𝑡 𝑒 =>?@A9 𝑑𝑡
You may view it as counting the occurrence of frequencies in the waveform
Yet, this function for continuous 𝑓 and 𝑡 cannot be applied to digital
signals!

Read: https://betterexplained.com/articles/an-interactive-guide-to-the-fourier-transform/

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 7

Video from: https://youtu.be/spUNpyF58BY

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 8

DISCRETE FOURIER TRANSFORM (DFT)
Since the input values (samples) are equally spaced, the Fourier
Transform for sound samples is discrete @9
𝑒 = cos 𝑡 + 𝑖 sin 𝑡
Sum of finite series of sinusoidal waves
From a sequence of 𝑁 (complex) samples
𝑥F ≔ 𝑥" , 𝑥H , … , 𝑥J=H
into a sequence of 𝑁 complex numbers
NOPQR
=
𝑋# ≔ ∑J=H
FM" 𝑥F 𝑒 S

for 0 ≤ 𝑘 ≤ 𝑁 − 1
Image from: https://blog.revolutionanalytics.com/2014/01/the-fourier-transform-explained-in-one-sentence.html
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 9
DISCRETE FOURIER TRANSFORM (DFT)
NOPQR
=
𝑋# ≔ ∑J=H 𝑥
FM" F 𝑒 S

The Xk series is called the DFT coefficients (of N frequency bins)

Magnitude 𝑋# = 𝑅𝑒(𝑋# )> + 𝐼𝑚(𝑋# )> 𝑒 @9 = cos 𝑡 + 𝑖 sin 𝑡
Phase 𝑎𝑟𝑔 𝑋# = arctan(𝐼𝑚 𝑋# /𝑅𝑒 𝑋# )
#
Bin frequency 𝑓# = 𝑓b c
J

DFT is a very popular tool for digital signal processing

Usually implemented as Fast Fourier Transform (FFT)
Ordinary DFT is 𝑂(𝑁 > ) while FFT is 𝑂(𝑁 log 𝑁)
Luckily, you can often use FFT simply as a black box in programming
libraries, without understanding the math behind!
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 10
SHORT-TIME FOURIER TRANSFORM (STFT)
DFT can only show the general “histogram” of frequencies
The appearance of frequencies in the whole analyzed sound
STFT breaks the process into multiple DFT/FFT in time segments
Analysis frames

amp.
Waveform
The result is a spectrogram DFT DFT DFT DFT DFT DFT
(time domain)
Magnitude vs. frequency vs. time time
frame
Magnitude often represented in
DFT DFT DFT DFT DFT DFT
freq.
the colour dimension Spectrogram
coeff coeff coeff coeff coeff coeff (freq. domain)
time

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 11

WINDOW FUNCTION
For a step-by-step Fourier analysis, a window function is needed
The value is 1 only for a short time, and 0 otherwise
Image from:
https://commons.wikimedia.org/wiki/File:
Mplwp_window-functions-symmetric.svg

The shape of the window function will

affect frequency responses
E.g. The sharp edges of a rectangular window
will result in high frequency components
Usual choices to avoid spectral leakage:
Hamming Window, Hann Window, …
Comparison of windows
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 12
FREQUENCY BINS
Usual window size: powers of two to facilitate FFT, e.g. 1024, …
Often with an overlap of 50% to compensate loss of data by windowing
1024-point FFT = 1024 time samples = 1024 frequency bins
The more samples in the window, Overlap Windows
the higher the frequency resolution

amp.
DFT DFT DFT DFT DFT DFT DFT
The results fall into frequency bins of smaller range
time
The higher the frequency resolution, DFT DFT DFT DFT DFT DFT DFT

freq.
the lower the time resolution coeff coeff coeff coeff coeff coeff coeff
Basically it is a trade-off between time and frequency time
Hop size
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 13
HOW TO READ THE PLOTS?

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 14

HOW TO READ THE PLOTS?

15
INVERSE OF THE FOURIER TRANSFORM
Rebuilding audio signal from the Fourier analysis data
Inverse DFT
From frequency domain back to time domain
Can easily be expressed in terms of the DFT
J=H J=H
=
>?@#F 1 >?@#F
𝑋# = g 𝑥F 𝑒 J 𝑥F = g 𝑋# 𝑒 J
𝑁
FM" #M"
Inverse STFT
Overlap-add (OLA) method
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 16
CEPSTRUM
What would happen for a
Fourier transform in the
frequency domain?
Cepstrum: the patterns found in the
spectrum
Quefrency: a measure of time
related to the sampling rate in
time domain
Lifter: a filter in the cepstrum
(quefrency) domain
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 17
Image from: https://sethares.engr.wisc.edu/vocoders/phasevocoder.html

PHASE VOCODER METHOD

A special kind of FFT analysis is the Phase
Vocoder method
Phase information is used to compensate the
inadequate frequency resolution
Mimicking the analog method using “filter banks”
Possible for spectral edits and resynthesis
Especially good for analysis of harmonic sounds
Using an appropriate frequency bin size to fit
harmonics

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 18

ANALYSIS ON THE HARMONIC SERIES
For harmonic sounds (e.g. musical
instruments), a series of peaks of
integer multiples can be found in the
spectrum 𝑓# = 𝑘𝑓"
Timbre: tone colour
The difference between musical
instruments, or human voice
Pn Gt Hp Xy
Piano
# sig. har. = 6 Guitar
# sig. har. = 6 Harp
# sig. har. = 1 Xylophone
# sig. har. = 3
Bandwidth = 8
Density of Sig. Har. = 0.75
Bandwidth = 9
Density of Sig. Har. = 0.67
Bandwidth = 1
Density of Sig. Har. = 1
Bandwidth = 7
Density of Sig. Har. = 0.43 Integer multiples of f0

1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 19

ALTERNATIVES TO STFT
Image from: http://ataspinar.com/2018/12/21/a-guide-for-using-
the-wavelet-transform-in-machine-learning/

STFT has drawbacks such as the resolution

constraints of time vs. frequency
There are alternatives, such as
Wavelet Transform (WT)
Constant-Q Transform (CQT)
The main aim is to reduce frequency
resolution at higher frequencies
Frequency bins gets larger in the high end Time series and various transforms

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 20

LECTURE REVIEW
The lecture is half-way done… with these discussed:
How can sounds be represented mathematically
The transform between time and frequency domain
Continuous vs. Discrete transforms
Different settings of FFT
Further possible analysis based on FFT

In the next half of this lecture, we will learn basic MATLAB

programming!
AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 21
READ FURTHER
Chapter 7, “Frequency-Domain Techniques”, Computer Music
Instruments
Chapter 2, “Fourier Analysis of Signals”, Fundamentals of Music
Processing

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 22

Miniature Concerto For Piano & Orchestra
100% (2)
Miniature Concerto For Piano & Orchestra
22 pages
SF Service Cloud Cheatsheet Web PDF
100% (1)
SF Service Cloud Cheatsheet Web PDF
2 pages
Spectral Modeling and Signal Processing Intro421
100% (2)
Spectral Modeling and Signal Processing Intro421
35 pages
FFT Analysis in Practice
No ratings yet
FFT Analysis in Practice
86 pages
Chapter 3 PDF
No ratings yet
Chapter 3 PDF
121 pages
FFT Spectral Analysis
No ratings yet
FFT Spectral Analysis
47 pages
Seewave Analysis
No ratings yet
Seewave Analysis
17 pages
FFT Spectral Analysis
No ratings yet
FFT Spectral Analysis
69 pages
FFT Research
No ratings yet
FFT Research
8 pages
Lab4 2011
No ratings yet
Lab4 2011
6 pages
l4n JN Uhbh Hiunun Hbinun
No ratings yet
l4n JN Uhbh Hiunun Hbinun
36 pages
FFT Analyzer - OnoSokki - Notes
No ratings yet
FFT Analyzer - OnoSokki - Notes
30 pages
Fourier Notes
No ratings yet
Fourier Notes
38 pages
CMP4101_4_ Frequency Domain Signal Processing -Part II
No ratings yet
CMP4101_4_ Frequency Domain Signal Processing -Part II
80 pages
Frequency Response and Continuous-Time Fourier Transform
No ratings yet
Frequency Response and Continuous-Time Fourier Transform
25 pages
1 FFT and Spectrogram: 1.1 Fourier Transform For Finite Duration Signals
No ratings yet
1 FFT and Spectrogram: 1.1 Fourier Transform For Finite Duration Signals
3 pages
FFT Theory
No ratings yet
FFT Theory
29 pages
L6 Fourier
No ratings yet
L6 Fourier
38 pages
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
No ratings yet
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
30 pages
Fourier Series Expansion of Periodic Signal: (With Period of T)
No ratings yet
Fourier Series Expansion of Periodic Signal: (With Period of T)
45 pages
Fourier Notes
No ratings yet
Fourier Notes
14 pages
Guide to the Basic Concepts and Techniques of Spectral Music Joshua Fineberg Part 4
No ratings yet
Guide to the Basic Concepts and Techniques of Spectral Music Joshua Fineberg Part 4
6 pages
Speech Signal Processing: A Handbook of Phonetic Science
No ratings yet
Speech Signal Processing: A Handbook of Phonetic Science
24 pages
4 Acoustic Signal Analysis
No ratings yet
4 Acoustic Signal Analysis
25 pages
Course Reader: Nontechnical Introduction To The Fourier Transform
No ratings yet
Course Reader: Nontechnical Introduction To The Fourier Transform
7 pages
MIT5 35F12 FTLectureBishof
No ratings yet
MIT5 35F12 FTLectureBishof
14 pages
Objectives:: Fast Fourier Transform
No ratings yet
Objectives:: Fast Fourier Transform
14 pages
Introduction To Signal Processing: Professor Mike Brennan
No ratings yet
Introduction To Signal Processing: Professor Mike Brennan
40 pages
Signal Spectra: Why Is The Spectral View Useful?
No ratings yet
Signal Spectra: Why Is The Spectral View Useful?
5 pages
Frequency Space
No ratings yet
Frequency Space
71 pages
Moving Into The Frequency Domain: Time Spatial Fourier Transform (FT)
No ratings yet
Moving Into The Frequency Domain: Time Spatial Fourier Transform (FT)
71 pages
Lab9
No ratings yet
Lab9
14 pages
Image Enhance Frequency Domain Student
No ratings yet
Image Enhance Frequency Domain Student
21 pages
CHP 3 Fourier Transform 2024 v1
No ratings yet
CHP 3 Fourier Transform 2024 v1
26 pages
Igital Ignal Rocessing: Balochistan University of Information Technology, Engineering & Management Sciences-Quetta
No ratings yet
Igital Ignal Rocessing: Balochistan University of Information Technology, Engineering & Management Sciences-Quetta
10 pages
Analysis of Audio Signal Using Various T Ef70b0cd
No ratings yet
Analysis of Audio Signal Using Various T Ef70b0cd
13 pages
Fourier Transform Applications
No ratings yet
Fourier Transform Applications
18 pages
Lecture 4 Slides DFT Sampling Theorem
No ratings yet
Lecture 4 Slides DFT Sampling Theorem
32 pages
Lecture8 Fouriertransforms
No ratings yet
Lecture8 Fouriertransforms
9 pages
FourierLecture06 Transforms Version4
No ratings yet
FourierLecture06 Transforms Version4
36 pages
Notes On Fourier Transforms: PHYS 332: Junior Physics Laboratory II
No ratings yet
Notes On Fourier Transforms: PHYS 332: Junior Physics Laboratory II
6 pages
Matlab Activity-1
No ratings yet
Matlab Activity-1
2 pages
M8 - Discrete Time Fourier Transform
No ratings yet
M8 - Discrete Time Fourier Transform
30 pages
FFT Wavelet NthOctave e
No ratings yet
FFT Wavelet NthOctave e
10 pages
Signal Processing
No ratings yet
Signal Processing
21 pages
ECE 410 Digital Signal Processing D. Munson University of Illinois
No ratings yet
ECE 410 Digital Signal Processing D. Munson University of Illinois
10 pages
NI Tutorial 4844 en Understanding FFT and Windowing
No ratings yet
NI Tutorial 4844 en Understanding FFT and Windowing
11 pages
12 Fourier T Xen
No ratings yet
12 Fourier T Xen
129 pages
Unit 5
No ratings yet
Unit 5
78 pages
Analysisof Speech Signal 29 TH October 2018
No ratings yet
Analysisof Speech Signal 29 TH October 2018
16 pages
Clase 3 Bioingenieria PDF
No ratings yet
Clase 3 Bioingenieria PDF
25 pages
Matlab Activity
No ratings yet
Matlab Activity
2 pages
Understanding FFTs and Windowing
No ratings yet
Understanding FFTs and Windowing
15 pages
12 Fourier T Xen
No ratings yet
12 Fourier T Xen
128 pages
Course Notes v17
No ratings yet
Course Notes v17
82 pages
Math IA Consolidated
No ratings yet
Math IA Consolidated
24 pages
Hall 2018 Time Frequency Decomposition
No ratings yet
Hall 2018 Time Frequency Decomposition
3 pages
Linear Algebra, Signal Processing, And Wavelets - A Unified Approach_ MATLAB Version (Instructor's Solution Manual) (Solutions)
No ratings yet
Linear Algebra, Signal Processing, And Wavelets - A Unified Approach_ MATLAB Version (Instructor's Solution Manual) (Solutions)
209 pages
Play Guitar: Exploration and Analysis of Harmonic Possibilities
From Everand
Play Guitar: Exploration and Analysis of Harmonic Possibilities
Kevin Kriescher
No ratings yet
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
From Everand
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
Fouad Sabry
No ratings yet
Digital Signal Processing for Audio Applications: Volume 1 - Formulae
From Everand
Digital Signal Processing for Audio Applications: Volume 1 - Formulae
Anton R Kamenov
No ratings yet
The Music Producer's Guide To Distortion: The Music Producer's Guide
From Everand
The Music Producer's Guide To Distortion: The Music Producer's Guide
Ashley Hewitt
No ratings yet
Note05-Arena Modeling
No ratings yet
Note05-Arena Modeling
67 pages
The Corrupting Influence of Variability: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
No ratings yet
The Corrupting Influence of Variability: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
32 pages
Introduction To Computer Simulation: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
No ratings yet
Introduction To Computer Simulation: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
29 pages
Variability Basics: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
No ratings yet
Variability Basics: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
28 pages
Introductions To EEMT512: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
No ratings yet
Introductions To EEMT512: Dr. Jeff Hong Ielm Dept, Hkust Fall 2010
4 pages
Note01-Basic Dynamics
No ratings yet
Note01-Basic Dynamics
29 pages
Introduction To Computer Music: AIST2010 Lecture 1
No ratings yet
Introduction To Computer Music: AIST2010 Lecture 1
15 pages
Lonely Planet - Cantonese Phrasebook PDF
No ratings yet
Lonely Planet - Cantonese Phrasebook PDF
160 pages
Surface Pro PDF
No ratings yet
Surface Pro PDF
3 pages
Music in Real Life Vs Music in Digital World: AIST2010 Lecture 2
No ratings yet
Music in Real Life Vs Music in Digital World: AIST2010 Lecture 2
35 pages
Interpretation of Argentine Tango For Strings, Bandoneon and Piano
No ratings yet
Interpretation of Argentine Tango For Strings, Bandoneon and Piano
5 pages
Dana Design Catalog Mexico - Vietnam PDF
No ratings yet
Dana Design Catalog Mexico - Vietnam PDF
57 pages
Artigo VoxPlot
No ratings yet
Artigo VoxPlot
10 pages
Overview of Digital Audio Steganography Techniques
No ratings yet
Overview of Digital Audio Steganography Techniques
5 pages
Debabala Swain Machine Learning and Information 2020
No ratings yet
Debabala Swain Machine Learning and Information 2020
533 pages
Final Report On Speech Recognition Project
No ratings yet
Final Report On Speech Recognition Project
32 pages
Analysis Introduction MAXMSP JITTER
100% (2)
Analysis Introduction MAXMSP JITTER
11 pages
Rajesh Thesis
No ratings yet
Rajesh Thesis
86 pages
Speaker Diarization
No ratings yet
Speaker Diarization
47 pages
3.2 Automatic Speech Recognition.pptx
No ratings yet
3.2 Automatic Speech Recognition.pptx
151 pages
A Deep Learning Framework For Audio Deepfake Detection
No ratings yet
A Deep Learning Framework For Audio Deepfake Detection
12 pages
Sok: A Study of The Security On Voice Processing Systems
No ratings yet
Sok: A Study of The Security On Voice Processing Systems
10 pages
Project Report
No ratings yet
Project Report
106 pages
Digital Speech Processing- Synthesis, And Recognition by Sadaoki Furui
No ratings yet
Digital Speech Processing- Synthesis, And Recognition by Sadaoki Furui
42 pages
Voicemorphing (Finel)
100% (1)
Voicemorphing (Finel)
20 pages
John W. Tukey's Work On Time Series and Spectrum Analysis
No ratings yet
John W. Tukey's Work On Time Series and Spectrum Analysis
25 pages
A Practical Handbook of Speech Coders
No ratings yet
A Practical Handbook of Speech Coders
15 pages
VTU CBCS2015SCHEME Ecsyll8aem
0% (1)
VTU CBCS2015SCHEME Ecsyll8aem
14 pages
Lectures 1 Rabiner Speech Processing
No ratings yet
Lectures 1 Rabiner Speech Processing
77 pages
Pitch Detection of Voice Signals
No ratings yet
Pitch Detection of Voice Signals
24 pages
Music Genre Classification
No ratings yet
Music Genre Classification
33 pages
Audio Steg
No ratings yet
Audio Steg
26 pages
Gender Recognition Using Fast Fourier Transform With Ann
No ratings yet
Gender Recognition Using Fast Fourier Transform With Ann
6 pages
Homomorphic Filtering and Speech Processing Using Cepstrum Analysis
100% (2)
Homomorphic Filtering and Speech Processing Using Cepstrum Analysis
22 pages
An Automatic Speaker Recognition System
100% (1)
An Automatic Speaker Recognition System
11 pages
A Statistical Pattern Recognition Paradigm For Vibration-Based Structural Health Monitoring
No ratings yet
A Statistical Pattern Recognition Paradigm For Vibration-Based Structural Health Monitoring
10 pages
Speech Signal Processing
100% (2)
Speech Signal Processing
173 pages
Research Article: Dance Evaluation Based On Movement and Neural Network
No ratings yet
Research Article: Dance Evaluation Based On Movement and Neural Network
7 pages
Meyer 2011
No ratings yet
Meyer 2011
16 pages
Cepstrum Analysis
No ratings yet
Cepstrum Analysis
13 pages
Emotion Recognition Using Speech Features by K. Sreenivasa Rao, Shashidhar G. Koolagudi (Auth.)
No ratings yet
Emotion Recognition Using Speech Features by K. Sreenivasa Rao, Shashidhar G. Koolagudi (Auth.)
133 pages
Unit 3
No ratings yet
Unit 3
44 pages

Aist2010 03 Analysis

Uploaded by

Aist2010 03 Analysis

Uploaded by

AUDIO ANALYSIS AND

Image from: Fund. of Music Processing, p.70

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 3

In particular, some waveforms sound ”better”

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 4

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 5

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 6

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 7

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 8

­The Xk series is called the DFT coefficients (of N frequency bins)

DFT is a very popular tool for digital signal processing

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 11

The shape of the window function will

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 14

PHASE VOCODER METHOD

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 18

1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 1 3 5 7 9 11 13 15 17 19 AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 19

STFT has drawbacks such as the resolution

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 20

In the next half of this lecture, we will learn basic MATLAB

AIST2010 L3 — AUDIO ANALYSIS AND VISUALIZATION 22

You might also like

The Xk series is called the DFT coefficients (of N frequency bins)