EC39201_Expt4_Lab Report_Grp-24

The experiment investigates the significance of low frequency temporal cues in speech recognition, demonstrating that speech can be recognized with minimal spectral information. By manipulating audio signals through band-limited noise and various filters, it was found that increasing the number of frequency bands enhances voice clarity. The results indicate that temporal cues play a crucial role in speech perception, especially in low frequency ranges.

Uploaded by

karthikr90637

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

EC39201_Expt4_Lab Report_Grp-24

Uploaded by

karthikr90637

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Digital Signal Processing Lab

Experiment IV : Speech Recognition with Primarily Temporal Cues

12 October 2022

Group-24
KR Rahul(20EC30021)
Rahul Singh(20EC30037)
AIM

In this experiment, we need to gain understanding of the relative importance of low frequency
temporal structure of speech and frequency content of speech in speech perception

THEORY

Nearly perfect speech recognition was observed under conditions of greatly reduced
spectral information. A speech temporal envelope was extracted from a wide frequency
band and used to modulate the same bandwidth of the sound. This operation preserved
the temporal envelope cues of each band but limited the listener to significantly
degraded information about the spectral energy distribution. Discrimination of
consonants, vowels, and words in simple sentences improved significantly as the number
of bands increased. High speech recognition performance was achieved with only her
three bands of modulated noise. Therefore, representing dynamic temporal patterns in
just a few broad spectral ranges is sufficient to recognize speech.

Speech recognition was supposed to require frequency-specific (spectral) cues. For

example, spectral energy peaks in speech reflect the resonance properties of the vocal
tract and provide acoustic information about the origin of the speech. However, attempts
to identify acoustic cues that reliably convey phoneme identity under different listening
conditions and different speakers have met with limited success. Studies using amplitude
compression and spectral reduction demonstrate the robustness of speech recognition
under these conditions. However, these manipulations resulted in stimuli whose
time-spectral properties were still very complex. Even removing spectral cues from
speech completely yielded stimuli containing a surprising amount of information about
consonant identity. Amplitude and time cues were retained while the amount of spectral
information was systematically varied. This combination allowed us not only to
parametrically assess the role of spectral detail in speech recognition, independent of
temporal cues but also to simulate cochlear implant stimulation patterns.

Spectral information was removed from a speech by replacing frequency-specific

information with band-limited noise over a wide range of frequencies. The acoustic
signal was divided into several frequency bands and the amplitude envelope was
extracted from each band.

Random noise was modulated using the envelope signal and spectrally limited by the
same bandpass filter used for the original analysis band. Thus, time and amplitude cues
were retained in each spectral band, but spectral details within each band were
removed. All bands were then summarized and presented to the audience
CODE:
clc;
clear;
info=audioinfo('fivewo.wav');
[x,Fs]=audioread('fivewo.wav');
t=0:seconds(1/Fs):seconds(info.Duration);
t=t(1:end-1);
subplot(3,1,1);
plot(t,x);
xlabel('Time');
ylabel('Audio Data');
title('Plot of the Given Audio');
disp(info);

n=100; %SNR
noise=(1/n)*wgn(156250,1,1);
subplot(3,1,2);
plot(t,noise);
xlabel('Time');
ylabel('Noise');
title('Plot of the White Gaussian Noise');

z=0;
N=2; %no. Of filters
for i = 1:N
f1=90*64.^((i-1)/N);
f2=90*64.^(i/N);
[B,A]=butter(2,[f1/Fs,f2/Fs],"stop");
y=filter(B,A,x);
y_hilb=hilbert(y);
[B_,A_]=butter(2,240/Fs,"low");
y_final=filter(B_,A_,abs(y_hilb));
mult=y_final.*noise;
z=z+mult;
end

subplot(3,1,3);
plot(t,z);
xlabel('Time');
ylabel('Final Audio');
title('Plot of Final Audio');

sound(n*z,Fs);
audiowrite('final.wav',z,Fs);
● For N=2 Bandpass Filters:

● For N=8 Bandpass Filters:

● For N=16 Bandpass Filters:

DISCUSSION

1. For N=1 or 2, where N is the number of bands, we couldn’t recognize the voice in
the provided audio file, however the rhythmic pattern could be understood. The
voice was recognizable from 3-bands onwards.
2. The voice is clearly recognizable for 8-bands and 16-bands. mThe clarity of voice
in the 16-bands is similar to using 8-bands.
3. As a result, as we add bands, the time domain representation becomes
increasingly close, and the frequency domain even begins to resemble the actual
spectrum of the audio stream.
4. The spacing between bands is crucial for the decoding of the audio signal.
5. The majority of human voice is located in the low frequency range. We begin to
acquire the information held in the speech as we apply small bandwidth filters
and increasingly closely spaced bands. Thus, increasing the number of bands
improves the output voice's clarity.

Platoon Leader 25
100% (1)
Platoon Leader 25
13 pages
Zipcar Case Analysis
No ratings yet
Zipcar Case Analysis
5 pages
39 22EC10057 Prasit
No ratings yet
39 22EC10057 Prasit
4 pages
Speech Lab
No ratings yet
Speech Lab
7 pages
Fundamental Frequency Estimation - Frequency Domain
No ratings yet
Fundamental Frequency Estimation - Frequency Domain
5 pages
Silence Removal
No ratings yet
Silence Removal
3 pages
MFCC
100% (2)
MFCC
6 pages
Ab Star Action
No ratings yet
Ab Star Action
7 pages
Audio and Speech Processing - Prof - Muralikrishna H
No ratings yet
Audio and Speech Processing - Prof - Muralikrishna H
28 pages
DSP Project 2
No ratings yet
DSP Project 2
10 pages
Digital Signal Processing "Speech Recognition": Paper Presentation On
No ratings yet
Digital Signal Processing "Speech Recognition": Paper Presentation On
12 pages
ECE471 Lab#3 Due: 3/27/2015 Voice Recording and FFT (20points)
No ratings yet
ECE471 Lab#3 Due: 3/27/2015 Voice Recording and FFT (20points)
1 page
Lec 65
No ratings yet
Lec 65
11 pages
Homework 1
No ratings yet
Homework 1
3 pages
Towards Neurocomputational Speech and So
No ratings yet
Towards Neurocomputational Speech and So
279 pages
Team5 Final
No ratings yet
Team5 Final
24 pages
Exp 04 SpeechRecTemporal
No ratings yet
Exp 04 SpeechRecTemporal
1 page
Sns Lab 7 19-Ee-0
No ratings yet
Sns Lab 7 19-Ee-0
12 pages
Speech Processing Using MATLAB111
No ratings yet
Speech Processing Using MATLAB111
32 pages
Speech Acoustics Project
No ratings yet
Speech Acoustics Project
22 pages
Biometric Voice Recognition
No ratings yet
Biometric Voice Recognition
33 pages
Firs DSP
No ratings yet
Firs DSP
24 pages
46 Silence PDF
No ratings yet
46 Silence PDF
8 pages
Lab2 Cepstrales Sin Cepstrales
No ratings yet
Lab2 Cepstrales Sin Cepstrales
21 pages
Exp1 Merged
No ratings yet
Exp1 Merged
11 pages
lecours1968
No ratings yet
lecours1968
3 pages
Lab7 Time-Frequency+Analysis+of+Signals PDF
No ratings yet
Lab7 Time-Frequency+Analysis+of+Signals PDF
16 pages
A Novel Filtering Based Approach for Epoch Extraction - Bachhav2015
No ratings yet
A Novel Filtering Based Approach for Epoch Extraction - Bachhav2015
5 pages
Review On ELEC333: Spring 2011 Nico & Wilber
No ratings yet
Review On ELEC333: Spring 2011 Nico & Wilber
63 pages
Lectures 7-8 Winter 2012
No ratings yet
Lectures 7-8 Winter 2012
73 pages
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
No ratings yet
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
3 pages
Spectral Analysis in Speech Processing Techniques: Prof. Vijaya Sugandhi
No ratings yet
Spectral Analysis in Speech Processing Techniques: Prof. Vijaya Sugandhi
3 pages
Speech To Text Matlab PGM
No ratings yet
Speech To Text Matlab PGM
5 pages
Project Ncsi 24
No ratings yet
Project Ncsi 24
3 pages
Shazam Princeton ELE201
No ratings yet
Shazam Princeton ELE201
7 pages
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
No ratings yet
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
4 pages
PCS Lab 1
No ratings yet
PCS Lab 1
11 pages
Chapter6 - SPEECH SIGNAL PROCESSING
No ratings yet
Chapter6 - SPEECH SIGNAL PROCESSING
54 pages
Digital Signal Processing: Course
No ratings yet
Digital Signal Processing: Course
47 pages
199568.speaker Recognition Method Combining FFT Wavelet Functions and Neural Networks
No ratings yet
199568.speaker Recognition Method Combining FFT Wavelet Functions and Neural Networks
4 pages
Group Delay
No ratings yet
Group Delay
38 pages
Group Delay Functions and Its Applications in Speech
No ratings yet
Group Delay Functions and Its Applications in Speech
38 pages
Acoustic Phonetics - The Handbook of Phonetic Sciences - Blackwell Reference Online
100% (1)
Acoustic Phonetics - The Handbook of Phonetic Sciences - Blackwell Reference Online
32 pages
For End For End: "Sp01.wav"
No ratings yet
For End For End: "Sp01.wav"
2 pages
DSP Lab 5
No ratings yet
DSP Lab 5
7 pages
Speech Signal Processing Lab Work Book
No ratings yet
Speech Signal Processing Lab Work Book
55 pages
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
No ratings yet
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
10 pages
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
No ratings yet
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
5 pages
Abstract:: Text-Independent and Dependent Methods. in A Text
No ratings yet
Abstract:: Text-Independent and Dependent Methods. in A Text
11 pages
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
No ratings yet
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
31 pages
Objective
No ratings yet
Objective
2 pages
List of Figures: Second Unit: Audio and Speech Descriptors
No ratings yet
List of Figures: Second Unit: Audio and Speech Descriptors
22 pages
Project1 Final Report (Team 13)
No ratings yet
Project1 Final Report (Team 13)
12 pages
Rajesh Thesis
No ratings yet
Rajesh Thesis
86 pages
Speech Analysis
No ratings yet
Speech Analysis
6 pages
Audproc 2
No ratings yet
Audproc 2
40 pages
An Automatic Speaker Recognition System
100% (1)
An Automatic Speaker Recognition System
11 pages
Diplomarbeit
No ratings yet
Diplomarbeit
20 pages
Hands-On Lab On Speech Processing-Time-domain Processing - 2021
No ratings yet
Hands-On Lab On Speech Processing-Time-domain Processing - 2021
11 pages
Acoustics: The Art of Sound
From Everand
Acoustics: The Art of Sound
Steve Marshall
No ratings yet
Voice Leading: The Science behind a Musical Art
From Everand
Voice Leading: The Science behind a Musical Art
David Huron
5/5 (2)
Sound Wave Science
From Everand
Sound Wave Science
William Brown
No ratings yet
M-726 B.Tech. CSE
No ratings yet
M-726 B.Tech. CSE
2 pages
CO FORM E
No ratings yet
CO FORM E
2 pages
93197737 (3)
No ratings yet
93197737 (3)
2 pages
The Philosophy Behind S-Curves
No ratings yet
The Philosophy Behind S-Curves
3 pages
Immediate download Geoinformation Remote Sensing Photogrammetry and Geographic Information Systems 2nd Edition Konecny ebooks 2024
No ratings yet
Immediate download Geoinformation Remote Sensing Photogrammetry and Geographic Information Systems 2nd Edition Konecny ebooks 2024
42 pages
detailed-lesson-plan-in-lifestyle-and-weight-management
No ratings yet
detailed-lesson-plan-in-lifestyle-and-weight-management
15 pages
c7000 Chassis
No ratings yet
c7000 Chassis
6 pages
Atex Chart
100% (4)
Atex Chart
1 page
7 Form Lesson Plans 1 UNIT
No ratings yet
7 Form Lesson Plans 1 UNIT
25 pages
Monument List Final English
No ratings yet
Monument List Final English
176 pages
Sustainable construction waste management plan
No ratings yet
Sustainable construction waste management plan
3 pages
MC Donalds Asg
No ratings yet
MC Donalds Asg
22 pages
Employment Application: English LL
No ratings yet
Employment Application: English LL
3 pages
7 PDFsam Alex E. Cardenas 1973
No ratings yet
7 PDFsam Alex E. Cardenas 1973
6 pages
Physics 9th Specs 2
No ratings yet
Physics 9th Specs 2
5 pages
Sunwave™ Prismatic Skylights: Product Information
No ratings yet
Sunwave™ Prismatic Skylights: Product Information
9 pages
Nathan Wilkinson - CV 2016-03
No ratings yet
Nathan Wilkinson - CV 2016-03
2 pages
SMA 2000 Manual
No ratings yet
SMA 2000 Manual
17 pages
Algebra Cambridge
No ratings yet
Algebra Cambridge
20 pages
Revalidation UK CoC
No ratings yet
Revalidation UK CoC
8 pages
TheBadList ZBergRyanRoss
No ratings yet
TheBadList ZBergRyanRoss
6 pages
Comprehension 9th & 10th Class
No ratings yet
Comprehension 9th & 10th Class
35 pages
QQQ - GCSE Straight Line Graphs: Justification)
No ratings yet
QQQ - GCSE Straight Line Graphs: Justification)
3 pages
Ibt-Toefl Prep 2 3/10/18!
No ratings yet
Ibt-Toefl Prep 2 3/10/18!
29 pages
1 02 - Embedded Hardware Units and Devices in A Syste
No ratings yet
1 02 - Embedded Hardware Units and Devices in A Syste
23 pages
Usecase Testing
No ratings yet
Usecase Testing
10 pages
Netcdf MOD04
No ratings yet
Netcdf MOD04
37 pages
Agromisa Publications - Agrodoks: How To Order
No ratings yet
Agromisa Publications - Agrodoks: How To Order
1 page

EC39201_Expt4_Lab Report_Grp-24

Uploaded by

EC39201_Expt4_Lab Report_Grp-24

Uploaded by

Digital Signal Processing Lab

Experiment IV : Speech Recognition with Primarily Temporal Cues

Speech recognition was supposed to require frequency-specific (spectral) cues. For

Spectral information was removed from a speech by replacing frequency-specific

● For N=8 Bandpass Filters:

You might also like