0% found this document useful (0 votes)

4 views

MTP 1

The document provides an overview of sound, including its components such as amplitude, frequency, and tempo, as well as techniques for audio processing like noise reduction and reverb addition. It discusses methods for detecting musical onsets and integrating instruments into audio tracks, emphasizing the importance of rhythm and clarity in music. Future work includes developing online platforms for audio processing and real-time systems for live performances.

Uploaded by

reheyi2494

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

MTP 1

Uploaded by

reheyi2494

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

What is Sound?

• It is a combination of various waveform signals aka sinusoids of

different time intervals, frequencies, tempo and amplitude.
• ‘Amplitude’ represent ‘volume’ and frequency represent pitch of the
sound.
• ‘Tempo’ or speed of sound can be defined as rate of change in
frequency.
• Frequency is the number of cycles of a wave or a periodic event that
occur within a specific unit of time.
• If a sound wave has a frequency of 1000 Hz, it means there are 1000
wave cycles every second.
Sound with constant frequency (beep sound) has no
meaning for tempo.

Constant Frequency, amplitude and tempo.

Sinusoidally varying frequency; Constant amplitude and

tempo.
Experiments:
Sinusoidally varying frequency and tempo; Constant
amplitude.

Sinusoidally varying frequency and tempo; Linearly

increasing amplitude

Bike Revving
Key Components of
Sound
• Attack: The initial burst of sound when a note
starts, often sharp and loud.
• Transient: The short, sudden increase in sound
that marks the beginning of the note.
• Decay: The gradual fading out of the sound after
the attack.

• References: Juan Pablo Bello, Laurent Daudet, Samer Abdallah, Chris Duxbury, Mike Davies, and Mark
B. Sandler, Senior Member, IEEE
Convert Convert Audio to an Array

Normalize Normalize the Signal Amplitude (-1 to 1)

Onset Segment Segment into Overlapping Frames (30ms-50ms)

Detection
Calculate Calculate Energy for Each Frame (Squared Amplitude)

Detect Detect Sudden Changes in Energy to Identify Peaks

Waveform &
Energy of Audio
Onsets of Audio
Duplicate Duplicate the original audio array to create a new array.

Reduce Reduce the amplitude of the duplicate array (e.g., scale it by 0.5).

Enhancing
Audio with Shift Shift the duplicate array to the right by 0.1 seconds.

Reverb Padding Add padding to the right-shifted and original arrays to ensure they
have the same length.

Combine Combine the two audio arrays into a single merged audio track.
Enhancing Audio with Reverb
Results of Merging Audio File
Noise Reduction

Audio with Noise

Audio after Noise Removal

Extract Extract the noise profile from the audio.

Connvert the audio signal from the time domain to the frequency
Convert domain.

Noise Calculate Calculate the power (magnitude squared) of both the noise and

Reduction
audio across all frequencies.

Subtract the noise power from the audio power at each

Subtract frequency.

Reconstruct the cleaned audio by converting it back from the

Reconstruct frequency domain to the time domain.
Frequency Domain
Extract Extract the noise profile from the audio.

Connvert the audio signal from the time domain to the frequency
Convert domain.

Noise Calculate Calculate the power (magnitude squared) of both the noise and

Reduction
audio across all frequencies.

Subtract the noise power from the audio power at each

Subtract frequency.

Reconstruct the cleaned audio by converting it back from the

Reconstruct frequency domain to the time domain.
Spectrogram
Added Instrument on each Onset.

Added instruments at specific onsets, based on

a 90% amplitude threshold.

Instruments Avoided overlap of instruments by tracking the

end time of instrument.
Sound Integration
Added 4 Instruments in a Loop

Added 4 Instruments A/C to Amplitude

Instrument on Each
Onset

• Detected Onset
• Generated Drum Sound
• Added drum sound to Onset
Added instruments at
specific onsets, based on a
90% amplitude threshold.

• Detected Onset
• Generated Drum Sound
• Added the drum sound to
onsets with the top 90%
amplitude.
Avoided overlap of
instruments

• Detected Onset
• Used Jhanjh instrument
recorded sound
• Tracked the onset start time and
end time of each instrument.
• This ensured clear separation
and precise timing between
different instrumental sounds.
Added 4 Instruments
in a Loop

• Intruments as: Tabla, Ghungroo,

Damru, Jhanjh
• Tracked the onset start time and
end time of each instrument.
• This ensured clear separation
and precise timing between
different instrumental sounds.
Added 4 Instruments
A/C to Amplitude

• Initially, I stored the amplitude of

each onset and sorted the data
in ascending order.
• The data was then divided into
four segments.
• Each segment was assigned to a
different instrument, ensuring
balanced distribution.
Ankur Satyam

Experimented With Friends Audio

Detecting Mukhada in Audio File

Importance:
Mukhada: A repeating Identifying Mukhada
melodic phrase in a helps in rhythmic
song. enhancement and
remixing.
Detecting Mukhada
Manually extracted the Mukhada segment from the song.

Modified Cross-correlation is used to measure the similarity between two signals.

Identify where the Mukhada appears in the song by comparing it to the full
audio.
Iterates over possible shifts, calculating a similarity score.

Experimented with different values of "skip" to optimize the detection.

High
Downsampling
2000

• Computationally
efficient but less
accurate.
• It took 1-2 hours to
compute.
Moderate
Downsampling
500
• Improved accuracy at a
slight increase in
computational cost
compared to the first.
• It took 3-4 hours to
compute.
Low
Downsampling
10
• Computationally
expensive, but most
accurate similarity.
• It took 26-28 hours to
compute.
Conclusion
Onset Detection: Accurate identification of musical onsets helps in rhythm analysis and adding
effects precisely.

Reverb Addition: Adding reverb creates a sense of space and depth, enriching the overall listening
experience.

Noise Reduction: Effective noise reduction techniques improve audio clarity without
compromising quality.

Instrument Integration: Adding extra sounds, like tabla, ghungroo, damru & jhan, helps improve
the rhythm and feel of the music.

Allows for the precise identification of key recurring themes (Mukhda) within a song.
Future Work

Detecting Periodicity in Music: Developing methods to find patterns in a

song and use this to add beats at regular intervals.

Online Audio Processing Platform: Create a website where users can

upload their recorded songs and choose options like noise removal, adding
reverb, or integrating beats and instruments.

Real-Time Model for Live Performances: Develop a real-time system that

can generate instruments beats during live music performances.
References
• Juan Pablo Bello, Laurent Daudet, Samer Abdallah, Chris Duxbury,
Mike Davies, and Mark B. Sandler, Senior Member, IEEE
• D. Sinha, S. Saeed, and A. Ferreira, “A novel automatic noise
removal technique for audio and speech signals,”
• Dattorro jon, “Effect design, part 1: reverberator and other filters,”
journal of the audio engineering society,

Audio Processes by David Creasey
100% (4)
Audio Processes by David Creasey
741 pages
Tryptophant Audio Production
No ratings yet
Tryptophant Audio Production
57 pages
Estimating Tempo, Swing and Beat Locations in Audio Recordings
No ratings yet
Estimating Tempo, Swing and Beat Locations in Audio Recordings
4 pages
Cross-Correlation As A Measure For Cross-Modal Analysis of Music and Floor Data
No ratings yet
Cross-Correlation As A Measure For Cross-Modal Analysis of Music and Floor Data
5 pages
OpenFrameworks Lections: Interactive Sound
No ratings yet
OpenFrameworks Lections: Interactive Sound
35 pages
AI-Based Vocal Judging Application
No ratings yet
AI-Based Vocal Judging Application
8 pages
spearfinal05
No ratings yet
spearfinal05
4 pages
EchoNest Analyze Documentation
No ratings yet
EchoNest Analyze Documentation
7 pages
Electronic Music Handbook-2019 PDF
No ratings yet
Electronic Music Handbook-2019 PDF
87 pages
06516351
No ratings yet
06516351
6 pages
SMS Software Manual
No ratings yet
SMS Software Manual
18 pages
PCS Lab 3
No ratings yet
PCS Lab 3
8 pages
Signals Report
No ratings yet
Signals Report
12 pages
Recording, Eq &amp FX Keywords
No ratings yet
Recording, Eq &amp FX Keywords
7 pages
A Comparative Study of Analogue and Digital Mixing Techniques
No ratings yet
A Comparative Study of Analogue and Digital Mixing Techniques
99 pages
Notes - 1.2.1 - Multimedia - Sound
No ratings yet
Notes - 1.2.1 - Multimedia - Sound
6 pages
Audio Processes (Part)
No ratings yet
Audio Processes (Part)
37 pages
Lab 6 - Shazam Part II
No ratings yet
Lab 6 - Shazam Part II
5 pages
Emilia ResearchWork
No ratings yet
Emilia ResearchWork
114 pages
Automatic Tuning System For Polyphonic Sound
No ratings yet
Automatic Tuning System For Polyphonic Sound
11 pages
Melody Transcription EC304 Signal Processing: Project Project Report
No ratings yet
Melody Transcription EC304 Signal Processing: Project Project Report
16 pages
Ronatay-Santos-Signals-Final-Output
No ratings yet
Ronatay-Santos-Signals-Final-Output
8 pages
First Research Paper
No ratings yet
First Research Paper
15 pages
PHD Tristan
No ratings yet
PHD Tristan
137 pages
2 - 1 - Lesson Overview (8-55)
No ratings yet
2 - 1 - Lesson Overview (8-55)
5 pages
Week2 - Fourier Series - The Math Behind The Music - V1
No ratings yet
Week2 - Fourier Series - The Math Behind The Music - V1
5 pages
C# Beat Detection Technical Writeup
No ratings yet
C# Beat Detection Technical Writeup
4 pages
How Does Chromaprint Work
No ratings yet
How Does Chromaprint Work
4 pages
Basic Features of Audio Signals (音訊的基本特徵) : Jyh-Shing Roger Jang (張智星) MIR Lab, CS Dept, Tsing Hua Univ. Hsinchu, Taiwan
No ratings yet
Basic Features of Audio Signals (音訊的基本特徵) : Jyh-Shing Roger Jang (張智星) MIR Lab, CS Dept, Tsing Hua Univ. Hsinchu, Taiwan
18 pages
MIT21M 380S12 Lec01 PDF
No ratings yet
MIT21M 380S12 Lec01 PDF
10 pages
Enhancing Orchestration Technique Via Spectrally Based Linear Algebra Methods
No ratings yet
Enhancing Orchestration Technique Via Spectrally Based Linear Algebra Methods
11 pages
Imm 6321
No ratings yet
Imm 6321
88 pages
Qiaozhan Gao Report ReportFinal
No ratings yet
Qiaozhan Gao Report ReportFinal
6 pages
06b Yamamoto
No ratings yet
06b Yamamoto
1 page
5 Basics of Digital Audio (1)
No ratings yet
5 Basics of Digital Audio (1)
29 pages
4 LAB EXERCISE: Synthesis of Musical Notes: Laboratory 4 Moses Abu
100% (2)
4 LAB EXERCISE: Synthesis of Musical Notes: Laboratory 4 Moses Abu
3 pages
Musical Instrument Identi Cation With Feature Selection Using Evolutionary Methods Loughran Thesis
No ratings yet
Musical Instrument Identi Cation With Feature Selection Using Evolutionary Methods Loughran Thesis
281 pages
Visualizing Sounds
No ratings yet
Visualizing Sounds
13 pages
Audio Data Analysis Using Machine Learning and Deep
No ratings yet
Audio Data Analysis Using Machine Learning and Deep
74 pages
Shazam Princeton ELE201
No ratings yet
Shazam Princeton ELE201
7 pages
FFT Research
No ratings yet
FFT Research
8 pages
Introsounds 2 2
No ratings yet
Introsounds 2 2
33 pages
Thesis Fitz
No ratings yet
Thesis Fitz
206 pages
Multimedia System: Chapter Five: Basics of Digital Audio
No ratings yet
Multimedia System: Chapter Five: Basics of Digital Audio
42 pages
The Beat Spectrum: A New Approach To Rhythm Analysis: 2. Previous Work
No ratings yet
The Beat Spectrum: A New Approach To Rhythm Analysis: 2. Previous Work
4 pages
Royal Procession
No ratings yet
Royal Procession
17 pages
04 Digital Audio - Nuts and Bolts
No ratings yet
04 Digital Audio - Nuts and Bolts
52 pages
MPM12 Rhythm PDF
No ratings yet
MPM12 Rhythm PDF
34 pages
Chapter4 Sound
No ratings yet
Chapter4 Sound
39 pages
As Music Technology Exam Revision Guide
No ratings yet
As Music Technology Exam Revision Guide
12 pages
Musical Signal Processing
No ratings yet
Musical Signal Processing
19 pages
Content-Based Classification of Musical Instrument Timbres: Agostini Longari Pollastri
100% (1)
Content-Based Classification of Musical Instrument Timbres: Agostini Longari Pollastri
8 pages
Mixng Analysis
No ratings yet
Mixng Analysis
11 pages
Guide to the Basic Concepts and Techniques of Spectral Music Joshua Fineberg Part 4
No ratings yet
Guide to the Basic Concepts and Techniques of Spectral Music Joshua Fineberg Part 4
6 pages
A Digital Audio Primer: Waveforms
No ratings yet
A Digital Audio Primer: Waveforms
4 pages
Music database retrieval based on spectral similarity.
No ratings yet
Music database retrieval based on spectral similarity.
9 pages
The Impulse Response Bible
From Everand
The Impulse Response Bible
Past To Future
No ratings yet
So,You Want To Be An Audio Engineer: A Complete Beginners Guide For Selecting Audio Gear
From Everand
So,You Want To Be An Audio Engineer: A Complete Beginners Guide For Selecting Audio Gear
Kevin Parker
5/5 (1)
Sound Design and Mixing in Reason
From Everand
Sound Design and Mixing in Reason
Andrew Eisele
3/5 (2)
ABCs of Audio Recording
From Everand
ABCs of Audio Recording
Jon Bellona
No ratings yet
2011 Pre-Calc Slides Section 7.2
No ratings yet
2011 Pre-Calc Slides Section 7.2
19 pages
Eduardo Moreno Olivera: Storage Assistant
No ratings yet
Eduardo Moreno Olivera: Storage Assistant
3 pages
Cuadernillo Nivel 1 2024
No ratings yet
Cuadernillo Nivel 1 2024
66 pages
Motivating ESL Learners To Overcome Speech Anxiety
No ratings yet
Motivating ESL Learners To Overcome Speech Anxiety
5 pages
Lesson 1 and 2
No ratings yet
Lesson 1 and 2
6 pages
Student Manual Religion 121-122
No ratings yet
Student Manual Religion 121-122
439 pages
A. Match The Functions With The Correct Sentence.: Already, Just, Never, Not Yet, Still
No ratings yet
A. Match The Functions With The Correct Sentence.: Already, Just, Never, Not Yet, Still
2 pages
(Ebook) Intonation Phonology 2th Edition by D. Robert Ladd (full name:Dwight Robert Ladd Jr) ISBN 9780511808814, 051180881X download pdf
100% (1)
(Ebook) Intonation Phonology 2th Edition by D. Robert Ladd (full name:Dwight Robert Ladd Jr) ISBN 9780511808814, 051180881X download pdf
81 pages
[FREE PDF sample] Oral Poetry and Somali Nationalism The Case of Sayid Mahammad Abdille Hasan African Studies 1st Edition Said S. Samatar ebooks
100% (6)
[FREE PDF sample] Oral Poetry and Somali Nationalism The Case of Sayid Mahammad Abdille Hasan African Studies 1st Edition Said S. Samatar ebooks
32 pages
Ruaumoko-2D Examples: 1 Earthquake Response
100% (1)
Ruaumoko-2D Examples: 1 Earthquake Response
30 pages
Vijeo Citect Non Equipment Tag List
No ratings yet
Vijeo Citect Non Equipment Tag List
5 pages
Engineering Mathematics-I KAS103T
No ratings yet
Engineering Mathematics-I KAS103T
4 pages
SS1C Reviewer
No ratings yet
SS1C Reviewer
26 pages
SSC CGL Tier 2 Maths Paper 13 Sep 2019 46
No ratings yet
SSC CGL Tier 2 Maths Paper 13 Sep 2019 46
26 pages
Interview Fred D'augiar
No ratings yet
Interview Fred D'augiar
9 pages
Lecture-3 - Architecture of Distributed Systems F23
No ratings yet
Lecture-3 - Architecture of Distributed Systems F23
20 pages
IT2301-Java Programming QB
No ratings yet
IT2301-Java Programming QB
10 pages
Quiz Ks3 Drama Skills and Vocab
No ratings yet
Quiz Ks3 Drama Skills and Vocab
5 pages
Downloads - CS608 - Lecture 1A - CS608VBNETIntro - Part I of IV
No ratings yet
Downloads - CS608 - Lecture 1A - CS608VBNETIntro - Part I of IV
66 pages
Exam 14-15
No ratings yet
Exam 14-15
16 pages
Structure of English
No ratings yet
Structure of English
30 pages
Speed Up Reading & Noting
No ratings yet
Speed Up Reading & Noting
4 pages
Che715 s21 01
No ratings yet
Che715 s21 01
27 pages
Jeff Bezos and The End of PowerPoint As We Know It
No ratings yet
Jeff Bezos and The End of PowerPoint As We Know It
9 pages
XI Revision Worksheet9 Clauses and Sentence Reordering
No ratings yet
XI Revision Worksheet9 Clauses and Sentence Reordering
4 pages
Rhythm and Music of Kuttiyattam
No ratings yet
Rhythm and Music of Kuttiyattam
12 pages
Pearson Teaching Activities Jamboree A PDF
No ratings yet
Pearson Teaching Activities Jamboree A PDF
99 pages
01DDT22F2044 Varshaan A/L Puaneswaran: SESI 2 2022/2023
No ratings yet
01DDT22F2044 Varshaan A/L Puaneswaran: SESI 2 2022/2023
5 pages
Modal Verbs
No ratings yet
Modal Verbs
2 pages
Introduction To Computer Applications
No ratings yet
Introduction To Computer Applications
7 pages

MTP 1

Uploaded by

MTP 1

Uploaded by

What is Sound?

• It is a combination of various waveform signals aka sinusoids of

Constant Frequency, amplitude and tempo.

Sinusoidally varying frequency; Constant amplitude and

Sinusoidally varying frequency and tempo; Linearly

Normalize Normalize the Signal Amplitude (-1 to 1)

Onset Segment Segment into Overlapping Frames (30ms-50ms)

Detect Detect Sudden Changes in Energy to Identify Peaks

Audio with Noise

Audio after Noise Removal

Subtract the noise power from the audio power at each

Reconstruct the cleaned audio by converting it back from the

Subtract the noise power from the audio power at each

Reconstruct the cleaned audio by converting it back from the

Added instruments at specific onsets, based on

Instruments Avoided overlap of instruments by tracking the

Added 4 Instruments A/C to Amplitude

• Intruments as: Tabla, Ghungroo,

• Initially, I stored the amplitude of

Experimented With Friends Audio

Modified Cross-correlation is used to measure the similarity between two signals.

Experimented with different values of "skip" to optimize the detection.

Detecting Periodicity in Music: Developing methods to find patterns in a

Online Audio Processing Platform: Create a website where users can

Real-Time Model for Live Performances: Develop a real-time system that

You might also like