MTP 1
MTP 1
Bike Revving
Key Components of
Sound
• Attack: The initial burst of sound when a note
starts, often sharp and loud.
• Transient: The short, sudden increase in sound
that marks the beginning of the note.
• Decay: The gradual fading out of the sound after
the attack.
• References: Juan Pablo Bello, Laurent Daudet, Samer Abdallah, Chris Duxbury, Mike Davies, and Mark
B. Sandler, Senior Member, IEEE
Convert Convert Audio to an Array
Detection
Calculate Calculate Energy for Each Frame (Squared Amplitude)
Reduce Reduce the amplitude of the duplicate array (e.g., scale it by 0.5).
Enhancing
Audio with Shift Shift the duplicate array to the right by 0.1 seconds.
Reverb Padding Add padding to the right-shifted and original arrays to ensure they
have the same length.
Combine Combine the two audio arrays into a single merged audio track.
Enhancing Audio with Reverb
Results of Merging Audio File
Noise Reduction
Connvert the audio signal from the time domain to the frequency
Convert domain.
Noise Calculate Calculate the power (magnitude squared) of both the noise and
Reduction
audio across all frequencies.
Connvert the audio signal from the time domain to the frequency
Convert domain.
Noise Calculate Calculate the power (magnitude squared) of both the noise and
Reduction
audio across all frequencies.
• Detected Onset
• Generated Drum Sound
• Added drum sound to Onset
Added instruments at
specific onsets, based on a
90% amplitude threshold.
• Detected Onset
• Generated Drum Sound
• Added the drum sound to
onsets with the top 90%
amplitude.
Avoided overlap of
instruments
• Detected Onset
• Used Jhanjh instrument
recorded sound
• Tracked the onset start time and
end time of each instrument.
• This ensured clear separation
and precise timing between
different instrumental sounds.
Added 4 Instruments
in a Loop
Importance:
Mukhada: A repeating Identifying Mukhada
melodic phrase in a helps in rhythmic
song. enhancement and
remixing.
Detecting Mukhada
Manually extracted the Mukhada segment from the song.
Identify where the Mukhada appears in the song by comparing it to the full
audio.
Iterates over possible shifts, calculating a similarity score.
• Computationally
efficient but less
accurate.
• It took 1-2 hours to
compute.
Moderate
Downsampling
500
• Improved accuracy at a
slight increase in
computational cost
compared to the first.
• It took 3-4 hours to
compute.
Low
Downsampling
10
• Computationally
expensive, but most
accurate similarity.
• It took 26-28 hours to
compute.
Conclusion
Onset Detection: Accurate identification of musical onsets helps in rhythm analysis and adding
effects precisely.
Reverb Addition: Adding reverb creates a sense of space and depth, enriching the overall listening
experience.
Noise Reduction: Effective noise reduction techniques improve audio clarity without
compromising quality.
Instrument Integration: Adding extra sounds, like tabla, ghungroo, damru & jhan, helps improve
the rhythm and feel of the music.
Allows for the precise identification of key recurring themes (Mukhda) within a song.
Future Work