The Emergence of Deep Learning: New Opportunities For Music and Audio Technologies

This document introduces a special issue of the journal Neural Computing and Applications focusing on deep learning applications for music and audio. It summarizes several papers in the issue that apply deep learning techniques to tasks like chord labeling, voice separation, music generation, and audio style transfer. The introduction discusses how deep learning is opening new opportunities in music and audio technologies by allowing computers to learn musical structures and complete complex tasks.

Uploaded by

Dorien Herremans

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views

The Emergence of Deep Learning: New Opportunities For Music and Audio Technologies

Uploaded by

Dorien Herremans

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

The emergence of deep learning: new

opportunities for music and audio technologies

Dorien Herremans∗& Ching-Hua Chuan†

There has been tremendous interest in deep learning across many fields of
study. Recently, these techniques have gained popularity in the field of music.
Projects such as Magenta (Google’s Brain Team’s music generation project),
Jukedeck and IBM Watson Beat testify to their potential. Due to this rising
interest in using deep neural networks to tackle tasks in the domain of audio and
music, the guest editors organized the first International Workshop on Music
and Audio as part of the International Joint Conference on Neural Networks
in Alaska in 2017. The current NCAA issue on “Deep learning for music and
audio” was born out of the workshop.
While humans can rely on their intuitive understanding of musical patterns
and the relationships between them, it remains a challenging task for com-
puters to capture and quantify musical structures. Recently, researchers have
attempted to use deep learning models to learn features and relationships that
allow us to accomplish tasks such as music transcription, audio feature extrac-
tion, emotion recognition, music recommendation, and automated music gener-
ation. With this special issues, we aim to present a collection of research that
advances the state-of-the-art in machine intelligence for music and audio. This
enables us to critically review and discuss cutting-edge-research so as to identify
grand challenges, effective methodologies, and potential new applications. The
current issue therefore contains a wide variety of manuscripts that touch upon
an important number of topics which remain of particular interest to the field
of music and audio technology including:

• deep learning for computational music research;

• modeling hierarchical and long term music structures using deep learning;
• modeling ambiguity and preference in music;
• applications of deep networks for music and audio such as audio tran-
scription, voice separation, music generation, music recommendation and
etc.;
• novel architectures designed to represent music and audio.

We present a selection of papers on state-of-the-art approaches, current chal-

lenges, and future directions in deep learning for music and audio. Novel ap-
∗ Singapore University of Technology and Design
† University of Miami

Preprint of: Herremans D., Chuan C.H. 2019. The emergence of deep learning: new
opportunities for music and audio technologies. Editorial, Special Issue on Deep Learning for
Music and Audio. Neural Computing and Applications. Springer. 2019.
DOI:10.1007/s00521-019-04166-0
proaches are explored in various applications, including chord labeling, voice
separation, and music generation. For instance, Koops et al. discuss how to
model ambiguity and individual preferences when performing automatic chord
labeling from audio, by using a merged representation of a dense deep neural
network. Singing voice separation in audio recordings was tackled by Lin et al.
by using an ideal binary mask to train a deep convolutional neural network.
With regards to music generation, Hadjeres and Nielsen propose a new network
architecture for generating (harmonized) soprano parts of Chorales that incor-
porates user-constraints in a recurrent neural network. In addition, Dean and
Forth examine the use of neural networks to generate music in a rather unex-
plored style (post-tonal improvisation) and manage to obtain promising initial
results. Oore et al. show that recurrent neural networks are able to generate
expressive music. Their system received positive feedback from musicians. For
readers who are new to music generation and deep learning, Briot and Pachet’s
paper provides an introductory overview of the problem, approaches, and re-
maining challenges. Finally, the question of using CNNs for audio style transfer
is examined by Shahrin and Wyse. While this problem remains hard, the au-
thors showed that the network learns meaningful features, as audio texture is
revealed in the gram matrices.
In addition to applications, a number of papers in this special issue also
examine meaningful concepts that deep networks can learn from music and au-
dio, as well as compare the performance of different architectures on feature
learning, and investigate the impact of challenging scenarios in acoustic signals.
Chuan et al. show that musical concepts such as key and chords can be captured
by statistical learning methods such as word2vec, a commonly used technique
in the field of natural language processing. Convolutional neural networks for
audio emotion recognition are explored by Wieser et al., who found that these
networks can learn meaningful features related to certain emotions. Deng et
al. propose a novel deep Time-Frequency LSTM for audio restoration, whereby
temporal and spectral dynamics are explicitly captured, thus allowing for more
effective low bitrate audio restoration. Dörfler et al. show that the design of the
audio filter and the time-frequency resolution can affect the accuracy of convo-
lutional neural networks when used as a classifier. Kiskin et al. focus on the
detection of low signal-to-noise ratio acoustic events (e.g., detecting the presence
of mosquitoes in audio recordings) through convolutional neural networks, and
other machine learning techniques, using acoustic features extracted by differ-
ent transforms. Finally, the effect of different deep architectures and multiple
learning sources on a model’s ability to learn efficient musical representations is
examined by Kim et al.
We hope the readers will enjoy the manuscripts in this special issue. Our
thanks goes out to all of the authors, reviewers, editor-in-chief, and the editorial
office of NCAA for their support. Exciting times are ahead for the field of audio
and music technologies.

GFTPM 1984 - 11
100% (11)
GFTPM 1984 - 11
116 pages
Music, Physics and Engineering
From Everand
Music, Physics and Engineering
Harry F. Olson
4/5 (1)
Hit Song Prediction Based On Early Adopter Data and Audio Features
No ratings yet
Hit Song Prediction Based On Early Adopter Data and Audio Features
2 pages
Scribd Upload Template
No ratings yet
Scribd Upload Template
7 pages
Elvis Presley
No ratings yet
Elvis Presley
5 pages
DL For Acoustics
No ratings yet
DL For Acoustics
4 pages
PACHET_-BRIOT.Deeplearningformusicgeneration
No ratings yet
PACHET_-BRIOT.Deeplearningformusicgeneration
14 pages
2021 Deep Learning Audio Book
No ratings yet
2021 Deep Learning Audio Book
38 pages
Music Generation with NLP-1
No ratings yet
Music Generation with NLP-1
15 pages
Cmlai2023 54 60
No ratings yet
Cmlai2023 54 60
7 pages
DL Music
No ratings yet
DL Music
16 pages
Artificial Intelligence in Music Recent Trends And
No ratings yet
Artificial Intelligence in Music Recent Trends And
40 pages
A Comprehensive Survey On Deep Music Generation
No ratings yet
A Comprehensive Survey On Deep Music Generation
96 pages
Applications of Deep Learning to Audio Generation (1)
No ratings yet
Applications of Deep Learning to Audio Generation (1)
16 pages
Deep Learning and Music Adversaries
No ratings yet
Deep Learning and Music Adversaries
13 pages
Sound And Music Computing Tapio Lokki Meinard Mller Stefania Serafin download
No ratings yet
Sound And Music Computing Tapio Lokki Meinard Mller Stefania Serafin download
83 pages
AI-Augmented Creativity Evaluating the Role of Generative Models in Music Composition
No ratings yet
AI-Augmented Creativity Evaluating the Role of Generative Models in Music Composition
7 pages
Music Deep Learning Deep Learning Methods for Music Signal ProcessingA Review of the State-Of-The-Art
No ratings yet
Music Deep Learning Deep Learning Methods for Music Signal ProcessingA Review of the State-Of-The-Art
22 pages
Musical_Genre_Classification_Using_Advanced_Audio_Analysis_and_Deep_Learning_Techniques
No ratings yet
Musical_Genre_Classification_Using_Advanced_Audio_Analysis_and_Deep_Learning_Techniques
11 pages
ISMIR 2019 Tutorial - Waveform-Based Music Processing With Deep Learning
No ratings yet
ISMIR 2019 Tutorial - Waveform-Based Music Processing With Deep Learning
152 pages
A survey of deep learning audio generation methods
No ratings yet
A survey of deep learning audio generation methods
14 pages
Fams 05 00044
No ratings yet
Fams 05 00044
9 pages
Artificial Intelligence and MusicOpen Questions of Coyrighgt Law and Engineering Praxis
No ratings yet
Artificial Intelligence and MusicOpen Questions of Coyrighgt Law and Engineering Praxis
16 pages
música - inteligencia musical
No ratings yet
música - inteligencia musical
11 pages
AI 4th RP
No ratings yet
AI 4th RP
16 pages
Deep Learning Neural Networks For Music Information Retrieval
No ratings yet
Deep Learning Neural Networks For Music Information Retrieval
4 pages
WIMP2017 Martinez-RamirezReiss
No ratings yet
WIMP2017 Martinez-RamirezReiss
4 pages
A Review of Intelligent Music Generation Systems: Lei Wang, Ziyi Zhao, Hanwei Liu, Junwei Pang, Yi Qin and Qidi Wu
No ratings yet
A Review of Intelligent Music Generation Systems: Lei Wang, Ziyi Zhao, Hanwei Liu, Junwei Pang, Yi Qin and Qidi Wu
28 pages
Paper 10
No ratings yet
Paper 10
9 pages
Deep Learning For Audio Signal Processing
No ratings yet
Deep Learning For Audio Signal Processing
14 pages
2307.13821v1
No ratings yet
2307.13821v1
5 pages
A Survey on Artificial Intelligence for Music Generation
No ratings yet
A Survey on Artificial Intelligence for Music Generation
26 pages
Generating Musical Sequences With Transformers
No ratings yet
Generating Musical Sequences With Transformers
5 pages
Updated_IEEE_Paper
No ratings yet
Updated_IEEE_Paper
19 pages
Ben Tal O 41834 AAM
No ratings yet
Ben Tal O 41834 AAM
37 pages
Music_emotion_recognition_system
No ratings yet
Music_emotion_recognition_system
3 pages
One Deep Music Representation To Rule Them All? A Comparative Analysis of Different Representation Learning Strategies
No ratings yet
One Deep Music Representation To Rule Them All? A Comparative Analysis of Different Representation Learning Strategies
27 pages
Music Genre Classification
No ratings yet
Music Genre Classification
5 pages
Music Genre Detection Using Machine Learning Algorithms
No ratings yet
Music Genre Detection Using Machine Learning Algorithms
6 pages
Music Generation Using Recurrent Neural Networks
No ratings yet
Music Generation Using Recurrent Neural Networks
9 pages
Deep BiDirec Transformers-Base Masked Predictive
No ratings yet
Deep BiDirec Transformers-Base Masked Predictive
17 pages
MusicAL - Go - Algorithmic Music Generation
No ratings yet
MusicAL - Go - Algorithmic Music Generation
9 pages
数据集和项目介绍
No ratings yet
数据集和项目介绍
68 pages
Midi RNN Ieee
No ratings yet
Midi RNN Ieee
6 pages
Audio Classification Using Deep Learning Report
No ratings yet
Audio Classification Using Deep Learning Report
25 pages
App and Advances
No ratings yet
App and Advances
19 pages
Audio Representations For Deep Learning in Sound Synthesis A Review
No ratings yet
Audio Representations For Deep Learning in Sound Synthesis A Review
8 pages
Paper 10
No ratings yet
Paper 10
7 pages
Cognitive Comp and Systems - 2022 - Li - Guest Editorial Music Perception and Cognition in Music Technology
No ratings yet
Cognitive Comp and Systems - 2022 - Li - Guest Editorial Music Perception and Cognition in Music Technology
3 pages
Real Time Emotion Based Music Player
No ratings yet
Real Time Emotion Based Music Player
5 pages
A Survey of AI Music Generation Tools and Models
No ratings yet
A Survey of AI Music Generation Tools and Models
39 pages
The Sound of Science: A Beginner's Guide to Acoustics
From Everand
The Sound of Science: A Beginner's Guide to Acoustics
D Brown
No ratings yet
Article - April 26th Version
No ratings yet
Article - April 26th Version
4 pages
2+ijrise 2023 1083
No ratings yet
2+ijrise 2023 1083
3 pages
ji-yang-luo-survey-symbolic-music-generation
No ratings yet
ji-yang-luo-survey-symbolic-music-generation
39 pages
Project Final Document
No ratings yet
Project Final Document
80 pages
Music PPT (3.1)
No ratings yet
Music PPT (3.1)
13 pages
DR Sourabh Sharma
No ratings yet
DR Sourabh Sharma
5 pages
Music AI Bibliography
No ratings yet
Music AI Bibliography
7 pages
SMC2017 Proc Papers
No ratings yet
SMC2017 Proc Papers
470 pages
Ambuja Salgaonkar - Computer Assisted Music and Dramatics
No ratings yet
Ambuja Salgaonkar - Computer Assisted Music and Dramatics
243 pages
Music Generation With NLP-4
No ratings yet
Music Generation With NLP-4
12 pages
Digital Signatures: The Impact of Digitization on Popular Music Sound
From Everand
Digital Signatures: The Impact of Digitization on Popular Music Sound
Ragnhild Brøvig
No ratings yet
Voice Recognition
From Everand
Voice Recognition
Kai Turing
No ratings yet
dnmr8 DH DC PDF
No ratings yet
dnmr8 DH DC PDF
1 page
Generating Structured Music Using Quality Metrics Based On Markov Models
No ratings yet
Generating Structured Music Using Quality Metrics Based On Markov Models
20 pages
2010 09969 PDF
No ratings yet
2010 09969 PDF
8 pages
Attendaffectnet: Self-Attention Based Networks Predicting Affective Responses From Movies
No ratings yet
Attendaffectnet: Self-Attention Based Networks Predicting Affective Responses From Movies
8 pages
perceptionGAN Preprint PDF
No ratings yet
perceptionGAN Preprint PDF
7 pages
Compose Compute - Computer Generation and Classification of Music Through Operations Research Methods
No ratings yet
Compose Compute - Computer Generation and Classification of Music Through Operations Research Methods
250 pages
Computational Music Analysis
No ratings yet
Computational Music Analysis
24 pages
First Species Counterpoint Generation With VNS and Vertical Viewpoints
No ratings yet
First Species Counterpoint Generation With VNS and Vertical Viewpoints
1 page
Markov Based Quality Metrics For Generating Structured Music With Optimization Techniques
No ratings yet
Markov Based Quality Metrics For Generating Structured Music With Optimization Techniques
1 page
ALF33663S
0% (2)
ALF33663S
1 page
Rebonds A and B A Short Analyses
No ratings yet
Rebonds A and B A Short Analyses
5 pages
Ped 230
No ratings yet
Ped 230
118 pages
(Lincoln Brew... ) Guitar Tab
No ratings yet
(Lincoln Brew... ) Guitar Tab
5 pages
Slide Guitar
100% (1)
Slide Guitar
12 pages
Curriculum Guide: Seminole County Public Schools
No ratings yet
Curriculum Guide: Seminole County Public Schools
25 pages
Accordeon VST Kontakt
No ratings yet
Accordeon VST Kontakt
5 pages
English ABT Test 40 Marks. (1)
No ratings yet
English ABT Test 40 Marks. (1)
3 pages
Lesson Plan in Mapeh
100% (1)
Lesson Plan in Mapeh
5 pages
Ëé ®óó Ó Ä À Íêby Kangkang228: Moderate 75
No ratings yet
Ëé ®óó Ó Ä À Íêby Kangkang228: Moderate 75
10 pages
Lesson Plans 1-3 Beginner Strings
No ratings yet
Lesson Plans 1-3 Beginner Strings
12 pages
Summertime PDF
100% (1)
Summertime PDF
1 page
67thawards PressList 11072024
No ratings yet
67thawards PressList 11072024
58 pages
Unit Lesson Number Balinese Gamelan
No ratings yet
Unit Lesson Number Balinese Gamelan
2 pages
Name: Date:: Unit 1 Worksheet
100% (1)
Name: Date:: Unit 1 Worksheet
70 pages
Berg Violin Concerto - Research For Music Comm (REVISED)
No ratings yet
Berg Violin Concerto - Research For Music Comm (REVISED)
13 pages
Student Recital Program Template
No ratings yet
Student Recital Program Template
3 pages
Thelonious Monk Paradigmatic Analysis of PDF
100% (2)
Thelonious Monk Paradigmatic Analysis of PDF
87 pages
IMSLP635304 PMLP699173 1 Treble
No ratings yet
IMSLP635304 PMLP699173 1 Treble
2 pages
Standard 2 4 Musical Instrument Project
No ratings yet
Standard 2 4 Musical Instrument Project
4 pages
Debussy - Pagodes (Orchestra)
No ratings yet
Debussy - Pagodes (Orchestra)
16 pages
Pop Song Chord Charts
No ratings yet
Pop Song Chord Charts
16 pages
Dimitri Varelas - From The Tales (Quinteto)
No ratings yet
Dimitri Varelas - From The Tales (Quinteto)
10 pages
SC 6
No ratings yet
SC 6
2 pages
The Best Classical Music Playlist Mix
No ratings yet
The Best Classical Music Playlist Mix
3 pages
The Mathematics of Music and Harmonics
No ratings yet
The Mathematics of Music and Harmonics
20 pages
Kim Namjoon Biography
100% (1)
Kim Namjoon Biography
9 pages

The Emergence of Deep Learning: New Opportunities For Music and Audio Technologies

Uploaded by

The Emergence of Deep Learning: New Opportunities For Music and Audio Technologies

Uploaded by

The emergence of deep learning: new

opportunities for music and audio technologies

• deep learning for computational music research;

We present a selection of papers on state-of-the-art approaches, current chal-

You might also like