Piece Identification in Classical Piano Music Without Reference Scores
Piece Identification in Classical Piano Music Without Reference Scores
REFERENCE SCORES
Query Results
Tempo-invariant Symbolic
Name of the Piece,
Fingerprinter
corresponding to the Query
Query
Music Transcription Algorithm
Audio Snippet of an Unseen
Transcribe Query
Performance of a Piece
Table 2. Results of the baseline approach. The results are Table 3. Results on the reference database based on mul-
based on 3 700 queries for each query length. tiple recordings (the top five results according to the web
source) to represent each piece. The results are based on
3 700 queries for each query length.
This is the percentage of queries which have the correct
corresponding piece in the first k retrieval results. In our
experiments we look at the recall at ranks 1, 5 and 10. In Query Length
addition, we also report the Mean Reciprocal Rank (MRR). 2s 5s 10 s
Recall at Rank 1 0.76 0.87 0.91
|Q|
1 X 1 Recall at Rank 5 0.84 0.94 0.97
MRR = (1) Recall at Rank 10 0.86 0.95 0.98
|Q| i=1 ranki
Mean Reciprocal Rank 0.80 0.90 0.94
Here, ranki refers to the rank position of the correct re- Mean Query Time 0.82 s 2.85 s 6.08 s
sult for the ith query.
The mean query times (i.e. the mean time it takes to
process a single query) given in the tables are based on a Table 4. Results on the reference database based on multi-
desktop computer on a single core 5 . If needed, the compu- ple recordings (the top fifteen results according to the web
tation could easily be sped up by multi-threading the query source) to represent each piece. The results are based on
process. 3 700 queries for each query length.
[4] Sebastian Böck, Filip Korzeniowski, Jan Schlüter, Flo- [14] Joan Serrà, Emilia Gómez, and Perfecto Herrera. Au-
rian Krebs, and Gerhard Widmer. madmom: a new dio cover song identification and similarity: back-
Python Audio and Music Signal Processing Library. ground, approaches, evaluation and beyond. In Z. W.
In Proceedings of the 24th ACM International Con- Ras and A. A. Wieczorkowska, editors, Advances in
ference on Multimedia, pages 1174–1178, Amsterdam, Music Information Retrieval, volume 274 of Studies
The Netherlands, 10 2016. in Computational Intelligence, chapter 14, pages 307–
332. Springer, Berlin, Germany, 2010.
[5] Sebastian Böck and Markus Schedl. Polyphonic piano
[15] Joren Six and Marc Leman. Panako - a scalable acous-
note transcription with recurrent neural networks. In
tic fingerprinting system handling time-scale and pitch
Proceedings of the IEEE International Conference on
modification. In Proceedings of the International So-
Acoustics, Speech, and Signal Processing (ICASSP),
ciety for Music Information Retrieval Conference (IS-
pages 121–124, Kyoto, Japan, 2012.
MIR), pages 259–264, Taipei, Taiwan, 2014.
[6] Pedro Cano, Eloi Batlle, Ton Kalker, and Jaap Haitsma. [16] Reinhard Sonnleitner and Gerhard Widmer. Robust
A review of algorithms for audio fingerprinting. In Pro- quad-based audio fingerprinting. IEEE/ACM Trans-
ceedings of the IEEE International Workshop on Multi- actions on Audio, Speech and Language Processing,
media Signal Processing (MMSP), pages 169–173, St. 24(3):409–421, 2016.
Thomas, Virgin Islands, USA, 2002.
[17] Avery Wang. An industrial strength audio search al-
[7] Michael A. Casey and Malcolm Slaney. Song intersec- gorithm. In Proceedings of the International Society
tion by approximate nearest neighbor search. In Pro- for Music Information Retrieval Conference (ISMIR),
ceedings of the International Society for Music Infor- pages 7–13, Baltimore, Maryland, USA, 2003.
mation Retrieval Conference (ISMIR), pages 144–149,
Victoria, Canada, 2006.