Abstract
The “winning” system in the 2013 MIREX Latin Genre Classification Task was a deep neural network trained with simple features. An explanation for its winning performance has yet to be found. In previous work, we built similar systems using the BALLROOM music dataset, and found their performances to be greatly affected by slightly changing the tempo of the music of a test recording. In the MIREX task, however, systems are trained and tested using the Latin Music Dataset (LMD), which is 4.5 times larger than BALLROOM, and which does not seem to show as strong a relationship between tempo and label as BALLROOM. In this paper, we reproduce the “winning” deep learning system using LMD, and measure the effects of time dilation on its performance. We find that tempo changes of at most ±6 % greatly diminish and improve its performance. Interpreted with the low-level nature of the input features, this supports the conclusion that the system is exploiting some low-level absolute time characteristics to reproduce ground truth in LMD.
Original language | English |
---|---|
Title of host publication | Proceedings of the 5th International Conference Mathematics and Computation in Music, MCM 2015 |
Editors | Tom Collins, David Meredith, Anja Volk |
Publisher | Springer |
Publication date | 2015 |
Pages | 335-346 |
ISBN (Print) | 978-3-319-20602-8 |
ISBN (Electronic) | 978-3-319-20603-5 |
DOIs | |
Publication status | Published - 2015 |
Event | 5th International Conference on Mathematics and Computation in Music (MCM 2015) - London, United Kingdom Duration: 22 Jun 2015 → 25 Jun 2015 Conference number: 5 http://mcm2015.qmul.ac.uk/ |
Conference
Conference | 5th International Conference on Mathematics and Computation in Music (MCM 2015) |
---|---|
Number | 5 |
Country/Territory | United Kingdom |
City | London |
Period | 22/06/2015 → 25/06/2015 |
Internet address |
Series | Lecture Notes in Computer Science |
---|---|
Volume | 9110 |
ISSN | 0302-9743 |
Keywords
- Machine music listening
- Genre
- Deep Learning
- Evaluation