Temporal feature integration for music genre classification

Publication: Research - peer-reviewJournal article – Annual report year: 2007

Standard

Temporal feature integration for music genre classification. / Meng, Anders; Ahrendt, Peter; Larsen, Jan; Hansen, Lars Kai.

In: I E E E Transactions on Audio, Speech and Language Processing, Vol. 15, No. 5, 2007, p. 1654-1664.

Publication: Research - peer-reviewJournal article – Annual report year: 2007

Harvard

APA

CBE

MLA

Vancouver

Author

Meng, Anders; Ahrendt, Peter; Larsen, Jan; Hansen, Lars Kai / Temporal feature integration for music genre classification.

In: I E E E Transactions on Audio, Speech and Language Processing, Vol. 15, No. 5, 2007, p. 1654-1664.

Publication: Research - peer-reviewJournal article – Annual report year: 2007

Bibtex

@article{ca97ac4935fe401fab225bf96940d245,
title = "Temporal feature integration for music genre classification",
publisher = "I E E E",
author = "Anders Meng and Peter Ahrendt and Jan Larsen and Hansen, {Lars Kai}",
note = "Copyright: 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE",
year = "2007",
doi = "10.1109/TASL.2007.899293",
volume = "15",
number = "5",
pages = "1654--1664",
journal = "I E E E Transactions on Audio, Speech and Language Processing",
issn = "1558-7916",

}

RIS

TY - JOUR

T1 - Temporal feature integration for music genre classification

A1 - Meng,Anders

A1 - Ahrendt,Peter

A1 - Larsen,Jan

A1 - Hansen,Lars Kai

AU - Meng,Anders

AU - Ahrendt,Peter

AU - Larsen,Jan

AU - Hansen,Lars Kai

PB - I E E E

PY - 2007

Y1 - 2007

N2 - Temporal feature integration is the process of combining all the feature vectors in a time window into a single feature vector in order to capture the relevant temporal information in the window. The mean and variance along the temporal dimension are often used for temporal feature integration, but they capture neither the temporal dynamics nor dependencies among the individual feature dimensions. Here, a multivariate autoregressive feature model is proposed to solve this problem for music genre classification. This model gives two different feature sets, the diagonal autoregressive (DAR) and multivariate autoregressive (MAR) features which are compared against the baseline mean-variance as well as two other temporal feature integration techniques. Reproducibility in performance ranking of temporal feature integration methods were demonstrated using two data sets with five and eleven music genres, and by using four different classification schemes. The methods were further compared to human performance. The proposed MAR features perform better than the other features at the cost of increased computational complexity.

AB - Temporal feature integration is the process of combining all the feature vectors in a time window into a single feature vector in order to capture the relevant temporal information in the window. The mean and variance along the temporal dimension are often used for temporal feature integration, but they capture neither the temporal dynamics nor dependencies among the individual feature dimensions. Here, a multivariate autoregressive feature model is proposed to solve this problem for music genre classification. This model gives two different feature sets, the diagonal autoregressive (DAR) and multivariate autoregressive (MAR) features which are compared against the baseline mean-variance as well as two other temporal feature integration techniques. Reproducibility in performance ranking of temporal feature integration methods were demonstrated using two data sets with five and eleven music genres, and by using four different classification schemes. The methods were further compared to human performance. The proposed MAR features perform better than the other features at the cost of increased computational complexity.

U2 - 10.1109/TASL.2007.899293

DO - 10.1109/TASL.2007.899293

JO - I E E E Transactions on Audio, Speech and Language Processing

JF - I E E E Transactions on Audio, Speech and Language Processing

SN - 1558-7916

IS - 5

VL - 15

SP - 1654

EP - 1664

ER -