Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music

Sigurdur Sigurdsson, Kaare Brandt Petersen, Tue Lehn-Schiøler

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    185 Downloads (Pure)

    Abstract

    In large MP3 databases, files are typically generated with different parameter settings, i.e., bit rate and sampling rates. This is of concern for MIR applications, as encoding difference can potentially confound meta-data estimation and similarity evaluation. In this paper we will discuss the influence of MP3 coding for the Mel frequency cepstral coeficients (MFCCs). The main result is that the widely used subset of the MFCCs is robust at bit rates equal or higher than 128 kbits/s, for the implementations we have investigated. However, for lower bit rates, e.g., 64 kbits/s, the implementation of the Mel filter bank becomes an issue.
    Original languageEnglish
    Title of host publicationProceedings of the Seventh International Conference on Music Information Retrieval (ISMIR)
    Publication date2006
    Publication statusPublished - 2006
    EventSeventh International Conference on Music Information Retrieval (ISMIR) -
    Duration: 1 Jan 2006 → …

    Conference

    ConferenceSeventh International Conference on Music Information Retrieval (ISMIR)
    Period01/01/2006 → …

    Keywords

    • MP3
    • robustness
    • MFCC
    • Mel frequency cepstral coefficients

    Cite this

    Sigurdsson, S., Petersen, K. B., & Lehn-Schiøler, T. (2006). Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music. In Proceedings of the Seventh International Conference on Music Information Retrieval (ISMIR)