Low Complexity Bayesian Single Channel Source Separation

Thomas Beierholm, Brian Dam Pedersen, Ole Winther

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearch

    278 Downloads (Pure)

    Abstract

    We propose a simple Bayesian model for performing single channel speech separation using factorized source priors in a sliding window linearly transformed domain. Using a one dimensional mixture of Gaussians to model each band source leads to fast tractable inference for the source signals. Simulations with separation of a male and a female speaker using priors trained on the same speakers show comparable performance with the blind separation approach of G.-J. Jang and T.-W. Lee (see NIPS, vol.15, 2003) with a SNR improvement of 4.9 dB for both the male and female speaker. Mixing coefficients can be estimated quite precisely using ML-II, but the estimation is quite sensitive to the accuracy of the priors as opposed to the source separation quality for known mixing coefficients, which is quite insensitive to the accuracy of the priors. Finally, we discuss how to improve our approach while keeping the complexity low using machine learning and CASA (computational auditory scene analysis) approaches (Jang and Lee, 2003; Roweis, S.T., 2001; Wang, D.L. and Brown, G.J., 1999; Hu, G. and Wang, D., 2003).
    Original languageEnglish
    Title of host publicationProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
    VolumeVolume 5
    PublisherIEEE
    Publication date2004
    Pages529-532
    ISBN (Print)07-80-38484-9
    DOIs
    Publication statusPublished - 2004
    EventIEEE International Conference on Acoustics, Speech, and Signal Processing 2004 - Montreal, Quebec, Canada
    Duration: 17 May 200421 May 2004

    Conference

    ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing 2004
    CountryCanada
    CityMontreal, Quebec
    Period17/05/200421/05/2004

    Bibliographical note

    Copyright 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

    Fingerprint Dive into the research topics of 'Low Complexity Bayesian Single Channel Source Separation'. Together they form a unique fingerprint.

    Cite this