Blind estimation of the number of speech source in reverberant multisource scenarios based on binaural signals

Tobias May, Steven van de Par

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    283 Downloads (Pure)

    Abstract

    In this paper we present a new approach for estimating the number of active speech sources in the presence of interfering noise sources and reverberation. First, a binaural front-end is used to detect the spatial positions of all active sound sources, resulting in a binary mask for each candidate position. Then, each candidate position is characterized by a set of features. In addition to exploiting the overall
    spectral shape, a new set of mask-based features is proposed which aims at characterizing the pattern of the estimated binary mask. The decision stage for detecting a speech source is based on a support vector machine (SVM) classifier. A systematic analysis shows that the proposed algorithm is able to blindly determine the number and the corresponding spatial positions of speech sources in multisource scenarios and generalizes well to unknown acoustic conditions
    Original languageEnglish
    Title of host publicationProceedings of IWAENC 2012
    Number of pages4
    Publication date2012
    Publication statusPublished - 2012
    EventInternational Workshop on Acoustic Signal Enhancement - RWTH Aachen University, Aachen, Germany
    Duration: 4 Sept 20126 Sept 2012

    Conference

    ConferenceInternational Workshop on Acoustic Signal Enhancement
    LocationRWTH Aachen University
    Country/TerritoryGermany
    CityAachen
    Period04/09/201206/09/2012

    Keywords

    • Binaural processing
    • Binary mask
    • computational computational

    Fingerprint

    Dive into the research topics of 'Blind estimation of the number of speech source in reverberant multisource scenarios based on binaural signals'. Together they form a unique fingerprint.

    Cite this