ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES

Tobias May, Ning Ma, Guy Brown

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    547 Downloads (Pure)

    Abstract

    This paper addresses the problem of localising multiple competing speakers in the presence of room reverberation, where sound sources can be positioned at any azimuth on the horizontal plane. To reduce the amount of front-back confusions which can occur due to the sim- ilarity of interaural time differences (ITDs) and interaural level dif- ferences (ILDs) in the front and rear hemifield, a machine hearing system is presented which combines supervised learning of binaural cues using multi-conditional training (MCT) with a head movement strategy. A systematic evaluation showed that this approach substan- tially reduced the amount of front-back confusions in challenging acoustic scenarios. Moreover, the system was able to generalise to a variety of different acoustic conditions not seen during training.
    Original languageEnglish
    Title of host publicationProceedings of IEEE International Conference on Acoustics, Speech and Signal Processing
    Number of pages5
    PublisherIEEE
    Publication date2015
    Publication statusPublished - 2015
    Event2015 IEEE International Conference on Acoustics, Speech and Signal Processing - Brisbane, Australia
    Duration: 19 Apr 201524 Apr 2015
    Conference number: 40
    https://icassp2015.org/

    Conference

    Conference2015 IEEE International Conference on Acoustics, Speech and Signal Processing
    Number40
    Country/TerritoryAustralia
    CityBrisbane
    Period19/04/201524/04/2015
    Internet address

    Keywords

    • Binaural sound source localisation
    • Head movements
    • Multi-conditional training
    • Generalisation

    Fingerprint

    Dive into the research topics of 'ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES'. Together they form a unique fingerprint.

    Cite this