ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES

Tobias May, Ning Ma, Guy Brown

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

377 Downloads (Pure)

Abstract

This paper addresses the problem of localising multiple competing speakers in the presence of room reverberation, where sound sources can be positioned at any azimuth on the horizontal plane. To reduce the amount of front-back confusions which can occur due to the sim- ilarity of interaural time differences (ITDs) and interaural level dif- ferences (ILDs) in the front and rear hemifield, a machine hearing system is presented which combines supervised learning of binaural cues using multi-conditional training (MCT) with a head movement strategy. A systematic evaluation showed that this approach substan- tially reduced the amount of front-back confusions in challenging acoustic scenarios. Moreover, the system was able to generalise to a variety of different acoustic conditions not seen during training.
Original languageEnglish
Title of host publicationProceedings of IEEE International Conference on Acoustics, Speech and Signal Processing
Number of pages5
PublisherIEEE
Publication date2015
Publication statusPublished - 2015
Event40th IEEE International Conference on Acoustics, Speech and Signal Processing - Brisbane, Australia
Duration: 19 Apr 201524 Apr 2015

Conference

Conference40th IEEE International Conference on Acoustics, Speech and Signal Processing
CountryAustralia
CityBrisbane
Period19/04/201524/04/2015

Keywords

  • Binaural sound source localisation
  • Head movements
  • Multi-conditional training
  • Generalisation

Fingerprint

Dive into the research topics of 'ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES'. Together they form a unique fingerprint.

Cite this