Abstract
This paper addresses the problem of localising multiple competing
speakers in the presence of room reverberation, where sound sources
can be positioned at any azimuth on the horizontal plane. To reduce
the amount of front-back confusions which can occur due to the sim-
ilarity of interaural time differences (ITDs) and interaural level dif-
ferences (ILDs) in the front and rear hemifield, a machine hearing
system is presented which combines supervised learning of binaural
cues using multi-conditional training (MCT) with a head movement
strategy. A systematic evaluation showed that this approach substan-
tially reduced the amount of front-back confusions in challenging
acoustic scenarios. Moreover, the system was able to generalise to a
variety of different acoustic conditions not seen during training.
Original language | English |
---|---|
Title of host publication | Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 2015 |
Publication status | Published - 2015 |
Event | 2015 IEEE International Conference on Acoustics, Speech and Signal Processing - Brisbane, Australia Duration: 19 Apr 2015 → 24 Apr 2015 Conference number: 40 https://icassp2015.org/ |
Conference
Conference | 2015 IEEE International Conference on Acoustics, Speech and Signal Processing |
---|---|
Number | 40 |
Country/Territory | Australia |
City | Brisbane |
Period | 19/04/2015 → 24/04/2015 |
Internet address |
Keywords
- Binaural sound source localisation
- Head movements
- Multi-conditional training
- Generalisation