Auditory spatial analysis in reverberant multi-talker environments with congruent and incongruent audio-visual room information

Axel Ahrens*, Kasper Duemose Lund

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

67 Downloads (Pure)

Abstract

In a multi-talker situation, listeners have the challenge of identifying a target speech source out of a mixture of interfering background noises. In the current study, it was investigated how listeners analyze audio-visual scenes with varying complexity in terms of number of talkers and reverberation. The visual information of the room was either congruent with the acoustic room or incongruent. The listeners' task was to locate an ongoing speech source in a mixture of other speech sources. The three-dimensional audio-visual scenarios were presented using a loudspeaker array and virtual reality glasses. It was shown that room reverberation, as well as the number of talkers in a scene, influence the ability to analyze an auditory scene in terms of accuracy and response time. Incongruent visual information of the room did not affect this ability. When few talkers were presented simultaneously, listeners were able to detect a target talker quickly and accurately even in adverse room acoustical conditions. Reverberation started to affect the response time when four or more talkers were presented. The number of talkers became a significant factor for five or more simultaneous talkers.

Original languageEnglish
JournalJournal of the Acoustical Society of America
Volume152
Issue number3
Pages (from-to)1586-1594
ISSN0001-4966
DOIs
Publication statusPublished - 2022

Fingerprint

Dive into the research topics of 'Auditory spatial analysis in reverberant multi-talker environments with congruent and incongruent audio-visual room information'. Together they form a unique fingerprint.

Cite this