Projects per year
Abstract
Young listeners with a healthy auditory system are capable of understanding speech in the presence of one or several interfering voices, even in the most challenging listening scenarios. This ability is crucial for daily social life, but it is compromised in older listeners affected by sensorineural hearing loss, who often experience difficulties understanding speech in complex auditory environments, even when hearing-aid solutions are provided. Investigating auditory scenarios with multiple speakers is thus essential for revealing phenomena that can inspire the development of hearing-loss compensation strategies.
Among many auditory cues, the fundamental frequency (F0) and its differences between competing voices provide useful information that aids target speech intelligibility, which can be successfully utilized by normal-hearing (NH) listeners, whereas older hearing-impaired (HI) listeners have limited access to it. Evidence for the efficacy of F0 information has been obtained by means of laboratory simulations of competing-talker scenarios, with highly constrained speech stimuli that are not truly representative of the characteristics and the variety of realistic speech and therefore do not guarantee the reproducibility of the results in the diversity of auditory situations encountered in daily life.
This thesis aims to expand the available knowledge on the role of F0-related cues in competing-talker scenarios by using naturalistic everyday speech stimuli with a wide variability of F0 characteristics that is typical of realistic voices. The effects of differences in average F0 and F0 dynamic range between competing voices on speech perception were investigated in NH and HI listeners. For NH listeners, the measured effects of these cues on speech intelligibility were small or negligible. The average F0 difference between competing voices was found to only provide a speech-intelligibility benefit when energetic cues were limited or absent, which occurs especially when the competing voices have unrealistically similar syntactical structure and F0 trajectories. The effect on speech intelligibility induced by the difference in F0-dynamic-range between competing voices was found to be negligible. However, it was shown that the presence of a relatively large F0 dynamic range in at least one of the two competing sentences improved speech intelligibility, regardless of the difference in F0 dynamic range between sentences. For HI listeners, the inability to utilize these F0 cues was confirmed: compared to NH listeners, the benefit induced by an average F0 separation between competing voices was smaller and no significant effect of F0-dynamic-range of the sentences nor of their difference was observed. Finally, an analysis of the F0 properties of speech recordings from naturalistic dialogues was presented, providing a reference for the F0 properties of realistic speech and describing the changes in F0 properties that talkers produce in the presence of communication barriers such as background noise and hearing impairment.
This thesis contributes to the body of literature on the role of F0-related cues in communication scenarios, by proposing new methodologies that focus on the realism of the speech materials and on the numerical control of the experimental method. Overall, the results of this thesis suggest that in realistic competing-talker scenarios, the F0-related cues contribute to a holistic picture of the auditory scene that involves many auditory cues. In such scenarios, especially the F0 dynamic range of the individual competing sentences can affect speech intelligibility.
Among many auditory cues, the fundamental frequency (F0) and its differences between competing voices provide useful information that aids target speech intelligibility, which can be successfully utilized by normal-hearing (NH) listeners, whereas older hearing-impaired (HI) listeners have limited access to it. Evidence for the efficacy of F0 information has been obtained by means of laboratory simulations of competing-talker scenarios, with highly constrained speech stimuli that are not truly representative of the characteristics and the variety of realistic speech and therefore do not guarantee the reproducibility of the results in the diversity of auditory situations encountered in daily life.
This thesis aims to expand the available knowledge on the role of F0-related cues in competing-talker scenarios by using naturalistic everyday speech stimuli with a wide variability of F0 characteristics that is typical of realistic voices. The effects of differences in average F0 and F0 dynamic range between competing voices on speech perception were investigated in NH and HI listeners. For NH listeners, the measured effects of these cues on speech intelligibility were small or negligible. The average F0 difference between competing voices was found to only provide a speech-intelligibility benefit when energetic cues were limited or absent, which occurs especially when the competing voices have unrealistically similar syntactical structure and F0 trajectories. The effect on speech intelligibility induced by the difference in F0-dynamic-range between competing voices was found to be negligible. However, it was shown that the presence of a relatively large F0 dynamic range in at least one of the two competing sentences improved speech intelligibility, regardless of the difference in F0 dynamic range between sentences. For HI listeners, the inability to utilize these F0 cues was confirmed: compared to NH listeners, the benefit induced by an average F0 separation between competing voices was smaller and no significant effect of F0-dynamic-range of the sentences nor of their difference was observed. Finally, an analysis of the F0 properties of speech recordings from naturalistic dialogues was presented, providing a reference for the F0 properties of realistic speech and describing the changes in F0 properties that talkers produce in the presence of communication barriers such as background noise and hearing impairment.
This thesis contributes to the body of literature on the role of F0-related cues in communication scenarios, by proposing new methodologies that focus on the realism of the speech materials and on the numerical control of the experimental method. Overall, the results of this thesis suggest that in realistic competing-talker scenarios, the F0-related cues contribute to a holistic picture of the auditory scene that involves many auditory cues. In such scenarios, especially the F0 dynamic range of the individual competing sentences can affect speech intelligibility.
Original language | English |
---|
Publisher | DTU Health Technology |
---|---|
Number of pages | 151 |
Publication status | Published - 2022 |
Series | Contributions to Hearing Research |
---|---|
Volume | 56 |
Fingerprint
Dive into the research topics of 'Assessing the effects of fundamental-frequency dynamics on the intelligibility of competing voices'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Characterizing consequences of hearing impairment and hearing-aid on speech perception in competing-talker scenarios
Mesiano, P. A. (PhD Student), Dau, T. (Main Supervisor), Relaño-Iborra, H. (Supervisor) & Zaar, J. (Supervisor)
01/06/2018 → 30/09/2022
Project: PhD