Predicting the effect of spectral subtraction on the speech recognition threshold based on the signal-to-noise ratio in the envelope domain

Søren Jørgensen, Torsten Dau

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

Digital noise reduction strategies are important in technical devices such as hearing aids and mobile phones. One well-described noise reduction scheme is the spectral subtraction algorithm. Many versions of the spectral subtraction scheme have been presented in the literature, but the methods have rarely been evaluated perceptually in terms of speech intelligibility. This study analyzed the effects of the spectral subtraction strategy proposed by Berouti at al. [ICASSP 4 (1979), 208-211] on the speech recognition threshold (SRT) obtained with sentences presented in stationary speech-shaped noise. The SRT was measured in five normal-hearing listeners in six conditions of spectral subtraction. The results showed an increase of the SRT after processing, i.e. a decreased speech intelligibility, in contrast to what is predicted by the Speech Transmission Index (STI). Here, another approach is proposed, denoted the speech-based envelope power spectrum model (sEPSM) which predicts the intelligibility based on the signal-to-noise ratio in the envelope domain. In contrast to the STI, the sEPSM is sensitive to the increased amount of the noise envelope power as a consequence of the spectral subtraction operation, which leads to a decreased speech intelligibility in this model, in quantitative agreement with the experimental data.
Original languageEnglish
Title of host publicationForum Acusticum 2011
Publication date2011
Publication statusPublished - 2011
EventForum Acusticum 2011 - Aalborg, Denmark
Duration: 26 Jun 20111 Jul 2011
http://www.fa2011.org/

Conference

ConferenceForum Acusticum 2011
Country/TerritoryDenmark
CityAalborg
Period26/06/201101/07/2011
Internet address

Fingerprint

Dive into the research topics of 'Predicting the effect of spectral subtraction on the speech recognition threshold based on the signal-to-noise ratio in the envelope domain'. Together they form a unique fingerprint.

Cite this