A Method for Conversational Signal-to-Noise Ratio Estimation in Real-World Sound Scenarios

Research output: Contribution to journalConference abstract in journalResearchpeer-review

134 Downloads (Pure)

Abstract

The analysis of conversational signal-to-noise ratios (SNRs) measured in real-world scenarios can provide vital insight into people's communicative strategies and difficulties and guide development of hearing devices. However, measuring SNRs accurately and realistically is challenging in typical recording conditions, where only a mixture of sound sources is captured. This study introduces a novel method for realistic in situ SNR estimation, where the speech signal of a person in natural conversation is captured by a cheek-mounted microphone, adjusted for free-field conditions, and convolved with a measured impulse response to estimate the clean speech component at the receiver. A microphone near the receiver computes the noise-only component by applying a voice activity detector. Obtained SNR values are analyzed using in situ recordings of a real-world workspace meeting. It is shown that the temporal resolution is increased, and fluctuations in the speech level are more accurately tracked compared to a typical spectral-subtraction-derived method. The application of the proposed SNR estimation method may be valuable for compensation procedures in hearing instruments that take conversational dynamics into account.

Original languageEnglish
JournalAcoustical Society of America. Journal
Volume145
Pages (from-to)1873
ISSN0001-4966
DOIs
Publication statusPublished - 2019
Event177th Meeting of the Acoustical Society of America - The Galt House, Louisville, United States
Duration: 13 May 201917 May 2019
Conference number: 177
https://acousticalsociety.org/program-of-177th-meeting/

Conference

Conference177th Meeting of the Acoustical Society of America
Number177
LocationThe Galt House
CountryUnited States
CityLouisville
Period13/05/201917/05/2019
Internet address

Cite this

@article{2d79677471fa449e88ab928a73629ed1,
title = "A Method for Conversational Signal-to-Noise Ratio Estimation in Real-World Sound Scenarios",
abstract = "The analysis of conversational signal-to-noise ratios (SNRs) measured in real-world scenarios can provide vital insight into people's communicative strategies and difficulties and guide development of hearing devices. However, measuring SNRs accurately and realistically is challenging in typical recording conditions, where only a mixture of sound sources is captured. This study introduces a novel method for realistic in situ SNR estimation, where the speech signal of a person in natural conversation is captured by a cheek-mounted microphone, adjusted for free-field conditions, and convolved with a measured impulse response to estimate the clean speech component at the receiver. A microphone near the receiver computes the noise-only component by applying a voice activity detector. Obtained SNR values are analyzed using in situ recordings of a real-world workspace meeting. It is shown that the temporal resolution is increased, and fluctuations in the speech level are more accurately tracked compared to a typical spectral-subtraction-derived method. The application of the proposed SNR estimation method may be valuable for compensation procedures in hearing instruments that take conversational dynamics into account.",
author = "Naim Mansour and Marton Marschall and Tobias May and Adam Westermann and Torsten Dau",
year = "2019",
doi = "10.1121/1.5101769",
language = "English",
volume = "145",
pages = "1873",
journal = "Acoustical Society of America. Journal",
issn = "0001-4966",
publisher = "A I P Publishing LLC",

}

A Method for Conversational Signal-to-Noise Ratio Estimation in Real-World Sound Scenarios. / Mansour, Naim; Marschall, Marton; May, Tobias; Westermann, Adam; Dau, Torsten.

In: Acoustical Society of America. Journal, Vol. 145, 2019, p. 1873.

Research output: Contribution to journalConference abstract in journalResearchpeer-review

TY - ABST

T1 - A Method for Conversational Signal-to-Noise Ratio Estimation in Real-World Sound Scenarios

AU - Mansour, Naim

AU - Marschall, Marton

AU - May, Tobias

AU - Westermann, Adam

AU - Dau, Torsten

PY - 2019

Y1 - 2019

N2 - The analysis of conversational signal-to-noise ratios (SNRs) measured in real-world scenarios can provide vital insight into people's communicative strategies and difficulties and guide development of hearing devices. However, measuring SNRs accurately and realistically is challenging in typical recording conditions, where only a mixture of sound sources is captured. This study introduces a novel method for realistic in situ SNR estimation, where the speech signal of a person in natural conversation is captured by a cheek-mounted microphone, adjusted for free-field conditions, and convolved with a measured impulse response to estimate the clean speech component at the receiver. A microphone near the receiver computes the noise-only component by applying a voice activity detector. Obtained SNR values are analyzed using in situ recordings of a real-world workspace meeting. It is shown that the temporal resolution is increased, and fluctuations in the speech level are more accurately tracked compared to a typical spectral-subtraction-derived method. The application of the proposed SNR estimation method may be valuable for compensation procedures in hearing instruments that take conversational dynamics into account.

AB - The analysis of conversational signal-to-noise ratios (SNRs) measured in real-world scenarios can provide vital insight into people's communicative strategies and difficulties and guide development of hearing devices. However, measuring SNRs accurately and realistically is challenging in typical recording conditions, where only a mixture of sound sources is captured. This study introduces a novel method for realistic in situ SNR estimation, where the speech signal of a person in natural conversation is captured by a cheek-mounted microphone, adjusted for free-field conditions, and convolved with a measured impulse response to estimate the clean speech component at the receiver. A microphone near the receiver computes the noise-only component by applying a voice activity detector. Obtained SNR values are analyzed using in situ recordings of a real-world workspace meeting. It is shown that the temporal resolution is increased, and fluctuations in the speech level are more accurately tracked compared to a typical spectral-subtraction-derived method. The application of the proposed SNR estimation method may be valuable for compensation procedures in hearing instruments that take conversational dynamics into account.

U2 - 10.1121/1.5101769

DO - 10.1121/1.5101769

M3 - Conference abstract in journal

VL - 145

SP - 1873

JO - Acoustical Society of America. Journal

JF - Acoustical Society of America. Journal

SN - 0001-4966

ER -