The role of temporal resolution in modulation-based speech segregation

Tobias May, Thomas Bentsen, Torsten Dau

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    347 Downloads (Pure)

    Abstract

    This study is concerned with the challenge of automatically segregating a target speech signal from interfering background noise. A computational speech segregation system is presented which exploits logarithmically-scaled amplitude modulation spectrogram (AMS) features to distinguish between speech and noise activity on the basis of individual time-frequency (T-F) units. One important parameter of the segregation system is the window duration of the analysis-synthesis stage, which determines the lower limit of modulation frequencies that can be represented but also the temporal acuity with which the segregation system can manipulate individual T-F units. To clarify the consequences of this trade-off on modulation-based speech segregation performance, the influence of the window duration was systematically investigated
    Original languageEnglish
    Title of host publicationProceedings of Interspeech 2015
    Number of pages5
    Publication date2015
    Publication statusPublished - 2015
    EventINTERSPEECH 2015 : Speech beyond Speech - Dresden, Germany
    Duration: 6 Sept 201510 Sept 2015

    Conference

    ConferenceINTERSPEECH 2015
    Country/TerritoryGermany
    CityDresden
    Period06/09/201510/09/2015

    Keywords

    • Speech segregation
    • Ideal binary mask
    • Amplitude modulation spectrogram features
    • Temporal resolution

    Fingerprint

    Dive into the research topics of 'The role of temporal resolution in modulation-based speech segregation'. Together they form a unique fingerprint.

    Cite this