Abstract
This paper addresses the problem of speech segregation by es-
timating the ideal binary mask (IBM) from noisy speech. Two
methods will be compared, one supervised learning approach
that incorporates
a priori
knowledge about the feature distri-
bution observed during training. The second method solely
relies on a frame-based speech presence probability (SPP) es-
timation, and therefore, does not depend on the acoustic con-
dition seen during training. We investigate the influence of
mismatches between the acoustic conditions used for training
and testing on the IBM estimation performance and discuss
the advantages of both approaches.
Original language | English |
---|---|
Title of host publication | Proceedings of IWAENC 2014 |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 2014 |
ISBN (Print) | 9781479968084 |
DOIs | |
Publication status | Published - 2014 |
Event | The International Workshop on Acoustic Signal Enhancement - Juan les Pins, France Duration: 8 Sept 2014 → 11 Sept 2014 |
Conference
Conference | The International Workshop on Acoustic Signal Enhancement |
---|---|
Country/Territory | France |
City | Juan les Pins |
Period | 08/09/2014 → 11/09/2014 |
Keywords
- Ideal binary mask
- Speech segregation
- Generalization
- Speech presence probability