Abstract
Although the field of automatic speaker recognition (ASR) has been the subject of extensive research over the past decades, the lack of robustness against background noise has remained a major challenge. This paper describes a noise-robust speaker recognition system that combines missing data (MD) recognition with the adaptation of speaker models using a universal background model (UBM). For MD recognition, the identification of reliable and unreliable feature components is required. For this purpose, the signal-to-noise ratio (SNR) based mask estimation performance of various state-of-the art noise estimation techniques and noise reduction schemes is compared. Speaker recognition experiments show that the usage of a UBM in combination with missing data recognition yields substantial improvements in recognition performance, especially in the presence of highly non-stationary background noise at low SNRs.
| Original language | English |
|---|---|
| Journal | I E E E Transactions on Audio, Speech and Language Processing |
| Volume | 20 |
| Issue number | 1 |
| Pages (from-to) | 108-121 |
| ISSN | 1558-7916 |
| DOIs | |
| Publication status | Published - 2012 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver