Fishing for meaningful units in connected speech

Publication: Research - peer-reviewArticle in proceedings – Annual report year: 2009

Standard

Fishing for meaningful units in connected speech. / Henrichsen, Peter Juel; Christiansen, Thomas Ulrich.

Proceedings of ISAAR 2009. ed. / Jörg Buchholz; Torsten Dau; Jakob Christensen-Dalsgaard; Torben Poulsen. 2009.

Publication: Research - peer-reviewArticle in proceedings – Annual report year: 2009

Harvard

Henrichsen, PJ & Christiansen, TU 2009, 'Fishing for meaningful units in connected speech'. in J Buchholz, T Dau, J Christensen-Dalsgaard & T Poulsen (eds), Proceedings of ISAAR 2009.

APA

Henrichsen, P. J., & Christiansen, T. U. (2009). Fishing for meaningful units in connected speech. In J. Buchholz, T. Dau, J. Christensen-Dalsgaard, & T. Poulsen (Eds.), Proceedings of ISAAR 2009.

CBE

Henrichsen PJ, Christiansen TU. 2009. Fishing for meaningful units in connected speech. Buchholz J, Dau T, Christensen-Dalsgaard J, Poulsen T, editors. In Proceedings of ISAAR 2009.

MLA

Henrichsen, Peter Juel and Thomas Ulrich Christiansen "Fishing for meaningful units in connected speech"., Buchholz, Jörg and Dau, Torsten Christensen-Dalsgaard, Jakob Poulsen, Torben (ed.). Proceedings of ISAAR 2009. 2009.

Vancouver

Henrichsen PJ, Christiansen TU. Fishing for meaningful units in connected speech. In Buchholz J, Dau T, Christensen-Dalsgaard J, Poulsen T, editors, Proceedings of ISAAR 2009. 2009.

Author

Henrichsen, Peter Juel; Christiansen, Thomas Ulrich / Fishing for meaningful units in connected speech.

Proceedings of ISAAR 2009. ed. / Jörg Buchholz; Torsten Dau; Jakob Christensen-Dalsgaard; Torben Poulsen. 2009.

Publication: Research - peer-reviewArticle in proceedings – Annual report year: 2009

Bibtex

@inbook{a504d2a11a3b4dda8513b221d718466c,
title = "Fishing for meaningful units in connected speech",
author = "Henrichsen, {Peter Juel} and Christiansen, {Thomas Ulrich}",
year = "2009",
editor = "Jörg Buchholz and Torsten Dau and Jakob Christensen-Dalsgaard and Torben Poulsen",
isbn = "87-990013-2-2",
booktitle = "Proceedings of ISAAR 2009",

}

RIS

TY - GEN

T1 - Fishing for meaningful units in connected speech

A1 - Henrichsen,Peter Juel

A1 - Christiansen,Thomas Ulrich

AU - Henrichsen,Peter Juel

AU - Christiansen,Thomas Ulrich

PY - 2009

Y1 - 2009

N2 - In many branches of spoken language analysis including ASR, the set of smallest meaningful units of speech is taken to coincide with the set of phones or phonemes. However, fishing for phones is difficult, error-prone, and computationally expensive. We present an experiment, based on machine learning, with an alternative approach. Instead of stipulating a basic set of target units, the determination of the set is considered to be part of the learning task. Given 18 recordings of Danish talkers performing a simple lab task, our algorithm produced a set of acoustically well-defined units sufficient for identifying all the major semantic elements (be they parts of words, words or several words), relevant to the task. As the sound encoding used was very simple – fundamental frequency (F0), Harmonicity-to-Noise-Ratio (HNR), and Intensity samples only – the computational complexity involved was far lower than for phonemic recognition. Our findings show that it is possible to automatically characterize a linguistic message, without detailed spectral information or presumptions about the target units. Further, fishing for simple meaningful cues and enhancing these selectively would potentially be a more effective way of achieving intelligibility transfer, which is the end goal for speech transducing technologies.

AB - In many branches of spoken language analysis including ASR, the set of smallest meaningful units of speech is taken to coincide with the set of phones or phonemes. However, fishing for phones is difficult, error-prone, and computationally expensive. We present an experiment, based on machine learning, with an alternative approach. Instead of stipulating a basic set of target units, the determination of the set is considered to be part of the learning task. Given 18 recordings of Danish talkers performing a simple lab task, our algorithm produced a set of acoustically well-defined units sufficient for identifying all the major semantic elements (be they parts of words, words or several words), relevant to the task. As the sound encoding used was very simple – fundamental frequency (F0), Harmonicity-to-Noise-Ratio (HNR), and Intensity samples only – the computational complexity involved was far lower than for phonemic recognition. Our findings show that it is possible to automatically characterize a linguistic message, without detailed spectral information or presumptions about the target units. Further, fishing for simple meaningful cues and enhancing these selectively would potentially be a more effective way of achieving intelligibility transfer, which is the end goal for speech transducing technologies.

SN - 87-990013-2-2

BT - Proceedings of ISAAR 2009

T2 - Proceedings of ISAAR 2009

A2 - Poulsen,Torben

ED - Poulsen,Torben

ER -