Korean articulatory speech synthesis using physical vocal tract model

Huynh Van Luong, Jong-Myon Kim, Cheol Hong Kim

Research output: Contribution to journalConference abstract in journalResearchpeer-review

Abstract

Artificial vocal tract models provide the support of learning a second language and the therapy of speech disorders. Moreover, phonetic education and research can benefit from articulatory speech synthesis. Articulatory speech synthesis models are constructed by the source‐filter model of the human vocal tract. In this study, we generated a Korean articulatory speech synthesis model using Artisynth [Fels et al., ISSP, 419–426 (2006)], which is a 3‐D biomechanical open‐source simulation platform. As the origin of the Korean language, it has 10 basic vowel phonemes and 11 complicated vowels in which some vowels can be rounded and unrounded such as ∕eu∕, ∕yeo∕, ∕wae∕, etc. To synthesize these specific vowels, we created a new physical vocal tract model, which interconnects to form a complete integrated biomechanical system. The created model efficiently supports recording the Korean vowel sounds and linguistic analysis based on the linear prediction model. As a result, parameters of the glottis and controllable vocal tract filter are automatically evaluated. The acoustic quality of the synthesizer for Korean vowels is comparable with that of the existing commercial speech synthesis systems such as concatenation synthesizers [Donovan (1996)] and [Hamza (2000)].
Original languageEnglish
JournalJournal of the Acoustical Society of America
Volume125
Issue number4
Pages (from-to)2498
Number of pages1
ISSN0001-4966
Publication statusPublished - 2009
Externally publishedYes
Event157th Meeting of the Acoustical Society of America - Portland, Oregon, United States
Duration: 18 May 200922 May 2009
Conference number: 157

Conference

Conference157th Meeting of the Acoustical Society of America
Number157
Country/TerritoryUnited States
CityPortland, Oregon
Period18/05/200922/05/2009

Fingerprint

Dive into the research topics of 'Korean articulatory speech synthesis using physical vocal tract model'. Together they form a unique fingerprint.

Cite this