Sense Meets Nonsense: a dual-layer Danish speech corpus for perception studies

Thomas Ulrich Christiansen, Peter Juel Henrichsen

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    439 Downloads (Pure)

    Abstract

    In this paper, we present the newly established Danish speech corpus PiTu. The corpus consists of recordings of 28 native Danish talkers (14 female and 14 male) each reproducing (i) a series of nonsense syllables, and (ii) a set of authentic natural language sentences. The speech corpus is tailored for investigating the relationship between early stages of the speech perceptual process and later stages. We present our considerations involved in preparing the experimental set-up, producing the anechoic recordings, compiling the data, and exploring the materials in linguistic research. We report on a small pilot experiment demonstrating how PiTu and similar speech corpora can be used in studies of prosody as a function of semantic content. The experiment addresses the issue of whether the governing principles of Danish prosody assignment is mainly talker-specific or mainly content-typical (under the specific experimental conditions).
    Original languageEnglish
    Title of host publication8th International Conference on Language Resources and Evaluation
    Number of pages6
    Publication date2012
    Publication statusPublished - 2012
    EventLREC 2012 Istanbul - Istanbul, Turkey
    Duration: 21 May 201227 May 2012

    Conference

    ConferenceLREC 2012 Istanbul
    Country/TerritoryTurkey
    CityIstanbul
    Period21/05/201227/05/2012

    Bibliographical note

    The corpus is available at http://amtoolbox.sourceforge.net/pitu/

    Keywords

    • Speech corpus
    • Danish language
    • Nonsense syllables
    • Prosodic structure
    • Corpus-based spoken language analysis

    Fingerprint

    Dive into the research topics of 'Sense Meets Nonsense: a dual-layer Danish speech corpus for perception studies'. Together they form a unique fingerprint.

    Cite this