Sense Meets Nonsense: a dual-layer Danish speech corpus for perception studies

Publication: Research - peer-reviewArticle in proceedings – Annual report year: 2012


View graph of relations

In this paper, we present the newly established Danish speech corpus PiTu. The corpus consists of recordings of 28 native Danish talkers (14 female and 14 male) each reproducing (i) a series of nonsense syllables, and (ii) a set of authentic natural language sentences. The speech corpus is tailored for investigating the relationship between early stages of the speech perceptual process and later stages. We present our considerations involved in preparing the experimental set-up, producing the anechoic recordings, compiling the data, and exploring the materials in linguistic research. We report on a small pilot experiment demonstrating how PiTu and similar speech corpora can be used in studies of prosody as a function of semantic content. The experiment addresses the issue of whether the governing principles of Danish prosody assignment is mainly talker-specific or mainly content-typical (under the specific experimental conditions).
Original languageEnglish
Title of host publication8th International Conference on Language Resources and Evaluation
Number of pages6
Publication date2012
StatePublished - 2012


ConferenceLREC 2012 Istanbul

Bibliographical note

The corpus is available at


  • Speech corpus, Danish language, Nonsense syllables, Prosodic structure, Corpus-based spoken language analysis
Download as:
Download as PDF
Select render style:
Download as HTML
Select render style:
Download as Word
Select render style:

Download statistics

No data available

ID: 7644726