Cognitive componets of speech at different time scales

Ling Feng, Lars Kai Hansen

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    251 Downloads (Pure)

    Abstract

    Cognitive component analysis (COCA) is defined as unsupervised grouping of data leading to a group structure well aligned with that resulting from human cognitive activity. We focus here on speech at different time scales looking for possible hidden ‘cognitive structure’. Statistical regularities have earlier been revealed at multiple time scales corresponding to: phoneme, gender, height and speaker identity. We here show that the same simple unsupervised learning algorithm can detect these cues. Our basic features are 25-dimensional short time Mel-frequency weighted cepstral coefficients, assumed to model the basic representation of the human auditory system. The basic features are aggregated in time to obtain features at longer time scales. Simple energy based filtering is used to achieve a sparse representation. Our hypothesis is now basically ecological: We hypothesize that features that are essentially independent in a reasonable ensemble can be efficiently coded using a sparse independent component representation. The representations are indeed shown to be very similar between supervised learning (invoking cognitive activity) and unsupervised learning (statistical regularities), hence lending additional support to our cognitive component hypothesis.
    Original languageEnglish
    Title of host publicationTwenty-Ninth Meeting of the Cognitive Science Society (CogSci'07)
    Publication date2007
    Pages983-988
    ISBN (Print)978-0-9768318-3-9
    Publication statusPublished - 2007
    Event29th Annual Conference of the Cognitive Science Society - Nashville, United States
    Duration: 1 Aug 20074 Aug 2007
    Conference number: 29

    Conference

    Conference29th Annual Conference of the Cognitive Science Society
    Number29
    Country/TerritoryUnited States
    CityNashville
    Period01/08/200704/08/2007

    Keywords

    • Time Scales
    • Unsupervised Learning
    • Energy Based Sparsification
    • Cognitive component analysis
    • Statistical Regularity
    • Supervised Learning

    Fingerprint

    Dive into the research topics of 'Cognitive componets of speech at different time scales'. Together they form a unique fingerprint.

    Cite this