Temporal analysis of text data using latent variable models

Lasse Lohilahti Mølgaard, Jan Larsen, Cyril Goutte

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    811 Downloads (Pure)

    Abstract

    Detecting and tracking of temporal data is an important task in multiple applications. In this paper we study temporal text mining methods for Music Information Retrieval. We compare two ways of detecting the temporal latent semantics of a corpus extracted from Wikipedia, using a stepwise Probabilistic Latent Semantic Analysis (PLSA) approach and a global multiway PLSA method. The analysis indicates that the global analysis method is able to identify relevant trends which are difficult to get using a step-by-step approach. Furthermore we show that inspection of PLSA models with different number of factors may reveal the stability of temporal clusters making it possible to choose the relevant number of factors.
    Original languageEnglish
    Title of host publication2009 IEEE International Workshop on MACHINE LEARNING FOR SIGNAL PROCESSING. : Formerly the IEEE Workshop on Neural Networks for Signal Processing
    PublisherIEEE
    Publication date2009
    ISBN (Print)978-1-4244-4947-7
    DOIs
    Publication statusPublished - 2009
    Event2009 IEEE International Workshop on Machine Learning for Signal Processing - Grenoble, France
    Duration: 1 Sept 20094 Sept 2009
    Conference number: 19
    https://ieeexplore.ieee.org/xpl/conhome/5290615/proceeding

    Workshop

    Workshop2009 IEEE International Workshop on Machine Learning for Signal Processing
    Number19
    Country/TerritoryFrance
    CityGrenoble
    Period01/09/200904/09/2009
    Internet address

    Bibliographical note

    Copyright 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

    Fingerprint

    Dive into the research topics of 'Temporal analysis of text data using latent variable models'. Together they form a unique fingerprint.

    Cite this