ClusterSignificance: A bioconductor package facilitating statistical analysis of class cluster separations in dimensionality reduced data

Jason T. Serviss, Jesper R. Gådin, Per Eriksson, Lasse Folkersen, Dan Grandér*

*Corresponding author for this work

    Research output: Contribution to journalJournal articleResearchpeer-review

    Abstract

    Summary Multi-dimensional data generated via high-throughput experiments is increasingly used in conjunction with dimensionality reduction methods to ascertain if resulting separations of the data correspond with known classes. This is particularly useful to determine if a subset of the variables, e.g. genes in a specific pathway, alone can separate samples into these established classes. Despite this, the evaluation of class separations is often subjective and performed via visualization. Here we present the ClusterSignificance package; a set of tools designed to assess the statistical significance of class separations downstream of dimensionality reduction algorithms. In addition, we demonstrate the design and utility of the ClusterSignificance package and utilize it to determine the importance of long non-coding RNA expression in the identity of multiple hematological malignancies.

    Original languageEnglish
    JournalBioinformatics
    Volume33
    Issue number19
    Pages (from-to)3126-3128
    ISSN1367-4803
    DOIs
    Publication statusPublished - 2017

    Cite this