Modelling of multi-block data

Agnar Høskuldsson, K. Svinning

    Research output: Contribution to journalJournal articleResearchpeer-review

    Abstract

    Here is presented a unified approach to modelling multi-block regression data. The starting point is a partition of the data X into L data blocks, X = (X-1, X-2,...X-L), and the data Y into M data-blocks, Y = (Y-1, Y-2,...,Y-M). The methods of linear regression, X -> Y, are extended to the case of a linear relationship between each X-i and Y-j. X-i -> Y-j. A modelling strategy is used to decide if the residual X-i should take part in the modelling of one or more Y(j)s. At each step the procedure of finding score vectors is based on well-defined optimisation procedures. The principle of optimisation is based on that the score vectors should give the sizes of the resulting Y(j)s loading vectors as large as possible. The partition of X and Y are independent of each other. The choice of Y-j can be X-j, Y-i = X-i, thus including the possibility of modelling X -> X-i,i=1,...,L. It is shown how these methods can be extended to a network of data blocks. Examples of the optimisation procedures in a network are shown. The examples chosen are the ones that are useful to work within industrial production environments. The methods are illustrated by simulated data and data from cement production.
    Original languageEnglish
    JournalJournal of Chemometrics
    Volume20
    Issue number8-10
    Pages (from-to)376-385
    ISSN0886-9383
    Publication statusPublished - 2006

    Keywords

    • multi-block data
    • network of data blocks
    • regression in networks
    • regression
    • optimisation of loadings

    Fingerprint Dive into the research topics of 'Modelling of multi-block data'. Together they form a unique fingerprint.

    Cite this