Abstract
Here is presented a unified approach to modelling multi-block regression data. The starting point is a partition of the data X into L data blocks, X = (X-1, X-2,...X-L), and the data Y into M data-blocks, Y = (Y-1, Y-2,...,Y-M). The methods of linear regression, X -> Y, are extended to the case of a linear relationship between each X-i and Y-j. X-i -> Y-j. A modelling strategy is used to decide if the residual X-i should take part in the modelling of one or more Y(j)s. At each step the procedure of finding score vectors is based on well-defined optimisation procedures. The principle of optimisation is based on that the score vectors should give the sizes of the resulting Y(j)s loading vectors as large as possible. The partition of X and Y are independent of each other. The choice of Y-j can be X-j, Y-i = X-i, thus including the possibility of modelling X -> X-i,i=1,...,L. It is shown how these methods can be extended to a network of data blocks. Examples of the optimisation procedures in a network are shown. The examples chosen are the ones that are useful to work within industrial production environments. The methods are illustrated by simulated data and data from cement production.
Original language | English |
---|---|
Journal | Journal of Chemometrics |
Volume | 20 |
Issue number | 8-10 |
Pages (from-to) | 376-385 |
ISSN | 0886-9383 |
Publication status | Published - 2006 |
Keywords
- multi-block data
- network of data blocks
- regression in networks
- regression
- optimisation of loadings