On the structure of dynamic principal component analysis used in statistical process monitoring

Erik Vanhatalo, Murat Kulahci, Bjarne Bergquist

Research output: Contribution to journalJournal articleResearchpeer-review

369 Downloads (Pure)

Abstract

When principal component analysis (PCA) is used for statistical process monitoring it relies on the assumption that data are time independent. However, industrial data will often exhibit serial correlation. Dynamic PCA (DPCA) has been suggested as a remedy for high-dimensional and time-dependent data. In DPCA the input matrix is augmented by adding time-lagged values of the variables. In building a DPCA model the analyst needs to decide on (1) the number of lags to add, and (2) given a specific lag structure, how many principal components to retain. In this article we propose a new analyst driven method to determine the maximum number of lags in DPCA with a foundation in multivariate time series analysis. The method is based on the behavior of the eigenvalues of the lagged autocorrelation and partial autocorrelation matrices. Given a specific lag structure we also propose a method for determining the number of principal components to retain. The number of retained principal components is determined by visual inspection of the serial correlation in the squared prediction error statistic, Q (SPE), together with the cumulative explained variance of the model. The methods are illustrated using simulated vector autoregressive and moving average data, and tested on Tennessee Eastman process data.
Original languageEnglish
JournalChemometrics and Intelligent Laboratory Systems
Volume167
Pages (from-to)1-11
Number of pages11
ISSN0169-7439
DOIs
Publication statusPublished - 2017

Keywords

  • Dynamic principal component analysis
  • Vector autoregressive process
  • Vector moving average process
  • Autocorrelation
  • Simulation
  • Tennessee Eastman process simulator

Fingerprint

Dive into the research topics of 'On the structure of dynamic principal component analysis used in statistical process monitoring'. Together they form a unique fingerprint.

Cite this