Feature selection in jump models

Peter Nystrup*, Petter N. Kolm, Erik Lindström

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

1 Downloads (Pure)

Abstract

Jump models switch infrequently between states to fit a sequence of data while taking the ordering of the data into account We propose a new framework for joint feature selection, parameter and state-sequence estimation in jump models. Feature selection is necessary in high-dimensional settings where the number of features is large compared to the number of observations and the underlying states differ only with respect to a subset of the features. We develop and implement a coordinate descent algorithm that alternates between selecting the features and estimating the model parameters and state sequence, which scales to large data sets with large numbers of (noisy) features. We demonstrate the usefulness of the proposed framework by comparing it with a number of other methods on both simulated and real data in the form of financial returns, protein sequences, and text. By leveraging information embedded in the ordering of the data, the resulting sparse jump model outperforms all other methods considered and is remarkably robust to noise.
Original languageEnglish
Article number115558
JournalExpert Systems with Applications
Volume184
Number of pages14
ISSN0957-4174
DOIs
Publication statusPublished - 2021

Keywords

  • High-dimensional
  • Sequential data
  • Time series
  • Clustering
  • Unsupervised learning
  • Regime switching

Fingerprint

Dive into the research topics of 'Feature selection in jump models'. Together they form a unique fingerprint.

Cite this