Abstract
Time Series Representation Learning (TSRL) focuses on generating informative representations for various Time Series (TS) modeling tasks. Traditional Self-Supervised Learning (SSL) methods in TSRL fall into four main categories: reconstructive, adversarial, contrastive, and predictive, each with a common challenge of sensitivity to noise and intricate data nuances. Recently, diffusion-based methods have shown advanced generative capabilities. However, they primarily target specific application scenarios like imputation and forecasting, leaving a gap in leveraging diffusion models for generic TSRL. Our work, Time Series Diffusion Embedding (TSDE), bridges this gap as the first diffusion-based SSL TSRL approach. TSDE segments TS data into observed and masked parts using an Imputation-Interpolation-Forecasting (IIF) mask. It applies a trainable embedding function, featuring dual-orthogonal Transformer encoders with a crossover mechanism, to the observed part. We train a reverse diffusion process conditioned on the embeddings, designed to predict noise added to the masked part. Extensive experiments demonstrate TSDE's superiority in imputation, interpolation, forecasting, anomaly detection, classification, and clustering. We also conduct an ablation study, present embedding visualizations, and compare inference speed, further substantiating TSDE's efficiency and validity in learning representations of TS data.
Original language | English |
---|---|
Title of host publication | Proceedings of KDD ’24 |
Number of pages | 23 |
Publisher | Association for Computing Machinery |
Publication date | 2024 |
Pages | 2560-2571 |
ISBN (Electronic) | 979-8-4007-0490-1/24/08 |
DOIs | |
Publication status | Published - 2024 |
Event | ACM KDD 2024 - Barcelona, Spain Duration: 25 Aug 2024 → 29 Aug 2024 |
Conference
Conference | ACM KDD 2024 |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 25/08/2024 → 29/08/2024 |
Keywords
- Multivariate time series
- Diffusion model
- Representation learning
- Self-supervised learning
- Imputation
- Interpolation
- Forecasting
- Anomaly detection
- Clustering
- Classification
- Time series modeling