SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals

  • Rahul Thapa*
  • , Bryan He
  • , Magnus Ruud Kjær
  • , Hyatt Moore
  • , Gauri Ganjoo
  • , Emmanuel Mignot
  • , James Zou
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

93 Downloads (Orbit)

Abstract

Sleep is a complex physiological process evaluated through various modalities recording electrical brain, cardiac, and respiratory activities. We curate a large polysomnography dataset from over 14,000 participants comprising over 100,000 hours of multi-modal sleep recordings. Leveraging this extensive dataset, we developed SleepFM, the first multi-modal foundation model for sleep analysis. We show that a novel leave-one-out approach for contrastive learning significantly improves downstream task performance compared to representations from standard pairwise contrastive learning. A logistic regression model trained on SleepFM’s learned embeddings outperforms an end-to-end trained convolutional neural network (CNN) on sleep stage classification (macro AUROC 0.88 vs 0.72 and macro AUPRC 0.72 vs 0.48) and sleep disordered breathing detection (AUROC 0.85 vs 0.69 and AUPRC 0.77 vs 0.61). Notably, the learned embeddings achieve 48% top-1 average accuracy in retrieving modality clip pairs from 90,000 candidates. This work demonstrates the value of holistic multi-modal sleep modeling to fully capture the richness of sleep recordings. SleepFM is open source and available at https://github.com/rthapa84/sleepfmcodebase.

Original languageEnglish
Title of host publicationProceedings of the 41st International Conference on Machine Learning
Volume235
PublisherProceedings of Machine Learning Research
Publication date2024
Pages48019-48037
Article number1961
Publication statusPublished - 2024
Event41st International Conference on Machine Learning - Vienna, Austria
Duration: 21 Jul 202427 Jul 2024

Conference

Conference41st International Conference on Machine Learning
Country/TerritoryAustria
CityVienna
Period21/07/202427/07/2024
SeriesProceedings of Machine Learning Research
ISSN2640-3498

Fingerprint

Dive into the research topics of 'SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals'. Together they form a unique fingerprint.

Cite this