Abstract
The issue of missing data in supervised learning has been largely overlooked, especially in the deep learning community. We investigate strategies to adapt neural architectures for handling missing values. Here, we focus on regression and classification problems where the features are assumed to be missing at random. Of particular interest are schemes that allow reusing as-is a neural discriminative architecture. To address supervised deep learning with missing values, we propose to marginalize over missing values in a joint model of covariates and outcomes. Thereby, we leverage both the flexibility of deep generative models to describe the distribution of the covariates and the power of purely discriminative models to make predictions. More precisely, a deep latent variable model can be learned jointly with the discriminative model, using importance-weighted variational inference, essentially using importance sampling to mimick averaging over multiple imputations. In low-capacity regimes, or when the discriminative model has a strong inductive bias, we find that our hybrid generative/discriminative approach generally outperforms single imputations methods.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 2022 International Conference on Learning Representations |
| Number of pages | 30 |
| Publication date | 2022 |
| Publication status | Published - 2022 |
| Event | 10th International Conference on Learning Representations - Virtual event Duration: 25 Apr 2022 → 29 Apr 2022 Conference number: 10 https://iclr.cc/Conferences/2022 |
Conference
| Conference | 10th International Conference on Learning Representations |
|---|---|
| Number | 10 |
| Location | Virtual event |
| Period | 25/04/2022 → 29/04/2022 |
| Internet address |
Fingerprint
Dive into the research topics of 'How to deal with missing data in supervised deep learning?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver