Handling Missing Values in Healthcare Settings
In statistical modeling, it is common to encounter missing values, particularly in healthcare situations where vital sign measurements may be sporadic. This can lead to difficulties in accurately evaluating a patient’s health and making informed decisions about their treatment. One possible solution is to use imputation methods to fill in the missing values and enable counterfactual predictions in healthcare settings. To address this issue, we have conducted a study examining existing approaches to missing value imputation in healthcare settings, with a specific focus on time series data. We provide an overview of both classical (e.g. MICE, MissForest, Gaussian Processes), and deep learning-based (e.g. Recurrent Neural Networks, Generative Adversarial Networks, Autoencoders) imputation methods, assess their strengths and limitations within the context of time series data, and highlight potential areas for further investigation.
Model | Year | GP | RNN | CNN | TF | GAN | ODE | AE | Indicator | Datasets |
---|---|---|---|---|---|---|---|---|---|---|
MTGP | 2015 | y | TBI, MIMIC-II | |||||||
GASF, GADF, MTF | 2015 | y | Gun Point, CBF, Swedish Leaf, ECG, 7 Misc | |||||||
LSTM | 2016 | y | y | PICU at Children’s Hospital LA | ||||||
MTGP | 2017 | y | Duke University Hospital | |||||||
M-RNN | 2017 | y | MIMIC-III, Deterioration, UNOS-Heart, UNOS-Lung, UK Biobank | |||||||
BRITS | 2018 | y | y | PhysioNet, Beijing Air Quality, Human Activity | ||||||
GRU-D | 2018 | y | y | PhysioNet, MIMIC-III, Gesture | ||||||
GAIN | 2018 | y | Breast Cancer, Spam, Letter, Credit, News | |||||||
GP-VAE | 2019 | y | y | y | PhysioNet, Healing MNIST, SPRITES | |||||
T-CGAN | 2019 | y | Starlight Curves, Power Demand, ECG200 | |||||||
Imp-GAIN | 2019 | y | Insomnia | |||||||
Latent ODE | 2019 | y | y | PhysioNet, MuJoCo, Human Activity | ||||||
VaDER | 2019 | y | y | y | ADNI, PPMI | |||||
TKAE | 2019 | y | y | PhysioNet, ECG, EHR | ||||||
ODE-GRU-D | 2020 | y | y | y | PhysioNet | |||||
RBM | 2020 | y | Acute Abdomen Taiwan | |||||||
Multitask LSTM | 2020 | y | PhysioNet | |||||||
HeartImp | 2020 | y | Garmin, Fitbit | |||||||
GRU-DF | 2020 | y | CLIMB (Multiple Sclerosis) | |||||||
TAME | 2020 | y | y | MMIC-III, DACMI | ||||||
P-BiGAN | 2020 | y | y | MIMIC-III | ||||||
Deep AE | 2021 | y | Ischemic Heart Disease Taiwan | |||||||
Deep Recurrent AD | 2021 | y | y | TADPOLE (ADNI) | ||||||
MTSIT | 2022 | y | PhysioNet, Beijing Air Quality | |||||||
AJ-RNN | 2022 | y | PhysioNet, UCR Time Series |
@article{septiandri2023missing,
title = {Handling Missing Values in Healthcare Settings},
author = {Septiandri, Ali Akbar and Jendoubi, Takoua and De la O, Alejandro Díaz},
year = 2023,
month = {September},
url = {https://aliakbars.id/posts/2023/09/missing-values}
}