Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series

Zachary C Lipton; David Kale; Randall Wetzel

Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series

Zachary C Lipton, David Kale, Randall Wetzel

Proceedings of the 1st Machine Learning for Healthcare Conference, PMLR 56:253-270, 2016.

Abstract

We demonstrate a simple strategy to cope with missing data in sequential inputs, addressing the task of multilabel classification of diagnoses given clinical time series. Collected from the intensive care unit (ICU) of a major urban medical center, our data consists of multivariate time series of observations. The data is irregularly sampled, leading to missingness patterns in re-sampled sequences. In this work, we show the remarkable ability of RNNs to make effective use of binary indicators to directly model missing data, improving AUC and F1significantly. However, while RNNs can learn arbitrary functions of the missing data and observations, linear models can only learn substitution values. For linear models and MLPs, we show an alternative strategy to capture this signal. Additionally, we evaluate LSTMs, MLPs, and linear models trained on missingness patterns only, showing that for several diseases, what tests are run can be more predictive than the results themselves.

Cite this Paper

BibTeX


@InProceedings{pmlr-v56-Lipton16,
  title = 	 {Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series},
  author = 	 {Lipton, Zachary C and Kale, David and Wetzel, Randall},
  booktitle = 	 {Proceedings of the 1st Machine Learning for Healthcare Conference},
  pages = 	 {253--270},
  year = 	 {2016},
  editor = 	 {Doshi-Velez, Finale and Fackler, Jim and Kale, David and Wallace, Byron and Wiens, Jenna},
  volume = 	 {56},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Northeastern University, Boston, MA, USA},
  month = 	 {18--19 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v56/Lipton16.pdf},
  url = 	 {https://proceedings.mlr.press/v56/Lipton16.html},
  abstract = 	 {We demonstrate a simple strategy to cope with missing data in sequential inputs, addressing the task of multilabel classification of diagnoses given clinical time series. Collected from the intensive care unit (ICU) of a major urban medical center, our data consists of multivariate time series of observations. The data is irregularly sampled, leading to missingness patterns in re-sampled sequences. In this work, we show the remarkable ability of RNNs to make effective use of binary indicators to directly model missing data, improving AUC and F1significantly. However, while RNNs can learn arbitrary functions of the missing data and observations, linear models can only learn substitution values. For linear models and MLPs, we show an alternative strategy to capture this signal. Additionally, we evaluate LSTMs, MLPs, and linear models trained on missingness patterns only, showing that for several diseases, what tests are run can be more predictive than the results themselves.}
}

Endnote

%0 Conference Paper
%T Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series
%A Zachary C Lipton
%A David Kale
%A Randall Wetzel
%B Proceedings of the 1st Machine Learning for Healthcare Conference
%C Proceedings of Machine Learning Research
%D 2016
%E Finale Doshi-Velez
%E Jim Fackler
%E David Kale
%E Byron Wallace
%E Jenna Wiens	
%F pmlr-v56-Lipton16
%I PMLR
%P 253--270
%U https://proceedings.mlr.press/v56/Lipton16.html
%V 56
%X We demonstrate a simple strategy to cope with missing data in sequential inputs, addressing the task of multilabel classification of diagnoses given clinical time series. Collected from the intensive care unit (ICU) of a major urban medical center, our data consists of multivariate time series of observations. The data is irregularly sampled, leading to missingness patterns in re-sampled sequences. In this work, we show the remarkable ability of RNNs to make effective use of binary indicators to directly model missing data, improving AUC and F1significantly. However, while RNNs can learn arbitrary functions of the missing data and observations, linear models can only learn substitution values. For linear models and MLPs, we show an alternative strategy to capture this signal. Additionally, we evaluate LSTMs, MLPs, and linear models trained on missingness patterns only, showing that for several diseases, what tests are run can be more predictive than the results themselves.

RIS


TY  - CPAPER
TI  - Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series
AU  - Zachary C Lipton
AU  - David Kale
AU  - Randall Wetzel
BT  - Proceedings of the 1st Machine Learning for Healthcare Conference
DA  - 2016/12/10
ED  - Finale Doshi-Velez
ED  - Jim Fackler
ED  - David Kale
ED  - Byron Wallace
ED  - Jenna Wiens	
ID  - pmlr-v56-Lipton16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 56
SP  - 253
EP  - 270
L1  - http://proceedings.mlr.press/v56/Lipton16.pdf
UR  - https://proceedings.mlr.press/v56/Lipton16.html
AB  - We demonstrate a simple strategy to cope with missing data in sequential inputs, addressing the task of multilabel classification of diagnoses given clinical time series. Collected from the intensive care unit (ICU) of a major urban medical center, our data consists of multivariate time series of observations. The data is irregularly sampled, leading to missingness patterns in re-sampled sequences. In this work, we show the remarkable ability of RNNs to make effective use of binary indicators to directly model missing data, improving AUC and F1significantly. However, while RNNs can learn arbitrary functions of the missing data and observations, linear models can only learn substitution values. For linear models and MLPs, we show an alternative strategy to capture this signal. Additionally, we evaluate LSTMs, MLPs, and linear models trained on missingness patterns only, showing that for several diseases, what tests are run can be more predictive than the results themselves.
ER  -

APA


Lipton, Z.C., Kale, D. & Wetzel, R.. (2016). Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series. Proceedings of the 1st Machine Learning for Healthcare Conference, in Proceedings of Machine Learning Research 56:253-270 Available from https://proceedings.mlr.press/v56/Lipton16.html.

Related Material

Download PDF