Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records

Raphael Poulain; Mehak Gupta; Rahmatollah Beheshti

Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records

Raphael Poulain, Mehak Gupta, Rahmatollah Beheshti

Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:853-873, 2022.

Abstract

With the growing availability of Electronic Health Records (EHRs), many deep learning methods have been developed to leverage such datasets in medical prediction tasks. Notably, transformer-based architectures have proven to be highly effective for EHRs. Transformer-based architectures are generally very effective in “transferring” the acquired knowledge from very large datasets to smaller target datasets through their comprehensive “pre-training” process. However, to work efficiently, they still rely on the target datasets for the downstream tasks, and if the target dataset is (very) small, the performance of downstream models can degrade rapidly. In biomedical applications, it is common to only have access to small datasets, for instance, when studying rare diseases, invasive procedures, or using restrictive cohort selection processes. In this study, we present CEHR-GAN-BERT, a semi-supervised transformer-based architecture that leverages both in and out-of-cohort patients to learn better patient representations in the context of few-shot learning. The proposed method opens new learning opportunities where only a few hundred samples are available. We extensively evaluate our method on four prediction tasks and three public datasets showing the ability of our model to achieve improvements upwards of 5% on all performance metrics (including AUROC and F1 Score) on the tasks that use less than 200 annotated patients during the training process.

Cite this Paper

BibTeX


@InProceedings{pmlr-v182-poulain22a,
  title = 	 {Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records},
  author =       {Poulain, Raphael and Gupta, Mehak and Beheshti, Rahmatollah},
  booktitle = 	 {Proceedings of the 7th Machine Learning for Healthcare Conference},
  pages = 	 {853--873},
  year = 	 {2022},
  editor = 	 {Lipton, Zachary and Ranganath, Rajesh and Sendak, Mark and Sjoding, Michael and Yeung, Serena},
  volume = 	 {182},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {05--06 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v182/poulain22a/poulain22a.pdf},
  url = 	 {https://proceedings.mlr.press/v182/poulain22a.html},
  abstract = 	 {With the growing availability of Electronic Health Records (EHRs), many deep learning methods have been developed to leverage such datasets in medical prediction tasks. Notably, transformer-based architectures have proven to be highly effective for EHRs. Transformer-based architectures are generally very effective in “transferring” the acquired knowledge from very large datasets to smaller target datasets through their comprehensive “pre-training” process. However, to work efficiently, they still rely on the target datasets for the downstream tasks, and if the target dataset is (very) small, the performance of downstream models can degrade rapidly. In biomedical applications, it is common to only have access to small datasets, for instance, when studying rare diseases, invasive procedures, or using restrictive cohort selection processes. In this study, we present CEHR-GAN-BERT, a semi-supervised transformer-based architecture that leverages both in and out-of-cohort patients to learn better patient representations in the context of few-shot learning. The proposed method opens new learning opportunities where only a few hundred samples are available. We extensively evaluate our method on four prediction tasks and three public datasets showing the ability of our model to achieve improvements upwards of 5% on all performance metrics (including AUROC and F1 Score) on the tasks that use less than 200 annotated patients during the training process.}
}

Endnote

%0 Conference Paper
%T Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records
%A Raphael Poulain
%A Mehak Gupta
%A Rahmatollah Beheshti
%B Proceedings of the 7th Machine Learning for Healthcare Conference
%C Proceedings of Machine Learning Research
%D 2022
%E Zachary Lipton
%E Rajesh Ranganath
%E Mark Sendak
%E Michael Sjoding
%E Serena Yeung	
%F pmlr-v182-poulain22a
%I PMLR
%P 853--873
%U https://proceedings.mlr.press/v182/poulain22a.html
%V 182
%X With the growing availability of Electronic Health Records (EHRs), many deep learning methods have been developed to leverage such datasets in medical prediction tasks. Notably, transformer-based architectures have proven to be highly effective for EHRs. Transformer-based architectures are generally very effective in “transferring” the acquired knowledge from very large datasets to smaller target datasets through their comprehensive “pre-training” process. However, to work efficiently, they still rely on the target datasets for the downstream tasks, and if the target dataset is (very) small, the performance of downstream models can degrade rapidly. In biomedical applications, it is common to only have access to small datasets, for instance, when studying rare diseases, invasive procedures, or using restrictive cohort selection processes. In this study, we present CEHR-GAN-BERT, a semi-supervised transformer-based architecture that leverages both in and out-of-cohort patients to learn better patient representations in the context of few-shot learning. The proposed method opens new learning opportunities where only a few hundred samples are available. We extensively evaluate our method on four prediction tasks and three public datasets showing the ability of our model to achieve improvements upwards of 5% on all performance metrics (including AUROC and F1 Score) on the tasks that use less than 200 annotated patients during the training process.

APA


Poulain, R., Gupta, M. & Beheshti, R.. (2022). Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records. Proceedings of the 7th Machine Learning for Healthcare Conference, in Proceedings of Machine Learning Research 182:853-873 Available from https://proceedings.mlr.press/v182/poulain22a.html.

Related Material

Download PDF