Survival Seq2Seq: A Survival Model based on Sequence to Sequence Architecture

Ebrahim Pourjafari, Navid Ziaei, Mohammad R. Rezaei, Amir Sameizadeh, Mohammad Shafiee, Mohammad Alavinia, Mansour Abolghasemian, Nick Sajadi
Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:79-100, 2022.

Abstract

This paper introduces a novel non-parametric deep model for estimating time-to-event (survival analysis) in presence of censored data and competing risks. The model is designed based on the sequence-to-sequence (Seq2Seq) architecture, therefore we name it Survival Seq2Seq. The first recurrent neural network (RNN) layer of the encoder of our model is made up of Gated Recurrent Unit with Decay (GRU-D) cells. These cells have the ability to effectively impute not-missing-at-random values of longitudinal datasets with very high missing rates, such as electronic health records (EHRs). The decoder of Survival Seq2Seq generates a probability distribution function (PDF) for each competing risk without assuming any prior distribution for the risks. Taking advantage of RNN cells, the decoder is able to generate smooth and virtually spike-free PDFs. This is beyond the capability of existing non-parametric deep models for survival analysis. Training results on synthetic and medical datasets prove that Survival Seq2Seq surpasses other existing deep survival models in terms of the accuracy of predictions and the quality of generated PDFs.

Cite this Paper


BibTeX
@InProceedings{pmlr-v182-pourjafari22a, title = {Survival Seq2Seq: A Survival Model based on Sequence to Sequence Architecture}, author = {Pourjafari, Ebrahim and Ziaei, Navid and Rezaei, Mohammad R. and Sameizadeh, Amir and Shafiee, Mohammad and Alavinia, Mohammad and Abolghasemian, Mansour and Sajadi, Nick}, booktitle = {Proceedings of the 7th Machine Learning for Healthcare Conference}, pages = {79--100}, year = {2022}, editor = {Lipton, Zachary and Ranganath, Rajesh and Sendak, Mark and Sjoding, Michael and Yeung, Serena}, volume = {182}, series = {Proceedings of Machine Learning Research}, month = {05--06 Aug}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v182/pourjafari22a/pourjafari22a.pdf}, url = {https://proceedings.mlr.press/v182/pourjafari22a.html}, abstract = {This paper introduces a novel non-parametric deep model for estimating time-to-event (survival analysis) in presence of censored data and competing risks. The model is designed based on the sequence-to-sequence (Seq2Seq) architecture, therefore we name it Survival Seq2Seq. The first recurrent neural network (RNN) layer of the encoder of our model is made up of Gated Recurrent Unit with Decay (GRU-D) cells. These cells have the ability to effectively impute not-missing-at-random values of longitudinal datasets with very high missing rates, such as electronic health records (EHRs). The decoder of Survival Seq2Seq generates a probability distribution function (PDF) for each competing risk without assuming any prior distribution for the risks. Taking advantage of RNN cells, the decoder is able to generate smooth and virtually spike-free PDFs. This is beyond the capability of existing non-parametric deep models for survival analysis. Training results on synthetic and medical datasets prove that Survival Seq2Seq surpasses other existing deep survival models in terms of the accuracy of predictions and the quality of generated PDFs.} }
Endnote
%0 Conference Paper %T Survival Seq2Seq: A Survival Model based on Sequence to Sequence Architecture %A Ebrahim Pourjafari %A Navid Ziaei %A Mohammad R. Rezaei %A Amir Sameizadeh %A Mohammad Shafiee %A Mohammad Alavinia %A Mansour Abolghasemian %A Nick Sajadi %B Proceedings of the 7th Machine Learning for Healthcare Conference %C Proceedings of Machine Learning Research %D 2022 %E Zachary Lipton %E Rajesh Ranganath %E Mark Sendak %E Michael Sjoding %E Serena Yeung %F pmlr-v182-pourjafari22a %I PMLR %P 79--100 %U https://proceedings.mlr.press/v182/pourjafari22a.html %V 182 %X This paper introduces a novel non-parametric deep model for estimating time-to-event (survival analysis) in presence of censored data and competing risks. The model is designed based on the sequence-to-sequence (Seq2Seq) architecture, therefore we name it Survival Seq2Seq. The first recurrent neural network (RNN) layer of the encoder of our model is made up of Gated Recurrent Unit with Decay (GRU-D) cells. These cells have the ability to effectively impute not-missing-at-random values of longitudinal datasets with very high missing rates, such as electronic health records (EHRs). The decoder of Survival Seq2Seq generates a probability distribution function (PDF) for each competing risk without assuming any prior distribution for the risks. Taking advantage of RNN cells, the decoder is able to generate smooth and virtually spike-free PDFs. This is beyond the capability of existing non-parametric deep models for survival analysis. Training results on synthetic and medical datasets prove that Survival Seq2Seq surpasses other existing deep survival models in terms of the accuracy of predictions and the quality of generated PDFs.
APA
Pourjafari, E., Ziaei, N., Rezaei, M.R., Sameizadeh, A., Shafiee, M., Alavinia, M., Abolghasemian, M. & Sajadi, N.. (2022). Survival Seq2Seq: A Survival Model based on Sequence to Sequence Architecture. Proceedings of the 7th Machine Learning for Healthcare Conference, in Proceedings of Machine Learning Research 182:79-100 Available from https://proceedings.mlr.press/v182/pourjafari22a.html.

Related Material