GRU-M: A Joint Impute and Learn Approach for Sequential Prediction under Missing Data

Soumen Pachal, Avinash Achar, Nancy Bhutani, Akash Das
Proceedings of the 16th Asian Conference on Machine Learning, PMLR 260:797-812, 2025.

Abstract

Sequential Prediction in presence of missing data is an old research problem. Classically, researchers have tackled this by imputing data first and then building predictive models. This 2-stage process is typically prone to errors and to circumvent this, researchers have provided a variety of techniques which employ a joint impute and learn approach before prediction. Among these, Recurrent Neural Networks (RNNs) have been very popular given their natural ability to tackle sequential data efficiently. Existing state-of-art approaches either (i)do not impute (ii) do not completely factor the available information around a gap, (iii)ignore position information within a gap and so on. Our approach intelligently addresses these gaps by proposing a novel architecture which jointly imputes and learns by taking into account (i)information from either end of the gap (ii)proximity to the left/right-end of a gap (iii)the length of the gap. In context of this work, prediction means either sequence classification or forecasting. In this paper, we demonstrate the utility of the proposed architecture on forecasting tasks. We benchmark against a range of state-of-art baselines and in scenarios where data is either (a)naturally missing or (b)synthetically masked.

Cite this Paper


BibTeX
@InProceedings{pmlr-v260-pachal25a, title = {{GRU-M}: {A} Joint Impute and Learn Approach for Sequential Prediction under Missing Data}, author = {Pachal, Soumen and Achar, Avinash and Bhutani, Nancy and Das, Akash}, booktitle = {Proceedings of the 16th Asian Conference on Machine Learning}, pages = {797--812}, year = {2025}, editor = {Nguyen, Vu and Lin, Hsuan-Tien}, volume = {260}, series = {Proceedings of Machine Learning Research}, month = {05--08 Dec}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v260/main/assets/pachal25a/pachal25a.pdf}, url = {https://proceedings.mlr.press/v260/pachal25a.html}, abstract = {Sequential Prediction in presence of missing data is an old research problem. Classically, researchers have tackled this by imputing data first and then building predictive models. This 2-stage process is typically prone to errors and to circumvent this, researchers have provided a variety of techniques which employ a joint impute and learn approach before prediction. Among these, Recurrent Neural Networks (RNNs) have been very popular given their natural ability to tackle sequential data efficiently. Existing state-of-art approaches either (i)do not impute (ii) do not completely factor the available information around a gap, (iii)ignore position information within a gap and so on. Our approach intelligently addresses these gaps by proposing a novel architecture which jointly imputes and learns by taking into account (i)information from either end of the gap (ii)proximity to the left/right-end of a gap (iii)the length of the gap. In context of this work, prediction means either sequence classification or forecasting. In this paper, we demonstrate the utility of the proposed architecture on forecasting tasks. We benchmark against a range of state-of-art baselines and in scenarios where data is either (a)naturally missing or (b)synthetically masked.} }
Endnote
%0 Conference Paper %T GRU-M: A Joint Impute and Learn Approach for Sequential Prediction under Missing Data %A Soumen Pachal %A Avinash Achar %A Nancy Bhutani %A Akash Das %B Proceedings of the 16th Asian Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2025 %E Vu Nguyen %E Hsuan-Tien Lin %F pmlr-v260-pachal25a %I PMLR %P 797--812 %U https://proceedings.mlr.press/v260/pachal25a.html %V 260 %X Sequential Prediction in presence of missing data is an old research problem. Classically, researchers have tackled this by imputing data first and then building predictive models. This 2-stage process is typically prone to errors and to circumvent this, researchers have provided a variety of techniques which employ a joint impute and learn approach before prediction. Among these, Recurrent Neural Networks (RNNs) have been very popular given their natural ability to tackle sequential data efficiently. Existing state-of-art approaches either (i)do not impute (ii) do not completely factor the available information around a gap, (iii)ignore position information within a gap and so on. Our approach intelligently addresses these gaps by proposing a novel architecture which jointly imputes and learns by taking into account (i)information from either end of the gap (ii)proximity to the left/right-end of a gap (iii)the length of the gap. In context of this work, prediction means either sequence classification or forecasting. In this paper, we demonstrate the utility of the proposed architecture on forecasting tasks. We benchmark against a range of state-of-art baselines and in scenarios where data is either (a)naturally missing or (b)synthetically masked.
APA
Pachal, S., Achar, A., Bhutani, N. & Das, A.. (2025). GRU-M: A Joint Impute and Learn Approach for Sequential Prediction under Missing Data. Proceedings of the 16th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 260:797-812 Available from https://proceedings.mlr.press/v260/pachal25a.html.

Related Material