Encode-Decoder-based GAN for Estimating Counterfactual Outcomes under Sequential Selection Bias and Combinatorial Explosion

Yoshiyuki Norimatsu, Masaaki Imaizumi
Proceedings of the Fourth Conference on Causal Learning and Reasoning, PMLR 275:451-489, 2025.

Abstract

Estimating counterfactual outcomes of time-varying treatment types and associated dosages is important for addressing medical problems. This task becomes more challenging when both the treatment type and dosage assignment are biased due to the presence of time-varying confounders, as compared to estimating outcomes for treatment types alone. Specifically, the setup yields the following two obstacles: first, treatment types and dosages are selected sequentially, causing observed outcomes to be biased at each time step, leading to $2 \times \tau$ biases for a $\tau$-step-ahead prediction (sequential selection bias); second, the number of treatment trajectories increases exponentially with $\tau$ (combinatorial explosion). In this paper, we introduce Encoder-Decoder Time-SCIGAN (EDTS), which combines a longitudinal encoder-decoder transformer with a Generative Adversarial Network (GAN) for estimating counterfactuals. The encoder-decoder architecture predicts outcomes for one-step- and multi-step-ahead predictions separately, while the GAN generates counterfactual outcomes that cannot be distinguished from observed outcomes by the discriminators to handle sequential selection bias. To address combinatorial explosion, we propose a novel discrimination method, Sequential Counterfactual Discrimination (SCD) for EDTS discriminators. Our evaluation of synthetic and semi-synthetic datasets demonstrate that EDTS outperforms the current baselines. To the best of our knowledge, this is the first study to propose an architecture for estimating counterfactual outcomes of both time-varying treatment types and dosages. Implementation is available at \url{https://github.com/ynorimat/EDTS}.

Cite this Paper


BibTeX
@InProceedings{pmlr-v275-norimatsu25a, title = {Encode-Decoder-based GAN for Estimating Counterfactual Outcomes under Sequential Selection Bias and Combinatorial Explosion}, author = {Norimatsu, Yoshiyuki and Imaizumi, Masaaki}, booktitle = {Proceedings of the Fourth Conference on Causal Learning and Reasoning}, pages = {451--489}, year = {2025}, editor = {Huang, Biwei and Drton, Mathias}, volume = {275}, series = {Proceedings of Machine Learning Research}, month = {07--09 May}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v275/main/assets/norimatsu25a/norimatsu25a.pdf}, url = {https://proceedings.mlr.press/v275/norimatsu25a.html}, abstract = {Estimating counterfactual outcomes of time-varying treatment types and associated dosages is important for addressing medical problems. This task becomes more challenging when both the treatment type and dosage assignment are biased due to the presence of time-varying confounders, as compared to estimating outcomes for treatment types alone. Specifically, the setup yields the following two obstacles: first, treatment types and dosages are selected sequentially, causing observed outcomes to be biased at each time step, leading to $2 \times \tau$ biases for a $\tau$-step-ahead prediction (sequential selection bias); second, the number of treatment trajectories increases exponentially with $\tau$ (combinatorial explosion). In this paper, we introduce Encoder-Decoder Time-SCIGAN (EDTS), which combines a longitudinal encoder-decoder transformer with a Generative Adversarial Network (GAN) for estimating counterfactuals. The encoder-decoder architecture predicts outcomes for one-step- and multi-step-ahead predictions separately, while the GAN generates counterfactual outcomes that cannot be distinguished from observed outcomes by the discriminators to handle sequential selection bias. To address combinatorial explosion, we propose a novel discrimination method, Sequential Counterfactual Discrimination (SCD) for EDTS discriminators. Our evaluation of synthetic and semi-synthetic datasets demonstrate that EDTS outperforms the current baselines. To the best of our knowledge, this is the first study to propose an architecture for estimating counterfactual outcomes of both time-varying treatment types and dosages. Implementation is available at \url{https://github.com/ynorimat/EDTS}.} }
Endnote
%0 Conference Paper %T Encode-Decoder-based GAN for Estimating Counterfactual Outcomes under Sequential Selection Bias and Combinatorial Explosion %A Yoshiyuki Norimatsu %A Masaaki Imaizumi %B Proceedings of the Fourth Conference on Causal Learning and Reasoning %C Proceedings of Machine Learning Research %D 2025 %E Biwei Huang %E Mathias Drton %F pmlr-v275-norimatsu25a %I PMLR %P 451--489 %U https://proceedings.mlr.press/v275/norimatsu25a.html %V 275 %X Estimating counterfactual outcomes of time-varying treatment types and associated dosages is important for addressing medical problems. This task becomes more challenging when both the treatment type and dosage assignment are biased due to the presence of time-varying confounders, as compared to estimating outcomes for treatment types alone. Specifically, the setup yields the following two obstacles: first, treatment types and dosages are selected sequentially, causing observed outcomes to be biased at each time step, leading to $2 \times \tau$ biases for a $\tau$-step-ahead prediction (sequential selection bias); second, the number of treatment trajectories increases exponentially with $\tau$ (combinatorial explosion). In this paper, we introduce Encoder-Decoder Time-SCIGAN (EDTS), which combines a longitudinal encoder-decoder transformer with a Generative Adversarial Network (GAN) for estimating counterfactuals. The encoder-decoder architecture predicts outcomes for one-step- and multi-step-ahead predictions separately, while the GAN generates counterfactual outcomes that cannot be distinguished from observed outcomes by the discriminators to handle sequential selection bias. To address combinatorial explosion, we propose a novel discrimination method, Sequential Counterfactual Discrimination (SCD) for EDTS discriminators. Our evaluation of synthetic and semi-synthetic datasets demonstrate that EDTS outperforms the current baselines. To the best of our knowledge, this is the first study to propose an architecture for estimating counterfactual outcomes of both time-varying treatment types and dosages. Implementation is available at \url{https://github.com/ynorimat/EDTS}.
APA
Norimatsu, Y. & Imaizumi, M.. (2025). Encode-Decoder-based GAN for Estimating Counterfactual Outcomes under Sequential Selection Bias and Combinatorial Explosion. Proceedings of the Fourth Conference on Causal Learning and Reasoning, in Proceedings of Machine Learning Research 275:451-489 Available from https://proceedings.mlr.press/v275/norimatsu25a.html.

Related Material