Diffusion Causal Models for Counterfactual Estimation

Pedro Sanchez, Sotirios A. Tsaftaris
Proceedings of the First Conference on Causal Learning and Reasoning, PMLR 177:647-668, 2022.

Abstract

We consider the task of counterfactual estimation from observational imaging data given a known causal structure. In particular, quantifying the causal effect of interventions for high-dimensional data with neural networks remains an open challenge. Herein we propose Diff-SCM, a deep structural causal model that builds on recent advances of generative energy-based models. In our setting, inference is performed by iteratively sampling gradients of the marginal and conditional distributions entailed by the causal model. Counterfactual estimation is achieved by firstly inferring latent variables with deterministic forward diffusion, then intervening on a reverse diffusion process using the gradients of an anti-causal predictor w.r.t the input. Furthermore, we propose a metric for evaluating the generated counterfactuals. We find that Diff-SCM produces more realistic and minimal counterfactuals than baselines on MNIST data and can also be applied to ImageNet data. Code is available https://github.com/vios-s/Diff-SCM.

Cite this Paper


BibTeX
@InProceedings{pmlr-v177-sanchez22a, title = {Diffusion Causal Models for Counterfactual Estimation}, author = {Sanchez, Pedro and Tsaftaris, Sotirios A.}, booktitle = {Proceedings of the First Conference on Causal Learning and Reasoning}, pages = {647--668}, year = {2022}, editor = {Schölkopf, Bernhard and Uhler, Caroline and Zhang, Kun}, volume = {177}, series = {Proceedings of Machine Learning Research}, month = {11--13 Apr}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v177/sanchez22a/sanchez22a.pdf}, url = {https://proceedings.mlr.press/v177/sanchez22a.html}, abstract = {We consider the task of counterfactual estimation from observational imaging data given a known causal structure. In particular, quantifying the causal effect of interventions for high-dimensional data with neural networks remains an open challenge. Herein we propose Diff-SCM, a deep structural causal model that builds on recent advances of generative energy-based models. In our setting, inference is performed by iteratively sampling gradients of the marginal and conditional distributions entailed by the causal model. Counterfactual estimation is achieved by firstly inferring latent variables with deterministic forward diffusion, then intervening on a reverse diffusion process using the gradients of an anti-causal predictor w.r.t the input. Furthermore, we propose a metric for evaluating the generated counterfactuals. We find that Diff-SCM produces more realistic and minimal counterfactuals than baselines on MNIST data and can also be applied to ImageNet data. Code is available https://github.com/vios-s/Diff-SCM.} }
Endnote
%0 Conference Paper %T Diffusion Causal Models for Counterfactual Estimation %A Pedro Sanchez %A Sotirios A. Tsaftaris %B Proceedings of the First Conference on Causal Learning and Reasoning %C Proceedings of Machine Learning Research %D 2022 %E Bernhard Schölkopf %E Caroline Uhler %E Kun Zhang %F pmlr-v177-sanchez22a %I PMLR %P 647--668 %U https://proceedings.mlr.press/v177/sanchez22a.html %V 177 %X We consider the task of counterfactual estimation from observational imaging data given a known causal structure. In particular, quantifying the causal effect of interventions for high-dimensional data with neural networks remains an open challenge. Herein we propose Diff-SCM, a deep structural causal model that builds on recent advances of generative energy-based models. In our setting, inference is performed by iteratively sampling gradients of the marginal and conditional distributions entailed by the causal model. Counterfactual estimation is achieved by firstly inferring latent variables with deterministic forward diffusion, then intervening on a reverse diffusion process using the gradients of an anti-causal predictor w.r.t the input. Furthermore, we propose a metric for evaluating the generated counterfactuals. We find that Diff-SCM produces more realistic and minimal counterfactuals than baselines on MNIST data and can also be applied to ImageNet data. Code is available https://github.com/vios-s/Diff-SCM.
APA
Sanchez, P. & Tsaftaris, S.A.. (2022). Diffusion Causal Models for Counterfactual Estimation. Proceedings of the First Conference on Causal Learning and Reasoning, in Proceedings of Machine Learning Research 177:647-668 Available from https://proceedings.mlr.press/v177/sanchez22a.html.

Related Material