Learning Causal Markov Boundaries with Mixed Observational and Experimental Data

Konstantina Lelova; Gregory F. Cooper; Sofia Triantafillou

Learning Causal Markov Boundaries with Mixed Observational and Experimental Data

Konstantina Lelova, Gregory F. Cooper, Sofia Triantafillou

Proceedings of The 12th International Conference on Probabilistic Graphical Models, PMLR 246:312-326, 2024.

Abstract

A frequent goal in healthcare is to estimate personalized causal effects in order to select the best treatment for a patient from observational or experimental (RCT) data (or both), where "best" is defined in terms of maximizing the expectation of the desired outcome. The first task in estimating personalized effects is selecting the optimal set of personalization covariates (causal feature selection). This set of covariates is the Markov Boundary of the outcome in the experimental distribution, also known as the Interventional Markov Boundary (IMB), and can be identified from RCT data using methods for finding Markov Boundaries. However, most RCT data are very limited in sample size and do not work well with these methods. In this work, we develop methods that combine limited experimental and large observational data to identify the IMB, and improve the estimation of conditional (personalized) causal effects. These methods extend recent results (Triantafillou et al., 2021), which were limited to discrete data, to mixed data with binary and ordinal outcomes. The methods are based on Bayesian regression models. In simulated data, we show that our methods identify the correct IMB and improve causal effect estimation.

Cite this Paper

BibTeX


@InProceedings{pmlr-v246-lelova24a,
  title = 	 {Learning Causal Markov Boundaries with Mixed Observational and Experimental Data},
  author =       {Lelova, Konstantina and Cooper, Gregory F. and Triantafillou, Sofia},
  booktitle = 	 {Proceedings of The 12th International Conference on Probabilistic Graphical Models},
  pages = 	 {312--326},
  year = 	 {2024},
  editor = 	 {Kwisthout, Johan and Renooij, Silja},
  volume = 	 {246},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {11--13 Sep},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v246/main/assets/lelova24a/lelova24a.pdf},
  url = 	 {https://proceedings.mlr.press/v246/lelova24a.html},
  abstract = 	 {A frequent goal in healthcare is to estimate personalized causal effects in order to select the best treatment for a patient from observational or experimental (RCT) data (or both), where "best" is defined in terms of maximizing the expectation of the desired outcome. The first task in estimating personalized effects is selecting the optimal set of personalization covariates (causal feature selection). This set of covariates is the Markov Boundary of the outcome in the experimental distribution, also known as the Interventional Markov Boundary (IMB), and can be identified from RCT data using methods for finding Markov Boundaries. However, most RCT data are very limited in sample size and do not work well with these methods. In this work, we develop methods that combine limited experimental and large observational data to identify the IMB, and improve the estimation of conditional (personalized) causal effects. These methods extend recent results (Triantafillou et al., 2021), which were limited to discrete data, to mixed data with binary and ordinal outcomes. The methods are based on Bayesian regression models. In simulated data, we show that our methods identify the correct IMB and improve causal effect estimation.}
}

Endnote

%0 Conference Paper
%T Learning Causal Markov Boundaries with Mixed Observational and Experimental Data
%A Konstantina Lelova
%A Gregory F. Cooper
%A Sofia Triantafillou
%B Proceedings of The 12th International Conference on Probabilistic Graphical Models
%C Proceedings of Machine Learning Research
%D 2024
%E Johan Kwisthout
%E Silja Renooij	
%F pmlr-v246-lelova24a
%I PMLR
%P 312--326
%U https://proceedings.mlr.press/v246/lelova24a.html
%V 246
%X A frequent goal in healthcare is to estimate personalized causal effects in order to select the best treatment for a patient from observational or experimental (RCT) data (or both), where "best" is defined in terms of maximizing the expectation of the desired outcome. The first task in estimating personalized effects is selecting the optimal set of personalization covariates (causal feature selection). This set of covariates is the Markov Boundary of the outcome in the experimental distribution, also known as the Interventional Markov Boundary (IMB), and can be identified from RCT data using methods for finding Markov Boundaries. However, most RCT data are very limited in sample size and do not work well with these methods. In this work, we develop methods that combine limited experimental and large observational data to identify the IMB, and improve the estimation of conditional (personalized) causal effects. These methods extend recent results (Triantafillou et al., 2021), which were limited to discrete data, to mixed data with binary and ordinal outcomes. The methods are based on Bayesian regression models. In simulated data, we show that our methods identify the correct IMB and improve causal effect estimation.

APA


Lelova, K., Cooper, G.F. & Triantafillou, S.. (2024). Learning Causal Markov Boundaries with Mixed Observational and Experimental Data. Proceedings of The 12th International Conference on Probabilistic Graphical Models, in Proceedings of Machine Learning Research 246:312-326 Available from https://proceedings.mlr.press/v246/lelova24a.html.

Related Material

Download PDF