Automating the Selection of Proxy Variables of Unmeasured Confounders

Feng Xie; Zhengming Chen; Shanshan Luo; Wang Miao; Ruichu Cai; Zhi Geng

Automating the Selection of Proxy Variables of Unmeasured Confounders

Feng Xie, Zhengming Chen, Shanshan Luo, Wang Miao, Ruichu Cai, Zhi Geng

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:54430-54459, 2024.

Abstract

Recently, interest has grown in the use of proxy variables of unobserved confounding for inferring the causal effect in the presence of unmeasured confounders from observational data. One difficulty inhibiting the practical use is finding valid proxy variables of unobserved confounding to a target causal effect of interest. These proxy variables are typically justified by background knowledge. In this paper, we investigate the estimation of causal effects among multiple treatments and a single outcome, all of which are affected by unmeasured confounders, within a linear causal model, without prior knowledge of the validity of proxy variables. To be more specific, we first extend the existing proxy variable estimator, originally addressing a single unmeasured confounder, to accommodate scenarios where multiple unmeasured confounders exist between the treatments and the outcome. Subsequently, we present two different sets of precise identifiability conditions for selecting valid proxy variables of unmeasured confounders, based on the second-order statistics and higher-order statistics of the data, respectively. Moreover, we propose two data-driven methods for the selection of proxy variables and for the unbiased estimation of causal effects. Theoretical analysis demonstrates the correctness of our proposed algorithms. Experimental results on both synthetic and real-world data show the effectiveness of the proposed approach.

Cite this Paper

BibTeX

@InProceedings{pmlr-v235-xie24b,
  title = 	 {Automating the Selection of Proxy Variables of Unmeasured Confounders},
  author =       {Xie, Feng and Chen, Zhengming and Luo, Shanshan and Miao, Wang and Cai, Ruichu and Geng, Zhi},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {54430--54459},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/xie24b/xie24b.pdf},
  url = 	 {https://proceedings.mlr.press/v235/xie24b.html},
  abstract = 	 {Recently, interest has grown in the use of proxy variables of unobserved confounding for inferring the causal effect in the presence of unmeasured confounders from observational data. One difficulty inhibiting the practical use is finding valid proxy variables of unobserved confounding to a target causal effect of interest. These proxy variables are typically justified by background knowledge. In this paper, we investigate the estimation of causal effects among multiple treatments and a single outcome, all of which are affected by unmeasured confounders, within a linear causal model, without prior knowledge of the validity of proxy variables. To be more specific, we first extend the existing proxy variable estimator, originally addressing a single unmeasured confounder, to accommodate scenarios where multiple unmeasured confounders exist between the treatments and the outcome. Subsequently, we present two different sets of precise identifiability conditions for selecting valid proxy variables of unmeasured confounders, based on the second-order statistics and higher-order statistics of the data, respectively. Moreover, we propose two data-driven methods for the selection of proxy variables and for the unbiased estimation of causal effects. Theoretical analysis demonstrates the correctness of our proposed algorithms. Experimental results on both synthetic and real-world data show the effectiveness of the proposed approach.}
}

Endnote

%0 Conference Paper
%T Automating the Selection of Proxy Variables of Unmeasured Confounders
%A Feng Xie
%A Zhengming Chen
%A Shanshan Luo
%A Wang Miao
%A Ruichu Cai
%A Zhi Geng
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-xie24b
%I PMLR
%P 54430--54459
%U https://proceedings.mlr.press/v235/xie24b.html
%V 235
%X Recently, interest has grown in the use of proxy variables of unobserved confounding for inferring the causal effect in the presence of unmeasured confounders from observational data. One difficulty inhibiting the practical use is finding valid proxy variables of unobserved confounding to a target causal effect of interest. These proxy variables are typically justified by background knowledge. In this paper, we investigate the estimation of causal effects among multiple treatments and a single outcome, all of which are affected by unmeasured confounders, within a linear causal model, without prior knowledge of the validity of proxy variables. To be more specific, we first extend the existing proxy variable estimator, originally addressing a single unmeasured confounder, to accommodate scenarios where multiple unmeasured confounders exist between the treatments and the outcome. Subsequently, we present two different sets of precise identifiability conditions for selecting valid proxy variables of unmeasured confounders, based on the second-order statistics and higher-order statistics of the data, respectively. Moreover, we propose two data-driven methods for the selection of proxy variables and for the unbiased estimation of causal effects. Theoretical analysis demonstrates the correctness of our proposed algorithms. Experimental results on both synthetic and real-world data show the effectiveness of the proposed approach.

APA

Xie, F., Chen, Z., Luo, S., Miao, W., Cai, R. & Geng, Z.. (2024). Automating the Selection of Proxy Variables of Unmeasured Confounders. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:54430-54459 Available from https://proceedings.mlr.press/v235/xie24b.html.

Automating the Selection of Proxy Variables of Unmeasured Confounders

Abstract

Cite this Paper

Related Material