Causal Discovery via Conditional Independence Testing with Proxy Variables

Mingzhou Liu, Xinwei Sun, Yu Qiao, Yizhou Wang
Proceedings of the 41st International Conference on Machine Learning, PMLR 235:31866-31889, 2024.

Abstract

Distinguishing causal connections from correlations is important in many scenarios. However, the presence of unobserved variables, such as the latent confounder, can introduce bias in conditional independence testing commonly employed in constraint-based causal discovery for identifying causal relations. To address this issue, existing methods introduced proxy variables to adjust for the bias caused by unobserveness. However, these methods were either limited to categorical variables or relied on strong parametric assumptions for identification. In this paper, we propose a novel hypothesis-testing procedure that can effectively examine the existence of the causal relationship over continuous variables, without any parametric constraint. Our procedure is based on discretization, which under completeness conditions, is able to asymptotically establish a linear equation whose coefficient vector is identifiable under the causal null hypothesis. Based on this, we introduce our test statistic and demonstrate its asymptotic level and power. We validate the effectiveness of our procedure using both synthetic and real-world data.

Cite this Paper


BibTeX
@InProceedings{pmlr-v235-liu24bc, title = {Causal Discovery via Conditional Independence Testing with Proxy Variables}, author = {Liu, Mingzhou and Sun, Xinwei and Qiao, Yu and Wang, Yizhou}, booktitle = {Proceedings of the 41st International Conference on Machine Learning}, pages = {31866--31889}, year = {2024}, editor = {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix}, volume = {235}, series = {Proceedings of Machine Learning Research}, month = {21--27 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v235/main/assets/liu24bc/liu24bc.pdf}, url = {https://proceedings.mlr.press/v235/liu24bc.html}, abstract = {Distinguishing causal connections from correlations is important in many scenarios. However, the presence of unobserved variables, such as the latent confounder, can introduce bias in conditional independence testing commonly employed in constraint-based causal discovery for identifying causal relations. To address this issue, existing methods introduced proxy variables to adjust for the bias caused by unobserveness. However, these methods were either limited to categorical variables or relied on strong parametric assumptions for identification. In this paper, we propose a novel hypothesis-testing procedure that can effectively examine the existence of the causal relationship over continuous variables, without any parametric constraint. Our procedure is based on discretization, which under completeness conditions, is able to asymptotically establish a linear equation whose coefficient vector is identifiable under the causal null hypothesis. Based on this, we introduce our test statistic and demonstrate its asymptotic level and power. We validate the effectiveness of our procedure using both synthetic and real-world data.} }
Endnote
%0 Conference Paper %T Causal Discovery via Conditional Independence Testing with Proxy Variables %A Mingzhou Liu %A Xinwei Sun %A Yu Qiao %A Yizhou Wang %B Proceedings of the 41st International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2024 %E Ruslan Salakhutdinov %E Zico Kolter %E Katherine Heller %E Adrian Weller %E Nuria Oliver %E Jonathan Scarlett %E Felix Berkenkamp %F pmlr-v235-liu24bc %I PMLR %P 31866--31889 %U https://proceedings.mlr.press/v235/liu24bc.html %V 235 %X Distinguishing causal connections from correlations is important in many scenarios. However, the presence of unobserved variables, such as the latent confounder, can introduce bias in conditional independence testing commonly employed in constraint-based causal discovery for identifying causal relations. To address this issue, existing methods introduced proxy variables to adjust for the bias caused by unobserveness. However, these methods were either limited to categorical variables or relied on strong parametric assumptions for identification. In this paper, we propose a novel hypothesis-testing procedure that can effectively examine the existence of the causal relationship over continuous variables, without any parametric constraint. Our procedure is based on discretization, which under completeness conditions, is able to asymptotically establish a linear equation whose coefficient vector is identifiable under the causal null hypothesis. Based on this, we introduce our test statistic and demonstrate its asymptotic level and power. We validate the effectiveness of our procedure using both synthetic and real-world data.
APA
Liu, M., Sun, X., Qiao, Y. & Wang, Y.. (2024). Causal Discovery via Conditional Independence Testing with Proxy Variables. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:31866-31889 Available from https://proceedings.mlr.press/v235/liu24bc.html.

Related Material