Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference

Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Yu
Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, PMLR 258:3970-3978, 2025.

Abstract

Estimating causal effects under interference is pertinent to many real-world settings. Recent work with low-order potential outcomes models uses a rollout design to obtain unbiased estimators that require no interference network information. However, the required extrapolation can lead to prohibitively high variance. To address this, we propose a two-stage experiment that selects a sub-population in the first stage and restricts treatment rollout to this sub-population in the second stage. We explore the role of clustering in the first stage by analyzing the bias and variance of a polynomial interpolation-style estimator under this experimental design. Bias increases with the number of edges cut in the clustering of the interference network, but variance depends on qualities of the clustering that relate to homophily and covariate balance. There is a tension between clustering objectives that minimize the number of cut edges versus those that maximize covariate balance across clusters. Through simulations, we explore {a bias-variance} trade-off and compare the performance of the estimator under different clustering strategies.

Cite this Paper


BibTeX
@InProceedings{pmlr-v258-cortez-rodriguez25a, title = {Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference}, author = {Cortez-Rodriguez, Mayleen and Eichhorn, Matthew and Yu, Christina}, booktitle = {Proceedings of The 28th International Conference on Artificial Intelligence and Statistics}, pages = {3970--3978}, year = {2025}, editor = {Li, Yingzhen and Mandt, Stephan and Agrawal, Shipra and Khan, Emtiyaz}, volume = {258}, series = {Proceedings of Machine Learning Research}, month = {03--05 May}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v258/main/assets/cortez-rodriguez25a/cortez-rodriguez25a.pdf}, url = {https://proceedings.mlr.press/v258/cortez-rodriguez25a.html}, abstract = {Estimating causal effects under interference is pertinent to many real-world settings. Recent work with low-order potential outcomes models uses a rollout design to obtain unbiased estimators that require no interference network information. However, the required extrapolation can lead to prohibitively high variance. To address this, we propose a two-stage experiment that selects a sub-population in the first stage and restricts treatment rollout to this sub-population in the second stage. We explore the role of clustering in the first stage by analyzing the bias and variance of a polynomial interpolation-style estimator under this experimental design. Bias increases with the number of edges cut in the clustering of the interference network, but variance depends on qualities of the clustering that relate to homophily and covariate balance. There is a tension between clustering objectives that minimize the number of cut edges versus those that maximize covariate balance across clusters. Through simulations, we explore {a bias-variance} trade-off and compare the performance of the estimator under different clustering strategies.} }
Endnote
%0 Conference Paper %T Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference %A Mayleen Cortez-Rodriguez %A Matthew Eichhorn %A Christina Yu %B Proceedings of The 28th International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2025 %E Yingzhen Li %E Stephan Mandt %E Shipra Agrawal %E Emtiyaz Khan %F pmlr-v258-cortez-rodriguez25a %I PMLR %P 3970--3978 %U https://proceedings.mlr.press/v258/cortez-rodriguez25a.html %V 258 %X Estimating causal effects under interference is pertinent to many real-world settings. Recent work with low-order potential outcomes models uses a rollout design to obtain unbiased estimators that require no interference network information. However, the required extrapolation can lead to prohibitively high variance. To address this, we propose a two-stage experiment that selects a sub-population in the first stage and restricts treatment rollout to this sub-population in the second stage. We explore the role of clustering in the first stage by analyzing the bias and variance of a polynomial interpolation-style estimator under this experimental design. Bias increases with the number of edges cut in the clustering of the interference network, but variance depends on qualities of the clustering that relate to homophily and covariate balance. There is a tension between clustering objectives that minimize the number of cut edges versus those that maximize covariate balance across clusters. Through simulations, we explore {a bias-variance} trade-off and compare the performance of the estimator under different clustering strategies.
APA
Cortez-Rodriguez, M., Eichhorn, M. & Yu, C.. (2025). Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference. Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 258:3970-3978 Available from https://proceedings.mlr.press/v258/cortez-rodriguez25a.html.

Related Material