Causal Inference out of Control: Estimating Performativity without Treatment Randomization

Gary Cheng, Moritz Hardt, Celestine Mendler-Dünner
Proceedings of the 41st International Conference on Machine Learning, PMLR 235:8077-8103, 2024.

Abstract

Regulators and academics are increasingly interested in the causal effect that algorithmic actions of a digital platform have on user consumption. In pursuit of estimating this effect from observational data, we identify a set of assumptions that permit causal identifiability without assuming randomized platform actions. Our results are applicable to platforms that rely on machine-learning-powered predictions and leverage knowledge from historical data. The key novelty of our approach is to explicitly model the dynamics of consumption over time, exploiting the repeated interaction of digital platforms with their participants to prove our identifiability results. By viewing the platform as a controller acting on a dynamical system, we can show that exogenous variation in consumption and appropriately responsive algorithmic control actions are sufficient for identifying the causal effect of interest. We complement our claims with an analysis of ready-to-use finite sample estimators and empirical investigations. More broadly, our results deriving identifiability conditions tailored to digital platform settings illustrate a fruitful interplay of control theory and causal inference.

Cite this Paper


BibTeX
@InProceedings{pmlr-v235-cheng24d, title = {Causal Inference out of Control: Estimating Performativity without Treatment Randomization}, author = {Cheng, Gary and Hardt, Moritz and Mendler-D\"{u}nner, Celestine}, booktitle = {Proceedings of the 41st International Conference on Machine Learning}, pages = {8077--8103}, year = {2024}, editor = {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix}, volume = {235}, series = {Proceedings of Machine Learning Research}, month = {21--27 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v235/main/assets/cheng24d/cheng24d.pdf}, url = {https://proceedings.mlr.press/v235/cheng24d.html}, abstract = {Regulators and academics are increasingly interested in the causal effect that algorithmic actions of a digital platform have on user consumption. In pursuit of estimating this effect from observational data, we identify a set of assumptions that permit causal identifiability without assuming randomized platform actions. Our results are applicable to platforms that rely on machine-learning-powered predictions and leverage knowledge from historical data. The key novelty of our approach is to explicitly model the dynamics of consumption over time, exploiting the repeated interaction of digital platforms with their participants to prove our identifiability results. By viewing the platform as a controller acting on a dynamical system, we can show that exogenous variation in consumption and appropriately responsive algorithmic control actions are sufficient for identifying the causal effect of interest. We complement our claims with an analysis of ready-to-use finite sample estimators and empirical investigations. More broadly, our results deriving identifiability conditions tailored to digital platform settings illustrate a fruitful interplay of control theory and causal inference.} }
Endnote
%0 Conference Paper %T Causal Inference out of Control: Estimating Performativity without Treatment Randomization %A Gary Cheng %A Moritz Hardt %A Celestine Mendler-Dünner %B Proceedings of the 41st International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2024 %E Ruslan Salakhutdinov %E Zico Kolter %E Katherine Heller %E Adrian Weller %E Nuria Oliver %E Jonathan Scarlett %E Felix Berkenkamp %F pmlr-v235-cheng24d %I PMLR %P 8077--8103 %U https://proceedings.mlr.press/v235/cheng24d.html %V 235 %X Regulators and academics are increasingly interested in the causal effect that algorithmic actions of a digital platform have on user consumption. In pursuit of estimating this effect from observational data, we identify a set of assumptions that permit causal identifiability without assuming randomized platform actions. Our results are applicable to platforms that rely on machine-learning-powered predictions and leverage knowledge from historical data. The key novelty of our approach is to explicitly model the dynamics of consumption over time, exploiting the repeated interaction of digital platforms with their participants to prove our identifiability results. By viewing the platform as a controller acting on a dynamical system, we can show that exogenous variation in consumption and appropriately responsive algorithmic control actions are sufficient for identifying the causal effect of interest. We complement our claims with an analysis of ready-to-use finite sample estimators and empirical investigations. More broadly, our results deriving identifiability conditions tailored to digital platform settings illustrate a fruitful interplay of control theory and causal inference.
APA
Cheng, G., Hardt, M. & Mendler-Dünner, C.. (2024). Causal Inference out of Control: Estimating Performativity without Treatment Randomization. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:8077-8103 Available from https://proceedings.mlr.press/v235/cheng24d.html.

Related Material