Compositional Models for Estimating Causal Effects

Purva Pruthi, David Jensen
Proceedings of the Fourth Conference on Causal Learning and Reasoning, PMLR 275:1365-1404, 2025.

Abstract

Many real-world systems can be usefully represented as sets of interacting components. Examples include computational systems, such as query processors and compilers; natural systems, such as cells and ecosystems; and social systems, such as families and organizations. However, current approaches to estimating *potential outcomes* and *causal effects* typically treat such systems as single units, represent them with a fixed set of variables, and assume a homogeneous data-generating process. In this work, we study a *compositional* approach for estimating individual-level potential outcomes and causal effects in structured systems, where each unit is represented by an *instance-specific* composition of multiple heterogeneous components. The compositional approach decomposes unit-level causal queries into more fine-grained queries, explicitly modeling how unit-level interventions affect component-level outcomes to generate a unit’s outcome. We demonstrate this approach using modular neural network architectures and show that it provides benefits for causal effect estimation from observational data, such as accurate causal effect estimation for structured units, increased sample efficiency, improved overlap between treatment and control groups, and compositional generalization to units with unseen combinations of components. Remarkably, our results show that compositional modeling can improve the accuracy of causal estimation even when component-level outcomes are unobserved. We also create and use a set of real-world evaluation environments for the empirical evaluation of compositional approaches for causal effect estimation and demonstrate the role of composition structure, varying amounts of component-level data access, and component heterogeneity in the performance of compositional models as compared to the non-compositional approaches.

Cite this Paper


BibTeX
@InProceedings{pmlr-v275-pruthi25a, title = {Compositional Models for Estimating Causal Effects}, author = {Pruthi, Purva and Jensen, David}, booktitle = {Proceedings of the Fourth Conference on Causal Learning and Reasoning}, pages = {1365--1404}, year = {2025}, editor = {Huang, Biwei and Drton, Mathias}, volume = {275}, series = {Proceedings of Machine Learning Research}, month = {07--09 May}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v275/main/assets/pruthi25a/pruthi25a.pdf}, url = {https://proceedings.mlr.press/v275/pruthi25a.html}, abstract = {Many real-world systems can be usefully represented as sets of interacting components. Examples include computational systems, such as query processors and compilers; natural systems, such as cells and ecosystems; and social systems, such as families and organizations. However, current approaches to estimating *potential outcomes* and *causal effects* typically treat such systems as single units, represent them with a fixed set of variables, and assume a homogeneous data-generating process. In this work, we study a *compositional* approach for estimating individual-level potential outcomes and causal effects in structured systems, where each unit is represented by an *instance-specific* composition of multiple heterogeneous components. The compositional approach decomposes unit-level causal queries into more fine-grained queries, explicitly modeling how unit-level interventions affect component-level outcomes to generate a unit’s outcome. We demonstrate this approach using modular neural network architectures and show that it provides benefits for causal effect estimation from observational data, such as accurate causal effect estimation for structured units, increased sample efficiency, improved overlap between treatment and control groups, and compositional generalization to units with unseen combinations of components. Remarkably, our results show that compositional modeling can improve the accuracy of causal estimation even when component-level outcomes are unobserved. We also create and use a set of real-world evaluation environments for the empirical evaluation of compositional approaches for causal effect estimation and demonstrate the role of composition structure, varying amounts of component-level data access, and component heterogeneity in the performance of compositional models as compared to the non-compositional approaches.} }
Endnote
%0 Conference Paper %T Compositional Models for Estimating Causal Effects %A Purva Pruthi %A David Jensen %B Proceedings of the Fourth Conference on Causal Learning and Reasoning %C Proceedings of Machine Learning Research %D 2025 %E Biwei Huang %E Mathias Drton %F pmlr-v275-pruthi25a %I PMLR %P 1365--1404 %U https://proceedings.mlr.press/v275/pruthi25a.html %V 275 %X Many real-world systems can be usefully represented as sets of interacting components. Examples include computational systems, such as query processors and compilers; natural systems, such as cells and ecosystems; and social systems, such as families and organizations. However, current approaches to estimating *potential outcomes* and *causal effects* typically treat such systems as single units, represent them with a fixed set of variables, and assume a homogeneous data-generating process. In this work, we study a *compositional* approach for estimating individual-level potential outcomes and causal effects in structured systems, where each unit is represented by an *instance-specific* composition of multiple heterogeneous components. The compositional approach decomposes unit-level causal queries into more fine-grained queries, explicitly modeling how unit-level interventions affect component-level outcomes to generate a unit’s outcome. We demonstrate this approach using modular neural network architectures and show that it provides benefits for causal effect estimation from observational data, such as accurate causal effect estimation for structured units, increased sample efficiency, improved overlap between treatment and control groups, and compositional generalization to units with unseen combinations of components. Remarkably, our results show that compositional modeling can improve the accuracy of causal estimation even when component-level outcomes are unobserved. We also create and use a set of real-world evaluation environments for the empirical evaluation of compositional approaches for causal effect estimation and demonstrate the role of composition structure, varying amounts of component-level data access, and component heterogeneity in the performance of compositional models as compared to the non-compositional approaches.
APA
Pruthi, P. & Jensen, D.. (2025). Compositional Models for Estimating Causal Effects. Proceedings of the Fourth Conference on Causal Learning and Reasoning, in Proceedings of Machine Learning Research 275:1365-1404 Available from https://proceedings.mlr.press/v275/pruthi25a.html.

Related Material