Characterization of Overlap in Observational Studies

Michael Oberst; Fredrik Johansson; Dennis Wei; Tian Gao; Gabriel Brat; David Sontag; Kush Varshney

Characterization of Overlap in Observational Studies

Michael Oberst, Fredrik Johansson, Dennis Wei, Tian Gao, Gabriel Brat, David Sontag, Kush Varshney

Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:788-798, 2020.

Abstract

Overlap between treatment groups is required for non-parametric estimation of causal effects. If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions. When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects, and can help guide additional data collection. To have impact, these descriptions must be interpretable for downstream users who are not machine learning experts, such as policy makers. We formalize overlap estimation as a problem of finding minimum volume sets subject to coverage constraints and reduce this problem to binary classification with Boolean rule classifiers. We then generalize this method to estimate overlap in off-policy policy evaluation. In several real-world applications, we demonstrate that these rules have comparable accuracy to black-box estimators and provide intuitive and informative explanations that can inform policy making.

Cite this Paper

BibTeX

@InProceedings{pmlr-v108-oberst20a,
  title = 	 {Characterization of Overlap in Observational Studies},
  author =       {Oberst, Michael and Johansson, Fredrik and Wei, Dennis and Gao, Tian and Brat, Gabriel and Sontag, David and Varshney, Kush},
  booktitle = 	 {Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics},
  pages = 	 {788--798},
  year = 	 {2020},
  editor = 	 {Chiappa, Silvia and Calandra, Roberto},
  volume = 	 {108},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {26--28 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v108/oberst20a/oberst20a.pdf},
  url = 	 {https://proceedings.mlr.press/v108/oberst20a.html},
  abstract = 	 {Overlap between treatment groups is required for non-parametric estimation of causal effects.  If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions.  When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects, and can help guide additional data collection. To have impact, these descriptions must be interpretable for downstream users who are not machine learning experts, such as policy makers.  We formalize overlap estimation as a problem of finding minimum volume sets subject to coverage constraints and reduce this problem to binary classification with Boolean rule classifiers. We then generalize this method to estimate overlap in off-policy policy evaluation. In several real-world applications, we demonstrate that these rules have comparable accuracy to black-box estimators and provide intuitive and informative explanations that can inform policy making.}
}

Endnote

%0 Conference Paper
%T Characterization of Overlap in Observational Studies
%A Michael Oberst
%A Fredrik Johansson
%A Dennis Wei
%A Tian Gao
%A Gabriel Brat
%A David Sontag
%A Kush Varshney
%B Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2020
%E Silvia Chiappa
%E Roberto Calandra	
%F pmlr-v108-oberst20a
%I PMLR
%P 788--798
%U https://proceedings.mlr.press/v108/oberst20a.html
%V 108
%X Overlap between treatment groups is required for non-parametric estimation of causal effects.  If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions.  When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects, and can help guide additional data collection. To have impact, these descriptions must be interpretable for downstream users who are not machine learning experts, such as policy makers.  We formalize overlap estimation as a problem of finding minimum volume sets subject to coverage constraints and reduce this problem to binary classification with Boolean rule classifiers. We then generalize this method to estimate overlap in off-policy policy evaluation. In several real-world applications, we demonstrate that these rules have comparable accuracy to black-box estimators and provide intuitive and informative explanations that can inform policy making.

APA

Oberst, M., Johansson, F., Wei, D., Gao, T., Brat, G., Sontag, D. & Varshney, K.. (2020). Characterization of Overlap in Observational Studies. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 108:788-798 Available from https://proceedings.mlr.press/v108/oberst20a.html.

Characterization of Overlap in Observational Studies

Abstract

Cite this Paper

Related Material