Causal Inference and Uplift Modelling: A Review of the Literature

Pierre Gutierrez, Jean-Yves Gérardy
Proceedings of The 3rd International Conference on Predictive Applications and APIs, PMLR 67:1-13, 2017.

Abstract

Uplift modeling refers to the set of techniques used to model the incremental impact of an action or treatment on a customer outcome. Uplift modeling is therefore both a Causal Inference problem and a Machine Learning one. The literature on uplift is split into 3 main approaches - the Two-Model approach, the Class Transformation approach and modeling uplift directly. Unfortunately, in the absence of a common framework of causal inference and notation, it can be quite difficult to assess those three methods. In this paper, we use the Rubin (1974) model of causal inference and its modern “econometrics” notation to provide a clear comparison of the three approaches and generalize one of them. To our knowledge, this is the first paper that provides a unified review of the uplift literature. Moreover, our paper contributes to the literature by showing that, in the limit, minimizing the Mean Square Error (MSE) formula with respect to a causal effect estimator is equivalent to minimizing the MSE in which the unobserved treatment effect is replaced by a modified target variable. Finally, we hope that our paper will be of use to researchers interested in applying Machine Learning techniques to causal inference problems in a business context as well as in other fields: medicine, sociology or economics.

Cite this Paper


BibTeX
@InProceedings{pmlr-v67-gutierrez17a, title = {Causal Inference and Uplift Modelling: A Review of the Literature}, author = {Gutierrez, Pierre and Gérardy, Jean-Yves}, booktitle = {Proceedings of The 3rd International Conference on Predictive Applications and APIs}, pages = {1--13}, year = {2017}, editor = {Hardgrove, Claire and Dorard, Louis and Thompson, Keiran and Douetteau, Florian}, volume = {67}, series = {Proceedings of Machine Learning Research}, month = {11--12 Oct}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v67/gutierrez17a/gutierrez17a.pdf}, url = {https://proceedings.mlr.press/v67/gutierrez17a.html}, abstract = {Uplift modeling refers to the set of techniques used to model the incremental impact of an action or treatment on a customer outcome. Uplift modeling is therefore both a Causal Inference problem and a Machine Learning one. The literature on uplift is split into 3 main approaches - the Two-Model approach, the Class Transformation approach and modeling uplift directly. Unfortunately, in the absence of a common framework of causal inference and notation, it can be quite difficult to assess those three methods. In this paper, we use the Rubin (1974) model of causal inference and its modern “econometrics” notation to provide a clear comparison of the three approaches and generalize one of them. To our knowledge, this is the first paper that provides a unified review of the uplift literature. Moreover, our paper contributes to the literature by showing that, in the limit, minimizing the Mean Square Error (MSE) formula with respect to a causal effect estimator is equivalent to minimizing the MSE in which the unobserved treatment effect is replaced by a modified target variable. Finally, we hope that our paper will be of use to researchers interested in applying Machine Learning techniques to causal inference problems in a business context as well as in other fields: medicine, sociology or economics.} }
Endnote
%0 Conference Paper %T Causal Inference and Uplift Modelling: A Review of the Literature %A Pierre Gutierrez %A Jean-Yves Gérardy %B Proceedings of The 3rd International Conference on Predictive Applications and APIs %C Proceedings of Machine Learning Research %D 2017 %E Claire Hardgrove %E Louis Dorard %E Keiran Thompson %E Florian Douetteau %F pmlr-v67-gutierrez17a %I PMLR %P 1--13 %U https://proceedings.mlr.press/v67/gutierrez17a.html %V 67 %X Uplift modeling refers to the set of techniques used to model the incremental impact of an action or treatment on a customer outcome. Uplift modeling is therefore both a Causal Inference problem and a Machine Learning one. The literature on uplift is split into 3 main approaches - the Two-Model approach, the Class Transformation approach and modeling uplift directly. Unfortunately, in the absence of a common framework of causal inference and notation, it can be quite difficult to assess those three methods. In this paper, we use the Rubin (1974) model of causal inference and its modern “econometrics” notation to provide a clear comparison of the three approaches and generalize one of them. To our knowledge, this is the first paper that provides a unified review of the uplift literature. Moreover, our paper contributes to the literature by showing that, in the limit, minimizing the Mean Square Error (MSE) formula with respect to a causal effect estimator is equivalent to minimizing the MSE in which the unobserved treatment effect is replaced by a modified target variable. Finally, we hope that our paper will be of use to researchers interested in applying Machine Learning techniques to causal inference problems in a business context as well as in other fields: medicine, sociology or economics.
APA
Gutierrez, P. & Gérardy, J.. (2017). Causal Inference and Uplift Modelling: A Review of the Literature. Proceedings of The 3rd International Conference on Predictive Applications and APIs, in Proceedings of Machine Learning Research 67:1-13 Available from https://proceedings.mlr.press/v67/gutierrez17a.html.

Related Material