Full Law Identification in Graphical Models of Missing Data: Completeness Results

Razieh Nabi, Rohit Bhattacharya, Ilya Shpitser
Proceedings of the 37th International Conference on Machine Learning, PMLR 119:7153-7163, 2020.

Abstract

Missing data has the potential to affect analyses conducted in all fields of scientific study including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic graph. In this paper, we address the longstanding question of the characterization of models that are identifiable within this class of missing data distributions. We provide the first completeness result in this field of study – necessary and sufficient graphical conditions under which, the full data distribution can be recovered from the observed data distribution. We then simultaneously address issues that may arise due to the presence of both missing data and unmeasured confounding, by extending these graphical conditions and proofs of completeness, to settings where some variables are not just missing, but completely unobserved.

Cite this Paper


BibTeX
@InProceedings{pmlr-v119-nabi20a, title = {Full Law Identification in Graphical Models of Missing Data: Completeness Results}, author = {Nabi, Razieh and Bhattacharya, Rohit and Shpitser, Ilya}, booktitle = {Proceedings of the 37th International Conference on Machine Learning}, pages = {7153--7163}, year = {2020}, editor = {III, Hal Daumé and Singh, Aarti}, volume = {119}, series = {Proceedings of Machine Learning Research}, month = {13--18 Jul}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v119/nabi20a/nabi20a.pdf}, url = {https://proceedings.mlr.press/v119/nabi20a.html}, abstract = {Missing data has the potential to affect analyses conducted in all fields of scientific study including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic graph. In this paper, we address the longstanding question of the characterization of models that are identifiable within this class of missing data distributions. We provide the first completeness result in this field of study – necessary and sufficient graphical conditions under which, the full data distribution can be recovered from the observed data distribution. We then simultaneously address issues that may arise due to the presence of both missing data and unmeasured confounding, by extending these graphical conditions and proofs of completeness, to settings where some variables are not just missing, but completely unobserved.} }
Endnote
%0 Conference Paper %T Full Law Identification in Graphical Models of Missing Data: Completeness Results %A Razieh Nabi %A Rohit Bhattacharya %A Ilya Shpitser %B Proceedings of the 37th International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2020 %E Hal Daumé III %E Aarti Singh %F pmlr-v119-nabi20a %I PMLR %P 7153--7163 %U https://proceedings.mlr.press/v119/nabi20a.html %V 119 %X Missing data has the potential to affect analyses conducted in all fields of scientific study including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic graph. In this paper, we address the longstanding question of the characterization of models that are identifiable within this class of missing data distributions. We provide the first completeness result in this field of study – necessary and sufficient graphical conditions under which, the full data distribution can be recovered from the observed data distribution. We then simultaneously address issues that may arise due to the presence of both missing data and unmeasured confounding, by extending these graphical conditions and proofs of completeness, to settings where some variables are not just missing, but completely unobserved.
APA
Nabi, R., Bhattacharya, R. & Shpitser, I.. (2020). Full Law Identification in Graphical Models of Missing Data: Completeness Results. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:7153-7163 Available from https://proceedings.mlr.press/v119/nabi20a.html.

Related Material