Domain Generalization using Causal Matching

Divyat Mahajan; Shruti Tople; Amit Sharma

Domain Generalization using Causal Matching

Divyat Mahajan, Shruti Tople, Amit Sharma

Proceedings of the 38th International Conference on Machine Learning, PMLR 139:7313-7324, 2021.

Abstract

In the domain generalization literature, a common objective is to learn representations independent of the domain after conditioning on the class label. We show that this objective is not sufficient: there exist counter-examples where a model fails to generalize to unseen domains even after satisfying class-conditional domain invariance. We formalize this observation through a structural causal model and show the importance of modeling within-class variations for generalization. Specifically, classes contain objects that characterize specific causal features, and domains can be interpreted as interventions on these objects that change non-causal features. We highlight an alternative condition: inputs across domains should have the same representation if they are derived from the same object. Based on this objective, we propose matching-based algorithms when base objects are observed (e.g., through data augmentation) and approximate the objective when objects are not observed (MatchDG). Our simple matching-based algorithms are competitive to prior work on out-of-domain accuracy for rotated MNIST, Fashion-MNIST, PACS, and Chest-Xray datasets. Our method MatchDG also recovers ground-truth object matches: on MNIST and Fashion-MNIST, top-10 matches from MatchDG have over 50% overlap with ground-truth matches.

Cite this Paper

BibTeX


@InProceedings{pmlr-v139-mahajan21b,
  title = 	 {Domain Generalization using Causal Matching},
  author =       {Mahajan, Divyat and Tople, Shruti and Sharma, Amit},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7313--7324},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/mahajan21b/mahajan21b.pdf},
  url = 	 {https://proceedings.mlr.press/v139/mahajan21b.html},
  abstract = 	 {In the domain generalization literature, a common objective is to learn representations independent of the domain after conditioning on the class label. We show that this objective is not sufficient: there exist counter-examples where a model fails to generalize to unseen domains even after satisfying class-conditional domain invariance. We formalize this observation through a structural causal model and show the importance of modeling within-class variations for generalization. Specifically, classes contain objects that characterize specific causal features, and domains can be interpreted as interventions on these objects that change non-causal features. We highlight an alternative condition: inputs across domains should have the same representation if they are derived from the same object. Based on this objective, we propose matching-based algorithms when base objects are observed (e.g., through data augmentation) and approximate the objective when objects are not observed (MatchDG). Our simple matching-based algorithms are competitive to prior work on out-of-domain accuracy for rotated MNIST, Fashion-MNIST, PACS, and Chest-Xray datasets. Our method MatchDG also recovers ground-truth object matches: on MNIST and Fashion-MNIST, top-10 matches from MatchDG have over 50% overlap with ground-truth matches.}
}

Endnote

%0 Conference Paper
%T Domain Generalization using Causal Matching
%A Divyat Mahajan
%A Shruti Tople
%A Amit Sharma
%B Proceedings of the 38th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Marina Meila
%E Tong Zhang	
%F pmlr-v139-mahajan21b
%I PMLR
%P 7313--7324
%U https://proceedings.mlr.press/v139/mahajan21b.html
%V 139
%X In the domain generalization literature, a common objective is to learn representations independent of the domain after conditioning on the class label. We show that this objective is not sufficient: there exist counter-examples where a model fails to generalize to unseen domains even after satisfying class-conditional domain invariance. We formalize this observation through a structural causal model and show the importance of modeling within-class variations for generalization. Specifically, classes contain objects that characterize specific causal features, and domains can be interpreted as interventions on these objects that change non-causal features. We highlight an alternative condition: inputs across domains should have the same representation if they are derived from the same object. Based on this objective, we propose matching-based algorithms when base objects are observed (e.g., through data augmentation) and approximate the objective when objects are not observed (MatchDG). Our simple matching-based algorithms are competitive to prior work on out-of-domain accuracy for rotated MNIST, Fashion-MNIST, PACS, and Chest-Xray datasets. Our method MatchDG also recovers ground-truth object matches: on MNIST and Fashion-MNIST, top-10 matches from MatchDG have over 50% overlap with ground-truth matches.

APA


Mahajan, D., Tople, S. & Sharma, A.. (2021). Domain Generalization using Causal Matching. Proceedings of the 38th International Conference on Machine Learning, in Proceedings of Machine Learning Research 139:7313-7324 Available from https://proceedings.mlr.press/v139/mahajan21b.html.

Domain Generalization using Causal Matching

Abstract

Cite this Paper

Related Material