Invariant Causal Prediction for Block MDPs

Amy Zhang; Clare Lyle; Shagun Sodhani; Angelos Filos; Marta Kwiatkowska; Joelle Pineau; Yarin Gal; Doina Precup

Invariant Causal Prediction for Block MDPs

Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal, Doina Precup

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:11214-11224, 2020.

Abstract

Generalization across environments is critical to the successful application of reinforcement learning (RL) algorithms to real-world challenges. In this work we propose a method for learning state abstractions which generalize to novel observation distributions in the multi-environment RL setting. We prove that for certain classes of environments, this approach outputs, with high probability, a state abstraction corresponding to the causal feature set with respect to the return. We give empirical evidence that analogous methods for the nonlinear setting can also attain improved generalization over single- and multi-task baselines. Lastly, we provide bounds on model generalization error in the multi-environment setting, in the process showing a connection between causal variable identification and the state abstraction framework for MDPs.

Cite this Paper

BibTeX

@InProceedings{pmlr-v119-zhang20t,
  title = 	 {Invariant Causal Prediction for Block {MDP}s},
  author =       {Zhang, Amy and Lyle, Clare and Sodhani, Shagun and Filos, Angelos and Kwiatkowska, Marta and Pineau, Joelle and Gal, Yarin and Precup, Doina},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {11214--11224},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/zhang20t/zhang20t.pdf},
  url = 	 {https://proceedings.mlr.press/v119/zhang20t.html},
  abstract = 	 {Generalization across environments is critical to the successful application of reinforcement learning (RL) algorithms to real-world challenges. In this work we propose a method for learning state abstractions which generalize to novel observation distributions in the multi-environment RL setting. We prove that for certain classes of environments, this approach outputs, with high probability, a state abstraction corresponding to the causal feature set with respect to the return. We give empirical evidence that analogous methods for the nonlinear setting can also attain improved generalization over single- and multi-task baselines. Lastly, we provide bounds on model generalization error in the multi-environment setting, in the process showing a connection between causal variable identification and the state abstraction framework for MDPs.}
}

Endnote

%0 Conference Paper
%T Invariant Causal Prediction for Block MDPs
%A Amy Zhang
%A Clare Lyle
%A Shagun Sodhani
%A Angelos Filos
%A Marta Kwiatkowska
%A Joelle Pineau
%A Yarin Gal
%A Doina Precup
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-zhang20t
%I PMLR
%P 11214--11224
%U https://proceedings.mlr.press/v119/zhang20t.html
%V 119
%X Generalization across environments is critical to the successful application of reinforcement learning (RL) algorithms to real-world challenges. In this work we propose a method for learning state abstractions which generalize to novel observation distributions in the multi-environment RL setting. We prove that for certain classes of environments, this approach outputs, with high probability, a state abstraction corresponding to the causal feature set with respect to the return. We give empirical evidence that analogous methods for the nonlinear setting can also attain improved generalization over single- and multi-task baselines. Lastly, we provide bounds on model generalization error in the multi-environment setting, in the process showing a connection between causal variable identification and the state abstraction framework for MDPs.

APA

Zhang, A., Lyle, C., Sodhani, S., Filos, A., Kwiatkowska, M., Pineau, J., Gal, Y. & Precup, D.. (2020). Invariant Causal Prediction for Block MDPs. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:11214-11224 Available from https://proceedings.mlr.press/v119/zhang20t.html.

Invariant Causal Prediction for Block MDPs

Abstract

Cite this Paper

Related Material