Equivariant Reinforcement Learning under Partial Observability

Hai Huu Nguyen; Andrea Baisero; David Klee; Dian Wang; Robert Platt; Christopher Amato

Equivariant Reinforcement Learning under Partial Observability

Hai Huu Nguyen, Andrea Baisero, David Klee, Dian Wang, Robert Platt, Christopher Amato

Proceedings of The 7th Conference on Robot Learning, PMLR 229:3309-3320, 2023.

Abstract

Incorporating inductive biases is a promising approach for tackling challenging robot learning domains with sample-efficient solutions. This paper identifies partially observable domains where symmetries can be a useful inductive bias for efficient learning. Specifically, by encoding the equivariance regarding specific group symmetries into the neural networks, our actor-critic reinforcement learning agents can reuse solutions in the past for related scenarios. Consequently, our equivariant agents outperform non-equivariant approaches significantly in terms of sample efficiency and final performance, demonstrated through experiments on a range of robotic tasks in simulation and real hardware.

Cite this Paper

BibTeX


@InProceedings{pmlr-v229-nguyen23a,
  title = 	 {Equivariant Reinforcement Learning under Partial Observability},
  author =       {Nguyen, Hai Huu and Baisero, Andrea and Klee, David and Wang, Dian and Platt, Robert and Amato, Christopher},
  booktitle = 	 {Proceedings of The 7th Conference on Robot Learning},
  pages = 	 {3309--3320},
  year = 	 {2023},
  editor = 	 {Tan, Jie and Toussaint, Marc and Darvish, Kourosh},
  volume = 	 {229},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--09 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v229/nguyen23a/nguyen23a.pdf},
  url = 	 {https://proceedings.mlr.press/v229/nguyen23a.html},
  abstract = 	 {Incorporating inductive biases is a promising approach for tackling challenging robot learning domains with sample-efficient solutions. This paper identifies partially observable domains where symmetries can be a useful inductive bias for efficient learning. Specifically, by encoding the equivariance regarding specific group symmetries into the neural networks, our actor-critic reinforcement learning agents can reuse solutions in the past for related scenarios. Consequently, our equivariant agents outperform non-equivariant approaches significantly in terms of sample efficiency and final performance, demonstrated through experiments on a range of robotic tasks in simulation and real hardware.}
}

Endnote

%0 Conference Paper
%T Equivariant Reinforcement Learning under Partial Observability
%A Hai Huu Nguyen
%A Andrea Baisero
%A David Klee
%A Dian Wang
%A Robert Platt
%A Christopher Amato
%B Proceedings of The 7th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2023
%E Jie Tan
%E Marc Toussaint
%E Kourosh Darvish	
%F pmlr-v229-nguyen23a
%I PMLR
%P 3309--3320
%U https://proceedings.mlr.press/v229/nguyen23a.html
%V 229
%X Incorporating inductive biases is a promising approach for tackling challenging robot learning domains with sample-efficient solutions. This paper identifies partially observable domains where symmetries can be a useful inductive bias for efficient learning. Specifically, by encoding the equivariance regarding specific group symmetries into the neural networks, our actor-critic reinforcement learning agents can reuse solutions in the past for related scenarios. Consequently, our equivariant agents outperform non-equivariant approaches significantly in terms of sample efficiency and final performance, demonstrated through experiments on a range of robotic tasks in simulation and real hardware.

APA


Nguyen, H.H., Baisero, A., Klee, D., Wang, D., Platt, R. & Amato, C.. (2023). Equivariant Reinforcement Learning under Partial Observability. Proceedings of The 7th Conference on Robot Learning, in Proceedings of Machine Learning Research 229:3309-3320 Available from https://proceedings.mlr.press/v229/nguyen23a.html.

Equivariant Reinforcement Learning under Partial Observability

Abstract

Cite this Paper

Related Material