Probing Transfer in Deep Reinforcement Learning without Task Engineering

Andrei Alex Rusu; Sebastian Flennerhag; Dushyant Rao; Razvan Pascanu; Raia Hadsell

Probing Transfer in Deep Reinforcement Learning without Task Engineering

Andrei Alex Rusu, Sebastian Flennerhag, Dushyant Rao, Razvan Pascanu, Raia Hadsell

Proceedings of The 1st Conference on Lifelong Learning Agents, PMLR 199:1231-1254, 2022.

Abstract

We evaluate the use of original game curricula supported by the Atari 2600 console as a heterogeneous transfer benchmark for deep reinforcement learning agents. Game designers created curricula using combinations of several discrete modifications to the basic versions of games such as Space Invaders, Breakout and Freeway, making them progressively more challenging for human players. By formally organising these modifications into several factors of variation, we are able to show that Analyses of Variance (ANOVA) are a potent tool for studying the effects of human-relevant domain changes on the learning and transfer performance of a deep reinforcement learning agent. Since no manual task engineering is needed on our part, leveraging the original multi-factorial design avoids the pitfalls of unintentionally biasing the experimental setup. We find that game design factors have a large and statistically significant impact on an agent’s ability to learn, and so do their combinatorial interactions. Furthermore, we show that zero-shot transfer from the basic games to their respective variations is possible, but the variance in performance is also largely explained by interactions between factors. As such, we argue that Atari game curricula offer a challenging benchmark for transfer learning in RL, that can help the community better understand the generalisation capabilities of RL agents along dimensions which meaningfully impact human generalisation performance. As a start, we report that value-function finetuning of regularly trained agents achieves positive transfer in a majority of cases, but significant headroom for algorithmic innovation remains. We conclude with the observation that selective transfer from multiple variants could further improve performance.

Cite this Paper

BibTeX


@InProceedings{pmlr-v199-rusu22a,
  title = 	 {Probing Transfer in Deep Reinforcement Learning without Task Engineering},
  author =       {Rusu, Andrei Alex and Flennerhag, Sebastian and Rao, Dushyant and Pascanu, Razvan and Hadsell, Raia},
  booktitle = 	 {Proceedings of The 1st Conference on Lifelong Learning Agents},
  pages = 	 {1231--1254},
  year = 	 {2022},
  editor = 	 {Chandar, Sarath and Pascanu, Razvan and Precup, Doina},
  volume = 	 {199},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {22--24 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v199/rusu22a/rusu22a.pdf},
  url = 	 {https://proceedings.mlr.press/v199/rusu22a.html},
  abstract = 	 {We evaluate the use of original game curricula supported by the Atari 2600 console as a heterogeneous transfer benchmark for deep reinforcement learning agents. Game designers created curricula using combinations of several discrete modifications to the basic versions of games such as Space Invaders, Breakout and Freeway, making them progressively more challenging for human players. By formally organising these modifications into several factors of variation, we are able to show that Analyses of Variance (ANOVA) are a potent tool for studying the effects of human-relevant domain changes on the learning and transfer performance of a deep reinforcement learning agent. Since no manual task engineering is needed on our part, leveraging the original multi-factorial design avoids the pitfalls of unintentionally biasing the experimental setup. We find that game design factors have a large and statistically significant impact on an agent’s ability to learn, and so do their combinatorial interactions. Furthermore, we show that zero-shot transfer from the basic games to their respective variations is possible, but the variance in performance is also largely explained by interactions between factors. As such, we argue that Atari game curricula offer a challenging benchmark for transfer learning in RL, that can help the community better understand the generalisation capabilities of RL agents along dimensions which meaningfully impact human generalisation performance. As a start, we report that value-function finetuning of regularly trained agents achieves positive transfer in a majority of cases, but significant headroom for algorithmic innovation remains. We conclude with the observation that selective transfer from multiple variants could further improve performance.}
}

Endnote

%0 Conference Paper
%T Probing Transfer in Deep Reinforcement Learning without Task Engineering
%A Andrei Alex Rusu
%A Sebastian Flennerhag
%A Dushyant Rao
%A Razvan Pascanu
%A Raia Hadsell
%B Proceedings of The 1st Conference on Lifelong Learning Agents
%C Proceedings of Machine Learning Research
%D 2022
%E Sarath Chandar
%E Razvan Pascanu
%E Doina Precup	
%F pmlr-v199-rusu22a
%I PMLR
%P 1231--1254
%U https://proceedings.mlr.press/v199/rusu22a.html
%V 199
%X We evaluate the use of original game curricula supported by the Atari 2600 console as a heterogeneous transfer benchmark for deep reinforcement learning agents. Game designers created curricula using combinations of several discrete modifications to the basic versions of games such as Space Invaders, Breakout and Freeway, making them progressively more challenging for human players. By formally organising these modifications into several factors of variation, we are able to show that Analyses of Variance (ANOVA) are a potent tool for studying the effects of human-relevant domain changes on the learning and transfer performance of a deep reinforcement learning agent. Since no manual task engineering is needed on our part, leveraging the original multi-factorial design avoids the pitfalls of unintentionally biasing the experimental setup. We find that game design factors have a large and statistically significant impact on an agent’s ability to learn, and so do their combinatorial interactions. Furthermore, we show that zero-shot transfer from the basic games to their respective variations is possible, but the variance in performance is also largely explained by interactions between factors. As such, we argue that Atari game curricula offer a challenging benchmark for transfer learning in RL, that can help the community better understand the generalisation capabilities of RL agents along dimensions which meaningfully impact human generalisation performance. As a start, we report that value-function finetuning of regularly trained agents achieves positive transfer in a majority of cases, but significant headroom for algorithmic innovation remains. We conclude with the observation that selective transfer from multiple variants could further improve performance.

APA


Rusu, A.A., Flennerhag, S., Rao, D., Pascanu, R. & Hadsell, R.. (2022). Probing Transfer in Deep Reinforcement Learning without Task Engineering. Proceedings of The 1st Conference on Lifelong Learning Agents, in Proceedings of Machine Learning Research 199:1231-1254 Available from https://proceedings.mlr.press/v199/rusu22a.html.

Probing Transfer in Deep Reinforcement Learning without Task Engineering

Abstract

Cite this Paper

Related Material