Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal; Christian A Schroeder De Witt; Bei Peng; Wendelin Boehmer; Shimon Whiteson; Fei Sha

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian A Schroeder De Witt, Bei Peng, Wendelin Boehmer, Shimon Whiteson, Fei Sha

Proceedings of the 38th International Conference on Machine Learning, PMLR 139:4596-4606, 2021.

Abstract

Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: “What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?” By posing this counterfactual question, we can recognize state-action trajectories within sub-groups of entities that we may have encountered in another task and use what we learned in that task to inform our prediction in the current one. We then reconstruct a prediction of the full returns as a combination of factors considering these disjoint groups of entities and train this “randomly factorized" value function as an auxiliary objective for value-based multi-agent reinforcement learning. By doing so, our model can recognize and leverage similarities across tasks to improve learning efficiency in a multi-task setting. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging multi-task StarCraft micromanagement settings.

Cite this Paper

BibTeX

@InProceedings{pmlr-v139-iqbal21a,
  title = 	 {Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning},
  author =       {Iqbal, Shariq and De Witt, Christian A Schroeder and Peng, Bei and Boehmer, Wendelin and Whiteson, Shimon and Sha, Fei},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {4596--4606},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/iqbal21a/iqbal21a.pdf},
  url = 	 {https://proceedings.mlr.press/v139/iqbal21a.html},
  abstract = 	 {Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: “What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?” By posing this counterfactual question, we can recognize state-action trajectories within sub-groups of entities that we may have encountered in another task and use what we learned in that task to inform our prediction in the current one. We then reconstruct a prediction of the full returns as a combination of factors considering these disjoint groups of entities and train this “randomly factorized" value function as an auxiliary objective for value-based multi-agent reinforcement learning. By doing so, our model can recognize and leverage similarities across tasks to improve learning efficiency in a multi-task setting. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging multi-task StarCraft micromanagement settings.}
}

Endnote

%0 Conference Paper
%T Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
%A Shariq Iqbal
%A Christian A Schroeder De Witt
%A Bei Peng
%A Wendelin Boehmer
%A Shimon Whiteson
%A Fei Sha
%B Proceedings of the 38th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Marina Meila
%E Tong Zhang	
%F pmlr-v139-iqbal21a
%I PMLR
%P 4596--4606
%U https://proceedings.mlr.press/v139/iqbal21a.html
%V 139
%X Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: “What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?” By posing this counterfactual question, we can recognize state-action trajectories within sub-groups of entities that we may have encountered in another task and use what we learned in that task to inform our prediction in the current one. We then reconstruct a prediction of the full returns as a combination of factors considering these disjoint groups of entities and train this “randomly factorized" value function as an auxiliary objective for value-based multi-agent reinforcement learning. By doing so, our model can recognize and leverage similarities across tasks to improve learning efficiency in a multi-task setting. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging multi-task StarCraft micromanagement settings.

APA

Iqbal, S., De Witt, C.A.S., Peng, B., Boehmer, W., Whiteson, S. & Sha, F.. (2021). Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning. Proceedings of the 38th International Conference on Machine Learning, in Proceedings of Machine Learning Research 139:4596-4606 Available from https://proceedings.mlr.press/v139/iqbal21a.html.

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Abstract

Cite this Paper

Related Material