Convergence rates for localized actor-critic in networked Markov potential games

Zhaoyi Zhou; Zaiwei Chen; Yiheng Lin; Adam Wierman

Convergence rates for localized actor-critic in networked Markov potential games

Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman

Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, PMLR 216:2563-2573, 2023.

Abstract

We introduce a class of networked Markov potential games where agents are associated with nodes in a network. Each agent has its own local potential function, and the reward of each agent depends only on the states and actions of agents within a neighborhood. In this context, we propose a localized actor-critic algorithm. The algorithm is scalable since each agent uses only local information and does not need access to the global state. Further, the algorithm overcomes the curse of dimensionality through the use of function approximation. Our main results provide finite-sample guarantees up to a localization error and a function approximation error. Specifically, we achieve an $\tilde{\mathcal{O}}(\tilde{\epsilon}^{-4})$ sample complexity measured by the averaged Nash regret. This is the first finite-sample bound for multi-agent competitive games that does not depend on the number of agents.

Cite this Paper

BibTeX

@InProceedings{pmlr-v216-zhou23b,
  title = 	 {Convergence rates for localized actor-critic in networked {M}arkov potential games},
  author =       {Zhou, Zhaoyi and Chen, Zaiwei and Lin, Yiheng and Wierman, Adam},
  booktitle = 	 {Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence},
  pages = 	 {2563--2573},
  year = 	 {2023},
  editor = 	 {Evans, Robin J. and Shpitser, Ilya},
  volume = 	 {216},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {31 Jul--04 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v216/zhou23b/zhou23b.pdf},
  url = 	 {https://proceedings.mlr.press/v216/zhou23b.html},
  abstract = 	 {We introduce a class of networked Markov potential games where agents are associated with nodes in a network. Each agent has its own local potential function, and the reward of each agent depends only on the states and actions of agents within a neighborhood.  In this context, we propose a localized actor-critic algorithm.  The algorithm is scalable since each agent uses only local information and does not need access to the global state.  Further, the algorithm overcomes the curse of dimensionality through the use of function approximation.  Our main results provide finite-sample guarantees up to a localization error and a function approximation error. Specifically, we achieve an $\tilde{\mathcal{O}}(\tilde{\epsilon}^{-4})$ sample complexity measured by the averaged Nash regret. This is the first finite-sample bound for multi-agent competitive games  that does not depend on the number of agents.}
}

Endnote

%0 Conference Paper
%T Convergence rates for localized actor-critic in networked Markov potential games
%A Zhaoyi Zhou
%A Zaiwei Chen
%A Yiheng Lin
%A Adam Wierman
%B Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence
%C Proceedings of Machine Learning Research
%D 2023
%E Robin J. Evans
%E Ilya Shpitser	
%F pmlr-v216-zhou23b
%I PMLR
%P 2563--2573
%U https://proceedings.mlr.press/v216/zhou23b.html
%V 216
%X We introduce a class of networked Markov potential games where agents are associated with nodes in a network. Each agent has its own local potential function, and the reward of each agent depends only on the states and actions of agents within a neighborhood.  In this context, we propose a localized actor-critic algorithm.  The algorithm is scalable since each agent uses only local information and does not need access to the global state.  Further, the algorithm overcomes the curse of dimensionality through the use of function approximation.  Our main results provide finite-sample guarantees up to a localization error and a function approximation error. Specifically, we achieve an $\tilde{\mathcal{O}}(\tilde{\epsilon}^{-4})$ sample complexity measured by the averaged Nash regret. This is the first finite-sample bound for multi-agent competitive games  that does not depend on the number of agents.

APA

Zhou, Z., Chen, Z., Lin, Y. & Wierman, A.. (2023). Convergence rates for localized actor-critic in networked Markov potential games. Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, in Proceedings of Machine Learning Research 216:2563-2573 Available from https://proceedings.mlr.press/v216/zhou23b.html.

Convergence rates for localized actor-critic in networked Markov potential games

Abstract

Cite this Paper

Related Material