Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach

Johan Peralez, Aurélien Delage, Olivier Buffet, Jilles Steeve Dibangoye
Proceedings of the 41st International Conference on Machine Learning, PMLR 235:40414-40438, 2024.

Abstract

A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of Bellman’s principle of optimality to solve the single-player game by breaking it down into single-stage subgames. However, this approach entangles the decision variables of all players at each single-stage subgame, resulting in backups with a double-exponential complexity. This paper demonstrates how to disentangle these decision variables while maintaining optimality under hierarchical information sharing, a prominent management style in our society. To achieve this, we apply the principle of optimality to solve any single-stage subgame by breaking it down further into smaller subgames, enabling us to make single-player decisions at a time. Our approach reveals that extensive-form games always exist with solutions to a single-stage subgame, significantly reducing time complexity. Our experimental results show that the algorithms leveraging these findings can scale up to much larger multi-player games without compromising optimality.

Cite this Paper


BibTeX
@InProceedings{pmlr-v235-peralez24a, title = {Solving Hierarchical Information-Sharing Dec-{POMDP}s: An Extensive-Form Game Approach}, author = {Peralez, Johan and Delage, Aur\'{e}lien and Buffet, Olivier and Dibangoye, Jilles Steeve}, booktitle = {Proceedings of the 41st International Conference on Machine Learning}, pages = {40414--40438}, year = {2024}, editor = {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix}, volume = {235}, series = {Proceedings of Machine Learning Research}, month = {21--27 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v235/main/assets/peralez24a/peralez24a.pdf}, url = {https://proceedings.mlr.press/v235/peralez24a.html}, abstract = {A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of Bellman’s principle of optimality to solve the single-player game by breaking it down into single-stage subgames. However, this approach entangles the decision variables of all players at each single-stage subgame, resulting in backups with a double-exponential complexity. This paper demonstrates how to disentangle these decision variables while maintaining optimality under hierarchical information sharing, a prominent management style in our society. To achieve this, we apply the principle of optimality to solve any single-stage subgame by breaking it down further into smaller subgames, enabling us to make single-player decisions at a time. Our approach reveals that extensive-form games always exist with solutions to a single-stage subgame, significantly reducing time complexity. Our experimental results show that the algorithms leveraging these findings can scale up to much larger multi-player games without compromising optimality.} }
Endnote
%0 Conference Paper %T Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach %A Johan Peralez %A Aurélien Delage %A Olivier Buffet %A Jilles Steeve Dibangoye %B Proceedings of the 41st International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2024 %E Ruslan Salakhutdinov %E Zico Kolter %E Katherine Heller %E Adrian Weller %E Nuria Oliver %E Jonathan Scarlett %E Felix Berkenkamp %F pmlr-v235-peralez24a %I PMLR %P 40414--40438 %U https://proceedings.mlr.press/v235/peralez24a.html %V 235 %X A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of Bellman’s principle of optimality to solve the single-player game by breaking it down into single-stage subgames. However, this approach entangles the decision variables of all players at each single-stage subgame, resulting in backups with a double-exponential complexity. This paper demonstrates how to disentangle these decision variables while maintaining optimality under hierarchical information sharing, a prominent management style in our society. To achieve this, we apply the principle of optimality to solve any single-stage subgame by breaking it down further into smaller subgames, enabling us to make single-player decisions at a time. Our approach reveals that extensive-form games always exist with solutions to a single-stage subgame, significantly reducing time complexity. Our experimental results show that the algorithms leveraging these findings can scale up to much larger multi-player games without compromising optimality.
APA
Peralez, J., Delage, A., Buffet, O. & Dibangoye, J.S.. (2024). Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:40414-40438 Available from https://proceedings.mlr.press/v235/peralez24a.html.

Related Material