Speeding up planning in Markov decision processes via automatically constructed abstractions

Alejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner
Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence, PMLR R6:306-314, 2008.

Abstract

In this paper, we consider planning in stochastic shortest path (SSP) problems, a subclass of Markov Decision Problems (MDP). We focus on medium-size problems whose state space can be fully enumerated. This problem has numerous important applications, such as navigation and planning under uncertainty. We propose a new approach for constructing a multi-level hierarchy of progressively simpler abstractions of the original problem. Once computed, the hierarchy can be used to speed up planning by first finding a policy for the most abstract level and then recursively refining it into a solution to the original problem. This approach is fully automated and delivers a speed-up of two orders of magnitude over a state-of-the-art MDP solver on sample problems while returning near-optimal solutions. We also prove theoretical bounds on the loss of solution optimality resulting from the use of abstractions.

Cite this Paper


BibTeX
@InProceedings{pmlr-vR6-isaza08a, title = {Speeding up planning in Markov decision processes via automatically constructed abstractions}, author = {Isaza, Alejandro and Szepesv\'{a}ri, Csaba and Bulitko, Vadim and Greiner, Russell}, booktitle = {Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence}, pages = {306--314}, year = {2008}, editor = {McAllester, David A. and Myllymäki, Petri}, volume = {R6}, series = {Proceedings of Machine Learning Research}, month = {09--12 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/r6/main/assets/isaza08a/isaza08a.pdf}, url = {https://proceedings.mlr.press/r6/isaza08a.html}, abstract = {In this paper, we consider planning in stochastic shortest path (SSP) problems, a subclass of Markov Decision Problems (MDP). We focus on medium-size problems whose state space can be fully enumerated. This problem has numerous important applications, such as navigation and planning under uncertainty. We propose a new approach for constructing a multi-level hierarchy of progressively simpler abstractions of the original problem. Once computed, the hierarchy can be used to speed up planning by first finding a policy for the most abstract level and then recursively refining it into a solution to the original problem. This approach is fully automated and delivers a speed-up of two orders of magnitude over a state-of-the-art MDP solver on sample problems while returning near-optimal solutions. We also prove theoretical bounds on the loss of solution optimality resulting from the use of abstractions.}, note = {Reissued by PMLR on 09 October 2024.} }
Endnote
%0 Conference Paper %T Speeding up planning in Markov decision processes via automatically constructed abstractions %A Alejandro Isaza %A Csaba Szepesvári %A Vadim Bulitko %A Russell Greiner %B Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence %C Proceedings of Machine Learning Research %D 2008 %E David A. McAllester %E Petri Myllymäki %F pmlr-vR6-isaza08a %I PMLR %P 306--314 %U https://proceedings.mlr.press/r6/isaza08a.html %V R6 %X In this paper, we consider planning in stochastic shortest path (SSP) problems, a subclass of Markov Decision Problems (MDP). We focus on medium-size problems whose state space can be fully enumerated. This problem has numerous important applications, such as navigation and planning under uncertainty. We propose a new approach for constructing a multi-level hierarchy of progressively simpler abstractions of the original problem. Once computed, the hierarchy can be used to speed up planning by first finding a policy for the most abstract level and then recursively refining it into a solution to the original problem. This approach is fully automated and delivers a speed-up of two orders of magnitude over a state-of-the-art MDP solver on sample problems while returning near-optimal solutions. We also prove theoretical bounds on the loss of solution optimality resulting from the use of abstractions. %Z Reissued by PMLR on 09 October 2024.
APA
Isaza, A., Szepesvári, C., Bulitko, V. & Greiner, R.. (2008). Speeding up planning in Markov decision processes via automatically constructed abstractions. Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence, in Proceedings of Machine Learning Research R6:306-314 Available from https://proceedings.mlr.press/r6/isaza08a.html. Reissued by PMLR on 09 October 2024.

Related Material