The Sample-Complexity of General Reinforcement Learning

Tor Lattimore, Marcus Hutter, Peter Sunehag
Proceedings of the 30th International Conference on Machine Learning, PMLR 28(3):28-36, 2013.

Abstract

We study the sample-complexity of reinforcement learning in a general setting without assuming ergodicity or finiteness of the environment. Instead, we define a topology on the space of environments and show that if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an algorithm achieving these bounds. We also show the existence of environment classes that are non-compact where finite sample-complexity bounds are not achievable. A lower bound is presented that matches the upper bound except for logarithmic factors.

Cite this Paper


BibTeX
@InProceedings{pmlr-v28-lattimore13, title = {The Sample-Complexity of General Reinforcement Learning}, author = {Lattimore, Tor and Hutter, Marcus and Sunehag, Peter}, booktitle = {Proceedings of the 30th International Conference on Machine Learning}, pages = {28--36}, year = {2013}, editor = {Dasgupta, Sanjoy and McAllester, David}, volume = {28}, number = {3}, series = {Proceedings of Machine Learning Research}, address = {Atlanta, Georgia, USA}, month = {17--19 Jun}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v28/lattimore13.pdf}, url = {https://proceedings.mlr.press/v28/lattimore13.html}, abstract = {We study the sample-complexity of reinforcement learning in a general setting without assuming ergodicity or finiteness of the environment. Instead, we define a topology on the space of environments and show that if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an algorithm achieving these bounds. We also show the existence of environment classes that are non-compact where finite sample-complexity bounds are not achievable. A lower bound is presented that matches the upper bound except for logarithmic factors. } }
Endnote
%0 Conference Paper %T The Sample-Complexity of General Reinforcement Learning %A Tor Lattimore %A Marcus Hutter %A Peter Sunehag %B Proceedings of the 30th International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2013 %E Sanjoy Dasgupta %E David McAllester %F pmlr-v28-lattimore13 %I PMLR %P 28--36 %U https://proceedings.mlr.press/v28/lattimore13.html %V 28 %N 3 %X We study the sample-complexity of reinforcement learning in a general setting without assuming ergodicity or finiteness of the environment. Instead, we define a topology on the space of environments and show that if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an algorithm achieving these bounds. We also show the existence of environment classes that are non-compact where finite sample-complexity bounds are not achievable. A lower bound is presented that matches the upper bound except for logarithmic factors.
RIS
TY - CPAPER TI - The Sample-Complexity of General Reinforcement Learning AU - Tor Lattimore AU - Marcus Hutter AU - Peter Sunehag BT - Proceedings of the 30th International Conference on Machine Learning DA - 2013/05/26 ED - Sanjoy Dasgupta ED - David McAllester ID - pmlr-v28-lattimore13 PB - PMLR DP - Proceedings of Machine Learning Research VL - 28 IS - 3 SP - 28 EP - 36 L1 - http://proceedings.mlr.press/v28/lattimore13.pdf UR - https://proceedings.mlr.press/v28/lattimore13.html AB - We study the sample-complexity of reinforcement learning in a general setting without assuming ergodicity or finiteness of the environment. Instead, we define a topology on the space of environments and show that if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an algorithm achieving these bounds. We also show the existence of environment classes that are non-compact where finite sample-complexity bounds are not achievable. A lower bound is presented that matches the upper bound except for logarithmic factors. ER -
APA
Lattimore, T., Hutter, M. & Sunehag, P.. (2013). The Sample-Complexity of General Reinforcement Learning. Proceedings of the 30th International Conference on Machine Learning, in Proceedings of Machine Learning Research 28(3):28-36 Available from https://proceedings.mlr.press/v28/lattimore13.html.

Related Material