The Sample-Complexity of General Reinforcement Learning

Tor Lattimore; Marcus Hutter; Peter Sunehag

The Sample-Complexity of General Reinforcement Learning

Tor Lattimore, Marcus Hutter, Peter Sunehag

Proceedings of the 30th International Conference on Machine Learning, PMLR 28(3):28-36, 2013.

Abstract

We study the sample-complexity of reinforcement learning in a general setting without assuming ergodicity or finiteness of the environment. Instead, we define a topology on the space of environments and show that if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an algorithm achieving these bounds. We also show the existence of environment classes that are non-compact where finite sample-complexity bounds are not achievable. A lower bound is presented that matches the upper bound except for logarithmic factors.

Cite this Paper

BibTeX


@InProceedings{pmlr-v28-lattimore13,
  title = 	 {The Sample-Complexity of General Reinforcement Learning},
  author = 	 {Lattimore, Tor and Hutter, Marcus and Sunehag, Peter},
  booktitle = 	 {Proceedings of the 30th International Conference on Machine Learning},
  pages = 	 {28--36},
  year = 	 {2013},
  editor = 	 {Dasgupta, Sanjoy and McAllester, David},
  volume = 	 {28},
  number =       {3},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Atlanta, Georgia, USA},
  month = 	 {17--19 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v28/lattimore13.pdf},
  url = 	 {https://proceedings.mlr.press/v28/lattimore13.html},
  abstract = 	 {We study the sample-complexity of reinforcement learning in a general setting without  assuming ergodicity or finiteness of the environment. Instead, we define a topology  on the space of environments and show that  if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an  algorithm achieving these bounds. We also  show the existence of environment classes  that are non-compact where finite sample-complexity bounds are not achievable. A  lower bound is presented that matches the  upper bound except for logarithmic factors.  }
}

Endnote

%0 Conference Paper
%T The Sample-Complexity of General Reinforcement Learning
%A Tor Lattimore
%A Marcus Hutter
%A Peter Sunehag
%B Proceedings of the 30th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2013
%E Sanjoy Dasgupta
%E David McAllester	
%F pmlr-v28-lattimore13
%I PMLR
%P 28--36
%U https://proceedings.mlr.press/v28/lattimore13.html
%V 28
%N 3
%X We study the sample-complexity of reinforcement learning in a general setting without  assuming ergodicity or finiteness of the environment. Instead, we define a topology  on the space of environments and show that  if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an  algorithm achieving these bounds. We also  show the existence of environment classes  that are non-compact where finite sample-complexity bounds are not achievable. A  lower bound is presented that matches the  upper bound except for logarithmic factors.

RIS


TY  - CPAPER
TI  - The Sample-Complexity of General Reinforcement Learning
AU  - Tor Lattimore
AU  - Marcus Hutter
AU  - Peter Sunehag
BT  - Proceedings of the 30th International Conference on Machine Learning
DA  - 2013/05/26
ED  - Sanjoy Dasgupta
ED  - David McAllester	
ID  - pmlr-v28-lattimore13
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 28
IS  - 3
SP  - 28
EP  - 36
L1  - http://proceedings.mlr.press/v28/lattimore13.pdf
UR  - https://proceedings.mlr.press/v28/lattimore13.html
AB  - We study the sample-complexity of reinforcement learning in a general setting without  assuming ergodicity or finiteness of the environment. Instead, we define a topology  on the space of environments and show that  if an environment class is compact with respect to this topology then finite sample-complexity bounds are possible and give an  algorithm achieving these bounds. We also  show the existence of environment classes  that are non-compact where finite sample-complexity bounds are not achievable. A  lower bound is presented that matches the  upper bound except for logarithmic factors.  
ER  -

APA


Lattimore, T., Hutter, M. & Sunehag, P.. (2013). The Sample-Complexity of General Reinforcement Learning. Proceedings of the 30th International Conference on Machine Learning, in Proceedings of Machine Learning Research 28(3):28-36 Available from https://proceedings.mlr.press/v28/lattimore13.html.

Related Material

Download PDF