On the Existence of a Complexity in Fixed Budget Bandit Identification

Rémy Degenne

On the Existence of a Complexity in Fixed Budget Bandit Identification

Rémy Degenne

Proceedings of Thirty Sixth Conference on Learning Theory, PMLR 195:1131-1154, 2023.

Abstract

In fixed budget bandit identification, an algorithm sequentially observes samples from several distributions up to a given final time.It then answers a query about the set of distributions. A good algorithm will have a small probability of error.While that probability decreases exponentially with the final time, the best attainable rate is not known precisely for most identification tasks.We show that if a fixed budget task admits a complexity, defined as a lower bound on the probability of error which is attained by the same algorithm on all bandit problems, then that complexity is determined by the best non-adaptive sampling procedure for that problem.We show that there is no such complexity for several fixed budget identification tasks including Bernoulli best arm identification with two arms: there is no single algorithm that attains everywhere the best possible rate.

Cite this Paper

BibTeX

@InProceedings{pmlr-v195-degenne23a,
  title = 	 {On the Existence of a Complexity in Fixed Budget Bandit Identification},
  author =       {Degenne, R{\'e}my},
  booktitle = 	 {Proceedings of Thirty Sixth Conference on Learning Theory},
  pages = 	 {1131--1154},
  year = 	 {2023},
  editor = 	 {Neu, Gergely and Rosasco, Lorenzo},
  volume = 	 {195},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {12--15 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v195/degenne23a/degenne23a.pdf},
  url = 	 {https://proceedings.mlr.press/v195/degenne23a.html},
  abstract = 	 {In fixed budget bandit identification, an algorithm sequentially observes samples from several distributions up to a given final time.It then answers a query about the set of distributions. A good algorithm will have a small probability of error.While that probability decreases exponentially with the final time, the best attainable rate is not known precisely for most identification tasks.We show that if a fixed budget task admits a complexity, defined as a lower bound on the probability of error which is attained by the same algorithm on all bandit problems, then that complexity is determined by the best non-adaptive sampling procedure for that problem.We show that there is no such complexity for several fixed budget identification tasks including Bernoulli best arm identification with two arms: there is no single algorithm that attains everywhere the best possible rate.}
}

Endnote

%0 Conference Paper
%T On the Existence of a Complexity in Fixed Budget Bandit Identification
%A Rémy Degenne
%B Proceedings of Thirty Sixth Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2023
%E Gergely Neu
%E Lorenzo Rosasco	
%F pmlr-v195-degenne23a
%I PMLR
%P 1131--1154
%U https://proceedings.mlr.press/v195/degenne23a.html
%V 195
%X In fixed budget bandit identification, an algorithm sequentially observes samples from several distributions up to a given final time.It then answers a query about the set of distributions. A good algorithm will have a small probability of error.While that probability decreases exponentially with the final time, the best attainable rate is not known precisely for most identification tasks.We show that if a fixed budget task admits a complexity, defined as a lower bound on the probability of error which is attained by the same algorithm on all bandit problems, then that complexity is determined by the best non-adaptive sampling procedure for that problem.We show that there is no such complexity for several fixed budget identification tasks including Bernoulli best arm identification with two arms: there is no single algorithm that attains everywhere the best possible rate.

APA

Degenne, R.. (2023). On the Existence of a Complexity in Fixed Budget Bandit Identification. Proceedings of Thirty Sixth Conference on Learning Theory, in Proceedings of Machine Learning Research 195:1131-1154 Available from https://proceedings.mlr.press/v195/degenne23a.html.

Related Material

Download PDF