Simple Bayesian Algorithms for Best Arm Identification

Daniel Russo

Simple Bayesian Algorithms for Best Arm Identification

Daniel Russo

29th Annual Conference on Learning Theory, PMLR 49:1417-1418, 2016.

Abstract

This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs. An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design after a small number of measurements. I propose three simple Bayesian algorithms for adaptively allocating measurement effort. One is Top-Two Probability sampling, which computes the two designs with the highest posterior probability of being optimal, and then randomizes to select among these two. One is a variant a top-two sampling which considers not only the probability a design is optimal, but the expected amount by which its quality exceeds that of other designs. The final algorithm is a modified version of Thompson sampling that is tailored for identifying the best design. I prove that these simple algorithms satisfy a strong optimality property. In a frequestist setting where the true quality of the designs is fixed, one hopes the posterior definitively identifies the optimal design, in the sense that that the posterior probability assigned to the event that some other design is optimal converges to zero as measurements are collected. I show that under the proposed algorithms this convergence occurs at an \emphexponential rate, and the corresponding exponent is the best possible among all allocation rules.

Cite this Paper

BibTeX


@InProceedings{pmlr-v49-russo16,
  title = 	 {Simple Bayesian Algorithms for Best Arm Identification},
  author = 	 {Russo, Daniel},
  booktitle = 	 {29th Annual Conference on Learning Theory},
  pages = 	 {1417--1418},
  year = 	 {2016},
  editor = 	 {Feldman, Vitaly and Rakhlin, Alexander and Shamir, Ohad},
  volume = 	 {49},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Columbia University, New York, New York, USA},
  month = 	 {23--26 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v49/russo16.pdf},
  url = 	 {https://proceedings.mlr.press/v49/russo16.html},
  abstract = 	 {This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs.  An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design  after a small number of measurements. I propose three simple Bayesian algorithms for adaptively allocating measurement effort. One is Top-Two Probability sampling, which computes the two designs with the highest posterior probability of being optimal, and then randomizes to select among these two. One is a variant a top-two sampling which considers not only the probability a design is optimal, but the expected amount by which its quality exceeds that of other designs. The final algorithm is a modified version of Thompson sampling that is tailored for identifying the best design. I prove that these simple algorithms satisfy a strong optimality property. In a frequestist setting where the true quality of the designs is fixed, one hopes the posterior definitively identifies the optimal design, in the sense that that the posterior probability assigned to the event that some other design is optimal converges to zero as measurements are collected. I show that under the proposed algorithms this convergence occurs at an \emphexponential rate, and the corresponding exponent is the best possible among all allocation rules.}
}

Endnote

%0 Conference Paper
%T Simple Bayesian Algorithms for Best Arm Identification
%A Daniel Russo
%B 29th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2016
%E Vitaly Feldman
%E Alexander Rakhlin
%E Ohad Shamir	
%F pmlr-v49-russo16
%I PMLR
%P 1417--1418
%U https://proceedings.mlr.press/v49/russo16.html
%V 49
%X This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs.  An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design  after a small number of measurements. I propose three simple Bayesian algorithms for adaptively allocating measurement effort. One is Top-Two Probability sampling, which computes the two designs with the highest posterior probability of being optimal, and then randomizes to select among these two. One is a variant a top-two sampling which considers not only the probability a design is optimal, but the expected amount by which its quality exceeds that of other designs. The final algorithm is a modified version of Thompson sampling that is tailored for identifying the best design. I prove that these simple algorithms satisfy a strong optimality property. In a frequestist setting where the true quality of the designs is fixed, one hopes the posterior definitively identifies the optimal design, in the sense that that the posterior probability assigned to the event that some other design is optimal converges to zero as measurements are collected. I show that under the proposed algorithms this convergence occurs at an \emphexponential rate, and the corresponding exponent is the best possible among all allocation rules.

RIS


TY  - CPAPER
TI  - Simple Bayesian Algorithms for Best Arm Identification
AU  - Daniel Russo
BT  - 29th Annual Conference on Learning Theory
DA  - 2016/06/06
ED  - Vitaly Feldman
ED  - Alexander Rakhlin
ED  - Ohad Shamir	
ID  - pmlr-v49-russo16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 49
SP  - 1417
EP  - 1418
L1  - http://proceedings.mlr.press/v49/russo16.pdf
UR  - https://proceedings.mlr.press/v49/russo16.html
AB  - This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs.  An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design  after a small number of measurements. I propose three simple Bayesian algorithms for adaptively allocating measurement effort. One is Top-Two Probability sampling, which computes the two designs with the highest posterior probability of being optimal, and then randomizes to select among these two. One is a variant a top-two sampling which considers not only the probability a design is optimal, but the expected amount by which its quality exceeds that of other designs. The final algorithm is a modified version of Thompson sampling that is tailored for identifying the best design. I prove that these simple algorithms satisfy a strong optimality property. In a frequestist setting where the true quality of the designs is fixed, one hopes the posterior definitively identifies the optimal design, in the sense that that the posterior probability assigned to the event that some other design is optimal converges to zero as measurements are collected. I show that under the proposed algorithms this convergence occurs at an \emphexponential rate, and the corresponding exponent is the best possible among all allocation rules.
ER  -

APA


Russo, D.. (2016). Simple Bayesian Algorithms for Best Arm Identification. 29th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 49:1417-1418 Available from https://proceedings.mlr.press/v49/russo16.html.

Related Material

Download PDF