Open Problem: Regret Bounds for Noise-Free Kernel-Based Bandits

Sattar Vakili

Open Problem: Regret Bounds for Noise-Free Kernel-Based Bandits

Sattar Vakili

Proceedings of Thirty Fifth Conference on Learning Theory, PMLR 178:5624-5629, 2022.

Abstract

Kernel-based bandit is an extensively studied black-box optimization problem, in which the objective function is assumed to live in a known reproducing kernel Hilbert space. While nearly optimal regret bounds (up to logarithmic factors) are established in the noisy setting, surprisingly, less is known about the noise-free setting (when the exact values of the underlying function is accessible without observation noise). We discuss several upper bounds on regret; none of which seem order optimal, and provide a conjecture on the order optimal regret bound.

Cite this Paper

BibTeX


@InProceedings{pmlr-v178-open-problem-vakili22a,
  title = 	 {Open Problem: Regret Bounds for Noise-Free Kernel-Based Bandits},
  author =       {Vakili, Sattar},
  booktitle = 	 {Proceedings of Thirty Fifth Conference on Learning Theory},
  pages = 	 {5624--5629},
  year = 	 {2022},
  editor = 	 {Loh, Po-Ling and Raginsky, Maxim},
  volume = 	 {178},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {02--05 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v178/open-problem-vakili22a/open-problem-vakili22a.pdf},
  url = 	 {https://proceedings.mlr.press/v178/open-problem-vakili22a.html},
  abstract = 	 {Kernel-based bandit is an extensively studied black-box optimization problem, in which the objective function is assumed to live in a known reproducing kernel Hilbert space. While nearly optimal regret bounds (up to logarithmic factors) are established in the noisy setting, surprisingly, less is known about the noise-free setting (when the exact values of the underlying function is accessible without observation noise). We discuss several upper bounds on regret; none of which seem order optimal, and provide a conjecture on the order optimal regret bound.}
}

Endnote

%0 Conference Paper
%T Open Problem: Regret Bounds for Noise-Free Kernel-Based Bandits
%A Sattar Vakili
%B Proceedings of Thirty Fifth Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2022
%E Po-Ling Loh
%E Maxim Raginsky	
%F pmlr-v178-open-problem-vakili22a
%I PMLR
%P 5624--5629
%U https://proceedings.mlr.press/v178/open-problem-vakili22a.html
%V 178
%X Kernel-based bandit is an extensively studied black-box optimization problem, in which the objective function is assumed to live in a known reproducing kernel Hilbert space. While nearly optimal regret bounds (up to logarithmic factors) are established in the noisy setting, surprisingly, less is known about the noise-free setting (when the exact values of the underlying function is accessible without observation noise). We discuss several upper bounds on regret; none of which seem order optimal, and provide a conjecture on the order optimal regret bound.

APA


Vakili, S.. (2022). Open Problem: Regret Bounds for Noise-Free Kernel-Based Bandits. Proceedings of Thirty Fifth Conference on Learning Theory, in Proceedings of Machine Learning Research 178:5624-5629 Available from https://proceedings.mlr.press/v178/open-problem-vakili22a.html.

Related Material

Download PDF