Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

Haoyang Zheng; Wei Deng; Christian Moya; Guang Lin

Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

Haoyang Zheng, Wei Deng, Christian Moya, Guang Lin

Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:2611-2619, 2024.

Abstract

Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from

$\mathcal{\tilde O}(d)$ to

$\mathcal{\tilde O}(\sqrt{d})$ . The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems.

Cite this Paper

BibTeX

@InProceedings{pmlr-v238-zheng24b,
  title = 	 {Accelerating Approximate {T}hompson Sampling with Underdamped {L}angevin {M}onte {C}arlo},
  author =       {Zheng, Haoyang and Deng, Wei and Moya, Christian and Lin, Guang},
  booktitle = 	 {Proceedings of The 27th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {2611--2619},
  year = 	 {2024},
  editor = 	 {Dasgupta, Sanjoy and Mandt, Stephan and Li, Yingzhen},
  volume = 	 {238},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {02--04 May},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v238/zheng24b/zheng24b.pdf},
  url = 	 {https://proceedings.mlr.press/v238/zheng24b.html},
  abstract = 	 {Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from $\mathcal{\tilde O}(d)$ to $\mathcal{\tilde O}(\sqrt{d})$. The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems.}
}

Endnote

%0 Conference Paper
%T Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo
%A Haoyang Zheng
%A Wei Deng
%A Christian Moya
%A Guang Lin
%B Proceedings of The 27th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2024
%E Sanjoy Dasgupta
%E Stephan Mandt
%E Yingzhen Li	
%F pmlr-v238-zheng24b
%I PMLR
%P 2611--2619
%U https://proceedings.mlr.press/v238/zheng24b.html
%V 238
%X Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from $\mathcal{\tilde O}(d)$ to $\mathcal{\tilde O}(\sqrt{d})$. The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems.

APA

Zheng, H., Deng, W., Moya, C. & Lin, G.. (2024). Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo. Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 238:2611-2619 Available from https://proceedings.mlr.press/v238/zheng24b.html.

Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

Abstract

Cite this Paper

Related Material