No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints

Arpan Losalka; Jonathan Scarlett

No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints

Arpan Losalka, Jonathan Scarlett

Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:3232-3240, 2024.

Abstract

We consider the problem of sequentially maximizing an unknown function

$f$ over a set of actions of the form

$(s, x)$ , where the selected actions must satisfy a safety constraint with respect to an unknown safety function

$g$ . We model

$f$ and

$g$ as lying in a reproducing kernel Hilbert space (RKHS), which facilitates the use of Gaussian process methods. While existing works for this setting have provided algorithms that are guaranteed to identify a near-optimal safe action, the problem of attaining low cumulative regret has remained largely unexplored, with a key challenge being that expanding the safe region can incur high regret. To address this challenge, we show that if

$g$ is monotone with respect to just the single variable

$s$ (with no such constraint on

$f$ ), sublinear regret becomes achievable with our proposed algorithm. In addition, we show that a modified version of our algorithm is able to attain sublinear regret (for suitably defined notions of regret) for the task of finding a near-optimal

$s$ corresponding to every

$x$ , as opposed to only finding the global safe optimum. Our findings are supported with empirical evaluations on various objective and safety functions.

Cite this Paper

BibTeX

@InProceedings{pmlr-v238-losalka24a,
  title = 	 {No-Regret Algorithms for Safe {B}ayesian Optimization with Monotonicity Constraints},
  author =       {Losalka, Arpan and Scarlett, Jonathan},
  booktitle = 	 {Proceedings of The 27th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3232--3240},
  year = 	 {2024},
  editor = 	 {Dasgupta, Sanjoy and Mandt, Stephan and Li, Yingzhen},
  volume = 	 {238},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {02--04 May},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v238/losalka24a/losalka24a.pdf},
  url = 	 {https://proceedings.mlr.press/v238/losalka24a.html},
  abstract = 	 {We consider the problem of sequentially maximizing an unknown function $f$ over a set of actions of the form $(s, x)$, where the selected actions must satisfy a safety constraint with respect to an unknown safety function $g$. We model $f$ and $g$ as lying in a reproducing kernel Hilbert space (RKHS), which facilitates the use of Gaussian process methods. While existing works for this setting have provided algorithms that are guaranteed to identify a near-optimal safe action, the problem of attaining low cumulative regret has remained largely unexplored, with a key challenge being that expanding the safe region can incur high regret. To address this challenge, we show that if $g$ is monotone with respect to just the single variable $s$ (with no such constraint on $f$), sublinear regret becomes achievable with our proposed algorithm. In addition, we show that a modified version of our algorithm is able to attain sublinear regret (for suitably defined notions of regret) for the task of finding a near-optimal $s$ corresponding to every $x$, as opposed to only finding the global safe optimum. Our findings are supported with empirical evaluations on various objective and safety functions.}
}

Endnote

%0 Conference Paper
%T No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints
%A Arpan Losalka
%A Jonathan Scarlett
%B Proceedings of The 27th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2024
%E Sanjoy Dasgupta
%E Stephan Mandt
%E Yingzhen Li	
%F pmlr-v238-losalka24a
%I PMLR
%P 3232--3240
%U https://proceedings.mlr.press/v238/losalka24a.html
%V 238
%X We consider the problem of sequentially maximizing an unknown function $f$ over a set of actions of the form $(s, x)$, where the selected actions must satisfy a safety constraint with respect to an unknown safety function $g$. We model $f$ and $g$ as lying in a reproducing kernel Hilbert space (RKHS), which facilitates the use of Gaussian process methods. While existing works for this setting have provided algorithms that are guaranteed to identify a near-optimal safe action, the problem of attaining low cumulative regret has remained largely unexplored, with a key challenge being that expanding the safe region can incur high regret. To address this challenge, we show that if $g$ is monotone with respect to just the single variable $s$ (with no such constraint on $f$), sublinear regret becomes achievable with our proposed algorithm. In addition, we show that a modified version of our algorithm is able to attain sublinear regret (for suitably defined notions of regret) for the task of finding a near-optimal $s$ corresponding to every $x$, as opposed to only finding the global safe optimum. Our findings are supported with empirical evaluations on various objective and safety functions.

APA

Losalka, A. & Scarlett, J.. (2024). No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints. Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 238:3232-3240 Available from https://proceedings.mlr.press/v238/losalka24a.html.

No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints

Abstract

Cite this Paper

Related Material