New Projection-free Algorithms for Online Convex Optimization with Adaptive Regret Guarantees

Dan Garber; Ben Kretzu

New Projection-free Algorithms for Online Convex Optimization with Adaptive Regret Guarantees

Dan Garber, Ben Kretzu

Proceedings of Thirty Fifth Conference on Learning Theory, PMLR 178:2326-2359, 2022.

Abstract

We present new efficient \emph{projection-free} algorithms for online convex optimization (OCO), where by projection-free we refer to algorithms that avoid computing orthogonal projections onto the feasible set, and instead relay on different and potentially much more efficient oracles. While most state-of-the-art projection-free algorithms are based on the \emph{follow-the-leader} framework, our algorithms are fundamentally different and are based on the \emph{online gradient descent} algorithm with a novel and efficient approach to computing so-called \emph{infeasible projections}. As a consequence, we obtain the first projection-free algorithms which naturally yield \emph{adaptive regret} guarantees, i.e., regret bounds that hold w.r.t. any sub-interval of the sequence. Concretely, when assuming the availability of a linear optimization oracle (LOO) for the feasible set, on a sequence of length

$T$ , our algorithms guarantee

$O(T^{3/4})$ adaptive regret and

$O(T^{3/4})$ adaptive expected regret, for the full-information and bandit settings, respectively, using only

$O(T)$ calls to the LOO. These bounds match the current state-of-the-art regret bounds for LOO-based projection-free OCO, which are \emph{not adaptive}. We also consider a new natural setting in which the feasible set is accessible through a separation oracle. We present algorithms which, using overall

$O(T)$ calls to the separation oracle, guarantee

$O(\sqrt{T})$ adaptive regret and

$O(T^{3/4})$ adaptive expected regret for the full-information and bandit settings, respectively.

Cite this Paper

BibTeX


@InProceedings{pmlr-v178-garber22a,
  title = 	 {New Projection-free Algorithms for Online Convex Optimization with Adaptive Regret Guarantees},
  author =       {Garber, Dan and Kretzu, Ben},
  booktitle = 	 {Proceedings of Thirty Fifth Conference on Learning Theory},
  pages = 	 {2326--2359},
  year = 	 {2022},
  editor = 	 {Loh, Po-Ling and Raginsky, Maxim},
  volume = 	 {178},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {02--05 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v178/garber22a/garber22a.pdf},
  url = 	 {https://proceedings.mlr.press/v178/garber22a.html},
  abstract = 	 {We present new efficient \emph{projection-free} algorithms for online convex optimization (OCO), where by projection-free we refer to algorithms that avoid computing orthogonal projections onto the feasible set,  and instead relay on different and potentially much more efficient oracles. While most state-of-the-art projection-free algorithms are based on the \emph{follow-the-leader} framework, our algorithms are fundamentally different and are based on the \emph{online gradient descent} algorithm with a novel and efficient approach to computing so-called \emph{infeasible projections}.  As a consequence, we obtain the first projection-free algorithms which naturally yield \emph{adaptive regret} guarantees, i.e., regret bounds that hold w.r.t. any sub-interval of the sequence.  Concretely, when assuming the availability of a linear optimization oracle (LOO) for the feasible set, on a sequence of length $T$, our algorithms guarantee $O(T^{3/4})$ adaptive regret and  $O(T^{3/4})$ adaptive expected regret, for the full-information and bandit settings, respectively, using only $O(T)$ calls to the LOO. These bounds match the current state-of-the-art regret bounds for LOO-based projection-free OCO, which are \emph{not adaptive}.  We also consider a new natural setting in which the feasible set is accessible through a separation oracle.    We present algorithms which, using overall $O(T)$ calls to the separation oracle, guarantee  $O(\sqrt{T})$ adaptive regret and $O(T^{3/4})$ adaptive expected regret for the full-information and bandit settings, respectively.}
}

Endnote

%0 Conference Paper
%T New Projection-free Algorithms for Online Convex Optimization with Adaptive Regret Guarantees
%A Dan Garber
%A Ben Kretzu
%B Proceedings of Thirty Fifth Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2022
%E Po-Ling Loh
%E Maxim Raginsky	
%F pmlr-v178-garber22a
%I PMLR
%P 2326--2359
%U https://proceedings.mlr.press/v178/garber22a.html
%V 178
%X We present new efficient \emph{projection-free} algorithms for online convex optimization (OCO), where by projection-free we refer to algorithms that avoid computing orthogonal projections onto the feasible set,  and instead relay on different and potentially much more efficient oracles. While most state-of-the-art projection-free algorithms are based on the \emph{follow-the-leader} framework, our algorithms are fundamentally different and are based on the \emph{online gradient descent} algorithm with a novel and efficient approach to computing so-called \emph{infeasible projections}.  As a consequence, we obtain the first projection-free algorithms which naturally yield \emph{adaptive regret} guarantees, i.e., regret bounds that hold w.r.t. any sub-interval of the sequence.  Concretely, when assuming the availability of a linear optimization oracle (LOO) for the feasible set, on a sequence of length $T$, our algorithms guarantee $O(T^{3/4})$ adaptive regret and  $O(T^{3/4})$ adaptive expected regret, for the full-information and bandit settings, respectively, using only $O(T)$ calls to the LOO. These bounds match the current state-of-the-art regret bounds for LOO-based projection-free OCO, which are \emph{not adaptive}.  We also consider a new natural setting in which the feasible set is accessible through a separation oracle.    We present algorithms which, using overall $O(T)$ calls to the separation oracle, guarantee  $O(\sqrt{T})$ adaptive regret and $O(T^{3/4})$ adaptive expected regret for the full-information and bandit settings, respectively.

APA


Garber, D. & Kretzu, B.. (2022). New Projection-free Algorithms for Online Convex Optimization with Adaptive Regret Guarantees. Proceedings of Thirty Fifth Conference on Learning Theory, in Proceedings of Machine Learning Research 178:2326-2359 Available from https://proceedings.mlr.press/v178/garber22a.html.

Related Material

Download PDF