The Hedge Algorithm on a Continuum

Walid Krichene; Maximilian Balandat; Claire Tomlin; Alexandre Bayen

The Hedge Algorithm on a Continuum

Walid Krichene, Maximilian Balandat, Claire Tomlin, Alexandre Bayen

Proceedings of the 32nd International Conference on Machine Learning, PMLR 37:824-832, 2015.

Abstract

We consider an online optimization problem on a subset S of R^n (not necessarily convex), in which a decision maker chooses, at each iteration t, a probability distribution x^(t) over S, and seeks to minimize a cumulative expected loss, where each loss is a Lipschitz function revealed at the end of iteration t. Building on previous work, we propose a generalized Hedge algorithm and show a O(\sqrtt \log t) bound on the regret when the losses are uniformly Lipschitz and S is uniformly fat (a weaker condition than convexity). Finally, we propose a generalization to the dual averaging method on the set of Lebesgue-continuous distributions over S.

Cite this Paper

BibTeX


@InProceedings{pmlr-v37-krichene15,
  title = 	 {The Hedge Algorithm on a Continuum},
  author = 	 {Krichene, Walid and Balandat, Maximilian and Tomlin, Claire and Bayen, Alexandre},
  booktitle = 	 {Proceedings of the 32nd International Conference on Machine Learning},
  pages = 	 {824--832},
  year = 	 {2015},
  editor = 	 {Bach, Francis and Blei, David},
  volume = 	 {37},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Lille, France},
  month = 	 {07--09 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v37/krichene15.pdf},
  url = 	 {https://proceedings.mlr.press/v37/krichene15.html},
  abstract = 	 {We consider an online optimization problem on a subset S of R^n (not necessarily convex), in which a decision maker chooses, at each iteration t, a probability distribution x^(t) over S, and seeks to minimize a cumulative expected loss, where each loss is a Lipschitz function revealed at the end of iteration t. Building on previous work, we propose a generalized Hedge algorithm and show a O(\sqrtt \log t) bound on the regret when the losses are uniformly Lipschitz and S is uniformly fat (a weaker condition than convexity). Finally, we propose a generalization to the dual averaging method on the set of Lebesgue-continuous distributions over S.}
}

Endnote

%0 Conference Paper
%T The Hedge Algorithm on a Continuum
%A Walid Krichene
%A Maximilian Balandat
%A Claire Tomlin
%A Alexandre Bayen
%B Proceedings of the 32nd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2015
%E Francis Bach
%E David Blei	
%F pmlr-v37-krichene15
%I PMLR
%P 824--832
%U https://proceedings.mlr.press/v37/krichene15.html
%V 37
%X We consider an online optimization problem on a subset S of R^n (not necessarily convex), in which a decision maker chooses, at each iteration t, a probability distribution x^(t) over S, and seeks to minimize a cumulative expected loss, where each loss is a Lipschitz function revealed at the end of iteration t. Building on previous work, we propose a generalized Hedge algorithm and show a O(\sqrtt \log t) bound on the regret when the losses are uniformly Lipschitz and S is uniformly fat (a weaker condition than convexity). Finally, we propose a generalization to the dual averaging method on the set of Lebesgue-continuous distributions over S.

RIS


TY  - CPAPER
TI  - The Hedge Algorithm on a Continuum
AU  - Walid Krichene
AU  - Maximilian Balandat
AU  - Claire Tomlin
AU  - Alexandre Bayen
BT  - Proceedings of the 32nd International Conference on Machine Learning
DA  - 2015/06/01
ED  - Francis Bach
ED  - David Blei	
ID  - pmlr-v37-krichene15
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 37
SP  - 824
EP  - 832
L1  - http://proceedings.mlr.press/v37/krichene15.pdf
UR  - https://proceedings.mlr.press/v37/krichene15.html
AB  - We consider an online optimization problem on a subset S of R^n (not necessarily convex), in which a decision maker chooses, at each iteration t, a probability distribution x^(t) over S, and seeks to minimize a cumulative expected loss, where each loss is a Lipschitz function revealed at the end of iteration t. Building on previous work, we propose a generalized Hedge algorithm and show a O(\sqrtt \log t) bound on the regret when the losses are uniformly Lipschitz and S is uniformly fat (a weaker condition than convexity). Finally, we propose a generalization to the dual averaging method on the set of Lebesgue-continuous distributions over S.
ER  -

APA


Krichene, W., Balandat, M., Tomlin, C. & Bayen, A.. (2015). The Hedge Algorithm on a Continuum. Proceedings of the 32nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 37:824-832 Available from https://proceedings.mlr.press/v37/krichene15.html.

The Hedge Algorithm on a Continuum

Abstract

Cite this Paper

Related Material