Prediction by random-walk perturbation

Luc Devroye; Gábor Lugosi; Gergely Neu

Prediction by random-walk perturbation

Luc Devroye, Gábor Lugosi, Gergely Neu

Proceedings of the 26th Annual Conference on Learning Theory, PMLR 30:460-473, 2013.

Abstract

We propose a version of the follow-the-perturbed-leader online prediction algorithm in which the cumulative losses are perturbed by independent symmetric random walks. The forecaster is shown to achieve an expected regret of the optimal order O(\sqrtn \log N) where n is the time horizon and N is the number of experts. More importantly, it is shown that the forecaster changes its prediction at most O(\sqrtn \log N) times, in expectation. We also extend the analysis to online combinatorial optimization and show that even in this more general setting, the forecaster rarely switches between experts while having a regret of near-optimal order.

Cite this Paper

BibTeX


@InProceedings{pmlr-v30-Devroye13,
  title = 	 {Prediction by random-walk perturbation},
  author = 	 {Devroye, Luc and Lugosi, Gábor and Neu, Gergely},
  booktitle = 	 {Proceedings of the 26th Annual Conference on Learning Theory},
  pages = 	 {460--473},
  year = 	 {2013},
  editor = 	 {Shalev-Shwartz, Shai and Steinwart, Ingo},
  volume = 	 {30},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Princeton, NJ, USA},
  month = 	 {12--14 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v30/Devroye13.pdf},
  url = 	 {https://proceedings.mlr.press/v30/Devroye13.html},
  abstract = 	 {We propose a version of the follow-the-perturbed-leader online prediction algorithm in which the cumulative losses are perturbed by independent symmetric random walks. The forecaster is shown to achieve an expected regret of the optimal order O(\sqrtn \log N) where n is the time horizon and N is the number of experts. More importantly, it is shown that the forecaster changes its prediction at most O(\sqrtn \log N) times, in expectation. We also extend the analysis to online combinatorial optimization and show that even in this more general setting, the forecaster rarely switches between experts while having a regret of near-optimal order. }
}

Endnote

%0 Conference Paper
%T Prediction by random-walk perturbation
%A Luc Devroye
%A Gábor Lugosi
%A Gergely Neu
%B Proceedings of the 26th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2013
%E Shai Shalev-Shwartz
%E Ingo Steinwart	
%F pmlr-v30-Devroye13
%I PMLR
%P 460--473
%U https://proceedings.mlr.press/v30/Devroye13.html
%V 30
%X We propose a version of the follow-the-perturbed-leader online prediction algorithm in which the cumulative losses are perturbed by independent symmetric random walks. The forecaster is shown to achieve an expected regret of the optimal order O(\sqrtn \log N) where n is the time horizon and N is the number of experts. More importantly, it is shown that the forecaster changes its prediction at most O(\sqrtn \log N) times, in expectation. We also extend the analysis to online combinatorial optimization and show that even in this more general setting, the forecaster rarely switches between experts while having a regret of near-optimal order.

RIS


TY  - CPAPER
TI  - Prediction by random-walk perturbation
AU  - Luc Devroye
AU  - Gábor Lugosi
AU  - Gergely Neu
BT  - Proceedings of the 26th Annual Conference on Learning Theory
DA  - 2013/06/13
ED  - Shai Shalev-Shwartz
ED  - Ingo Steinwart	
ID  - pmlr-v30-Devroye13
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 30
SP  - 460
EP  - 473
L1  - http://proceedings.mlr.press/v30/Devroye13.pdf
UR  - https://proceedings.mlr.press/v30/Devroye13.html
AB  - We propose a version of the follow-the-perturbed-leader online prediction algorithm in which the cumulative losses are perturbed by independent symmetric random walks. The forecaster is shown to achieve an expected regret of the optimal order O(\sqrtn \log N) where n is the time horizon and N is the number of experts. More importantly, it is shown that the forecaster changes its prediction at most O(\sqrtn \log N) times, in expectation. We also extend the analysis to online combinatorial optimization and show that even in this more general setting, the forecaster rarely switches between experts while having a regret of near-optimal order. 
ER  -

APA


Devroye, L., Lugosi, G. & Neu, G.. (2013). Prediction by random-walk perturbation. Proceedings of the 26th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 30:460-473 Available from https://proceedings.mlr.press/v30/Devroye13.html.

Related Material

Download PDF