Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints

Alexander Rakhlin; Karthik Sridharan

Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints

Alexander Rakhlin, Karthik Sridharan

Proceedings of The 28th Conference on Learning Theory, PMLR 40:1457-1479, 2015.

Abstract

We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial structure. A key aspect here is that finding the optimal benchmark predictor (even in hindsight, given all the data) might be computationally hard due to the combinatorial nature of the constraints. Despite this, we provide polynomial-time prediction algorithms that achieve low regret against combinatorial benchmark sets. We do so by building improper learning algorithms based on two ideas that work together. The first is to alleviate part of the computational burden through random playout, and the second is to employ Lasserre semidefinite hierarchies to approximate the resulting integer program. Interestingly, for our prediction algorithms, we only need to compute the values of the semidefinite programs and not the rounded solutions. However, the integrality gap for Lasserre hierarchy does enter the generic regret bound in terms of Rademacher complexity of the benchmark set. This establishes a trade-off between the computation time and the regret bound of the algorithm.

Cite this Paper

BibTeX


@InProceedings{pmlr-v40-Rakhlin15,
  title = 	 {Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints},
  author = 	 {Rakhlin, Alexander and Sridharan, Karthik},
  booktitle = 	 {Proceedings of The 28th Conference on Learning Theory},
  pages = 	 {1457--1479},
  year = 	 {2015},
  editor = 	 {Grünwald, Peter and Hazan, Elad and Kale, Satyen},
  volume = 	 {40},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Paris, France},
  month = 	 {03--06 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v40/Rakhlin15.pdf},
  url = 	 {https://proceedings.mlr.press/v40/Rakhlin15.html},
  abstract = 	 {We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial structure. A key aspect here is that finding the optimal benchmark predictor (even in hindsight, given all the data) might be computationally hard due to the combinatorial nature of the constraints. Despite this, we provide polynomial-time prediction algorithms that achieve low regret against combinatorial benchmark sets. We do so by building improper learning algorithms based on two ideas that work together. The first is to alleviate part of the computational burden through random playout, and the second is to employ Lasserre semidefinite hierarchies to approximate the resulting integer program. Interestingly, for our prediction algorithms, we only need to compute the values of the semidefinite programs and not the rounded solutions. However, the integrality gap for Lasserre hierarchy does enter the generic regret bound in terms of Rademacher complexity of the benchmark set. This establishes a trade-off between the computation time and the regret bound of the algorithm.}
}

Endnote

%0 Conference Paper
%T Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints
%A Alexander Rakhlin
%A Karthik Sridharan
%B Proceedings of The 28th Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2015
%E Peter Grünwald
%E Elad Hazan
%E Satyen Kale	
%F pmlr-v40-Rakhlin15
%I PMLR
%P 1457--1479
%U https://proceedings.mlr.press/v40/Rakhlin15.html
%V 40
%X We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial structure. A key aspect here is that finding the optimal benchmark predictor (even in hindsight, given all the data) might be computationally hard due to the combinatorial nature of the constraints. Despite this, we provide polynomial-time prediction algorithms that achieve low regret against combinatorial benchmark sets. We do so by building improper learning algorithms based on two ideas that work together. The first is to alleviate part of the computational burden through random playout, and the second is to employ Lasserre semidefinite hierarchies to approximate the resulting integer program. Interestingly, for our prediction algorithms, we only need to compute the values of the semidefinite programs and not the rounded solutions. However, the integrality gap for Lasserre hierarchy does enter the generic regret bound in terms of Rademacher complexity of the benchmark set. This establishes a trade-off between the computation time and the regret bound of the algorithm.

RIS


TY  - CPAPER
TI  - Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints
AU  - Alexander Rakhlin
AU  - Karthik Sridharan
BT  - Proceedings of The 28th Conference on Learning Theory
DA  - 2015/06/26
ED  - Peter Grünwald
ED  - Elad Hazan
ED  - Satyen Kale	
ID  - pmlr-v40-Rakhlin15
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 40
SP  - 1457
EP  - 1479
L1  - http://proceedings.mlr.press/v40/Rakhlin15.pdf
UR  - https://proceedings.mlr.press/v40/Rakhlin15.html
AB  - We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial structure. A key aspect here is that finding the optimal benchmark predictor (even in hindsight, given all the data) might be computationally hard due to the combinatorial nature of the constraints. Despite this, we provide polynomial-time prediction algorithms that achieve low regret against combinatorial benchmark sets. We do so by building improper learning algorithms based on two ideas that work together. The first is to alleviate part of the computational burden through random playout, and the second is to employ Lasserre semidefinite hierarchies to approximate the resulting integer program. Interestingly, for our prediction algorithms, we only need to compute the values of the semidefinite programs and not the rounded solutions. However, the integrality gap for Lasserre hierarchy does enter the generic regret bound in terms of Rademacher complexity of the benchmark set. This establishes a trade-off between the computation time and the regret bound of the algorithm.
ER  -

APA


Rakhlin, A. & Sridharan, K.. (2015). Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints. Proceedings of The 28th Conference on Learning Theory, in Proceedings of Machine Learning Research 40:1457-1479 Available from https://proceedings.mlr.press/v40/Rakhlin15.html.

Related Material

Download PDF