Online learning in repeated auctions

Jonathan Weed; Vianney Perchet; Philippe Rigollet

Online learning in repeated auctions

Jonathan Weed, Vianney Perchet, Philippe Rigollet

29th Annual Conference on Learning Theory, PMLR 49:1562-1583, 2016.

Abstract

Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good’s value once it is purchased. We adopt an online learning approach with bandit feedback to model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical. Comparing our performance against that of the best fixed bid in hindsight, we show that sublinear regret is also achievable in this case. For both the stochastic and adversarial models, we prove matching minimax lower bounds showing our strategies to be optimal up to lower-order terms. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.

Cite this Paper

BibTeX


@InProceedings{pmlr-v49-weed16,
  title = 	 {Online learning in repeated auctions},
  author = 	 {Weed, Jonathan and Perchet, Vianney and Rigollet, Philippe},
  booktitle = 	 {29th Annual Conference on Learning Theory},
  pages = 	 {1562--1583},
  year = 	 {2016},
  editor = 	 {Feldman, Vitaly and Rakhlin, Alexander and Shamir, Ohad},
  volume = 	 {49},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Columbia University, New York, New York, USA},
  month = 	 {23--26 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v49/weed16.pdf},
  url = 	 {https://proceedings.mlr.press/v49/weed16.html},
  abstract = 	 {Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good’s value once it is purchased. We adopt an online learning approach with bandit feedback to  model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are  random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical. Comparing our performance against that of the best fixed bid in hindsight, we show that sublinear regret is also achievable in this case. For both the stochastic and adversarial models, we prove matching minimax lower bounds showing our strategies to be optimal up to lower-order terms. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.}
}

Endnote

%0 Conference Paper
%T Online learning in repeated auctions
%A Jonathan Weed
%A Vianney Perchet
%A Philippe Rigollet
%B 29th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2016
%E Vitaly Feldman
%E Alexander Rakhlin
%E Ohad Shamir	
%F pmlr-v49-weed16
%I PMLR
%P 1562--1583
%U https://proceedings.mlr.press/v49/weed16.html
%V 49
%X Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good’s value once it is purchased. We adopt an online learning approach with bandit feedback to  model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are  random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical. Comparing our performance against that of the best fixed bid in hindsight, we show that sublinear regret is also achievable in this case. For both the stochastic and adversarial models, we prove matching minimax lower bounds showing our strategies to be optimal up to lower-order terms. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.

RIS


TY  - CPAPER
TI  - Online learning in repeated auctions
AU  - Jonathan Weed
AU  - Vianney Perchet
AU  - Philippe Rigollet
BT  - 29th Annual Conference on Learning Theory
DA  - 2016/06/06
ED  - Vitaly Feldman
ED  - Alexander Rakhlin
ED  - Ohad Shamir	
ID  - pmlr-v49-weed16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 49
SP  - 1562
EP  - 1583
L1  - http://proceedings.mlr.press/v49/weed16.pdf
UR  - https://proceedings.mlr.press/v49/weed16.html
AB  - Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good’s value once it is purchased. We adopt an online learning approach with bandit feedback to  model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are  random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical. Comparing our performance against that of the best fixed bid in hindsight, we show that sublinear regret is also achievable in this case. For both the stochastic and adversarial models, we prove matching minimax lower bounds showing our strategies to be optimal up to lower-order terms. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.
ER  -

APA


Weed, J., Perchet, V. & Rigollet, P.. (2016). Online learning in repeated auctions. 29th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 49:1562-1583 Available from https://proceedings.mlr.press/v49/weed16.html.

Related Material

Download PDF