Distributed Clustering of Linear Bandits in Peer to Peer Networks

Nathan Korda; Balazs Szorenyi; Shuai Li

Distributed Clustering of Linear Bandits in Peer to Peer Networks

Nathan Korda, Balazs Szorenyi, Shuai Li

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:1301-1309, 2016.

Abstract

We provide two distributed confidence ball algorithms for solving linear bandit problems in peer to peer networks with limited communication capabilities. For the first, we assume that all the peers are solving the same linear bandit problem, and prove that our algorithm achieves the optimal asymptotic regret rate of any centralised algorithm that can instantly communicate information between the peers. For the second, we assume that there are clusters of peers solving the same bandit problem within each cluster, and we prove that our algorithm discovers these clusters, while achieving the optimal asymptotic regret rate within each one. Through experiments on several real-world datasets, we demonstrate the performance of proposed algorithms compared to the state-of-the-art.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-korda16,
  title = 	 {Distributed Clustering of Linear Bandits in Peer to Peer Networks},
  author = 	 {Korda, Nathan and Szorenyi, Balazs and Li, Shuai},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {1301--1309},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/korda16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/korda16.html},
  abstract = 	 {We provide two distributed confidence ball algorithms for solving linear bandit problems in peer to peer networks with limited communication capabilities. For the first, we assume that all the peers are solving the same linear bandit problem, and prove that our algorithm achieves the optimal asymptotic regret rate of any centralised algorithm that can instantly communicate information between the peers. For the second, we assume that there are clusters of peers solving the same bandit problem within each cluster, and we prove that our algorithm discovers these clusters, while achieving the optimal asymptotic regret rate within each one. Through experiments on several real-world datasets, we demonstrate the performance of proposed algorithms compared to the state-of-the-art.}
}

Endnote

%0 Conference Paper
%T Distributed Clustering of Linear Bandits in Peer to Peer Networks
%A Nathan Korda
%A Balazs Szorenyi
%A Shuai Li
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-korda16
%I PMLR
%P 1301--1309
%U https://proceedings.mlr.press/v48/korda16.html
%V 48
%X We provide two distributed confidence ball algorithms for solving linear bandit problems in peer to peer networks with limited communication capabilities. For the first, we assume that all the peers are solving the same linear bandit problem, and prove that our algorithm achieves the optimal asymptotic regret rate of any centralised algorithm that can instantly communicate information between the peers. For the second, we assume that there are clusters of peers solving the same bandit problem within each cluster, and we prove that our algorithm discovers these clusters, while achieving the optimal asymptotic regret rate within each one. Through experiments on several real-world datasets, we demonstrate the performance of proposed algorithms compared to the state-of-the-art.

RIS


TY  - CPAPER
TI  - Distributed Clustering of Linear Bandits in Peer to Peer Networks
AU  - Nathan Korda
AU  - Balazs Szorenyi
AU  - Shuai Li
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-korda16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 1301
EP  - 1309
L1  - http://proceedings.mlr.press/v48/korda16.pdf
UR  - https://proceedings.mlr.press/v48/korda16.html
AB  - We provide two distributed confidence ball algorithms for solving linear bandit problems in peer to peer networks with limited communication capabilities. For the first, we assume that all the peers are solving the same linear bandit problem, and prove that our algorithm achieves the optimal asymptotic regret rate of any centralised algorithm that can instantly communicate information between the peers. For the second, we assume that there are clusters of peers solving the same bandit problem within each cluster, and we prove that our algorithm discovers these clusters, while achieving the optimal asymptotic regret rate within each one. Through experiments on several real-world datasets, we demonstrate the performance of proposed algorithms compared to the state-of-the-art.
ER  -

APA


Korda, N., Szorenyi, B. & Li, S.. (2016). Distributed Clustering of Linear Bandits in Peer to Peer Networks. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:1301-1309 Available from https://proceedings.mlr.press/v48/korda16.html.

Distributed Clustering of Linear Bandits in Peer to Peer Networks

Abstract

Cite this Paper

Related Material