Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts

Dirk Van Der Hoeven, Ciara Pike-Burke, Hao Qiu, Nicolò Cesa-Bianchi
Proceedings of the 40th International Conference on Machine Learning, PMLR 202:34809-34830, 2023.

Abstract

We investigate online classification with paid stochastic experts. Here, before making their prediction, each expert must be paid. The amount that we pay each expert directly influences the accuracy of their prediction through some unknown Lipschitz “productivity” function. In each round, the learner must decide how much to pay each expert and then make a prediction. They incur a cost equal to a weighted sum of the prediction error and upfront payments for all experts. We introduce an online learning algorithm whose total cost after $T$ rounds exceeds that of a predictor which knows the productivity of all experts in advance by at most $\mathcal{O}\big(K^2(\ln T)\sqrt{T}\big)$ where $K$ is the number of experts. In order to achieve this result, we combine Lipschitz bandits and online classification with surrogate losses. These tools allow us to improve upon the bound of order $T^{2/3}$ one would obtain in the standard Lipschitz bandit setting. Our algorithm is empirically evaluated on synthetic data.

Cite this Paper


BibTeX
@InProceedings{pmlr-v202-van-der-hoeven23a, title = {Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts}, author = {Van Der Hoeven, Dirk and Pike-Burke, Ciara and Qiu, Hao and Cesa-Bianchi, Nicol\`{o}}, booktitle = {Proceedings of the 40th International Conference on Machine Learning}, pages = {34809--34830}, year = {2023}, editor = {Krause, Andreas and Brunskill, Emma and Cho, Kyunghyun and Engelhardt, Barbara and Sabato, Sivan and Scarlett, Jonathan}, volume = {202}, series = {Proceedings of Machine Learning Research}, month = {23--29 Jul}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v202/van-der-hoeven23a/van-der-hoeven23a.pdf}, url = {https://proceedings.mlr.press/v202/van-der-hoeven23a.html}, abstract = {We investigate online classification with paid stochastic experts. Here, before making their prediction, each expert must be paid. The amount that we pay each expert directly influences the accuracy of their prediction through some unknown Lipschitz “productivity” function. In each round, the learner must decide how much to pay each expert and then make a prediction. They incur a cost equal to a weighted sum of the prediction error and upfront payments for all experts. We introduce an online learning algorithm whose total cost after $T$ rounds exceeds that of a predictor which knows the productivity of all experts in advance by at most $\mathcal{O}\big(K^2(\ln T)\sqrt{T}\big)$ where $K$ is the number of experts. In order to achieve this result, we combine Lipschitz bandits and online classification with surrogate losses. These tools allow us to improve upon the bound of order $T^{2/3}$ one would obtain in the standard Lipschitz bandit setting. Our algorithm is empirically evaluated on synthetic data.} }
Endnote
%0 Conference Paper %T Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts %A Dirk Van Der Hoeven %A Ciara Pike-Burke %A Hao Qiu %A Nicolò Cesa-Bianchi %B Proceedings of the 40th International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2023 %E Andreas Krause %E Emma Brunskill %E Kyunghyun Cho %E Barbara Engelhardt %E Sivan Sabato %E Jonathan Scarlett %F pmlr-v202-van-der-hoeven23a %I PMLR %P 34809--34830 %U https://proceedings.mlr.press/v202/van-der-hoeven23a.html %V 202 %X We investigate online classification with paid stochastic experts. Here, before making their prediction, each expert must be paid. The amount that we pay each expert directly influences the accuracy of their prediction through some unknown Lipschitz “productivity” function. In each round, the learner must decide how much to pay each expert and then make a prediction. They incur a cost equal to a weighted sum of the prediction error and upfront payments for all experts. We introduce an online learning algorithm whose total cost after $T$ rounds exceeds that of a predictor which knows the productivity of all experts in advance by at most $\mathcal{O}\big(K^2(\ln T)\sqrt{T}\big)$ where $K$ is the number of experts. In order to achieve this result, we combine Lipschitz bandits and online classification with surrogate losses. These tools allow us to improve upon the bound of order $T^{2/3}$ one would obtain in the standard Lipschitz bandit setting. Our algorithm is empirically evaluated on synthetic data.
APA
Van Der Hoeven, D., Pike-Burke, C., Qiu, H. & Cesa-Bianchi, N.. (2023). Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts. Proceedings of the 40th International Conference on Machine Learning, in Proceedings of Machine Learning Research 202:34809-34830 Available from https://proceedings.mlr.press/v202/van-der-hoeven23a.html.

Related Material