Linear Contextual Bandits With Interference

Yang Xu, Wenbin Lu, Rui Song
Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:69545-69582, 2025.

Abstract

Interference, a key concept in causal inference, extends the reward modeling process by accounting for the impact of one unit’s actions on the rewards of others. In contextual bandit (CB) settings where multiple units are present in the same round, interference can significantly affect the estimation of expected rewards for different arms, thereby influencing the decision-making process. Although some prior work has explored multi-agent and adversarial bandits in interference-aware settings, how to model interference in CB remains significantly underexplored. In this paper, we introduce a systematic framework to address interference in Linear CB (LinCB), bridging the gap between causal inference and online decision-making. We propose a series of algorithms that explicitly quantify the interference effect in the reward modeling process and provide comprehensive theoretical guarantees, including sublinear regret bounds, finite sample upper bounds, and asymptotic properties. The effectiveness of our approach is demonstrated through simulations and a synthetic data generated based on MovieLens data.

Cite this Paper


BibTeX
@InProceedings{pmlr-v267-xu25v, title = {Linear Contextual Bandits With Interference}, author = {Xu, Yang and Lu, Wenbin and Song, Rui}, booktitle = {Proceedings of the 42nd International Conference on Machine Learning}, pages = {69545--69582}, year = {2025}, editor = {Singh, Aarti and Fazel, Maryam and Hsu, Daniel and Lacoste-Julien, Simon and Berkenkamp, Felix and Maharaj, Tegan and Wagstaff, Kiri and Zhu, Jerry}, volume = {267}, series = {Proceedings of Machine Learning Research}, month = {13--19 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v267/main/assets/xu25v/xu25v.pdf}, url = {https://proceedings.mlr.press/v267/xu25v.html}, abstract = {Interference, a key concept in causal inference, extends the reward modeling process by accounting for the impact of one unit’s actions on the rewards of others. In contextual bandit (CB) settings where multiple units are present in the same round, interference can significantly affect the estimation of expected rewards for different arms, thereby influencing the decision-making process. Although some prior work has explored multi-agent and adversarial bandits in interference-aware settings, how to model interference in CB remains significantly underexplored. In this paper, we introduce a systematic framework to address interference in Linear CB (LinCB), bridging the gap between causal inference and online decision-making. We propose a series of algorithms that explicitly quantify the interference effect in the reward modeling process and provide comprehensive theoretical guarantees, including sublinear regret bounds, finite sample upper bounds, and asymptotic properties. The effectiveness of our approach is demonstrated through simulations and a synthetic data generated based on MovieLens data.} }
Endnote
%0 Conference Paper %T Linear Contextual Bandits With Interference %A Yang Xu %A Wenbin Lu %A Rui Song %B Proceedings of the 42nd International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2025 %E Aarti Singh %E Maryam Fazel %E Daniel Hsu %E Simon Lacoste-Julien %E Felix Berkenkamp %E Tegan Maharaj %E Kiri Wagstaff %E Jerry Zhu %F pmlr-v267-xu25v %I PMLR %P 69545--69582 %U https://proceedings.mlr.press/v267/xu25v.html %V 267 %X Interference, a key concept in causal inference, extends the reward modeling process by accounting for the impact of one unit’s actions on the rewards of others. In contextual bandit (CB) settings where multiple units are present in the same round, interference can significantly affect the estimation of expected rewards for different arms, thereby influencing the decision-making process. Although some prior work has explored multi-agent and adversarial bandits in interference-aware settings, how to model interference in CB remains significantly underexplored. In this paper, we introduce a systematic framework to address interference in Linear CB (LinCB), bridging the gap between causal inference and online decision-making. We propose a series of algorithms that explicitly quantify the interference effect in the reward modeling process and provide comprehensive theoretical guarantees, including sublinear regret bounds, finite sample upper bounds, and asymptotic properties. The effectiveness of our approach is demonstrated through simulations and a synthetic data generated based on MovieLens data.
APA
Xu, Y., Lu, W. & Song, R.. (2025). Linear Contextual Bandits With Interference. Proceedings of the 42nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 267:69545-69582 Available from https://proceedings.mlr.press/v267/xu25v.html.

Related Material