Causally Abstracted Multi-armed Bandits

Fabio Massimo Zennaro; Nicholas Bishop; Joel Dyer; Yorgos Felekis; Anisoara Calinescu; Michael Wooldridge; Theodoros Damoulas

Causally Abstracted Multi-armed Bandits

Fabio Massimo Zennaro, Nicholas Bishop, Joel Dyer, Yorgos Felekis, Anisoara Calinescu, Michael Wooldridge, Theodoros Damoulas

Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence, PMLR 244:4109-4139, 2024.

Abstract

Multi-armed bandits (MAB) and causal MABs (CMAB) are established frameworks for decision-making problems. The majority of prior work typically studies and solves individual MAB and CMAB in isolation for a given problem and associated data. However, decision-makers are often faced with multiple related problems and multi-scale observations where joint formulations are needed in order to efficiently exploit the problem structures and data dependencies. Transfer learning for CMABs addresses the situation where models are defined on identical variables, although causal connections may differ. In this work, we extend transfer learning to setups involving CMABs defined on potentially different variables, with varying degrees of granularity, and related via an abstraction map. Formally, we introduce the problem of causally abstracted MABs (CAMABs) by relying on the theory of causal abstraction in order to express a rigorous abstraction map. We propose algorithms to learn in a CAMAB, and study their regret. We illustrate the limitations and the strengths of our algorithms on a real-world scenario related to online advertising.

Cite this Paper

BibTeX

@InProceedings{pmlr-v244-zennaro24a,
  title = 	 {Causally Abstracted Multi-armed Bandits},
  author =       {Zennaro, Fabio Massimo and Bishop, Nicholas and Dyer, Joel and Felekis, Yorgos and Calinescu, Anisoara and Wooldridge, Michael and Damoulas, Theodoros},
  booktitle = 	 {Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence},
  pages = 	 {4109--4139},
  year = 	 {2024},
  editor = 	 {Kiyavash, Negar and Mooij, Joris M.},
  volume = 	 {244},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {15--19 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v244/main/assets/zennaro24a/zennaro24a.pdf},
  url = 	 {https://proceedings.mlr.press/v244/zennaro24a.html},
  abstract = 	 {Multi-armed bandits (MAB) and causal MABs (CMAB) are established frameworks for decision-making problems. The majority of prior work typically studies and solves individual MAB and CMAB in isolation for a given problem and associated data. However, decision-makers are often faced with multiple related problems and multi-scale observations where joint formulations are needed in order to efficiently exploit the problem structures and data dependencies. Transfer learning for CMABs addresses the situation where models are defined on identical variables, although causal connections may differ. In this work, we extend transfer learning to setups involving CMABs defined on potentially different variables, with varying degrees of granularity, and related via an abstraction map. Formally, we introduce the problem of causally abstracted MABs (CAMABs) by relying on the theory of causal abstraction in order to express a rigorous abstraction map. We propose algorithms to learn in a CAMAB, and study their regret. We illustrate the limitations and the strengths of our algorithms on a real-world scenario related to online advertising.}
}

Endnote

%0 Conference Paper
%T Causally Abstracted Multi-armed Bandits
%A Fabio Massimo Zennaro
%A Nicholas Bishop
%A Joel Dyer
%A Yorgos Felekis
%A Anisoara Calinescu
%A Michael Wooldridge
%A Theodoros Damoulas
%B Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence
%C Proceedings of Machine Learning Research
%D 2024
%E Negar Kiyavash
%E Joris M. Mooij	
%F pmlr-v244-zennaro24a
%I PMLR
%P 4109--4139
%U https://proceedings.mlr.press/v244/zennaro24a.html
%V 244
%X Multi-armed bandits (MAB) and causal MABs (CMAB) are established frameworks for decision-making problems. The majority of prior work typically studies and solves individual MAB and CMAB in isolation for a given problem and associated data. However, decision-makers are often faced with multiple related problems and multi-scale observations where joint formulations are needed in order to efficiently exploit the problem structures and data dependencies. Transfer learning for CMABs addresses the situation where models are defined on identical variables, although causal connections may differ. In this work, we extend transfer learning to setups involving CMABs defined on potentially different variables, with varying degrees of granularity, and related via an abstraction map. Formally, we introduce the problem of causally abstracted MABs (CAMABs) by relying on the theory of causal abstraction in order to express a rigorous abstraction map. We propose algorithms to learn in a CAMAB, and study their regret. We illustrate the limitations and the strengths of our algorithms on a real-world scenario related to online advertising.

APA

Zennaro, F.M., Bishop, N., Dyer, J., Felekis, Y., Calinescu, A., Wooldridge, M. & Damoulas, T.. (2024). Causally Abstracted Multi-armed Bandits. Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence, in Proceedings of Machine Learning Research 244:4109-4139 Available from https://proceedings.mlr.press/v244/zennaro24a.html.

Causally Abstracted Multi-armed Bandits

Abstract

Cite this Paper

Related Material