Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach

Masaki Adachi; Satoshi Hayakawa; Martin Jørgensen; Xingchen Wan; Vu Nguyen; Harald Oberhauser; Michael A. Osborne

Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach

Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:496-504, 2024.

Abstract

Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed—larger batches are more costly, smaller batches lead to slower wall-clock run-times—and the trade-off may change over the run (larger batches are often preferable earlier). To address this trade-off, we propose a novel Probabilistic Numerics framework that adaptively changes batch sizes. By framing batch selection as a quadrature task, our integration-error-aware algorithm facilitates the automatic tuning of batch sizes to meet predefined quadrature precision objectives, akin to how typical optimizers terminate based on convergence thresholds. This approach obviates the necessity for exhaustive searches across all potential batch sizes. We also extend this to scenarios with constrained active learning and constrained optimization, interpreting constraint violations as reductions in the precision requirement, to subsequently adapt batch construction. Through extensive experiments, we demonstrate that our approach significantly enhances learning efficiency and flexibility in diverse Bayesian batch active learning and Bayesian optimization applications.

Cite this Paper

BibTeX

@InProceedings{pmlr-v238-adachi24b,
  title = 	 {Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach},
  author =       {Adachi, Masaki and Hayakawa, Satoshi and J\o{}rgensen, Martin and Wan, Xingchen and Nguyen, Vu and Oberhauser, Harald and A. Osborne, Michael},
  booktitle = 	 {Proceedings of The 27th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {496--504},
  year = 	 {2024},
  editor = 	 {Dasgupta, Sanjoy and Mandt, Stephan and Li, Yingzhen},
  volume = 	 {238},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {02--04 May},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v238/adachi24b/adachi24b.pdf},
  url = 	 {https://proceedings.mlr.press/v238/adachi24b.html},
  abstract = 	 {Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed—larger batches are more costly, smaller batches lead to slower wall-clock run-times—and the trade-off may change over the run (larger batches are often preferable earlier). To address this trade-off, we propose a novel Probabilistic Numerics framework that adaptively changes batch sizes. By framing batch selection as a quadrature task, our integration-error-aware algorithm facilitates the automatic tuning of batch sizes to meet predefined quadrature precision objectives, akin to how typical optimizers terminate based on convergence thresholds. This approach obviates the necessity for exhaustive searches across all potential batch sizes. We also extend this to scenarios with constrained active learning and constrained optimization, interpreting constraint violations as reductions in the precision requirement, to subsequently adapt batch construction. Through extensive experiments, we demonstrate that our approach significantly enhances learning efficiency and flexibility in diverse Bayesian batch active learning and Bayesian optimization applications.}
}

Endnote

%0 Conference Paper
%T Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach
%A Masaki Adachi
%A Satoshi Hayakawa
%A Martin Jørgensen
%A Xingchen Wan
%A Vu Nguyen
%A Harald Oberhauser
%A Michael A. Osborne
%B Proceedings of The 27th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2024
%E Sanjoy Dasgupta
%E Stephan Mandt
%E Yingzhen Li	
%F pmlr-v238-adachi24b
%I PMLR
%P 496--504
%U https://proceedings.mlr.press/v238/adachi24b.html
%V 238
%X Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed—larger batches are more costly, smaller batches lead to slower wall-clock run-times—and the trade-off may change over the run (larger batches are often preferable earlier). To address this trade-off, we propose a novel Probabilistic Numerics framework that adaptively changes batch sizes. By framing batch selection as a quadrature task, our integration-error-aware algorithm facilitates the automatic tuning of batch sizes to meet predefined quadrature precision objectives, akin to how typical optimizers terminate based on convergence thresholds. This approach obviates the necessity for exhaustive searches across all potential batch sizes. We also extend this to scenarios with constrained active learning and constrained optimization, interpreting constraint violations as reductions in the precision requirement, to subsequently adapt batch construction. Through extensive experiments, we demonstrate that our approach significantly enhances learning efficiency and flexibility in diverse Bayesian batch active learning and Bayesian optimization applications.

APA

Adachi, M., Hayakawa, S., Jørgensen, M., Wan, X., Nguyen, V., Oberhauser, H. & A. Osborne, M.. (2024). Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach. Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 238:496-504 Available from https://proceedings.mlr.press/v238/adachi24b.html.

Adaptive Batch Sizes for Active Learning: A Probabilistic Numerics Approach

Abstract

Cite this Paper

Related Material