CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization

Jacob O Tørring; Carl Hvarfner; Luigi Nardi; Magnus Själander

CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization

Jacob O Tørring, Carl Hvarfner, Luigi Nardi, Magnus Själander

Proceedings of the Fourth International Conference on Automated Machine Learning, PMLR 293:24/1-20, 2025.

Abstract

Bayesian optimization is a powerful method for automating tuning of compilers. The complex landscape of autotuning provides a myriad of rarely considered structural challenges for black-box optimizers, and the lack of standardized benchmarks has limited the study of Bayesian optimization within the domain. To address this, we present CATBench, a comprehensive benchmarking suite that captures the complexities of compiler autotuning, ranging from discrete, conditional, and permutation parameter types to known and unknown binary constraints, as well as both multi-fidelity and multi-objective evaluations. The benchmarks in CATBench span a range of machine learning-oriented computations, from tensor algebra to image processing and clustering, and use state-of-the-art compilers, such as TACO and RISE/ELEVATE. CATBench offers a unified interface for evaluating Bayesian optimization algorithms, promoting reproducibility and innovation through an easy-to-use, fully containerized setup of both surrogate and real-world compiler optimization tasks. We validate CATBench on several state-of-the-art algorithms, revealing their strengths and weaknesses and demonstrating the suite’s potential for advancing both Bayesian optimization and compiler autotuning research.

Cite this Paper

BibTeX

@InProceedings{pmlr-v293-torring25a,
  title = 	 {CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization},
  author =       {T{\o}rring, Jacob O and Hvarfner, Carl and Nardi, Luigi and Sj\"alander, Magnus},
  booktitle = 	 {Proceedings of the Fourth International Conference on Automated Machine Learning},
  pages = 	 {24/1--20},
  year = 	 {2025},
  editor = 	 {Akoglu, Leman and Doerr, Carola and van Rijn, Jan N. and Garnett, Roman and Gardner, Jacob R.},
  volume = 	 {293},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {08--11 Sep},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v293/main/assets/torring25a/torring25a.pdf},
  url = 	 {https://proceedings.mlr.press/v293/torring25a.html},
  abstract = 	 {Bayesian optimization is a powerful method for automating tuning of compilers. The complex landscape of autotuning provides a myriad of rarely considered structural challenges for black-box optimizers, and the lack of standardized benchmarks has limited the study of Bayesian optimization within the domain. To address this, we present CATBench, a comprehensive benchmarking suite that captures the complexities of compiler autotuning, ranging from discrete, conditional, and permutation parameter types to known and unknown binary constraints, as well as both multi-fidelity and multi-objective evaluations. The benchmarks in CATBench span a range of machine learning-oriented computations, from tensor algebra to image processing and clustering, and use state-of-the-art compilers, such as TACO and RISE/ELEVATE. CATBench offers a unified interface for evaluating Bayesian optimization algorithms, promoting reproducibility and innovation through an easy-to-use, fully containerized setup of both surrogate and real-world compiler optimization tasks. We validate CATBench on several state-of-the-art algorithms, revealing their strengths and weaknesses and demonstrating the suite’s potential for advancing both Bayesian optimization and compiler autotuning research.}
}

Endnote

%0 Conference Paper
%T CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization
%A Jacob O Tørring
%A Carl Hvarfner
%A Luigi Nardi
%A Magnus Själander
%B Proceedings of the Fourth International Conference on Automated Machine Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Leman Akoglu
%E Carola Doerr
%E Jan N. van Rijn
%E Roman Garnett
%E Jacob R. Gardner	
%F pmlr-v293-torring25a
%I PMLR
%P 24/1--20
%U https://proceedings.mlr.press/v293/torring25a.html
%V 293
%X Bayesian optimization is a powerful method for automating tuning of compilers. The complex landscape of autotuning provides a myriad of rarely considered structural challenges for black-box optimizers, and the lack of standardized benchmarks has limited the study of Bayesian optimization within the domain. To address this, we present CATBench, a comprehensive benchmarking suite that captures the complexities of compiler autotuning, ranging from discrete, conditional, and permutation parameter types to known and unknown binary constraints, as well as both multi-fidelity and multi-objective evaluations. The benchmarks in CATBench span a range of machine learning-oriented computations, from tensor algebra to image processing and clustering, and use state-of-the-art compilers, such as TACO and RISE/ELEVATE. CATBench offers a unified interface for evaluating Bayesian optimization algorithms, promoting reproducibility and innovation through an easy-to-use, fully containerized setup of both surrogate and real-world compiler optimization tasks. We validate CATBench on several state-of-the-art algorithms, revealing their strengths and weaknesses and demonstrating the suite’s potential for advancing both Bayesian optimization and compiler autotuning research.

APA

Tørring, J.O., Hvarfner, C., Nardi, L. & Själander, M.. (2025). CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization. Proceedings of the Fourth International Conference on Automated Machine Learning, in Proceedings of Machine Learning Research 293:24/1-20 Available from https://proceedings.mlr.press/v293/torring25a.html.

CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization

Abstract

Cite this Paper

Related Material