Hyperparameter Transfer Learning with Adaptive Complexity

Samuel Horváth; Aaron Klein; Peter Richtarik; Cedric Archambeau

Hyperparameter Transfer Learning with Adaptive Complexity

Samuel Horváth, Aaron Klein, Peter Richtarik, Cedric Archambeau

Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, PMLR 130:1378-1386, 2021.

Abstract

Bayesian optimization (BO) is a data-efficient approach to automatically tune the hyperparameters of machine learning models. In practice, one frequently has to solve similar hyperparameter tuning problems sequentially. For example, one might have to tune a type of neural network learned across a series of different classification problems. Recent work on multi-task BO exploits knowledge gained from previous hyperparameter tuning tasks to speed up a new tuning task. However, previous approaches do not account for the fact that BO is a sequential decision making procedure. Hence, there is in general a mismatch between the number of evaluations collected in the current tuning task compared to the number of evaluations accumulated in all previously completed tasks. In this work, we enable multi-task BO to compensate for this mismatch, such that the transfer learning procedure is able to handle different data regimes in a principled way. We propose a new multi-task BO method that learns a set of ordered, non-linear basis functions of increasing complexity via nested drop-out and automatic relevance determination. Experiments on a variety of hyperparameter tuning problems show that our method improves the sample efficiency of recently published multi-task BO methods.

Cite this Paper

BibTeX


@InProceedings{pmlr-v130-horvath21a,
  title = 	 { Hyperparameter Transfer Learning with Adaptive Complexity },
  author =       {Horv{\'a}th, Samuel and Klein, Aaron and Richtarik, Peter and Archambeau, Cedric},
  booktitle = 	 {Proceedings of The 24th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {1378--1386},
  year = 	 {2021},
  editor = 	 {Banerjee, Arindam and Fukumizu, Kenji},
  volume = 	 {130},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--15 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v130/horvath21a/horvath21a.pdf},
  url = 	 {https://proceedings.mlr.press/v130/horvath21a.html},
  abstract = 	 { Bayesian optimization (BO) is a data-efficient approach to automatically tune the hyperparameters of machine learning models. In practice, one frequently has to solve similar hyperparameter tuning problems sequentially. For example, one might have to tune a type of neural network learned across a series of different classification problems. Recent work on multi-task BO exploits knowledge gained from previous hyperparameter tuning tasks to speed up a new tuning task. However, previous approaches do not account for the fact that BO is a sequential decision making procedure. Hence, there is in general a mismatch between the number of evaluations collected in the current tuning task compared to the number of evaluations accumulated in all previously completed tasks. In this work, we enable multi-task BO to compensate for this mismatch, such that the transfer learning procedure is able to handle different data regimes in a principled way. We propose a new multi-task BO method that learns a set of ordered, non-linear basis functions of increasing complexity via nested drop-out and automatic relevance determination. Experiments on a variety of hyperparameter tuning problems show that our method improves the sample efficiency of recently published multi-task BO methods. }
}

Endnote

%0 Conference Paper
%T  Hyperparameter Transfer Learning with Adaptive Complexity 
%A Samuel Horváth
%A Aaron Klein
%A Peter Richtarik
%A Cedric Archambeau
%B Proceedings of The 24th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2021
%E Arindam Banerjee
%E Kenji Fukumizu	
%F pmlr-v130-horvath21a
%I PMLR
%P 1378--1386
%U https://proceedings.mlr.press/v130/horvath21a.html
%V 130
%X  Bayesian optimization (BO) is a data-efficient approach to automatically tune the hyperparameters of machine learning models. In practice, one frequently has to solve similar hyperparameter tuning problems sequentially. For example, one might have to tune a type of neural network learned across a series of different classification problems. Recent work on multi-task BO exploits knowledge gained from previous hyperparameter tuning tasks to speed up a new tuning task. However, previous approaches do not account for the fact that BO is a sequential decision making procedure. Hence, there is in general a mismatch between the number of evaluations collected in the current tuning task compared to the number of evaluations accumulated in all previously completed tasks. In this work, we enable multi-task BO to compensate for this mismatch, such that the transfer learning procedure is able to handle different data regimes in a principled way. We propose a new multi-task BO method that learns a set of ordered, non-linear basis functions of increasing complexity via nested drop-out and automatic relevance determination. Experiments on a variety of hyperparameter tuning problems show that our method improves the sample efficiency of recently published multi-task BO methods.

APA


Horváth, S., Klein, A., Richtarik, P. & Archambeau, C.. (2021).  Hyperparameter Transfer Learning with Adaptive Complexity . Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 130:1378-1386 Available from https://proceedings.mlr.press/v130/horvath21a.html.

Hyperparameter Transfer Learning with Adaptive Complexity

Abstract

Cite this Paper

Related Material