Neural Architecture Search without Training

Joe Mellor; Jack Turner; Amos Storkey; Elliot J Crowley

Neural Architecture Search without Training

Joe Mellor, Jack Turner, Amos Storkey, Elliot J Crowley

Proceedings of the 38th International Conference on Machine Learning, PMLR 139:7588-7598, 2021.

Abstract

The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network’s trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network’s trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, NATS-Bench, and Network Design Spaces. Our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search. Code for reproducing our experiments is available at https://github.com/BayesWatch/nas-without-training.

Cite this Paper

BibTeX


@InProceedings{pmlr-v139-mellor21a,
  title = 	 {Neural Architecture Search without Training},
  author =       {Mellor, Joe and Turner, Jack and Storkey, Amos and Crowley, Elliot J},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7588--7598},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/mellor21a/mellor21a.pdf},
  url = 	 {https://proceedings.mlr.press/v139/mellor21a.html},
  abstract = 	 {The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network’s trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network’s trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, NATS-Bench, and Network Design Spaces. Our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search. Code for reproducing our experiments is available at https://github.com/BayesWatch/nas-without-training.}
}

Endnote

%0 Conference Paper
%T Neural Architecture Search without Training
%A Joe Mellor
%A Jack Turner
%A Amos Storkey
%A Elliot J Crowley
%B Proceedings of the 38th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Marina Meila
%E Tong Zhang	
%F pmlr-v139-mellor21a
%I PMLR
%P 7588--7598
%U https://proceedings.mlr.press/v139/mellor21a.html
%V 139
%X The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network’s trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network’s trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, NATS-Bench, and Network Design Spaces. Our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search. Code for reproducing our experiments is available at https://github.com/BayesWatch/nas-without-training.

APA


Mellor, J., Turner, J., Storkey, A. & Crowley, E.J.. (2021). Neural Architecture Search without Training. Proceedings of the 38th International Conference on Machine Learning, in Proceedings of Machine Learning Research 139:7588-7598 Available from https://proceedings.mlr.press/v139/mellor21a.html.

Neural Architecture Search without Training

Abstract

Cite this Paper

Related Material