Nondeterminism and Instability in Neural Network Optimization

Cecilia Summers; Michael J. Dinneen

Nondeterminism and Instability in Neural Network Optimization

Cecilia Summers, Michael J. Dinneen

Proceedings of the 38th International Conference on Machine Learning, PMLR 139:9913-9922, 2021.

Abstract

Nondeterminism in neural network optimization produces uncertainty in performance, making small improvements difficult to discern from run-to-run variability. While uncertainty can be reduced by training multiple model copies, doing so is time-consuming, costly, and harms reproducibility. In this work, we establish an experimental protocol for understanding the effect of optimization nondeterminism on model diversity, allowing us to isolate the effects of a variety of sources of nondeterminism. Surprisingly, we find that all sources of nondeterminism have similar effects on measures of model diversity. To explain this intriguing fact, we identify the instability of model training, taken as an end-to-end procedure, as the key determinant. We show that even one-bit changes in initial parameters result in models converging to vastly different values. Last, we propose two approaches for reducing the effects of instability on run-to-run variability.

Cite this Paper

BibTeX

@InProceedings{pmlr-v139-summers21a,
  title = 	 {Nondeterminism and Instability in Neural Network Optimization},
  author =       {Summers, Cecilia and Dinneen, Michael J.},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {9913--9922},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/summers21a/summers21a.pdf},
  url = 	 {https://proceedings.mlr.press/v139/summers21a.html},
  abstract = 	 {Nondeterminism in neural network optimization produces uncertainty in performance, making small improvements difficult to discern from run-to-run variability. While uncertainty can be reduced by training multiple model copies, doing so is time-consuming, costly, and harms reproducibility. In this work, we establish an experimental protocol for understanding the effect of optimization nondeterminism on model diversity, allowing us to isolate the effects of a variety of sources of nondeterminism. Surprisingly, we find that all sources of nondeterminism have similar effects on measures of model diversity. To explain this intriguing fact, we identify the instability of model training, taken as an end-to-end procedure, as the key determinant. We show that even one-bit changes in initial parameters result in models converging to vastly different values. Last, we propose two approaches for reducing the effects of instability on run-to-run variability.}
}

Endnote

%0 Conference Paper
%T Nondeterminism and Instability in Neural Network Optimization
%A Cecilia Summers
%A Michael J. Dinneen
%B Proceedings of the 38th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Marina Meila
%E Tong Zhang	
%F pmlr-v139-summers21a
%I PMLR
%P 9913--9922
%U https://proceedings.mlr.press/v139/summers21a.html
%V 139
%X Nondeterminism in neural network optimization produces uncertainty in performance, making small improvements difficult to discern from run-to-run variability. While uncertainty can be reduced by training multiple model copies, doing so is time-consuming, costly, and harms reproducibility. In this work, we establish an experimental protocol for understanding the effect of optimization nondeterminism on model diversity, allowing us to isolate the effects of a variety of sources of nondeterminism. Surprisingly, we find that all sources of nondeterminism have similar effects on measures of model diversity. To explain this intriguing fact, we identify the instability of model training, taken as an end-to-end procedure, as the key determinant. We show that even one-bit changes in initial parameters result in models converging to vastly different values. Last, we propose two approaches for reducing the effects of instability on run-to-run variability.

APA

Summers, C. & Dinneen, M.J.. (2021). Nondeterminism and Instability in Neural Network Optimization. Proceedings of the 38th International Conference on Machine Learning, in Proceedings of Machine Learning Research 139:9913-9922 Available from https://proceedings.mlr.press/v139/summers21a.html.

Nondeterminism and Instability in Neural Network Optimization

Abstract

Cite this Paper

Related Material