Dropout Inference in Bayesian Neural Networks with Alpha-divergences

Yingzhen Li; Yarin Gal

Dropout Inference in Bayesian Neural Networks with Alpha-divergences

Yingzhen Li, Yarin Gal

Proceedings of the 34th International Conference on Machine Learning, PMLR 70:2052-2061, 2017.

Abstract

To obtain uncertainty estimates with real-world Bayesian deep learning models, practical inference approximations are needed. Dropout variational inference (VI) for example has been used for machine vision and medical applications, but VI can severely underestimates model uncertainty. Alpha-divergences are alternative divergences to VI’s KL objective, which are able to avoid VI’s uncertainty underestimation. But these are hard to use in practice: existing techniques can only use Gaussian approximating distributions, and require existing models to be changed radically, thus are of limited use for practitioners. We propose a re-parametrisation of the alpha-divergence objectives, deriving a simple inference technique which, together with dropout, can be easily implemented with existing models by simply changing the loss of the model. We demonstrate improved uncertainty estimates and accuracy compared to VI in dropout networks. We study our model’s epistemic uncertainty far away from the data using adversarial images, showing that these can be distinguished from non-adversarial images by examining our model’s uncertainty.

Cite this Paper

BibTeX


@InProceedings{pmlr-v70-li17a,
  title = 	 {Dropout Inference in {B}ayesian Neural Networks with Alpha-divergences},
  author =       {Yingzhen Li and Yarin Gal},
  booktitle = 	 {Proceedings of the 34th International Conference on Machine Learning},
  pages = 	 {2052--2061},
  year = 	 {2017},
  editor = 	 {Precup, Doina and Teh, Yee Whye},
  volume = 	 {70},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--11 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v70/li17a/li17a.pdf},
  url = 	 {https://proceedings.mlr.press/v70/li17a.html},
  abstract = 	 {To obtain uncertainty estimates with real-world Bayesian deep learning models, practical inference approximations are needed. Dropout variational inference (VI) for example has been used for machine vision and medical applications, but VI can severely underestimates model uncertainty. Alpha-divergences are alternative divergences to VI’s KL objective, which are able to avoid VI’s uncertainty underestimation. But these are hard to use in practice: existing techniques can only use Gaussian approximating distributions, and require existing models to be changed radically, thus are of limited use for practitioners. We propose a re-parametrisation of the alpha-divergence objectives, deriving a simple inference technique which, together with dropout, can be easily implemented with existing models by simply changing the loss of the model. We demonstrate improved uncertainty estimates and accuracy compared to VI in dropout networks. We study our model’s epistemic uncertainty far away from the data using adversarial images, showing that these can be distinguished from non-adversarial images by examining our model’s uncertainty.}
}

Endnote

%0 Conference Paper
%T Dropout Inference in Bayesian Neural Networks with Alpha-divergences
%A Yingzhen Li
%A Yarin Gal
%B Proceedings of the 34th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2017
%E Doina Precup
%E Yee Whye Teh	
%F pmlr-v70-li17a
%I PMLR
%P 2052--2061
%U https://proceedings.mlr.press/v70/li17a.html
%V 70
%X To obtain uncertainty estimates with real-world Bayesian deep learning models, practical inference approximations are needed. Dropout variational inference (VI) for example has been used for machine vision and medical applications, but VI can severely underestimates model uncertainty. Alpha-divergences are alternative divergences to VI’s KL objective, which are able to avoid VI’s uncertainty underestimation. But these are hard to use in practice: existing techniques can only use Gaussian approximating distributions, and require existing models to be changed radically, thus are of limited use for practitioners. We propose a re-parametrisation of the alpha-divergence objectives, deriving a simple inference technique which, together with dropout, can be easily implemented with existing models by simply changing the loss of the model. We demonstrate improved uncertainty estimates and accuracy compared to VI in dropout networks. We study our model’s epistemic uncertainty far away from the data using adversarial images, showing that these can be distinguished from non-adversarial images by examining our model’s uncertainty.

APA


Li, Y. & Gal, Y.. (2017). Dropout Inference in Bayesian Neural Networks with Alpha-divergences. Proceedings of the 34th International Conference on Machine Learning, in Proceedings of Machine Learning Research 70:2052-2061 Available from https://proceedings.mlr.press/v70/li17a.html.

Dropout Inference in Bayesian Neural Networks with Alpha-divergences

Abstract

Cite this Paper

Related Material