Amortized variance reduction for doubly stochastic objective

Ayman Boustati; Sattar Vakili; James Hensman; ST John

Amortized variance reduction for doubly stochastic objective

Ayman Boustati, Sattar Vakili, James Hensman, ST John

Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), PMLR 124:61-70, 2020.

Abstract

Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of convergence. Control variates can be used to reduce the variance, but past approaches do not take into account how mini-batch stochasticity affects sampling stochasticity, resulting in sub-optimal variance reduction. We propose a new approach in which we use a recognition network to cheaply approximate the optimal control variate for each mini-batch, with no additional model gradient computations. We illustrate the properties of this proposal and test its performance on logistic regression and deep Gaussian processes.

Cite this Paper

BibTeX

@InProceedings{pmlr-v124-boustati20a,
  title = 	 {Amortized variance reduction for doubly stochastic objective},
  author =       {Boustati, Ayman and Vakili, Sattar and Hensman, James and John, ST},
  booktitle = 	 {Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI)},
  pages = 	 {61--70},
  year = 	 {2020},
  editor = 	 {Peters, Jonas and Sontag, David},
  volume = 	 {124},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {03--06 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v124/boustati20a/boustati20a.pdf},
  url = 	 {https://proceedings.mlr.press/v124/boustati20a.html},
  abstract = 	 {Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of convergence. Control variates can be used to reduce the variance, but past approaches do not take into account how mini-batch stochasticity affects sampling stochasticity, resulting in sub-optimal variance reduction. We propose a new approach in which we use a recognition network to cheaply approximate the optimal control variate for each mini-batch, with no additional model gradient computations. We illustrate the properties of this proposal and test its performance on logistic regression and deep Gaussian processes.}
}

Endnote

%0 Conference Paper
%T Amortized variance reduction for doubly stochastic objective
%A Ayman Boustati
%A Sattar Vakili
%A James Hensman
%A ST John
%B Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI)
%C Proceedings of Machine Learning Research
%D 2020
%E Jonas Peters
%E David Sontag	
%F pmlr-v124-boustati20a
%I PMLR
%P 61--70
%U https://proceedings.mlr.press/v124/boustati20a.html
%V 124
%X Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of convergence. Control variates can be used to reduce the variance, but past approaches do not take into account how mini-batch stochasticity affects sampling stochasticity, resulting in sub-optimal variance reduction. We propose a new approach in which we use a recognition network to cheaply approximate the optimal control variate for each mini-batch, with no additional model gradient computations. We illustrate the properties of this proposal and test its performance on logistic regression and deep Gaussian processes.

APA

Boustati, A., Vakili, S., Hensman, J. & John, S.. (2020). Amortized variance reduction for doubly stochastic objective. Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), in Proceedings of Machine Learning Research 124:61-70 Available from https://proceedings.mlr.press/v124/boustati20a.html.

Amortized variance reduction for doubly stochastic objective

Abstract

Cite this Paper

Related Material