Stochastic Quasi-Newton Langevin Monte Carlo

Umut Simsekli; Roland Badeau; Taylan Cemgil; Gaël Richard

Stochastic Quasi-Newton Langevin Monte Carlo

Umut Simsekli, Roland Badeau, Taylan Cemgil, Gaël Richard

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:642-651, 2016.

Abstract

Recently, Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) methods have been proposed for scaling up Monte Carlo computations to large data problems. Whilst these approaches have proven useful in many applications, vanilla SG-MCMC might suffer from poor mixing rates when random variables exhibit strong couplings under the target densities or big scale differences. In this study, we propose a novel SG-MCMC method that takes the local geometry into account by using ideas from Quasi-Newton optimization methods. These second order methods directly approximate the inverse Hessian by using a limited history of samples and their gradients. Our method uses dense approximations of the inverse Hessian while keeping the time and memory complexities linear with the dimension of the problem. We provide a formal theoretical analysis where we show that the proposed method is asymptotically unbiased and consistent with the posterior expectations. We illustrate the effectiveness of the approach on both synthetic and real datasets. Our experiments on two challenging applications show that our method achieves fast convergence rates similar to Riemannian approaches while at the same time having low computational requirements similar to diagonal preconditioning approaches.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-simsekli16,
  title = 	 {Stochastic Quasi-Newton Langevin Monte Carlo},
  author = 	 {Simsekli, Umut and Badeau, Roland and Cemgil, Taylan and Richard, Gaël},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {642--651},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/simsekli16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/simsekli16.html},
  abstract = 	 {Recently, Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) methods have been proposed for scaling up Monte Carlo computations to large data problems. Whilst these approaches have proven useful in many applications, vanilla SG-MCMC might suffer from poor mixing rates when random variables exhibit strong couplings under the target densities or big scale differences. In this study, we propose a novel SG-MCMC method that takes the local geometry into account by using ideas from Quasi-Newton optimization methods. These second order methods directly approximate the inverse Hessian by using a limited history of samples and their gradients. Our method uses dense approximations of the inverse Hessian while keeping the time and memory complexities linear with the dimension of the problem. We provide a formal theoretical analysis where we show that the proposed method is asymptotically unbiased and consistent with the posterior expectations. We illustrate the effectiveness of the approach on both synthetic and real datasets. Our experiments on two challenging applications show that our method achieves fast convergence rates similar to Riemannian approaches while at the same time having low computational requirements similar to diagonal preconditioning approaches.}
}

Endnote

%0 Conference Paper
%T Stochastic Quasi-Newton Langevin Monte Carlo
%A Umut Simsekli
%A Roland Badeau
%A Taylan Cemgil
%A Gaël Richard
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-simsekli16
%I PMLR
%P 642--651
%U https://proceedings.mlr.press/v48/simsekli16.html
%V 48
%X Recently, Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) methods have been proposed for scaling up Monte Carlo computations to large data problems. Whilst these approaches have proven useful in many applications, vanilla SG-MCMC might suffer from poor mixing rates when random variables exhibit strong couplings under the target densities or big scale differences. In this study, we propose a novel SG-MCMC method that takes the local geometry into account by using ideas from Quasi-Newton optimization methods. These second order methods directly approximate the inverse Hessian by using a limited history of samples and their gradients. Our method uses dense approximations of the inverse Hessian while keeping the time and memory complexities linear with the dimension of the problem. We provide a formal theoretical analysis where we show that the proposed method is asymptotically unbiased and consistent with the posterior expectations. We illustrate the effectiveness of the approach on both synthetic and real datasets. Our experiments on two challenging applications show that our method achieves fast convergence rates similar to Riemannian approaches while at the same time having low computational requirements similar to diagonal preconditioning approaches.

RIS


TY  - CPAPER
TI  - Stochastic Quasi-Newton Langevin Monte Carlo
AU  - Umut Simsekli
AU  - Roland Badeau
AU  - Taylan Cemgil
AU  - Gaël Richard
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-simsekli16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 642
EP  - 651
L1  - http://proceedings.mlr.press/v48/simsekli16.pdf
UR  - https://proceedings.mlr.press/v48/simsekli16.html
AB  - Recently, Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) methods have been proposed for scaling up Monte Carlo computations to large data problems. Whilst these approaches have proven useful in many applications, vanilla SG-MCMC might suffer from poor mixing rates when random variables exhibit strong couplings under the target densities or big scale differences. In this study, we propose a novel SG-MCMC method that takes the local geometry into account by using ideas from Quasi-Newton optimization methods. These second order methods directly approximate the inverse Hessian by using a limited history of samples and their gradients. Our method uses dense approximations of the inverse Hessian while keeping the time and memory complexities linear with the dimension of the problem. We provide a formal theoretical analysis where we show that the proposed method is asymptotically unbiased and consistent with the posterior expectations. We illustrate the effectiveness of the approach on both synthetic and real datasets. Our experiments on two challenging applications show that our method achieves fast convergence rates similar to Riemannian approaches while at the same time having low computational requirements similar to diagonal preconditioning approaches.
ER  -

APA


Simsekli, U., Badeau, R., Cemgil, T. & Richard, G.. (2016). Stochastic Quasi-Newton Langevin Monte Carlo. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:642-651 Available from https://proceedings.mlr.press/v48/simsekli16.html.

Stochastic Quasi-Newton Langevin Monte Carlo

Abstract

Cite this Paper

Related Material