SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization

Zheng Qu, Peter Richtarik, Martin Takac, Olivier Fercoq
Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:1823-1832, 2016.

Abstract

We propose a new algorithm for minimizing regularized empirical loss: Stochastic Dual Newton Ascent (SDNA). Our method is dual in nature: in each iteration we update a random subset of the dual variables. However, unlike existing methods such as stochastic dual coordinate ascent, SDNA is capable of utilizing all local curvature information contained in the examples, which leads to striking improvements in both theory and practice – sometimes by orders of magnitude. In the special case when an L2-regularizer is used in the primal, the dual problem is a concave quadratic maximization problem plus a separable term. In this regime, SDNA in each step solves a proximal subproblem involving a random principal submatrix of the Hessian of the quadratic function; whence the name of the method.

Cite this Paper


BibTeX
@InProceedings{pmlr-v48-qub16, title = {SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization}, author = {Qu, Zheng and Richtarik, Peter and Takac, Martin and Fercoq, Olivier}, booktitle = {Proceedings of The 33rd International Conference on Machine Learning}, pages = {1823--1832}, year = {2016}, editor = {Balcan, Maria Florina and Weinberger, Kilian Q.}, volume = {48}, series = {Proceedings of Machine Learning Research}, address = {New York, New York, USA}, month = {20--22 Jun}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v48/qub16.pdf}, url = {https://proceedings.mlr.press/v48/qub16.html}, abstract = {We propose a new algorithm for minimizing regularized empirical loss: Stochastic Dual Newton Ascent (SDNA). Our method is dual in nature: in each iteration we update a random subset of the dual variables. However, unlike existing methods such as stochastic dual coordinate ascent, SDNA is capable of utilizing all local curvature information contained in the examples, which leads to striking improvements in both theory and practice – sometimes by orders of magnitude. In the special case when an L2-regularizer is used in the primal, the dual problem is a concave quadratic maximization problem plus a separable term. In this regime, SDNA in each step solves a proximal subproblem involving a random principal submatrix of the Hessian of the quadratic function; whence the name of the method.} }
Endnote
%0 Conference Paper %T SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization %A Zheng Qu %A Peter Richtarik %A Martin Takac %A Olivier Fercoq %B Proceedings of The 33rd International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2016 %E Maria Florina Balcan %E Kilian Q. Weinberger %F pmlr-v48-qub16 %I PMLR %P 1823--1832 %U https://proceedings.mlr.press/v48/qub16.html %V 48 %X We propose a new algorithm for minimizing regularized empirical loss: Stochastic Dual Newton Ascent (SDNA). Our method is dual in nature: in each iteration we update a random subset of the dual variables. However, unlike existing methods such as stochastic dual coordinate ascent, SDNA is capable of utilizing all local curvature information contained in the examples, which leads to striking improvements in both theory and practice – sometimes by orders of magnitude. In the special case when an L2-regularizer is used in the primal, the dual problem is a concave quadratic maximization problem plus a separable term. In this regime, SDNA in each step solves a proximal subproblem involving a random principal submatrix of the Hessian of the quadratic function; whence the name of the method.
RIS
TY - CPAPER TI - SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization AU - Zheng Qu AU - Peter Richtarik AU - Martin Takac AU - Olivier Fercoq BT - Proceedings of The 33rd International Conference on Machine Learning DA - 2016/06/11 ED - Maria Florina Balcan ED - Kilian Q. Weinberger ID - pmlr-v48-qub16 PB - PMLR DP - Proceedings of Machine Learning Research VL - 48 SP - 1823 EP - 1832 L1 - http://proceedings.mlr.press/v48/qub16.pdf UR - https://proceedings.mlr.press/v48/qub16.html AB - We propose a new algorithm for minimizing regularized empirical loss: Stochastic Dual Newton Ascent (SDNA). Our method is dual in nature: in each iteration we update a random subset of the dual variables. However, unlike existing methods such as stochastic dual coordinate ascent, SDNA is capable of utilizing all local curvature information contained in the examples, which leads to striking improvements in both theory and practice – sometimes by orders of magnitude. In the special case when an L2-regularizer is used in the primal, the dual problem is a concave quadratic maximization problem plus a separable term. In this regime, SDNA in each step solves a proximal subproblem involving a random principal submatrix of the Hessian of the quadratic function; whence the name of the method. ER -
APA
Qu, Z., Richtarik, P., Takac, M. & Fercoq, O.. (2016). SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:1823-1832 Available from https://proceedings.mlr.press/v48/qub16.html.

Related Material