Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

Anish Acharya; Abolfazl Hashemi; Prateek Jain; Sujay Sanghavi; Inderjit S. Dhillon; Ufuk Topcu

Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:11145-11168, 2022.

Abstract

Geometric median (GM) is a classical method in statistics for achieving robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 1/2. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) in high-dimensional optimization problems. In this paper, we show that by applying GM to only a judiciously chosen block of coordinates at a time and using a memory mechanism, one can retain the breakdown point of 1/2 for smooth non-convex problems, with non-asymptotic convergence rates comparable to the SGD with GM while resulting in significant speedup in training. We further validate the run-time and robustness of our approach empirically on several popular deep learning tasks. Code available at: https://github.com/anishacharya/BGMD

Cite this Paper

BibTeX


@InProceedings{pmlr-v151-acharya22a,
  title = 	 { Robust Training in High Dimensions via Block Coordinate Geometric Median Descent },
  author =       {Acharya, Anish and Hashemi, Abolfazl and Jain, Prateek and Sanghavi, Sujay and Dhillon, Inderjit S. and Topcu, Ufuk},
  booktitle = 	 {Proceedings of The 25th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {11145--11168},
  year = 	 {2022},
  editor = 	 {Camps-Valls, Gustau and Ruiz, Francisco J. R. and Valera, Isabel},
  volume = 	 {151},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {28--30 Mar},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v151/acharya22a/acharya22a.pdf},
  url = 	 {https://proceedings.mlr.press/v151/acharya22a.html},
  abstract = 	 { Geometric median (GM) is a classical method in statistics for achieving robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 1/2. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) in high-dimensional optimization problems. In this paper, we show that by applying GM to only a judiciously chosen block of coordinates at a time and using a memory mechanism, one can retain the breakdown point of 1/2 for smooth non-convex problems, with non-asymptotic convergence rates comparable to the SGD with GM while resulting in significant speedup in training. We further validate the run-time and robustness of our approach empirically on several popular deep learning tasks. Code available at: https://github.com/anishacharya/BGMD }
}

Endnote

%0 Conference Paper
%T  Robust Training in High Dimensions via Block Coordinate Geometric Median Descent 
%A Anish Acharya
%A Abolfazl Hashemi
%A Prateek Jain
%A Sujay Sanghavi
%A Inderjit S. Dhillon
%A Ufuk Topcu
%B Proceedings of The 25th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2022
%E Gustau Camps-Valls
%E Francisco J. R. Ruiz
%E Isabel Valera	
%F pmlr-v151-acharya22a
%I PMLR
%P 11145--11168
%U https://proceedings.mlr.press/v151/acharya22a.html
%V 151
%X  Geometric median (GM) is a classical method in statistics for achieving robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 1/2. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) in high-dimensional optimization problems. In this paper, we show that by applying GM to only a judiciously chosen block of coordinates at a time and using a memory mechanism, one can retain the breakdown point of 1/2 for smooth non-convex problems, with non-asymptotic convergence rates comparable to the SGD with GM while resulting in significant speedup in training. We further validate the run-time and robustness of our approach empirically on several popular deep learning tasks. Code available at: https://github.com/anishacharya/BGMD

APA


Acharya, A., Hashemi, A., Jain, P., Sanghavi, S., Dhillon, I.S. & Topcu, U.. (2022).  Robust Training in High Dimensions via Block Coordinate Geometric Median Descent . Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 151:11145-11168 Available from https://proceedings.mlr.press/v151/acharya22a.html.

Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

Abstract

Cite this Paper

Related Material