Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

Grégoire Mialon; Julien Mairal; Alexandre d’Aspremont

Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

Grégoire Mialon, Julien Mairal, Alexandre d’Aspremont

Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:3610-3620, 2020.

Abstract

We design simple screening tests to automatically discard data samples in empirical risk minimization withoutlosing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, andpropose a general method to design screening tests for classification or regression based on ellipsoidal approximations of the optimal set. In addition to producing computational gains, our approach also allows us to compress a dataset into a subset of representative points.

Cite this Paper

BibTeX

@InProceedings{pmlr-v108-mialon20a,
  title = 	 {Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions},
  author =       {Mialon, Gr\'egoire and Mairal, Julien and d'Aspremont, Alexandre},
  booktitle = 	 {Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3610--3620},
  year = 	 {2020},
  editor = 	 {Chiappa, Silvia and Calandra, Roberto},
  volume = 	 {108},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {26--28 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v108/mialon20a/mialon20a.pdf},
  url = 	 {https://proceedings.mlr.press/v108/mialon20a.html},
  abstract = 	 {We design simple screening tests to automatically discard data samples in empirical risk minimization withoutlosing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, andpropose a general method to design screening tests for classification or regression based on ellipsoidal approximations of the optimal set. In addition to producing computational gains, our approach also allows us to compress a dataset into a subset of representative points.}
}

Endnote

%0 Conference Paper
%T Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions
%A Grégoire Mialon
%A Julien Mairal
%A Alexandre d’Aspremont
%B Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2020
%E Silvia Chiappa
%E Roberto Calandra	
%F pmlr-v108-mialon20a
%I PMLR
%P 3610--3620
%U https://proceedings.mlr.press/v108/mialon20a.html
%V 108
%X We design simple screening tests to automatically discard data samples in empirical risk minimization withoutlosing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, andpropose a general method to design screening tests for classification or regression based on ellipsoidal approximations of the optimal set. In addition to producing computational gains, our approach also allows us to compress a dataset into a subset of representative points.

APA

Mialon, G., Mairal, J. & d’Aspremont, A.. (2020). Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 108:3610-3620 Available from https://proceedings.mlr.press/v108/mialon20a.html.

Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

Abstract

Cite this Paper

Related Material