Multiclass Learnability and the ERM principle

Amit Daniely; Sivan Sabato; Shai Ben-David; Shai Shalev-Shwartz

Multiclass Learnability and the ERM principle

Amit Daniely, Sivan Sabato, Shai Ben-David, Shai Shalev-Shwartz

Proceedings of the 24th Annual Conference on Learning Theory, PMLR 19:207-232, 2011.

Abstract

Multiclass learning is an area of growing practical relevance, for which the currently available theory is still far from providing satisfactory understanding. We study the learnability of multiclass prediction, and derive upper and lower bounds on the sample complexity of multiclass hypothesis classes in different learning models: batch/online, realizable/unrealizable,full information/bandit feedback. Our analysis reveals a surprising phenomenon: In the multiclass setting, in sharp contrast to binary classification, not all Empirical Risk Minimization (ERM) algorithms are equally successful. We show that there exist hypotheses classes for which some ERM learners have lower sample complexity than others. Furthermore, there are classes that are learnable by some ERM learners, while other ERM learner will fail to learn them. We propose a principle for designing good ERM learners, and use this principle to prove tight bounds on the sample complexity of learning symmetric multiclass hypothesis classes (that is, classes that are invariant under any permutation of label names). We demonstrate the relevance of the theory by analyzing the sample complexity of two widely used hypothesis classes: generalized linear multiclass models and reduction trees. We also obtain some practically relevant conclusions.

Cite this Paper

BibTeX


@InProceedings{pmlr-v19-daniely11a,
  title = 	 {Multiclass Learnability and the ERM principle},
  author = 	 {Daniely, Amit and Sabato, Sivan and Ben-David, Shai and Shalev-Shwartz, Shai},
  booktitle = 	 {Proceedings of the 24th Annual Conference on Learning Theory},
  pages = 	 {207--232},
  year = 	 {2011},
  editor = 	 {Kakade, Sham M. and von Luxburg, Ulrike},
  volume = 	 {19},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Budapest, Hungary},
  month = 	 {09--11 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v19/daniely11a/daniely11a.pdf},
  url = 	 {https://proceedings.mlr.press/v19/daniely11a.html},
  abstract = 	 {Multiclass learning is an area of growing practical relevance, for which the currently available theory is still far from providing satisfactory understanding.  We study the learnability of multiclass prediction, and derive upper and lower bounds on the sample complexity of multiclass hypothesis classes in different learning models: batch/online, realizable/unrealizable,full information/bandit feedback.  Our analysis reveals a surprising phenomenon: In the multiclass setting, in sharp contrast to binary classification, not all Empirical Risk Minimization (ERM) algorithms are equally successful. We show that there exist hypotheses classes for which some ERM learners have lower sample complexity than others. Furthermore, there are classes that are learnable by some ERM learners, while other ERM learner will fail to learn them. We propose a principle for designing good ERM learners, and use this principle to prove tight bounds on the sample complexity of learning symmetric multiclass hypothesis classes (that is, classes that are invariant under any permutation of label names). We demonstrate the relevance of the theory by analyzing the sample complexity of two widely used hypothesis classes: generalized linear multiclass models and reduction trees. We also obtain some practically relevant conclusions.}
}

Endnote

%0 Conference Paper
%T Multiclass Learnability and the ERM principle
%A Amit Daniely
%A Sivan Sabato
%A Shai Ben-David
%A Shai Shalev-Shwartz
%B Proceedings of the 24th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2011
%E Sham M. Kakade
%E Ulrike von Luxburg	
%F pmlr-v19-daniely11a
%I PMLR
%P 207--232
%U https://proceedings.mlr.press/v19/daniely11a.html
%V 19
%X Multiclass learning is an area of growing practical relevance, for which the currently available theory is still far from providing satisfactory understanding.  We study the learnability of multiclass prediction, and derive upper and lower bounds on the sample complexity of multiclass hypothesis classes in different learning models: batch/online, realizable/unrealizable,full information/bandit feedback.  Our analysis reveals a surprising phenomenon: In the multiclass setting, in sharp contrast to binary classification, not all Empirical Risk Minimization (ERM) algorithms are equally successful. We show that there exist hypotheses classes for which some ERM learners have lower sample complexity than others. Furthermore, there are classes that are learnable by some ERM learners, while other ERM learner will fail to learn them. We propose a principle for designing good ERM learners, and use this principle to prove tight bounds on the sample complexity of learning symmetric multiclass hypothesis classes (that is, classes that are invariant under any permutation of label names). We demonstrate the relevance of the theory by analyzing the sample complexity of two widely used hypothesis classes: generalized linear multiclass models and reduction trees. We also obtain some practically relevant conclusions.

RIS


TY  - CPAPER
TI  - Multiclass Learnability and the ERM principle
AU  - Amit Daniely
AU  - Sivan Sabato
AU  - Shai Ben-David
AU  - Shai Shalev-Shwartz
BT  - Proceedings of the 24th Annual Conference on Learning Theory
DA  - 2011/12/21
ED  - Sham M. Kakade
ED  - Ulrike von Luxburg	
ID  - pmlr-v19-daniely11a
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 19
SP  - 207
EP  - 232
L1  - http://proceedings.mlr.press/v19/daniely11a/daniely11a.pdf
UR  - https://proceedings.mlr.press/v19/daniely11a.html
AB  - Multiclass learning is an area of growing practical relevance, for which the currently available theory is still far from providing satisfactory understanding.  We study the learnability of multiclass prediction, and derive upper and lower bounds on the sample complexity of multiclass hypothesis classes in different learning models: batch/online, realizable/unrealizable,full information/bandit feedback.  Our analysis reveals a surprising phenomenon: In the multiclass setting, in sharp contrast to binary classification, not all Empirical Risk Minimization (ERM) algorithms are equally successful. We show that there exist hypotheses classes for which some ERM learners have lower sample complexity than others. Furthermore, there are classes that are learnable by some ERM learners, while other ERM learner will fail to learn them. We propose a principle for designing good ERM learners, and use this principle to prove tight bounds on the sample complexity of learning symmetric multiclass hypothesis classes (that is, classes that are invariant under any permutation of label names). We demonstrate the relevance of the theory by analyzing the sample complexity of two widely used hypothesis classes: generalized linear multiclass models and reduction trees. We also obtain some practically relevant conclusions.
ER  -

APA


Daniely, A., Sabato, S., Ben-David, S. & Shalev-Shwartz, S.. (2011). Multiclass Learnability and the ERM principle. Proceedings of the 24th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 19:207-232 Available from https://proceedings.mlr.press/v19/daniely11a.html.

Related Material

Download PDF