Improved Generalization Bounds for Robust Learning

Idan Attias; Aryeh Kontorovich; Yishay Mansour

Improved Generalization Bounds for Robust Learning

Idan Attias, Aryeh Kontorovich, Yishay Mansour

Proceedings of the 30th International Conference on Algorithmic Learning Theory, PMLR 98:162-183, 2019.

Abstract

We consider a model of robust learning in an adversarial environment. The learner gets uncorrupted training data with access to possible corruptions that may be effected by the adversary during testing. The learner’s goal is to build a robust classifier that would be tested on future adversarial examples. We use a zero-sum game between the learner and the adversary as our game theoretic framework. The adversary is limited to $k$ possible corruptions for each input. Our model is closely related to the adversarial examples model of Schmidt et al. (2018); Madry et al. (2017). Our main results consist of generalization bounds for the binary and multi-class classification, as well as the real-valued case (regression). For the binary classification setting, we both tighten the generalization bound of Feige, Mansour, and Schapire (2015), and also are able to handle an infinite hypothesis class $H$. The sample complexity is improved from $O(\frac{1}{\epsilon^4}\log(\frac{|H|}{\delta}))$ to $O(\frac{1}{\epsilon^2}(k\log(k)VC(H)+\log\frac{1}{\delta}))$. Additionally, we extend the algorithm and generalization bound from the binary to the multiclass and real-valued cases. Along the way, we obtain results on fat-shattering dimension and Rademacher complexity of $k$-fold maxima over function classes; these may be of independent interest. For binary classification, the algorithm of Feige et al. (2015) uses a regret minimization algorithm and an ERM oracle as a blackbox; we adapt it for the multi-class and regression settings. The algorithm provides us with near optimal policies for the players on a given training sample.

Cite this Paper

BibTeX


@InProceedings{pmlr-v98-attias19a,
  title = 	 {Improved Generalization Bounds for Robust Learning},
  author =       {Attias, Idan and Kontorovich, Aryeh and Mansour, Yishay},
  booktitle = 	 {Proceedings of the 30th International Conference on Algorithmic Learning Theory},
  pages = 	 {162--183},
  year = 	 {2019},
  editor = 	 {Garivier, Aurélien and Kale, Satyen},
  volume = 	 {98},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {22--24 Mar},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v98/attias19a/attias19a.pdf},
  url = 	 {https://proceedings.mlr.press/v98/attias19a.html},
  abstract = 	 {We consider a model of robust learning in an adversarial
 environment. The learner gets uncorrupted training data with access
 to possible corruptions that may be effected by the adversary during
 testing. The learner’s goal is to build a robust classifier that would be
 tested on future adversarial examples. We use a zero-sum game
 between the learner and the adversary as our game theoretic
 framework. The adversary is limited to $k$ possible corruptions for
 each input. Our model is closely related to the adversarial examples
 model of Schmidt et al. (2018); Madry et al. (2017).
 Our main results consist of generalization bounds for the binary and
 multi-class classification, as well as the real-valued case (regression).
 For the binary classification setting, we both tighten the generalization bound of
 Feige, Mansour, and Schapire (2015), and also are able to handle an infinite hypothesis class $H$. 
 The sample complexity is improved from
 $O(\frac{1}{\epsilon^4}\log(\frac{|H|}{\delta}))$ to $O(\frac{1}{\epsilon^2}(k\log(k)VC(H)+\log\frac{1}{\delta}))$.
 Additionally, we extend the algorithm and generalization bound from the binary
 to the multiclass and real-valued cases. Along the way, we obtain results on fat-shattering dimension 
 and Rademacher complexity of $k$-fold maxima over function classes; these may be of independent interest.
 For binary classification, the algorithm of Feige et al. (2015) uses a regret minimization algorithm 
 and an ERM oracle as a blackbox; we adapt it for the multi-class and regression settings. 
 The algorithm provides us with near optimal policies for the players on a given training sample.}
}

Endnote

%0 Conference Paper
%T Improved Generalization Bounds for Robust Learning
%A Idan Attias
%A Aryeh Kontorovich
%A Yishay Mansour
%B Proceedings of the 30th International Conference on Algorithmic Learning Theory
%C Proceedings of Machine Learning Research
%D 2019
%E Aurélien Garivier
%E Satyen Kale	
%F pmlr-v98-attias19a
%I PMLR
%P 162--183
%U https://proceedings.mlr.press/v98/attias19a.html
%V 98
%X We consider a model of robust learning in an adversarial
 environment. The learner gets uncorrupted training data with access
 to possible corruptions that may be effected by the adversary during
 testing. The learner’s goal is to build a robust classifier that would be
 tested on future adversarial examples. We use a zero-sum game
 between the learner and the adversary as our game theoretic
 framework. The adversary is limited to $k$ possible corruptions for
 each input. Our model is closely related to the adversarial examples
 model of Schmidt et al. (2018); Madry et al. (2017).
 Our main results consist of generalization bounds for the binary and
 multi-class classification, as well as the real-valued case (regression).
 For the binary classification setting, we both tighten the generalization bound of
 Feige, Mansour, and Schapire (2015), and also are able to handle an infinite hypothesis class $H$. 
 The sample complexity is improved from
 $O(\frac{1}{\epsilon^4}\log(\frac{|H|}{\delta}))$ to $O(\frac{1}{\epsilon^2}(k\log(k)VC(H)+\log\frac{1}{\delta}))$.
 Additionally, we extend the algorithm and generalization bound from the binary
 to the multiclass and real-valued cases. Along the way, we obtain results on fat-shattering dimension 
 and Rademacher complexity of $k$-fold maxima over function classes; these may be of independent interest.
 For binary classification, the algorithm of Feige et al. (2015) uses a regret minimization algorithm 
 and an ERM oracle as a blackbox; we adapt it for the multi-class and regression settings. 
 The algorithm provides us with near optimal policies for the players on a given training sample.

APA


Attias, I., Kontorovich, A. & Mansour, Y.. (2019). Improved Generalization Bounds for Robust Learning. Proceedings of the 30th International Conference on Algorithmic Learning Theory, in Proceedings of Machine Learning Research 98:162-183 Available from https://proceedings.mlr.press/v98/attias19a.html.

Related Material

Download PDF