Provable Robustness of Adversarial Training for Learning Halfspaces with Noise

Difan Zou; Spencer Frei; Quanquan Gu

Provable Robustness of Adversarial Training for Learning Halfspaces with Noise

Difan Zou, Spencer Frei, Quanquan Gu

Proceedings of the 38th International Conference on Machine Learning, PMLR 139:13002-13011, 2021.

Abstract

We analyze the properties of adversarial training for learning adversarially robust halfspaces in the presence of agnostic label noise. Denoting

$\mathsf{OPT}_{p,r}$ as the best classification error achieved by a halfspace that is robust to perturbations of

$\ell^{p}$ balls of radius

$r$ , we show that adversarial training on the standard binary cross-entropy loss yields adversarially robust halfspaces up to classification error

$\tilde O(\sqrt{\mathsf{OPT}_{2,r}})$ for

$p=2$ , and

$\tilde O(d^{1/4} \sqrt{\mathsf{OPT}_{\infty, r}})$ when

$p=\infty$ . Our results hold for distributions satisfying anti-concentration properties enjoyed by log-concave isotropic distributions among others. We additionally show that if one instead uses a non-convex sigmoidal loss, adversarial training yields halfspaces with an improved robust classification error of

$O(\mathsf{OPT}_{2,r})$ for

$p=2$ , and

$O(d^{1/4} \mathsf{OPT}_{\infty, r})$ when

$p=\infty$ . To the best of our knowledge, this is the first work showing that adversarial training provably yields robust classifiers in the presence of noise.

Cite this Paper

BibTeX


@InProceedings{pmlr-v139-zou21a,
  title = 	 {Provable Robustness of Adversarial Training for Learning Halfspaces with Noise},
  author =       {Zou, Difan and Frei, Spencer and Gu, Quanquan},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {13002--13011},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/zou21a/zou21a.pdf},
  url = 	 {https://proceedings.mlr.press/v139/zou21a.html},
  abstract = 	 {We analyze the properties of adversarial training for learning adversarially robust halfspaces in the presence of agnostic label noise. Denoting $\mathsf{OPT}_{p,r}$ as the best classification error achieved by a halfspace that is robust to perturbations of $\ell^{p}$ balls of radius $r$, we show that adversarial training on the standard binary cross-entropy loss yields adversarially robust halfspaces up to classification error $\tilde O(\sqrt{\mathsf{OPT}_{2,r}})$ for $p=2$, and $\tilde O(d^{1/4} \sqrt{\mathsf{OPT}_{\infty, r}})$ when $p=\infty$. Our results hold for distributions satisfying anti-concentration properties enjoyed by log-concave isotropic distributions among others. We additionally show that if one instead uses a non-convex sigmoidal loss, adversarial training yields halfspaces with an improved robust classification error of $O(\mathsf{OPT}_{2,r})$ for $p=2$, and $O(d^{1/4} \mathsf{OPT}_{\infty, r})$ when $p=\infty$. To the best of our knowledge, this is the first work showing that adversarial training provably yields robust classifiers in the presence of noise.}
}

Endnote

%0 Conference Paper
%T Provable Robustness of Adversarial Training for Learning Halfspaces with Noise
%A Difan Zou
%A Spencer Frei
%A Quanquan Gu
%B Proceedings of the 38th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Marina Meila
%E Tong Zhang	
%F pmlr-v139-zou21a
%I PMLR
%P 13002--13011
%U https://proceedings.mlr.press/v139/zou21a.html
%V 139
%X We analyze the properties of adversarial training for learning adversarially robust halfspaces in the presence of agnostic label noise. Denoting $\mathsf{OPT}_{p,r}$ as the best classification error achieved by a halfspace that is robust to perturbations of $\ell^{p}$ balls of radius $r$, we show that adversarial training on the standard binary cross-entropy loss yields adversarially robust halfspaces up to classification error $\tilde O(\sqrt{\mathsf{OPT}_{2,r}})$ for $p=2$, and $\tilde O(d^{1/4} \sqrt{\mathsf{OPT}_{\infty, r}})$ when $p=\infty$. Our results hold for distributions satisfying anti-concentration properties enjoyed by log-concave isotropic distributions among others. We additionally show that if one instead uses a non-convex sigmoidal loss, adversarial training yields halfspaces with an improved robust classification error of $O(\mathsf{OPT}_{2,r})$ for $p=2$, and $O(d^{1/4} \mathsf{OPT}_{\infty, r})$ when $p=\infty$. To the best of our knowledge, this is the first work showing that adversarial training provably yields robust classifiers in the presence of noise.

APA


Zou, D., Frei, S. & Gu, Q.. (2021). Provable Robustness of Adversarial Training for Learning Halfspaces with Noise. Proceedings of the 38th International Conference on Machine Learning, in Proceedings of Machine Learning Research 139:13002-13011 Available from https://proceedings.mlr.press/v139/zou21a.html.

Provable Robustness of Adversarial Training for Learning Halfspaces with Noise

Abstract

Cite this Paper

Related Material