A margin-based multiclass generalization bound via geometric complexity

Michael Munn; Benoit Dherin; Javier Gonzalvo

A margin-based multiclass generalization bound via geometric complexity

Michael Munn, Benoit Dherin, Javier Gonzalvo

Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML), PMLR 221:189-205, 2023.

Abstract

There has been considerable effort to better understand the generalization capabilities of deep neural networks both as a means to unlock a theoretical understanding of their success as well as providing directions for further improvements. In this paper we investigate margin-based multiclass generalization bounds for neural networks which rely on a recent complexity measure, the geometric complexity, developed for neural networks and which measures the variability of the model function. We derive a new upper bound on the generalization error which scale with the margin-normalized geometric complexity of the network and which hold for a broad family of data distributions and model classes. Our generalization bound is empirically investigated for a ResNet-18 model trained with SGD on the CIFAR-10 and CIFAR-100 datasets with both original and random labels.

Cite this Paper

BibTeX


@InProceedings{pmlr-v221-munn23a,
  title = 	 {A margin-based multiclass generalization bound via geometric complexity},
  author =       {Munn, Michael and Dherin, Benoit and Gonzalvo, Javier},
  booktitle = 	 {Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML)},
  pages = 	 {189--205},
  year = 	 {2023},
  editor = 	 {Doster, Timothy and Emerson, Tegan and Kvinge, Henry and Miolane, Nina and Papillon, Mathilde and Rieck, Bastian and Sanborn, Sophia},
  volume = 	 {221},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {28 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v221/munn23a/munn23a.pdf},
  url = 	 {https://proceedings.mlr.press/v221/munn23a.html},
  abstract = 	 {There has been considerable effort to better understand the generalization capabilities of deep neural networks both as a means to unlock a theoretical understanding of their success as well as providing directions for further improvements. In this paper we investigate margin-based multiclass generalization bounds for neural networks which rely on a recent complexity measure, the geometric complexity,  developed for neural networks and which measures the variability of the model function. We derive a new upper bound on the generalization error which scale with the margin-normalized geometric complexity of the network and which hold for a broad family of data distributions and model classes. Our generalization bound is empirically investigated for a ResNet-18 model trained with SGD on the CIFAR-10 and CIFAR-100 datasets with both original and random labels.}
}

Endnote

%0 Conference Paper
%T A margin-based multiclass generalization bound via geometric complexity
%A Michael Munn
%A Benoit Dherin
%A Javier Gonzalvo
%B Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML)
%C Proceedings of Machine Learning Research
%D 2023
%E Timothy Doster
%E Tegan Emerson
%E Henry Kvinge
%E Nina Miolane
%E Mathilde Papillon
%E Bastian Rieck
%E Sophia Sanborn	
%F pmlr-v221-munn23a
%I PMLR
%P 189--205
%U https://proceedings.mlr.press/v221/munn23a.html
%V 221
%X There has been considerable effort to better understand the generalization capabilities of deep neural networks both as a means to unlock a theoretical understanding of their success as well as providing directions for further improvements. In this paper we investigate margin-based multiclass generalization bounds for neural networks which rely on a recent complexity measure, the geometric complexity,  developed for neural networks and which measures the variability of the model function. We derive a new upper bound on the generalization error which scale with the margin-normalized geometric complexity of the network and which hold for a broad family of data distributions and model classes. Our generalization bound is empirically investigated for a ResNet-18 model trained with SGD on the CIFAR-10 and CIFAR-100 datasets with both original and random labels.

APA


Munn, M., Dherin, B. & Gonzalvo, J.. (2023). A margin-based multiclass generalization bound via geometric complexity. Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML), in Proceedings of Machine Learning Research 221:189-205 Available from https://proceedings.mlr.press/v221/munn23a.html.

A margin-based multiclass generalization bound via geometric complexity

Abstract

Cite this Paper

Related Material