Fast and Multiphase Rates for Nearest Neighbor Classifiers

Pengkun Yang; Jingzhao Zhang

Fast and Multiphase Rates for Nearest Neighbor Classifiers

Pengkun Yang, Jingzhao Zhang

Proceedings of Thirty Eighth Conference on Learning Theory, PMLR 291:6014-6015, 2025.

Abstract

We study the scaling of classification error rates with respect to the size of the training dataset. In contrast to classical results where rates are minimax optimal for a problem class, this work starts with the empirical observation that, even for a fixed data distribution, the error scaling can have \emph{diverse} rates across different ranges of sample size. To understand when and why the error rate is non-uniform, we theoretically analyze nearest neighbor classifiers. We show that an error scaling law can have fine-grained rates: in the early phase, the test error depends polynomially on the data dimension and decreases fast; whereas in the later phase, the error depends exponentially on the data dimension and decreases slowly. Our analysis highlights the complexity of the data distribution in determining the test error. When the data are distributed benignly, we show that the generalization error of nearest neighbor classifier can depend polynomially, instead of exponentially, on the data dimension.

Cite this Paper

BibTeX

@InProceedings{pmlr-v291-yang25a,
  title = 	 {Fast and Multiphase Rates for Nearest Neighbor Classifiers},
  author =       {Yang, Pengkun and Zhang, Jingzhao},
  booktitle = 	 {Proceedings of Thirty Eighth Conference on Learning Theory},
  pages = 	 {6014--6015},
  year = 	 {2025},
  editor = 	 {Haghtalab, Nika and Moitra, Ankur},
  volume = 	 {291},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {30 Jun--04 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v291/main/assets/yang25a/yang25a.pdf},
  url = 	 {https://proceedings.mlr.press/v291/yang25a.html},
  abstract = 	 {We study the scaling of classification error rates with respect to the size of the training dataset. In contrast to classical results where rates are minimax optimal for a problem class, this work starts with the empirical observation that, even for a fixed data distribution,  the error scaling can have \emph{diverse} rates across different ranges of sample size.  To understand when and why the error rate is non-uniform, we theoretically analyze nearest neighbor classifiers. We show that an error scaling law can have fine-grained rates: in the early phase, the test error depends polynomially on the data dimension and decreases fast; whereas in the later phase, the error depends exponentially on the data dimension and decreases slowly.  Our analysis highlights the complexity of the data distribution in determining the test error. When the data are distributed benignly, we show that the generalization error of nearest neighbor classifier can depend polynomially, instead of exponentially, on the data dimension. }
}

Endnote

%0 Conference Paper
%T Fast and Multiphase Rates for Nearest Neighbor Classifiers
%A Pengkun Yang
%A Jingzhao Zhang
%B Proceedings of Thirty Eighth Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2025
%E Nika Haghtalab
%E Ankur Moitra	
%F pmlr-v291-yang25a
%I PMLR
%P 6014--6015
%U https://proceedings.mlr.press/v291/yang25a.html
%V 291
%X We study the scaling of classification error rates with respect to the size of the training dataset. In contrast to classical results where rates are minimax optimal for a problem class, this work starts with the empirical observation that, even for a fixed data distribution,  the error scaling can have \emph{diverse} rates across different ranges of sample size.  To understand when and why the error rate is non-uniform, we theoretically analyze nearest neighbor classifiers. We show that an error scaling law can have fine-grained rates: in the early phase, the test error depends polynomially on the data dimension and decreases fast; whereas in the later phase, the error depends exponentially on the data dimension and decreases slowly.  Our analysis highlights the complexity of the data distribution in determining the test error. When the data are distributed benignly, we show that the generalization error of nearest neighbor classifier can depend polynomially, instead of exponentially, on the data dimension.

APA

Yang, P. & Zhang, J.. (2025). Fast and Multiphase Rates for Nearest Neighbor Classifiers. Proceedings of Thirty Eighth Conference on Learning Theory, in Proceedings of Machine Learning Research 291:6014-6015 Available from https://proceedings.mlr.press/v291/yang25a.html.

Related Material

Download PDF