Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints

Wei-Ning Chen; Peter Kairouz; Ayfer Ozgur

Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints

Wei-Ning Chen, Peter Kairouz, Ayfer Ozgur

Proceedings of Thirty Fourth Conference on Learning Theory, PMLR 134:1028-1059, 2021.

Abstract

We consider the problem of estimating a

$d$ -dimensional

$s$ -sparse discrete distribution from its samples observed under a

$b$ -bit communication constraint. The best-known previous result on

$\ell_2$ estimation error for this problem is

$O\left( \frac{s\log\left( {d}/{s}\right)}{n2^b}\right)$ . Surprisingly, we show that when sample size

$n$ exceeds a minimum threshold

$n^*(s, d, b)$ , we can achieve an

$\ell_2$ estimation error of

$O\left( \frac{s}{n2^b}\right)$ . This implies that when

$n>n^*(s, d, b)$ the convergence rate does not depend on the ambient dimension

$d$ and is the same as knowing the support of the distribution beforehand. We next ask the question: ``what is the minimum

$n^*(s, d, b)$ that allows dimension-free convergence?'. To upper bound

$n^*(s, d, b)$ , we develop novel localization schemes to accurately and efficiently localize the unknown support. For the non-interactive setting, we show that

$n^*(s, d, b) = O\left( \min \left( {d^2\log^2 d}/{2^b}, {s^4\log^2 d}/{2^b}\right) \right)$ . Moreover, we connect the problem with non-adaptive group testing and obtain a polynomial-time estimation scheme when

$n = \tilde{\Omega}\left({s^4\log^4 d}/{2^b}\right)$ . This group testing based scheme is adaptive to the sparsity parameter

$s$ , and hence can be applied without knowing it. For the interactive setting, we propose a novel tree-based estimation scheme and show that the minimum sample-size needed to achieve dimension-free convergence can be further reduced to

$n^*(s, d, b) = \tilde{O}\left( {s^2\log^2 d}/{2^b} \right)$ .

Cite this Paper

BibTeX


@InProceedings{pmlr-v134-chen21a,
  title = 	 {Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints},
  author =       {Chen, Wei-Ning and Kairouz, Peter and Ozgur, Ayfer},
  booktitle = 	 {Proceedings of Thirty Fourth Conference on Learning Theory},
  pages = 	 {1028--1059},
  year = 	 {2021},
  editor = 	 {Belkin, Mikhail and Kpotufe, Samory},
  volume = 	 {134},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {15--19 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v134/chen21a/chen21a.pdf},
  url = 	 {https://proceedings.mlr.press/v134/chen21a.html},
  abstract = 	 {We consider the problem of estimating a $d$-dimensional $s$-sparse discrete distribution from its samples observed under a $b$-bit communication constraint. The best-known previous result on $\ell_2$ estimation error for this problem is $O\left( \frac{s\log\left( {d}/{s}\right)}{n2^b}\right)$. Surprisingly, we show that when sample size $n$ exceeds a minimum threshold $n^*(s, d, b)$, we can achieve an $\ell_2$ estimation error of $O\left( \frac{s}{n2^b}\right)$. This implies that when $n>n^*(s, d, b)$ the convergence rate does not depend on the ambient dimension $d$ and is the same as knowing the support of the distribution beforehand.
We next ask the question: ``what is the minimum $n^*(s, d, b)$ that allows dimension-free convergence?'.  To upper bound $n^*(s, d, b)$, we develop novel localization schemes to accurately and efficiently localize the unknown support. For the non-interactive setting, we show that $n^*(s, d, b) = O\left( \min \left( {d^2\log^2 d}/{2^b}, {s^4\log^2 d}/{2^b}\right) \right)$. Moreover,  we connect the problem with non-adaptive group testing and obtain a polynomial-time estimation scheme when $n = \tilde{\Omega}\left({s^4\log^4 d}/{2^b}\right)$. This group testing based scheme is adaptive to the sparsity parameter $s$, and hence can be applied without knowing it. For the interactive setting, we propose a novel tree-based estimation scheme and show that the minimum sample-size needed to achieve dimension-free convergence can be further reduced to $n^*(s, d, b) = \tilde{O}\left( {s^2\log^2 d}/{2^b} \right)$.}
}

Endnote

%0 Conference Paper
%T Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints
%A Wei-Ning Chen
%A Peter Kairouz
%A Ayfer Ozgur
%B Proceedings of Thirty Fourth Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2021
%E Mikhail Belkin
%E Samory Kpotufe	
%F pmlr-v134-chen21a
%I PMLR
%P 1028--1059
%U https://proceedings.mlr.press/v134/chen21a.html
%V 134
%X We consider the problem of estimating a $d$-dimensional $s$-sparse discrete distribution from its samples observed under a $b$-bit communication constraint. The best-known previous result on $\ell_2$ estimation error for this problem is $O\left( \frac{s\log\left( {d}/{s}\right)}{n2^b}\right)$. Surprisingly, we show that when sample size $n$ exceeds a minimum threshold $n^*(s, d, b)$, we can achieve an $\ell_2$ estimation error of $O\left( \frac{s}{n2^b}\right)$. This implies that when $n>n^*(s, d, b)$ the convergence rate does not depend on the ambient dimension $d$ and is the same as knowing the support of the distribution beforehand.
We next ask the question: ``what is the minimum $n^*(s, d, b)$ that allows dimension-free convergence?'.  To upper bound $n^*(s, d, b)$, we develop novel localization schemes to accurately and efficiently localize the unknown support. For the non-interactive setting, we show that $n^*(s, d, b) = O\left( \min \left( {d^2\log^2 d}/{2^b}, {s^4\log^2 d}/{2^b}\right) \right)$. Moreover,  we connect the problem with non-adaptive group testing and obtain a polynomial-time estimation scheme when $n = \tilde{\Omega}\left({s^4\log^4 d}/{2^b}\right)$. This group testing based scheme is adaptive to the sparsity parameter $s$, and hence can be applied without knowing it. For the interactive setting, we propose a novel tree-based estimation scheme and show that the minimum sample-size needed to achieve dimension-free convergence can be further reduced to $n^*(s, d, b) = \tilde{O}\left( {s^2\log^2 d}/{2^b} \right)$.

APA


Chen, W., Kairouz, P. & Ozgur, A.. (2021). Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints. Proceedings of Thirty Fourth Conference on Learning Theory, in Proceedings of Machine Learning Research 134:1028-1059 Available from https://proceedings.mlr.press/v134/chen21a.html.

Related Material

Download PDF