Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures

Chiranjib Bhattacharyya; Ravindran Kannan

Near-optimal sample complexity bounds for learning Latent $k-$ polytopes and applications to Ad-Mixtures

Chiranjib Bhattacharyya, Ravindran Kannan

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:854-863, 2020.

Abstract

Deriving Optimal bounds on Sample Complexity of Latent Variable models is an active area of research. Recently such bounds were obtained for Mixture of Gaussians \cite{HSNCAY18}, no such results are known for Ad-mixtures, a generalization of Mixture distributions. In this paper we show that

$O^*(dk/m)$ samples are sufficient to learn each of

$k-$ topic vectors of LDA, a popular Ad-mixture model, with vocabulary size

$d$ and

$m\in \Omega(1)$ words per document, to any constant error in

$L_1$ norm. The result is a corollary of the major contribution of this paper: the first sample complexity upper bound for the problem (introduced in \cite{BK20}) of learning the vertices of a Latent

$k-$ Polytope in

$\RR^d$ , given perturbed points from it. The bound,

$O^*(dk/\beta)$ , is optimal and linear in number of parameters. It applies to many stochastic models including a broad class Ad-mixtures. To demonstrate the generality of the approach we specialize the setting to Mixed Membership Stochastic Block Models(MMSB) and show for the first time that if an MMSB has

$k$ blocks, the sample complexity is

$O^*(k^2)$ under usual assumptions.

Cite this Paper

BibTeX


@InProceedings{pmlr-v119-bhattacharyya20b,
  title = 	 {Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures},
  author =       {Bhattacharyya, Chiranjib and Kannan, Ravindran},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {854--863},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/bhattacharyya20b/bhattacharyya20b.pdf},
  url = 	 {https://proceedings.mlr.press/v119/bhattacharyya20b.html},
  abstract = 	 {Deriving Optimal bounds on Sample Complexity of Latent Variable models is an active area of research. Recently such bounds were obtained for Mixture of Gaussians \cite{HSNCAY18}, no such results are known for Ad-mixtures, a generalization of Mixture distributions. In this paper we show that $O^*(dk/m)$ samples are sufficient to learn each of $k-$ topic vectors of LDA, a popular Ad-mixture model, with vocabulary size $d$ and $m\in \Omega(1)$ words per document, to any constant error in $L_1$ norm. The result is a corollary of the major contribution of this paper: the first sample complexity upper bound for the problem (introduced in \cite{BK20}) of learning the vertices of a Latent $k-$ Polytope in $\RR^d$, given perturbed points from it. The bound, $O^*(dk/\beta)$, is optimal and linear in number of parameters. It applies to many stochastic models including a broad class Ad-mixtures. To demonstrate the generality of the approach we specialize the setting to Mixed Membership Stochastic Block Models(MMSB) and show for the first time that if an MMSB has $k$ blocks, the sample complexity is $O^*(k^2)$ under usual assumptions.}
}

Endnote

%0 Conference Paper
%T Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures
%A Chiranjib Bhattacharyya
%A Ravindran Kannan
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-bhattacharyya20b
%I PMLR
%P 854--863
%U https://proceedings.mlr.press/v119/bhattacharyya20b.html
%V 119
%X Deriving Optimal bounds on Sample Complexity of Latent Variable models is an active area of research. Recently such bounds were obtained for Mixture of Gaussians \cite{HSNCAY18}, no such results are known for Ad-mixtures, a generalization of Mixture distributions. In this paper we show that $O^*(dk/m)$ samples are sufficient to learn each of $k-$ topic vectors of LDA, a popular Ad-mixture model, with vocabulary size $d$ and $m\in \Omega(1)$ words per document, to any constant error in $L_1$ norm. The result is a corollary of the major contribution of this paper: the first sample complexity upper bound for the problem (introduced in \cite{BK20}) of learning the vertices of a Latent $k-$ Polytope in $\RR^d$, given perturbed points from it. The bound, $O^*(dk/\beta)$, is optimal and linear in number of parameters. It applies to many stochastic models including a broad class Ad-mixtures. To demonstrate the generality of the approach we specialize the setting to Mixed Membership Stochastic Block Models(MMSB) and show for the first time that if an MMSB has $k$ blocks, the sample complexity is $O^*(k^2)$ under usual assumptions.

APA


Bhattacharyya, C. & Kannan, R.. (2020). Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:854-863 Available from https://proceedings.mlr.press/v119/bhattacharyya20b.html.

Near-optimal sample complexity bounds for learning Latent k−k-polytopes and applications to Ad-Mixtures

Abstract

Cite this Paper

Related Material

Near-optimal sample complexity bounds for learning Latent $k-$ polytopes and applications to Ad-Mixtures