[edit]
Learning Mixtures of Gaussians with Censored Data
Proceedings of the 40th International Conference on Machine Learning, PMLR 202:33396-33415, 2023.
Abstract
We study the problem of learning mixtures of Gaussians with censored data. Statistical learning with censored data is a classical problem, with numerous practical applications, however, finite-sample guarantees for even simple latent variable models such as Gaussian mixtures are missing. Formally, we are given censored data from a mixture of univariate Gaussians ∑ki=1wiN(μi,σ2), i.e. the sample is observed only if it lies inside a set S. The goal is to learn the weights wi and the means μi. We propose an algorithm that takes only 1εO(k) samples to estimate the weights wi and the means μi within ε error.