Large Covariance Matrix Estimation With Nonnegative Correlations

Yixin Yan; QIAO YANG; Ziping Zhao

Large Covariance Matrix Estimation With Nonnegative Correlations

Yixin Yan, QIAO YANG, Ziping Zhao

Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, PMLR 258:3502-3510, 2025.

Abstract

Covariance matrix estimation is a fundamental problem in multivariate data analysis. In many situations, it is often observed that variables exhibit a positive linear dependency, indicating a positive linear correlation. This paper tackles the challenge of estimating covariance matrices with positive correlations in high-dimensional settings. We propose a positive definite thresholding covariance estimation problem that includes nonconvex sparsity penalties and nonnegative correlation constraints. To address this problem, we introduce a multistage adaptive estimation algorithm based on majorization-minimization (MM). This algorithm progressively refines the estimates by solving a weighted $\ell_{1}$-regularized problem at each stage. Additionally, we present a comprehensive theoretical analysis that characterizes the estimation error associated with the estimates generated by the MM algorithm. The analysis reveals that the error comprises two components: the optimization error and the statistical error. The optimization error decreases to zero at a linear rate, allowing the proposed estimator to eventually reach the oracle statistical rate under mild conditions. Furthermore, we explore various extensions based on the proposed estimation technique. Our theoretical findings are supported by extensive numerical experiments conducted on both synthetic and real-world datasets. Furthermore, we demonstrate that the proposed estimation technique can be expanded to the correlation matrix estimation scenario. Our theoretical findings are corroborated through extensive numerical experiments on both synthetic data and real-world datasets.

Cite this Paper

BibTeX

@InProceedings{pmlr-v258-yan25b,
  title = 	 {Large Covariance Matrix Estimation With Nonnegative Correlations},
  author =       {Yan, Yixin and YANG, QIAO and Zhao, Ziping},
  booktitle = 	 {Proceedings of The 28th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3502--3510},
  year = 	 {2025},
  editor = 	 {Li, Yingzhen and Mandt, Stephan and Agrawal, Shipra and Khan, Emtiyaz},
  volume = 	 {258},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {03--05 May},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v258/main/assets/yan25b/yan25b.pdf},
  url = 	 {https://proceedings.mlr.press/v258/yan25b.html},
  abstract = 	 {Covariance matrix estimation is a fundamental problem in multivariate data analysis.  In many situations, it is often observed that variables exhibit a positive linear dependency, indicating a positive linear correlation. This paper tackles the challenge of estimating covariance matrices with positive correlations in high-dimensional settings. We propose a positive definite thresholding covariance estimation problem that includes nonconvex sparsity penalties and nonnegative correlation constraints. To address this problem, we introduce a multistage adaptive estimation algorithm based on majorization-minimization (MM). This algorithm progressively refines the estimates by solving a weighted $\ell_{1}$-regularized problem at each stage. Additionally, we present a comprehensive theoretical analysis that characterizes the estimation error associated with the estimates generated by the MM algorithm. The analysis reveals that the error comprises two components: the optimization error and the statistical error. The optimization error decreases to zero at a linear rate, allowing the proposed estimator to eventually reach the oracle statistical rate under mild conditions. Furthermore, we explore various extensions based on the proposed estimation technique. Our theoretical findings are supported by extensive numerical experiments conducted on both synthetic and real-world datasets. Furthermore, we demonstrate that the proposed estimation technique can be expanded to the correlation matrix estimation scenario. Our theoretical findings are corroborated through extensive numerical experiments on both synthetic data and real-world datasets.}
}

Endnote

%0 Conference Paper
%T Large Covariance Matrix Estimation With Nonnegative Correlations
%A Yixin Yan
%A QIAO YANG
%A Ziping Zhao
%B Proceedings of The 28th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2025
%E Yingzhen Li
%E Stephan Mandt
%E Shipra Agrawal
%E Emtiyaz Khan	
%F pmlr-v258-yan25b
%I PMLR
%P 3502--3510
%U https://proceedings.mlr.press/v258/yan25b.html
%V 258
%X Covariance matrix estimation is a fundamental problem in multivariate data analysis.  In many situations, it is often observed that variables exhibit a positive linear dependency, indicating a positive linear correlation. This paper tackles the challenge of estimating covariance matrices with positive correlations in high-dimensional settings. We propose a positive definite thresholding covariance estimation problem that includes nonconvex sparsity penalties and nonnegative correlation constraints. To address this problem, we introduce a multistage adaptive estimation algorithm based on majorization-minimization (MM). This algorithm progressively refines the estimates by solving a weighted $\ell_{1}$-regularized problem at each stage. Additionally, we present a comprehensive theoretical analysis that characterizes the estimation error associated with the estimates generated by the MM algorithm. The analysis reveals that the error comprises two components: the optimization error and the statistical error. The optimization error decreases to zero at a linear rate, allowing the proposed estimator to eventually reach the oracle statistical rate under mild conditions. Furthermore, we explore various extensions based on the proposed estimation technique. Our theoretical findings are supported by extensive numerical experiments conducted on both synthetic and real-world datasets. Furthermore, we demonstrate that the proposed estimation technique can be expanded to the correlation matrix estimation scenario. Our theoretical findings are corroborated through extensive numerical experiments on both synthetic data and real-world datasets.

APA

Yan, Y., YANG, Q. & Zhao, Z.. (2025). Large Covariance Matrix Estimation With Nonnegative Correlations. Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 258:3502-3510 Available from https://proceedings.mlr.press/v258/yan25b.html.

Large Covariance Matrix Estimation With Nonnegative Correlations

Abstract

Cite this Paper

Related Material