Average case analysis of Lasso under ultra sparse conditions

Koki Okajima; Xiangming Meng; Takashi Takahashi; Yoshiyuki Kabashima

Average case analysis of Lasso under ultra sparse conditions

Koki Okajima, Xiangming Meng, Takashi Takahashi, Yoshiyuki Kabashima

Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:11317-11330, 2023.

Abstract

We analyze the performance of the least absolute shrinkage and selection operator (Lasso) for the linear model when the number of regressors

$N$ grows larger keeping the true support size

$d$ finite, i.e., the ultra-sparse case. The result is based on a novel treatment of the non-rigorous replica method in statistical physics, which has been applied only to problem settings where

$N$ ,

$d$ and the number of observations

$M$ tend to infinity at the same rate. Our analysis makes it possible to assess the average performance of Lasso with Gaussian sensing matrices without assumptions on the scaling of

$N$ and

$M$ , the noise distribution, and the profile of the true signal. Under mild conditions on the noise distribution, the analysis also offers a lower bound on the sample complexity necessary for partial and perfect support recovery when

$M$ diverges as

$M = O(\log N)$ . The obtained bound for perfect support recovery is a generalization of that given in previous literature, which only considers the case of Gaussian noise and diverging

$d$ . Extensive numerical experiments strongly support our analysis.

Cite this Paper

BibTeX


@InProceedings{pmlr-v206-okajima23a,
  title = 	 {Average case analysis of Lasso under ultra sparse conditions},
  author =       {Okajima, Koki and Meng, Xiangming and Takahashi, Takashi and Kabashima, Yoshiyuki},
  booktitle = 	 {Proceedings of The 26th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {11317--11330},
  year = 	 {2023},
  editor = 	 {Ruiz, Francisco and Dy, Jennifer and van de Meent, Jan-Willem},
  volume = 	 {206},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {25--27 Apr},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v206/okajima23a/okajima23a.pdf},
  url = 	 {https://proceedings.mlr.press/v206/okajima23a.html},
  abstract = 	 {We analyze the performance of the least absolute shrinkage and selection operator (Lasso) for the linear model when the number of regressors $N$ grows larger keeping the true support size $d$ finite, i.e., the ultra-sparse case. The result is based on a novel treatment of the non-rigorous replica method in statistical physics, which has been applied only to problem settings where $N$, $d$ and the number of observations $M$ tend to infinity at the same rate. Our analysis makes it possible to assess the average performance of Lasso with Gaussian sensing matrices without assumptions on the scaling of $N$ and $M$, the noise distribution, and the profile of the true signal. Under mild conditions on the noise distribution, the analysis also offers a lower bound on the sample complexity necessary for partial and perfect support recovery when $M$ diverges as $M = O(\log N)$. The obtained bound for perfect support recovery is a generalization of that given in previous literature, which only considers the case of Gaussian noise and diverging $d$. Extensive numerical experiments strongly support our analysis.}
}

Endnote

%0 Conference Paper
%T Average case analysis of Lasso under ultra sparse conditions
%A Koki Okajima
%A Xiangming Meng
%A Takashi Takahashi
%A Yoshiyuki Kabashima
%B Proceedings of The 26th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2023
%E Francisco Ruiz
%E Jennifer Dy
%E Jan-Willem van de Meent	
%F pmlr-v206-okajima23a
%I PMLR
%P 11317--11330
%U https://proceedings.mlr.press/v206/okajima23a.html
%V 206
%X We analyze the performance of the least absolute shrinkage and selection operator (Lasso) for the linear model when the number of regressors $N$ grows larger keeping the true support size $d$ finite, i.e., the ultra-sparse case. The result is based on a novel treatment of the non-rigorous replica method in statistical physics, which has been applied only to problem settings where $N$, $d$ and the number of observations $M$ tend to infinity at the same rate. Our analysis makes it possible to assess the average performance of Lasso with Gaussian sensing matrices without assumptions on the scaling of $N$ and $M$, the noise distribution, and the profile of the true signal. Under mild conditions on the noise distribution, the analysis also offers a lower bound on the sample complexity necessary for partial and perfect support recovery when $M$ diverges as $M = O(\log N)$. The obtained bound for perfect support recovery is a generalization of that given in previous literature, which only considers the case of Gaussian noise and diverging $d$. Extensive numerical experiments strongly support our analysis.

APA


Okajima, K., Meng, X., Takahashi, T. & Kabashima, Y.. (2023). Average case analysis of Lasso under ultra sparse conditions. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 206:11317-11330 Available from https://proceedings.mlr.press/v206/okajima23a.html.

Related Material

Download PDF