Low-Rank Spectral Learning

Alex Kulesza; N. Raj Rao; Satinder Singh

Low-Rank Spectral Learning

Alex Kulesza, N. Raj Rao, Satinder Singh

Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, PMLR 33:522-530, 2014.

Abstract

Spectral learning methods have recently been proposed as alternatives to slow, non-convex optimization algorithms like EM for a variety of probabilistic models in which hidden information must be inferred by the learner. These methods are typically controlled by a rank hyperparameter that sets the complexity of the model; when the model rank matches the true rank of the process generating the data, the resulting predictions are provably consistent and admit finite sample convergence bounds. However, in practice we usually do not know the true rank, and, in any event, from a computational and statistical standpoint it is likely to be prohibitively large. It is therefore of great practical interest to understand the behavior of low-rank spectral learning, where the model rank is less than the true rank. Counterintuitively, we show that even when the singular values omitted by lowering the rank are arbitrarily small, the resulting prediction errors can in fact be arbitrarily large. We identify two distinct possible causes for this bad behavior, and illustrate them with simple examples. We then show that these two causes are essentially complete: assuming that they do not occur, we can prove that the prediction error is bounded in terms of the magnitudes of the omitted singular values. We argue that the assumptions necessary for this result are relatively realistic, making low-rank spectral learning a viable option for many applications.

Cite this Paper

BibTeX


@InProceedings{pmlr-v33-kulesza14,
  title = 	 {{Low-Rank Spectral Learning}},
  author = 	 {Kulesza, Alex and Rao, N. Raj and Singh, Satinder},
  booktitle = 	 {Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics},
  pages = 	 {522--530},
  year = 	 {2014},
  editor = 	 {Kaski, Samuel and Corander, Jukka},
  volume = 	 {33},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Reykjavik, Iceland},
  month = 	 {22--25 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v33/kulesza14.pdf},
  url = 	 {https://proceedings.mlr.press/v33/kulesza14.html},
  abstract = 	 {Spectral learning methods have recently been proposed as  alternatives to slow, non-convex optimization algorithms like EM for  a variety of probabilistic models in which hidden information must  be inferred by the learner.  These methods are typically controlled  by a rank hyperparameter that sets the complexity of the model; when  the model rank matches the true rank of the process generating the  data, the resulting predictions are provably consistent and admit  finite sample convergence bounds.  However, in practice we usually  do not know the true rank, and, in any event, from a computational  and statistical standpoint it is likely to be prohibitively large.  It is therefore of great practical interest to understand the  behavior of low-rank spectral learning, where the model rank is   less than the true rank.  Counterintuitively, we show that even   when the singular values omitted by lowering the rank are   arbitrarily small, the resulting prediction errors can in fact be  arbitrarily large.  We identify two distinct possible causes for  this bad behavior, and illustrate them with simple examples.  We  then show that these two causes are essentially complete: assuming  that they do not occur, we can prove that the prediction error is  bounded in terms of the magnitudes of the omitted singular values.  We argue that the assumptions necessary for this result are  relatively realistic, making low-rank spectral learning a viable  option for many applications.}
}

Endnote

%0 Conference Paper
%T Low-Rank Spectral Learning
%A Alex Kulesza
%A N. Raj Rao
%A Satinder Singh
%B Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2014
%E Samuel Kaski
%E Jukka Corander	
%F pmlr-v33-kulesza14
%I PMLR
%P 522--530
%U https://proceedings.mlr.press/v33/kulesza14.html
%V 33
%X Spectral learning methods have recently been proposed as  alternatives to slow, non-convex optimization algorithms like EM for  a variety of probabilistic models in which hidden information must  be inferred by the learner.  These methods are typically controlled  by a rank hyperparameter that sets the complexity of the model; when  the model rank matches the true rank of the process generating the  data, the resulting predictions are provably consistent and admit  finite sample convergence bounds.  However, in practice we usually  do not know the true rank, and, in any event, from a computational  and statistical standpoint it is likely to be prohibitively large.  It is therefore of great practical interest to understand the  behavior of low-rank spectral learning, where the model rank is   less than the true rank.  Counterintuitively, we show that even   when the singular values omitted by lowering the rank are   arbitrarily small, the resulting prediction errors can in fact be  arbitrarily large.  We identify two distinct possible causes for  this bad behavior, and illustrate them with simple examples.  We  then show that these two causes are essentially complete: assuming  that they do not occur, we can prove that the prediction error is  bounded in terms of the magnitudes of the omitted singular values.  We argue that the assumptions necessary for this result are  relatively realistic, making low-rank spectral learning a viable  option for many applications.

RIS


TY  - CPAPER
TI  - Low-Rank Spectral Learning
AU  - Alex Kulesza
AU  - N. Raj Rao
AU  - Satinder Singh
BT  - Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics
DA  - 2014/04/02
ED  - Samuel Kaski
ED  - Jukka Corander	
ID  - pmlr-v33-kulesza14
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 33
SP  - 522
EP  - 530
L1  - http://proceedings.mlr.press/v33/kulesza14.pdf
UR  - https://proceedings.mlr.press/v33/kulesza14.html
AB  - Spectral learning methods have recently been proposed as  alternatives to slow, non-convex optimization algorithms like EM for  a variety of probabilistic models in which hidden information must  be inferred by the learner.  These methods are typically controlled  by a rank hyperparameter that sets the complexity of the model; when  the model rank matches the true rank of the process generating the  data, the resulting predictions are provably consistent and admit  finite sample convergence bounds.  However, in practice we usually  do not know the true rank, and, in any event, from a computational  and statistical standpoint it is likely to be prohibitively large.  It is therefore of great practical interest to understand the  behavior of low-rank spectral learning, where the model rank is   less than the true rank.  Counterintuitively, we show that even   when the singular values omitted by lowering the rank are   arbitrarily small, the resulting prediction errors can in fact be  arbitrarily large.  We identify two distinct possible causes for  this bad behavior, and illustrate them with simple examples.  We  then show that these two causes are essentially complete: assuming  that they do not occur, we can prove that the prediction error is  bounded in terms of the magnitudes of the omitted singular values.  We argue that the assumptions necessary for this result are  relatively realistic, making low-rank spectral learning a viable  option for many applications.
ER  -

APA


Kulesza, A., Rao, N.R. & Singh, S.. (2014). Low-Rank Spectral Learning. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 33:522-530 Available from https://proceedings.mlr.press/v33/kulesza14.html.

Related Material

Download PDF