Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions

Robert Durrant; Ata Kaban

Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions

Robert Durrant, Ata Kaban

Proceedings of the 5th Asian Conference on Machine Learning, PMLR 29:17-32, 2013.

Abstract

We examine the performance of an ensemble of randomly-projected Fisher Linear Discriminant classifiers, focusing on the case when there are fewer training observations than data dimensions. Our ensemble is learned from a sequence of randomly-projected representations of the original high dimensional data and therefore for this approach data can be collected, stored and processed in such a compressed form. The specific form and simplicity of this ensemble permits a direct and much more detailed analysis than existing generic tools in previous works. In particular, we are able to derive the exact form of the generalization error of our ensemble, conditional on the training set, and based on this we give theoretical guarantees which directly link the performance of the ensemble to that of the corresponding linear discriminant learned in the full data space. To the best of our knowledge these are the first theoretical results to prove such an explicit link for any classifier and classifier ensemble pair. Furthermore we show that the randomly-projected ensemble is equivalent to implementing a sophisticated regularization scheme to the linear discriminant learned in the original data space and this prevents overfitting in conditions of small sample size where pseudo-inverse FLD learned in the data space is provably poor.

Cite this Paper

BibTeX


@InProceedings{pmlr-v29-Durrant13,
  title = 	 {Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions},
  author = 	 {Durrant, Robert and Kaban, Ata},
  booktitle = 	 {Proceedings of the 5th Asian Conference on Machine Learning},
  pages = 	 {17--32},
  year = 	 {2013},
  editor = 	 {Ong, Cheng Soon and Ho, Tu Bao},
  volume = 	 {29},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Australian National University, Canberra, Australia},
  month = 	 {13--15 Nov},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v29/Durrant13.pdf},
  url = 	 {https://proceedings.mlr.press/v29/Durrant13.html},
  abstract = 	 {We examine the performance of an ensemble of randomly-projected Fisher Linear Discriminant classifiers, focusing on the case when there are fewer training observations than data dimensions. Our ensemble is learned from a sequence of randomly-projected representations of the original high dimensional data and therefore for this approach data can be collected, stored and processed in such a compressed form. The specific form and simplicity of this ensemble permits a direct and much more detailed analysis than existing generic tools in previous works.  In particular, we are able to derive the exact form of the generalization error of our ensemble, conditional on the training set, and based on this  we give theoretical guarantees which directly link the performance of the ensemble to that of the corresponding linear discriminant learned in the full data space. To the best of our knowledge these are the first theoretical results to prove such an explicit link for any classifier and classifier ensemble pair. Furthermore we show that the randomly-projected ensemble is equivalent to implementing a sophisticated regularization scheme to the linear discriminant learned in the original data space  and this prevents overfitting in conditions of small sample size where pseudo-inverse FLD learned in the data space is provably poor.}
}

Endnote

%0 Conference Paper
%T Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions
%A Robert Durrant
%A Ata Kaban
%B Proceedings of the 5th Asian Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2013
%E Cheng Soon Ong
%E Tu Bao Ho	
%F pmlr-v29-Durrant13
%I PMLR
%P 17--32
%U https://proceedings.mlr.press/v29/Durrant13.html
%V 29
%X We examine the performance of an ensemble of randomly-projected Fisher Linear Discriminant classifiers, focusing on the case when there are fewer training observations than data dimensions. Our ensemble is learned from a sequence of randomly-projected representations of the original high dimensional data and therefore for this approach data can be collected, stored and processed in such a compressed form. The specific form and simplicity of this ensemble permits a direct and much more detailed analysis than existing generic tools in previous works.  In particular, we are able to derive the exact form of the generalization error of our ensemble, conditional on the training set, and based on this  we give theoretical guarantees which directly link the performance of the ensemble to that of the corresponding linear discriminant learned in the full data space. To the best of our knowledge these are the first theoretical results to prove such an explicit link for any classifier and classifier ensemble pair. Furthermore we show that the randomly-projected ensemble is equivalent to implementing a sophisticated regularization scheme to the linear discriminant learned in the original data space  and this prevents overfitting in conditions of small sample size where pseudo-inverse FLD learned in the data space is provably poor.

RIS


TY  - CPAPER
TI  - Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions
AU  - Robert Durrant
AU  - Ata Kaban
BT  - Proceedings of the 5th Asian Conference on Machine Learning
DA  - 2013/10/21
ED  - Cheng Soon Ong
ED  - Tu Bao Ho	
ID  - pmlr-v29-Durrant13
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 29
SP  - 17
EP  - 32
L1  - http://proceedings.mlr.press/v29/Durrant13.pdf
UR  - https://proceedings.mlr.press/v29/Durrant13.html
AB  - We examine the performance of an ensemble of randomly-projected Fisher Linear Discriminant classifiers, focusing on the case when there are fewer training observations than data dimensions. Our ensemble is learned from a sequence of randomly-projected representations of the original high dimensional data and therefore for this approach data can be collected, stored and processed in such a compressed form. The specific form and simplicity of this ensemble permits a direct and much more detailed analysis than existing generic tools in previous works.  In particular, we are able to derive the exact form of the generalization error of our ensemble, conditional on the training set, and based on this  we give theoretical guarantees which directly link the performance of the ensemble to that of the corresponding linear discriminant learned in the full data space. To the best of our knowledge these are the first theoretical results to prove such an explicit link for any classifier and classifier ensemble pair. Furthermore we show that the randomly-projected ensemble is equivalent to implementing a sophisticated regularization scheme to the linear discriminant learned in the original data space  and this prevents overfitting in conditions of small sample size where pseudo-inverse FLD learned in the data space is provably poor.
ER  -

APA


Durrant, R. & Kaban, A.. (2013). Random Projections as Regularizers: Learning a Linear Discriminant Ensemble from Fewer Observations than Dimensions. Proceedings of the 5th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 29:17-32 Available from https://proceedings.mlr.press/v29/Durrant13.html.

Related Material

Download PDF