An Information Geometry Approach for Distance Metric Learning

Shijun Wang; Rong Jin

An Information Geometry Approach for Distance Metric Learning

Shijun Wang, Rong Jin

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, PMLR 5:591-598, 2009.

Abstract

Metric learning is an important problem in machine learning and pattern recognition. In this paper, we propose a framework for metric learning based on information geometry. The key idea is to construct two kernel matrices for the given training data: one is based on the distance metric and the other is based on the assigned class labels. Inspired by the idea of information geometry, we relate these two kernel matrices to two Gaussian distributions, and the difference between the two kernel matrices is then computed by the Kullback-Leibler (KL) divergence between the two Gaussian distributions. The optimal distance metric is then found by minimizing the divergence between the two distributions. Based on this idea, we present two metric learning algorithms, one for linear distance metric and the other for nonlinear distance with the introduction of a kernel function. Unlike many existing algorithms for metric learning that require solving a non-trivial optimization problem and therefore are computationally expensive when the data dimension is high, the proposed algorithms have a closed-form solution and are computationally more efficient. Extensive experiments with data classification and face recognition show that the proposed algorithms are comparable to or better than the state-of-the-art algorithms for metric learning.

Cite this Paper

BibTeX


@InProceedings{pmlr-v5-wang09c,
  title = 	 {An Information Geometry Approach for Distance Metric Learning},
  author = 	 {Wang, Shijun and Jin, Rong},
  booktitle = 	 {Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics},
  pages = 	 {591--598},
  year = 	 {2009},
  editor = 	 {van Dyk, David and Welling, Max},
  volume = 	 {5},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Hilton Clearwater Beach Resort, Clearwater Beach, Florida USA},
  month = 	 {16--18 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v5/wang09c/wang09c.pdf},
  url = 	 {https://proceedings.mlr.press/v5/wang09c.html},
  abstract = 	 {Metric learning is an important problem in machine learning and  pattern recognition. In this paper, we propose a framework for  metric learning based on information geometry. The key idea is to  construct two kernel matrices for the given training data: one is  based on the distance metric and the other is based on the assigned  class labels. Inspired by the idea of information geometry, we  relate these two kernel matrices to two Gaussian distributions, and  the difference between the two kernel matrices is then computed by  the Kullback-Leibler (KL) divergence between the two Gaussian  distributions. The optimal distance metric is then found by  minimizing the divergence between the two distributions. Based on  this idea, we present two metric learning algorithms, one for linear  distance metric and the other for nonlinear distance with the  introduction of a kernel function. Unlike many existing algorithms  for metric learning that require solving a non-trivial optimization  problem and therefore are computationally expensive when the data  dimension is high, the proposed algorithms have a closed-form  solution and are computationally more efficient. Extensive  experiments with data classification and face recognition show that  the proposed algorithms are comparable to or better than the  state-of-the-art algorithms for metric learning.}
}

Endnote

%0 Conference Paper
%T An Information Geometry Approach for Distance Metric Learning
%A Shijun Wang
%A Rong Jin
%B Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2009
%E David van Dyk
%E Max Welling	
%F pmlr-v5-wang09c
%I PMLR
%P 591--598
%U https://proceedings.mlr.press/v5/wang09c.html
%V 5
%X Metric learning is an important problem in machine learning and  pattern recognition. In this paper, we propose a framework for  metric learning based on information geometry. The key idea is to  construct two kernel matrices for the given training data: one is  based on the distance metric and the other is based on the assigned  class labels. Inspired by the idea of information geometry, we  relate these two kernel matrices to two Gaussian distributions, and  the difference between the two kernel matrices is then computed by  the Kullback-Leibler (KL) divergence between the two Gaussian  distributions. The optimal distance metric is then found by  minimizing the divergence between the two distributions. Based on  this idea, we present two metric learning algorithms, one for linear  distance metric and the other for nonlinear distance with the  introduction of a kernel function. Unlike many existing algorithms  for metric learning that require solving a non-trivial optimization  problem and therefore are computationally expensive when the data  dimension is high, the proposed algorithms have a closed-form  solution and are computationally more efficient. Extensive  experiments with data classification and face recognition show that  the proposed algorithms are comparable to or better than the  state-of-the-art algorithms for metric learning.

RIS


TY  - CPAPER
TI  - An Information Geometry Approach for Distance Metric Learning
AU  - Shijun Wang
AU  - Rong Jin
BT  - Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics
DA  - 2009/04/15
ED  - David van Dyk
ED  - Max Welling	
ID  - pmlr-v5-wang09c
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 5
SP  - 591
EP  - 598
L1  - http://proceedings.mlr.press/v5/wang09c/wang09c.pdf
UR  - https://proceedings.mlr.press/v5/wang09c.html
AB  - Metric learning is an important problem in machine learning and  pattern recognition. In this paper, we propose a framework for  metric learning based on information geometry. The key idea is to  construct two kernel matrices for the given training data: one is  based on the distance metric and the other is based on the assigned  class labels. Inspired by the idea of information geometry, we  relate these two kernel matrices to two Gaussian distributions, and  the difference between the two kernel matrices is then computed by  the Kullback-Leibler (KL) divergence between the two Gaussian  distributions. The optimal distance metric is then found by  minimizing the divergence between the two distributions. Based on  this idea, we present two metric learning algorithms, one for linear  distance metric and the other for nonlinear distance with the  introduction of a kernel function. Unlike many existing algorithms  for metric learning that require solving a non-trivial optimization  problem and therefore are computationally expensive when the data  dimension is high, the proposed algorithms have a closed-form  solution and are computationally more efficient. Extensive  experiments with data classification and face recognition show that  the proposed algorithms are comparable to or better than the  state-of-the-art algorithms for metric learning.
ER  -

APA


Wang, S. & Jin, R.. (2009). An Information Geometry Approach for Distance Metric Learning. Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 5:591-598 Available from https://proceedings.mlr.press/v5/wang09c.html.

Related Material

Download PDF