A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions

Sharath Raghvendra; Pouyan Shirzadian; Kaiyi Zhang

A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions

Sharath Raghvendra, Pouyan Shirzadian, Kaiyi Zhang

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:41867-41885, 2024.

Abstract

The

$2$ -Wasserstein distance is sensitive to minor geometric differences between distributions, making it a very powerful dissimilarity metric. However, due to this sensitivity, a small outlier mass can also cause a significant increase in the

$2$ -Wasserstein distance between two similar distributions. Similarly, sampling discrepancy can cause the empirical

$2$ -Wasserstein distance on

$n$ samples in

$\mathbb{R}^2$ to converge to the true distance at a rate of

$n^{-1/4}$ , which is significantly slower than the rate of

$n^{-1/2}$ for

$1$ -Wasserstein distance. We introduce a new family of distances parameterized by

$k \ge 0$ , called

$k$ -RPW that is based on computing the partial

$2$ -Wasserstein distance. We show that (1)

$k$ -RPW satisfies the metric properties, (2)

$k$ -RPW is robust to small outlier mass while retaining the sensitivity of

$2$ -Wasserstein distance to minor geometric differences, and (3) when

$k$ is a constant,

$k$ -RPW distance between empirical distributions on

$n$ samples in

$\mathbb{R}^2$ converges to the true distance at a rate of

$n^{-1/3}$ , which is faster than the convergence rate of

$n^{-1/4}$ for the

$2$ -Wasserstein distance. Using the partial

$p$ -Wasserstein distance, we extend our distance to any

$p \in [1,\infty]$ . By setting parameters

$k$ or

$p$ appropriately, we can reduce our distance to the total variation,

$p$ -Wasserstein, and the Lévy-Prokhorov distances. Experiments show that our distance function achieves higher accuracy in comparison to the

$1$ -Wasserstein,

$2$ -Wasserstein, and TV distances for image retrieval tasks on noisy real-world data sets.

Cite this Paper

BibTeX


@InProceedings{pmlr-v235-raghvendra24a,
  title = 	 {A New Robust Partial p-{W}asserstein-Based Metric for Comparing Distributions},
  author =       {Raghvendra, Sharath and Shirzadian, Pouyan and Zhang, Kaiyi},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {41867--41885},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/raghvendra24a/raghvendra24a.pdf},
  url = 	 {https://proceedings.mlr.press/v235/raghvendra24a.html},
  abstract = 	 {The $2$-Wasserstein distance is sensitive to minor geometric differences between distributions, making it a very powerful dissimilarity metric. However, due to this sensitivity, a small outlier mass can also cause a significant increase in the $2$-Wasserstein distance between two similar distributions. Similarly, sampling discrepancy can cause the empirical $2$-Wasserstein distance on $n$ samples in $\mathbb{R}^2$ to converge to the true distance at a rate of $n^{-1/4}$, which is significantly slower than the rate of $n^{-1/2}$ for $1$-Wasserstein distance. We introduce a new family of distances parameterized by $k \ge 0$, called $k$-RPW that is based on computing the partial $2$-Wasserstein distance. We show that (1) $k$-RPW satisfies the metric properties, (2) $k$-RPW is robust to small outlier mass while retaining the sensitivity of $2$-Wasserstein distance to minor geometric differences, and (3) when $k$ is a constant, $k$-RPW distance between empirical distributions on $n$ samples in $\mathbb{R}^2$ converges to the true distance at a rate of $n^{-1/3}$, which is faster than the convergence rate of $n^{-1/4}$ for the $2$-Wasserstein distance. Using the partial $p$-Wasserstein distance, we extend our distance to any $p \in [1,\infty]$. By setting parameters $k$ or $p$ appropriately, we can reduce our distance to the total variation, $p$-Wasserstein, and the Lévy-Prokhorov distances. Experiments show that our distance function achieves higher accuracy in comparison to the $1$-Wasserstein, $2$-Wasserstein, and TV distances for image retrieval tasks on noisy real-world data sets.}
}

Endnote

%0 Conference Paper
%T A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions
%A Sharath Raghvendra
%A Pouyan Shirzadian
%A Kaiyi Zhang
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-raghvendra24a
%I PMLR
%P 41867--41885
%U https://proceedings.mlr.press/v235/raghvendra24a.html
%V 235
%X The $2$-Wasserstein distance is sensitive to minor geometric differences between distributions, making it a very powerful dissimilarity metric. However, due to this sensitivity, a small outlier mass can also cause a significant increase in the $2$-Wasserstein distance between two similar distributions. Similarly, sampling discrepancy can cause the empirical $2$-Wasserstein distance on $n$ samples in $\mathbb{R}^2$ to converge to the true distance at a rate of $n^{-1/4}$, which is significantly slower than the rate of $n^{-1/2}$ for $1$-Wasserstein distance. We introduce a new family of distances parameterized by $k \ge 0$, called $k$-RPW that is based on computing the partial $2$-Wasserstein distance. We show that (1) $k$-RPW satisfies the metric properties, (2) $k$-RPW is robust to small outlier mass while retaining the sensitivity of $2$-Wasserstein distance to minor geometric differences, and (3) when $k$ is a constant, $k$-RPW distance between empirical distributions on $n$ samples in $\mathbb{R}^2$ converges to the true distance at a rate of $n^{-1/3}$, which is faster than the convergence rate of $n^{-1/4}$ for the $2$-Wasserstein distance. Using the partial $p$-Wasserstein distance, we extend our distance to any $p \in [1,\infty]$. By setting parameters $k$ or $p$ appropriately, we can reduce our distance to the total variation, $p$-Wasserstein, and the Lévy-Prokhorov distances. Experiments show that our distance function achieves higher accuracy in comparison to the $1$-Wasserstein, $2$-Wasserstein, and TV distances for image retrieval tasks on noisy real-world data sets.

APA


Raghvendra, S., Shirzadian, P. & Zhang, K.. (2024). A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:41867-41885 Available from https://proceedings.mlr.press/v235/raghvendra24a.html.

A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions

Abstract

Cite this Paper

Related Material