Fair k-Centers via Maximum Matching

Matthew Jones; Huy Nguyen; Thy Nguyen

Fair k-Centers via Maximum Matching

Matthew Jones, Huy Nguyen, Thy Nguyen

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:4940-4949, 2020.

Abstract

The field of algorithms has seen a push for fairness, or the removal of inherent bias, in recent history. In data summarization, where a much smaller subset of a data set is chosen to represent the whole of the data, fairness can be introduced by guaranteeing each "demographic group" a specific portion of the representative subset. Specifically, this paper examines this fair variant of the k-centers problem, where a subset of the data with cardinality k is chosen to minimize distance to the rest of the data. Previous papers working on this problem presented both a 3-approximation algorithm with a super-linear runtime and a linear-time algorithm whose approximation factor is exponential in the number of demographic groups. This paper combines the best of each algorithm by presenting a linear-time algorithm with a guaranteed 3-approximation factor and provides empirical evidence of both the algorithm’s runtime and effectiveness.

Cite this Paper

BibTeX

@InProceedings{pmlr-v119-jones20a,
  title = 	 {Fair k-Centers via Maximum Matching},
  author =       {Jones, Matthew and Nguyen, Huy and Nguyen, Thy},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {4940--4949},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/jones20a/jones20a.pdf},
  url = 	 {https://proceedings.mlr.press/v119/jones20a.html},
  abstract = 	 {The field of algorithms has seen a push for fairness, or the removal of inherent bias, in recent history. In data summarization, where a much smaller subset of a data set is chosen to represent the whole of the data, fairness can be introduced by guaranteeing each "demographic group" a specific portion of the representative subset. Specifically, this paper examines this fair variant of the k-centers problem, where a subset of the data with cardinality k is chosen to minimize distance to the rest of the data. Previous papers working on this problem presented both a 3-approximation algorithm with a super-linear runtime and a linear-time algorithm whose approximation factor is exponential in the number of demographic groups. This paper combines the best of each algorithm by presenting a linear-time algorithm with a guaranteed 3-approximation factor and provides empirical evidence of both the algorithm’s runtime and effectiveness.}
}

Endnote

%0 Conference Paper
%T Fair k-Centers via Maximum Matching
%A Matthew Jones
%A Huy Nguyen
%A Thy Nguyen
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-jones20a
%I PMLR
%P 4940--4949
%U https://proceedings.mlr.press/v119/jones20a.html
%V 119
%X The field of algorithms has seen a push for fairness, or the removal of inherent bias, in recent history. In data summarization, where a much smaller subset of a data set is chosen to represent the whole of the data, fairness can be introduced by guaranteeing each "demographic group" a specific portion of the representative subset. Specifically, this paper examines this fair variant of the k-centers problem, where a subset of the data with cardinality k is chosen to minimize distance to the rest of the data. Previous papers working on this problem presented both a 3-approximation algorithm with a super-linear runtime and a linear-time algorithm whose approximation factor is exponential in the number of demographic groups. This paper combines the best of each algorithm by presenting a linear-time algorithm with a guaranteed 3-approximation factor and provides empirical evidence of both the algorithm’s runtime and effectiveness.

APA

Jones, M., Nguyen, H. & Nguyen, T.. (2020). Fair k-Centers via Maximum Matching. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:4940-4949 Available from https://proceedings.mlr.press/v119/jones20a.html.

Fair k-Centers via Maximum Matching

Abstract

Cite this Paper

Related Material