Fast k-means with accurate bounds

James Newling; Francois Fleuret

Fast k-means with accurate bounds

James Newling, Francois Fleuret

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:936-944, 2016.

Abstract

We propose a novel accelerated exact k-means algorithm, which outperforms the current state-of-the-art low-dimensional algorithm in 18 of 22 experiments, running up to 3 times faster. We also propose a general improvement of existing state-of-the-art accelerated exact k-means algorithms through better estimates of the distance bounds used to reduce the number of distance calculations, obtaining speedups in 36 of 44 experiments, of up to 1.8 times. We have conducted experiments with our own implementations of existing methods to ensure homogeneous evaluation of performance, and we show that our implementations perform as well or better than existing available implementations. Finally, we propose simplified variants of standard approaches and show that they are faster than their fully-fledged counterparts in 59 of 62 experiments.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-newling16,
  title = 	 {Fast k-means with accurate bounds},
  author = 	 {Newling, James and Fleuret, Francois},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {936--944},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/newling16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/newling16.html},
  abstract = 	 {We propose a novel accelerated exact k-means algorithm, which outperforms the current state-of-the-art low-dimensional algorithm in 18 of 22 experiments, running up to 3 times faster. We also propose a general improvement of existing state-of-the-art accelerated exact k-means algorithms through better estimates of the distance bounds used to reduce the number of distance calculations, obtaining speedups in 36 of 44 experiments, of up to 1.8 times. We have conducted experiments with our own implementations of existing methods to ensure homogeneous evaluation of performance, and we show that our implementations perform as well or better than existing available implementations. Finally, we propose simplified variants of standard approaches and show that they are faster than their fully-fledged counterparts in 59 of 62 experiments.}
}

Endnote

%0 Conference Paper
%T Fast k-means with accurate bounds
%A James Newling
%A Francois Fleuret
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-newling16
%I PMLR
%P 936--944
%U https://proceedings.mlr.press/v48/newling16.html
%V 48
%X We propose a novel accelerated exact k-means algorithm, which outperforms the current state-of-the-art low-dimensional algorithm in 18 of 22 experiments, running up to 3 times faster. We also propose a general improvement of existing state-of-the-art accelerated exact k-means algorithms through better estimates of the distance bounds used to reduce the number of distance calculations, obtaining speedups in 36 of 44 experiments, of up to 1.8 times. We have conducted experiments with our own implementations of existing methods to ensure homogeneous evaluation of performance, and we show that our implementations perform as well or better than existing available implementations. Finally, we propose simplified variants of standard approaches and show that they are faster than their fully-fledged counterparts in 59 of 62 experiments.

RIS


TY  - CPAPER
TI  - Fast k-means with accurate bounds
AU  - James Newling
AU  - Francois Fleuret
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-newling16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 936
EP  - 944
L1  - http://proceedings.mlr.press/v48/newling16.pdf
UR  - https://proceedings.mlr.press/v48/newling16.html
AB  - We propose a novel accelerated exact k-means algorithm, which outperforms the current state-of-the-art low-dimensional algorithm in 18 of 22 experiments, running up to 3 times faster. We also propose a general improvement of existing state-of-the-art accelerated exact k-means algorithms through better estimates of the distance bounds used to reduce the number of distance calculations, obtaining speedups in 36 of 44 experiments, of up to 1.8 times. We have conducted experiments with our own implementations of existing methods to ensure homogeneous evaluation of performance, and we show that our implementations perform as well or better than existing available implementations. Finally, we propose simplified variants of standard approaches and show that they are faster than their fully-fledged counterparts in 59 of 62 experiments.
ER  -

APA


Newling, J. & Fleuret, F.. (2016). Fast k-means with accurate bounds. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:936-944 Available from https://proceedings.mlr.press/v48/newling16.html.

Fast k-means with accurate bounds

Abstract

Cite this Paper

Related Material