Bipartite Correlation Clustering: Maximizing Agreements

Megasthenis Asteris; Anastasios Kyrillidis; Dimitris Papailiopoulos; Alexandros Dimakis

Bipartite Correlation Clustering: Maximizing Agreements

Megasthenis Asteris, Anastasios Kyrillidis, Dimitris Papailiopoulos, Alexandros Dimakis

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, PMLR 51:121-129, 2016.

Abstract

In Bipartite Correlation Clustering (BCC) we are given a complete bipartite graph G with ’+’ and ’-’ edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all ’+’ edges within clusters plus all ’-’ edges cut across clusters. BCC is known to be NP-hard [5]. We present a novel approximation algorithm for k-BCC, a variant of BCC with an upper bound k on the number of clusters. Our algorithm outputs a k-clustering that provably achieves a number of agreements within a multiplicative (1-δ)-factor from the optimal, for any desired accuracy δ. It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of G. It runs in time exponential in k and 1/δ, but linear in the size of the input. Further, we show that, in the (unconstrained) BCC setting, an (1-δ)-approximation can be achieved by O(1/δ) clusters regardless of the size of the graph. In turn, our k-BCC algorithm implies an Efficient PTAS for the BCC objective of maximizing agreements.

Cite this Paper

BibTeX


@InProceedings{pmlr-v51-asteris16,
  title = 	 {Bipartite Correlation Clustering: Maximizing Agreements},
  author = 	 {Asteris, Megasthenis and Kyrillidis, Anastasios and Papailiopoulos, Dimitris and Dimakis, Alexandros},
  booktitle = 	 {Proceedings of the 19th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {121--129},
  year = 	 {2016},
  editor = 	 {Gretton, Arthur and Robert, Christian C.},
  volume = 	 {51},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Cadiz, Spain},
  month = 	 {09--11 May},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v51/asteris16.pdf},
  url = 	 {https://proceedings.mlr.press/v51/asteris16.html},
  abstract = 	 {In Bipartite Correlation Clustering (BCC)  we are given a complete bipartite graph G with ’+’ and ’-’ edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all ’+’ edges within clusters plus all ’-’ edges cut across clusters. BCC is known to be NP-hard [5].  We present a novel approximation algorithm for k-BCC, a variant of BCC with an upper bound k on the number of clusters. Our algorithm outputs a k-clustering that provably achieves a number of agreements within a multiplicative (1-δ)-factor from the optimal, for any desired accuracy δ. It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of G. It runs in time exponential in k and 1/δ, but linear in the size of the input.  Further, we show that, in the (unconstrained) BCC setting, an (1-δ)-approximation can be achieved by O(1/δ) clusters regardless of the size of the graph. In turn, our k-BCC algorithm implies an Efficient PTAS for the BCC objective of maximizing agreements.}
}

Endnote

%0 Conference Paper
%T Bipartite Correlation Clustering: Maximizing Agreements
%A Megasthenis Asteris
%A Anastasios Kyrillidis
%A Dimitris Papailiopoulos
%A Alexandros Dimakis
%B Proceedings of the 19th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2016
%E Arthur Gretton
%E Christian C. Robert	
%F pmlr-v51-asteris16
%I PMLR
%P 121--129
%U https://proceedings.mlr.press/v51/asteris16.html
%V 51
%X In Bipartite Correlation Clustering (BCC)  we are given a complete bipartite graph G with ’+’ and ’-’ edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all ’+’ edges within clusters plus all ’-’ edges cut across clusters. BCC is known to be NP-hard [5].  We present a novel approximation algorithm for k-BCC, a variant of BCC with an upper bound k on the number of clusters. Our algorithm outputs a k-clustering that provably achieves a number of agreements within a multiplicative (1-δ)-factor from the optimal, for any desired accuracy δ. It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of G. It runs in time exponential in k and 1/δ, but linear in the size of the input.  Further, we show that, in the (unconstrained) BCC setting, an (1-δ)-approximation can be achieved by O(1/δ) clusters regardless of the size of the graph. In turn, our k-BCC algorithm implies an Efficient PTAS for the BCC objective of maximizing agreements.

RIS


TY  - CPAPER
TI  - Bipartite Correlation Clustering: Maximizing Agreements
AU  - Megasthenis Asteris
AU  - Anastasios Kyrillidis
AU  - Dimitris Papailiopoulos
AU  - Alexandros Dimakis
BT  - Proceedings of the 19th International Conference on Artificial Intelligence and Statistics
DA  - 2016/05/02
ED  - Arthur Gretton
ED  - Christian C. Robert	
ID  - pmlr-v51-asteris16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 51
SP  - 121
EP  - 129
L1  - http://proceedings.mlr.press/v51/asteris16.pdf
UR  - https://proceedings.mlr.press/v51/asteris16.html
AB  - In Bipartite Correlation Clustering (BCC)  we are given a complete bipartite graph G with ’+’ and ’-’ edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all ’+’ edges within clusters plus all ’-’ edges cut across clusters. BCC is known to be NP-hard [5].  We present a novel approximation algorithm for k-BCC, a variant of BCC with an upper bound k on the number of clusters. Our algorithm outputs a k-clustering that provably achieves a number of agreements within a multiplicative (1-δ)-factor from the optimal, for any desired accuracy δ. It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of G. It runs in time exponential in k and 1/δ, but linear in the size of the input.  Further, we show that, in the (unconstrained) BCC setting, an (1-δ)-approximation can be achieved by O(1/δ) clusters regardless of the size of the graph. In turn, our k-BCC algorithm implies an Efficient PTAS for the BCC objective of maximizing agreements.
ER  -

APA


Asteris, M., Kyrillidis, A., Papailiopoulos, D. & Dimakis, A.. (2016). Bipartite Correlation Clustering: Maximizing Agreements. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 51:121-129 Available from https://proceedings.mlr.press/v51/asteris16.html.

Bipartite Correlation Clustering: Maximizing Agreements

Abstract

Cite this Paper

Related Material