Why Steiner-tree type algorithms work for community detection

Mung Chiang; Henry Lam; Zhenming Liu; Vincent Poor

Why Steiner-tree type algorithms work for community detection

Mung Chiang, Henry Lam, Zhenming Liu, Vincent Poor

Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, PMLR 31:187-195, 2013.

Abstract

We consider the problem of reconstructing a specific connected community S ⊂V in a graph G = (V, E), where each node v is associated with a signal whose strength grows with the likelihood that v belongs to S. This problem appears in social or protein interaction network, the latter also referred to as the signaling pathway reconstruction problem. We study this community reconstruction problem under several natural generative models, and make the following two contributions. First, in the context of social networks, where the signals are modeled as bounded-supported random variables, we design an efficient algorithm for recovering most members in S with well-controlled false positive overhead, by utilizing the network structure for a large family of “homogeneous” generative models. This positive result is complemented by an information theoretic lower bound for the case where the network structure is unknown or the network is heterogeneous. Second, we consider the case in which the graph represents the protein interaction network, in which it is customary to consider signals that have unbounded support, we generalize our first contribution to give the first theoretical justification of why existing Steiner-tree type heuristics work well in practice.

Cite this Paper

BibTeX


@InProceedings{pmlr-v31-chiang13a,
  title = 	 {Why Steiner-tree type algorithms work for community detection},
  author = 	 {Chiang, Mung and Lam, Henry and Liu, Zhenming and Poor, Vincent},
  booktitle = 	 {Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics},
  pages = 	 {187--195},
  year = 	 {2013},
  editor = 	 {Carvalho, Carlos M. and Ravikumar, Pradeep},
  volume = 	 {31},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Scottsdale, Arizona, USA},
  month = 	 {29 Apr--01 May},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v31/chiang13a.pdf},
  url = 	 {https://proceedings.mlr.press/v31/chiang13a.html},
  abstract = 	 {We consider the problem of reconstructing a specific connected community S ⊂V in a graph G = (V, E), where each node v is associated with a signal whose strength grows with the likelihood that v belongs to S. This problem appears in social or protein interaction network, the latter also referred to as the signaling pathway reconstruction problem.  We study this community reconstruction problem under several natural generative models, and make the following two contributions. First, in the context of social networks, where the signals are modeled as bounded-supported random variables, we design an efficient algorithm for recovering most members in S with well-controlled false positive overhead, by utilizing the network structure for a large family of “homogeneous” generative models. This positive result is complemented by an information theoretic lower bound for the case where the network structure is unknown or the network is heterogeneous. Second, we consider the case in which the graph represents the protein interaction network, in which it is customary to consider signals that have unbounded support, we generalize our first contribution to give the first theoretical justification of why existing Steiner-tree type heuristics work well in practice.}
}

Endnote

%0 Conference Paper
%T Why Steiner-tree type algorithms work for community detection
%A Mung Chiang
%A Henry Lam
%A Zhenming Liu
%A Vincent Poor
%B Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2013
%E Carlos M. Carvalho
%E Pradeep Ravikumar	
%F pmlr-v31-chiang13a
%I PMLR
%P 187--195
%U https://proceedings.mlr.press/v31/chiang13a.html
%V 31
%X We consider the problem of reconstructing a specific connected community S ⊂V in a graph G = (V, E), where each node v is associated with a signal whose strength grows with the likelihood that v belongs to S. This problem appears in social or protein interaction network, the latter also referred to as the signaling pathway reconstruction problem.  We study this community reconstruction problem under several natural generative models, and make the following two contributions. First, in the context of social networks, where the signals are modeled as bounded-supported random variables, we design an efficient algorithm for recovering most members in S with well-controlled false positive overhead, by utilizing the network structure for a large family of “homogeneous” generative models. This positive result is complemented by an information theoretic lower bound for the case where the network structure is unknown or the network is heterogeneous. Second, we consider the case in which the graph represents the protein interaction network, in which it is customary to consider signals that have unbounded support, we generalize our first contribution to give the first theoretical justification of why existing Steiner-tree type heuristics work well in practice.

RIS


TY  - CPAPER
TI  - Why Steiner-tree type algorithms work for community detection
AU  - Mung Chiang
AU  - Henry Lam
AU  - Zhenming Liu
AU  - Vincent Poor
BT  - Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics
DA  - 2013/04/29
ED  - Carlos M. Carvalho
ED  - Pradeep Ravikumar	
ID  - pmlr-v31-chiang13a
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 31
SP  - 187
EP  - 195
L1  - http://proceedings.mlr.press/v31/chiang13a.pdf
UR  - https://proceedings.mlr.press/v31/chiang13a.html
AB  - We consider the problem of reconstructing a specific connected community S ⊂V in a graph G = (V, E), where each node v is associated with a signal whose strength grows with the likelihood that v belongs to S. This problem appears in social or protein interaction network, the latter also referred to as the signaling pathway reconstruction problem.  We study this community reconstruction problem under several natural generative models, and make the following two contributions. First, in the context of social networks, where the signals are modeled as bounded-supported random variables, we design an efficient algorithm for recovering most members in S with well-controlled false positive overhead, by utilizing the network structure for a large family of “homogeneous” generative models. This positive result is complemented by an information theoretic lower bound for the case where the network structure is unknown or the network is heterogeneous. Second, we consider the case in which the graph represents the protein interaction network, in which it is customary to consider signals that have unbounded support, we generalize our first contribution to give the first theoretical justification of why existing Steiner-tree type heuristics work well in practice.
ER  -

APA


Chiang, M., Lam, H., Liu, Z. & Poor, V.. (2013). Why Steiner-tree type algorithms work for community detection. Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 31:187-195 Available from https://proceedings.mlr.press/v31/chiang13a.html.

Why Steiner-tree type algorithms work for community detection

Abstract

Cite this Paper

Related Material