A Tensor Approach to Learning Mixed Membership Community Models
Abstract
Community detection is the task of detecting hidden communities from observed interactions. Guaranteed community detection has so far been mostly limited to models with non-overlapping communities such as the stochastic block model. In this paper, we remove this restriction, and provide guaranteed community detection for a family of probabilistic network models with overlapping communities, termed as the mixed membership Dirichlet model, first introduced by Airoldi et al. (2008). This model allows for nodes to have fractional memberships in multiple communities and assumes that the community memberships are drawn from a Dirichlet distribution. Moreover, it contains the stochastic block model as a special case. We propose a unified approach to learning these models via a tensor spectral decomposition method. Our estimator is based on low-order moment tensor of the observed network, consisting of 33-star counts. Our learning method is fast and is based on simple linear algebraic operations, e.g., singular value decomposition and tensor power iterations. We provide guaranteed recovery of community memberships and model parameters and present a careful finite sample analysis of our learning method. As an important special case, our results match the best known scaling requirements for the (homogeneous) stochastic block model.
Additional Information
© 2014 Anima Anandkumar, Rong Ge, Daniel Hsu, Sham Kakade. Submitted 7/13; Revised 11/13; Published 10/00. We thank the JMLR Action Editor Nathan Srebro and the anonymous reviewers for comments which significantly improved this manuscript. We thank Jure Leskovec for helpful discussions regarding various community models. Part of this work was done when AA, RG, and DH were at MSR New England. AA is supported in part by the Microsoft faculty fellowship, NSF Career award CCF-1254106, NSF Award CCF-1219234 and the ARO YIP Award W911NF-13-1-0084.Attached Files
Published - anandkumar14a.pdf
Submitted - 1302.2684.pdf
Files
Name | Size | Download all |
---|---|---|
md5:d9ec5025c79cad18082bee278bf14c3a
|
625.4 kB | Preview Download |
md5:c847af811997a976cf570b5ab5cc5b74
|
691.1 kB | Preview Download |
Additional details
- Eprint ID
- 81871
- Resolver ID
- CaltechAUTHORS:20170927-093022023
- Microsoft Research
- NSF
- CCF-1254106
- NSF
- CCF-1219234
- Army Research Office (ARO)
- W911NF-13-1-0084
- Created
-
2017-09-27Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field