Tensor Decompositions for Learning Latent Variable Models
Abstract
This work considers a computationally and statistically efficient parameter estimation method for a wide class of latent variable models---including Gaussian mixture models, hidden Markov models, and latent Dirichlet allocation---which exploits a certain tensor structure in their low-order observable moments (typically, of second- and third-order). Specifically, parameter estimation is reduced to the problem of extracting a certain (orthogonal) decomposition of a symmetric tensor derived from the moments; this decomposition can be viewed as a natural generalization of the singular value decomposition for matrices. Although tensor decompositions are generally intractable to compute, the decomposition of these specially structured tensors can be efficiently obtained by a variety of approaches, including power iterations and maximization approaches (similar to the case of matrices). A detailed analysis of a robust tensor power method is provided, establishing an analogue of Wedin's perturbation theorem for the singular vectors of matrices. This implies a robust and computationally tractable estimation approach for several popular latent variable models.
Additional Information
© 2014 Animashree Anandkumar, Rong Ge, Daniel Hsu, Sham M. Kakade, and Matus Telgarsky. Submitted 2/13; Revised 3/14; Published 8/14. We thank Boaz Barak, Dean Foster, Jon Kelner, and Greg Valiant for helpful discussions. We are also grateful to Hanzhang Hu, Drew Bagnell, and Martial Hebert for alerting us of an issue with Theorem 4.2 and suggesting a simple fix. This work was completed while DH was a postdoctoral researcher at Microsoft Research New England, and partly while AA, RG, and MT were visiting the same lab. AA is supported in part by the NSF Award CCF-1219234, AFOSR Award FA9550-10-1-0310 and the ARO Award W911NF-12-1-0404.Attached Files
Published - anandkumar14b.pdf
Submitted - 1210.7559.pdf
Files
Name | Size | Download all |
---|---|---|
md5:3e2d77e6dfae8c138c44da9222ba71d5
|
515.6 kB | Preview Download |
md5:8d97a04c80fe97eaafdc87b43c0e62a6
|
585.6 kB | Preview Download |
Additional details
- Eprint ID
- 81881
- Resolver ID
- CaltechAUTHORS:20170927-134735763
- Microsoft Research
- NSF
- CCF-1219234
- Air Force Office of Scientific Research (AFOSR)
- FA9550-10-1-0310
- Army Research Office (ARO)
- W911NF-12-1-0404
- Created
-
2017-09-27Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field