High-Dimensional Covariance Decomposition into Sparse Markov and Independence Models
- Creators
- Janzamin, Majid
- Anandkumar, Animashree
Abstract
Fitting high-dimensional data involves a delicate tradeoff between faithful representation and the use of sparse models. Too often, sparsity assumptions on the fitted model are too restrictive to provide a faithful representation of the observed data. In this paper, we present a novel framework incorporating sparsity in different domains. We decompose the observed covariance matrix into a sparse Gaussian Markov model (with a sparse precision matrix) and a sparse independence model (with a sparse covariance matrix). Our framework incorporates sparse covariance and sparse precision estimation as special cases and thus introduces a richer class of high-dimensional models. We posit the observed data as generated from a linear combination of a sparse Gaussian Markov model (with a sparse precision matrix) and a sparse Gaussian independence model (with a sparse covariance matrix). We characterize sufficient conditions for identifiability of the two models, viz., Markov and independence models. We propose an efficient decomposition method based on a modification of the popular ℓ_1-penalized maximum- likelihood estimator (ℓ_1-MLE). We establish that our estimator is consistent in both the domains, i.e., it successfully recovers the supports of both Markov and independence models, when the number of samples n scales as n=Ω(d^2log p), where p is the number of variables and d is the maximum node degree in the Markov model. Our experiments validate these results and also demonstrate that our models have better inference accuracy under simple algorithms such as loopy belief propagation.
Additional Information
© 2014 Majid Janzamin and Animashree Anandkumar. We thank Karthik Mohan for helpful discussions on running experiments. We also acknowledge useful discussions with Max Welling, Babak Hassibi and Martin Wainwright. We also thank Bin Yu and the JMLR reviewers for valuable comments that have significantly improved the manuscript. M. Janzamin is supported by NSF Award CCF-1219234 and ARO Award W911NF-12-1-0404. A. Anandkumar is supported in part by Microsoft Faculty Fellowship, NSF Career award CCF-1254106, NSF Award CCF-1219234, AFOSR Award FA9550-10-1-0310, and ARO Award W911NF-12-1-0404.Attached Files
Published - p1549-janzamin.pdf
Submitted - 1211.0919.pdf
Files
Name | Size | Download all |
---|---|---|
md5:ef2ad2ee39d7b610814c92d58177fe05
|
518.6 kB | Preview Download |
md5:9b5ed217f38a180e8cbb5b1051f99c79
|
1.5 MB | Preview Download |
Additional details
- Eprint ID
- 81883
- Resolver ID
- CaltechAUTHORS:20170927-142820777
- NSF
- CCF-1219234
- Army Research Office (ARO)
- W911NF-12-1-0404
- Microsoft Research
- NSF
- CCF-1254106
- Air Force Office of Scientific Research (AFOSR)
- FA9550-10-1-0310
- Created
-
2017-09-27Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field