Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions

Creators: Halko, N.; Martinsson, P. G.; Tropp, J. A.

Style

An error occurred while generating the citation.

Abstract

Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets. This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed—either explicitly or implicitly—to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, robustness, and/or speed. These claims are supported by extensive numerical experiments and a detailed error analysis. The specific benefits of randomized techniques depend on the computational environment. Consider the model problem of finding the k dominant components of the singular value decomposition of an m × n matrix. (i) For a dense input matrix, randomized algorithms require O(mn log(k)) floating-point operations (flops) in contrast to O(mnk) for classical algorithms. (ii) For a sparse input matrix, the flop count matches classical Krylov subspace methods, but the randomized approach is more robust and can easily be reorganized to exploit multiprocessor architectures. (iii) For a matrix that is too large to fit in fast memory, the randomized techniques require only a constant number of passes over the data, as opposed to O(k) passes for classical algorithms. In fact, it is sometimes possible to perform matrix approximation with a single pass over the data.

Additional Information

© 2011 Society for Industrial and Applied Mathematics. Received by the editors September 21, 2009; accepted for publication (in revised form) December 2, 2010; published electronically May 5, 2011. The authors have benefited from valuable discussions with many researchers, among them Inderjit Dhillon, Petros Drineas, Ming Gu, Edo Liberty, Michael Mahoney, Vladimir Rokhlin, Yoel Shkolnisky, and Arthur Szlam. In particular, we would like to thank Mark Tygert for his insightful remarks on early drafts of this paper. The example in section 7.2 was provided by Fran¸cois Meyer of the University of Colorado at Boulder. The example in section 7.3 comes from the FERET database of facial images collected under the FERET program, sponsored by the DoD Counterdrug Technology Development Program Office. The work reported was initiated during the program Mathematics of Knowledge and Search Engines held at IPAM in the fall of 2007. Finally, we would like to thank the anonymous referees, whose thoughtful remarks have helped us to improve the manuscript dramatically.

Attached Files

Published - Halko2011p16112SIAM_Review.pdf

Submitted - 0909.4061.pdf

Files

0909.4061.pdf

Files (2.6 MB)

Name	Size	Download all
0909.4061.pdf md5:50ec63f39d2b9e8d3c4a1201594cabca	1.3 MB	Preview Download
Halko2011p16112SIAM_Review.pdf md5:681d918f89701ff590789dbdd1e4c830	1.3 MB	Preview Download

Additional details

	All versions	This version
Views	57	57
Downloads	143	143
Data volume	228.8 MB	228.8 MB