Interpretable factor models of single-cell RNA-seq via variational autoencoders
Abstract
Motivation: Single-cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable. Results: We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be useful for other applications. Availability and implementation: The factor model is available in the scVI package hosted at https://github.com/YosefLab/scVI/.
Additional Information
© 2020 The Author(s). Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 13 September 2019; Revision received: 03 February 2020; Accepted: 20 February 2020; Published: 16 March 2020. We thank Eduardo da Veiga Beltrame and Romain Lopez for helpful feedback on the manuscript. Sina Booeshaghi provided useful comments on the LDVAE software. Additionally, we thank the users of scVI who provided helpful discussion about the implementation on Github. Funding: This work was supported by the National Institutes of Health [U19MH114830 to V.S. and L.P.]; and Error! Hyperlink reference not valid. [CZF2019-002454 to A.G. and N.Y.]. Conflict of Interest: none declared.Attached Files
Published - btaa169.pdf
Submitted - 737601.full.pdf
Supplemental Material - btaa169_supplementary_data.zip
Files
Additional details
- PMCID
- PMC7267837
- Eprint ID
- 97957
- Resolver ID
- CaltechAUTHORS:20190816-135915873
- NIH
- U19MH114830
- Chan Zuckerberg Foundation
- CZF2019-002454
- Created
-
2019-08-16Created from EPrint's datestamp field
- Updated
-
2022-02-15Created from EPrint's last_modified field
- Caltech groups
- Division of Biology and Biological Engineering (BBE)