Droplet scRNA-seq is not zero-inflated
- Creators
- Svensson, Valentine
Abstract
Potential users of single-cell RNA-sequencing (scRNA-seq) often encounter a choice between high-throughput droplet-based methods and high-sensitivity plate-based methods. There is a widespread belief that scRNA-seq will often fail to generate measurements for some genes from some cells owing to technical molecular inefficiencies. It is believed that this causes data to have an overabundance of zero values compared to what is expected from random sampling and that this effect is particularly pronounced in droplet-based methods. Here I present an investigation of published data for technical controls in droplet-based scRNA-seq experiments that demonstrates that the number of zero values in the data is consistent with common distributional models of molecule sampling counts. Thus, any additional zero values in biological data likely result from biological variation or may reflect variation in gene abundance among cell types or cell states.
Additional Information
© 2020 Springer Nature Limited. Published 14 January 2020. I thank E. da Veiga Beltrame for feedback on the manuscript, G. Eraslan for making fast code for fitting negative binomial models available, and the scientific community on Twitter for suggesting writing up this analysis as a manuscript. V.S. was funded in part by the EMBL International PhD Programme and NIH U19MH114830. The author declares no competing interests.Attached Files
Submitted - 582064.full.pdf
Supplemental Material - 41587_2019_379_MOESM1_ESM.pdf
Files
Name | Size | Download all |
---|---|---|
md5:6e707cdb4891d5121a0bc29d5fd93ebb
|
430.9 kB | Preview Download |
md5:eeb9bd7c1c1fa409cc69235344bf0d51
|
1.3 MB | Preview Download |
Additional details
- Eprint ID
- 93970
- Resolver ID
- CaltechAUTHORS:20190319-111610182
- European Molecular Biology Laboratory (EMBL)
- U19MH114830
- NIH
- Created
-
2019-03-19Created from EPrint's datestamp field
- Updated
-
2023-06-01Created from EPrint's last_modified field