Published August 1, 2021
| Submitted + Published
Journal Article
Open
Normalization of single-cell RNA-seq counts by log(x+1)* or log(1+x)*
- Creators
-
Booeshaghi, A. Sina
-
Pachter, Lior
Chicago
Abstract
Single-cell RNA-seq technologies have been successfully employed over the past decade to generate many high resolution cell atlases. These have proved invaluable in recent efforts aimed at understanding the cell type specificity of host genes involved in SARS-CoV-2 infections. While single-cell atlases are based on well-sampled highly-expressed genes, many of the genes of interest for understanding SARS-CoV-2 can be expressed at very low levels. Common assumptions underlying standard single-cell analyses don't hold when examining low-expressed genes, with the result that standard workflows can produce misleading results.
Additional Information
© The Author(s) 2021. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 14 October 2020; Revision received: 23 December 2020; Editorial decision: 29 January 2021; Accepted: 01 March 2021; Published: 02 March 2021. We thank Charles Herring, Michael Hoffman, Johan Gustafsson, Harold Pimentel, Jeffrey Spence, and Valentine Svensson for helpful comments. A.S.B. and L.P. were partially funded by NIH U19MH114830. Data Availability Statement. Data and code that reproduce the results in this paper are available here: https://github.com/pachterlab/BP_2021_2. Conflict of Interest: none declared.Attached Files
Published - btab085.pdf
Submitted - 2020.05.19.100214v3.full.pdf
Files
btab085.pdf
Files
(530.8 kB)
Name | Size | Download all |
---|---|---|
md5:c03eb11c55b1b270fa744311a6492db7
|
214.1 kB | Preview Download |
md5:8b5570bb08df7be10a46d48a66d34934
|
316.7 kB | Preview Download |
Additional details
- Alternative title
- log(x+1)* and log(1+x)*
- PMCID
- PMC7989636
- Eprint ID
- 103346
- DOI
- 10.1093/bioinformatics/btab085
- Resolver ID
- CaltechAUTHORS:20200520-084505912
- NIH
- U19MH114830
- Created
-
2020-05-20Created from EPrint's datestamp field
- Updated
-
2023-07-20Created from EPrint's last_modified field
- Caltech groups
- COVID-19, Division of Biology and Biological Engineering (BBE)