Improving RNA-Seq expression estimates by correcting for fragment bias
Abstract
The biochemistry of RNA-Seq library preparation results in cDNA fragments that are not uniformly distributed within the transcripts they represent. This non-uniformity must be accounted for when estimating expression levels, and we show how to perform the needed corrections using a likelihood based approach. We find improvements in expression estimates as measured by correlation with independently performed qRT-PCR and show that correction of bias leads to improved replicability of results across libraries and sequencing technologies.
Additional Information
© 2011 Roberts et al.; licensee BioMed Central Ltd. This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 4 December 2010. Accepted: 16 March 2011. Published: 16 March 2011. We thank Joshua Levin and Mitchell Guttman for their help with the NanoString experiment. Anat Caspi was instrumental in helping us obtain the SOLiD data. Adam Roberts was supported by an NSF graduate research fellowship. Authors' contributions: AR, CT and LP developed the bias correction approach. AR implemented the improvements to the Cufflinks software. JLR provided reagents and guidance. JD performed the NanoString experiment. AR performed the analysis. AR and LP wrote the paper. All authors read and approved the final manuscript.Attached Files
Published - art_3A10.1186_2Fgb-2011-12-3-r22.pdf
Supplemental Material - 13059_2010_2498_MOESM10_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM11_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM12_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM13_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM14_ESM.tiff
Supplemental Material - 13059_2010_2498_MOESM15_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM1_ESM.PDF
Supplemental Material - 13059_2010_2498_MOESM2_ESM.tgz
Supplemental Material - 13059_2010_2498_MOESM3_ESM.PDF
Supplemental Material - 13059_2010_2498_MOESM4_ESM.py
Supplemental Material - 13059_2010_2498_MOESM5_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM6_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM7_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM8_ESM.pdf
Supplemental Material - 13059_2010_2498_MOESM9_ESM.pdf
Files
Name | Size | Download all |
---|---|---|
md5:3a766563121820af814a4fbb7b8ec888
|
1.8 MB | Preview Download |
md5:249476ea04cbc5b4760549a44a9b86b3
|
304.2 kB | Preview Download |
md5:7f1a507594848897483f3d1af842b4e6
|
104.0 kB | Preview Download |
md5:84b1f5e3750ca61e84c6f252ca11ca7c
|
421.3 kB | Preview Download |
md5:b14177e0a6afe581bf19d55724f245ae
|
169.2 kB | Preview Download |
md5:2cb26ee5a1f3c2061f65e914c121c4d8
|
99.9 kB | Preview Download |
md5:bbfc17984722d45a86f496c65aa48cde
|
105.9 kB | Preview Download |
md5:69af72a206ba6c1be6268c62ab888e25
|
694.2 kB | Download |
md5:e2795ef0357ea1395b45d68039ef7e12
|
1.6 MB | Preview Download |
md5:48615d88841f78dd8153b6e8c3f053bc
|
735.2 kB | Preview Download |
md5:742b9703805aec73ca88c44b817f694d
|
137.5 kB | Preview Download |
md5:f87b909233d19703c11041f02637b319
|
3.2 kB | Download |
md5:239e6c6489d8eceecbcb8268083eb5e2
|
123.9 kB | Preview Download |
md5:da2e7fc3ac45bc6612e4732ae4340619
|
195.5 kB | Preview Download |
md5:b10e268cf890c196e0627f7cee474982
|
92.7 kB | Preview Download |
md5:835cfd7715d5d2aef67146fb09dbcbd8
|
1.5 MB | Preview Download |
Additional details
- PMCID
- PMC3129672
- Eprint ID
- 74779
- Resolver ID
- CaltechAUTHORS:20170306-105110860
- NSF Graduate Research Fellowship
- Created
-
2017-03-06Created from EPrint's datestamp field
- Updated
-
2023-10-24Created from EPrint's last_modified field