Models for transcript quantification from RNA-Seq
- Creators
-
Pachter, Lior
Abstract
RNA-Seq is rapidly becoming the standard technology for transcriptome analysis. Fundamental to many of the applications of RNA-Seq is the quantification problem, which is the accurate measurement of relative transcript abundances from the sequenced reads. We focus on this problem, and review many recently published models that are used to estimate the relative abundances. In addition to describing the models and the different approaches to inference, we also explain how methods are related to each other. A key result is that we show how inference with many of the models results in identical estimates of relative abundances, even though model formulations can be very different. In fact, we are able to show how a single general model captures many of the elements of previously published methods. We also review the applications of RNA-Seq models to differential analysis, and explain why accurate relative transcript abundance estimates are crucial for downstream analyses.
Additional Information
Submitted on 19 Apr 2011 (v1), last revised 13 May 2011 (this version, v2). I thank Adam Roberts and Cole Trapnell for many helpful discussions that clarified our understanding of RNA-Seq and that led to many of the remarks in this paper. Meromit Singer, during the course of many discussions with me, questioned the relevance and interpretation of alternative likelihood formulations for methyl-Seq models; those conversations led to similar questions about RNA-Seq models, and finally to the comments on the equivalence between multinomial and Poisson models for RNA-Seq discussed in Section 4 and Appendix I. Colin Dewey provided many helpful suggestions and comments after reviewing a preliminary version of the manuscript. Finally, thanks to Sharon Aviran, Nicolas Bray, Ingileif Hallgrímsdόttir, Valerie Hower, Aaron Kleinman, Megan Owen, Harold Pimentel, Atif Rahman, Adam Roberts, Meromit Singer and Cole Trapnell for valuable comments and insights during the writing of this paper.Attached Files
Submitted - 1104.3889.pdf
Files
Name | Size | Download all |
---|---|---|
md5:c71b27651aa7be7b435de6725311e0ee
|
933.6 kB | Preview Download |
Additional details
- Eprint ID
- 74848
- Resolver ID
- CaltechAUTHORS:20170307-112830219
- Created
-
2017-03-07Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field