CGAL: computing genome assembly likelihoods
- Creators
-
Rahman, Atif
-
Pachter, Lior
Abstract
Assembly algorithms have been extensively benchmarked using simulated data so that results can be compared to ground truth. However, in de novo assembly, only crude metrics such as contig number and size are typically used to evaluate assembly quality. We present CGAL, a novel likelihood-based approach to assembly assessment in the absence of a ground truth. We show that likelihood is more accurate than other metrics currently used for evaluating assemblies, and describe its application to the optimization and comparison of assembly algorithms. Our methods are implemented in software that is freely available at http://bio.math.berkeley.edu/cgal/.
Additional Information
© 2013 Rahman and Pachter, licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 23 August 2012. Accepted: 29 January 2013. Published: 29 January 2013. We thank Michael Eisen, Aaron Kleinman, Harold Pimentel and Adam Roberts for helpful conversations in the development of the likelihood-based approach for assembly evaluation. LP was funded in part by NIH R21 HG006583. AR was funded in part by Fulbright Science & Technology Fellowship 15093630. Authors' contributions: AR and LP conceived the project and developed the methodology. AR implemented the method in the CGAL software and obtained the results of the paper. AR and LP wrote the manuscript. All authors read and approved the final manuscript. The authors have no competing interests.Attached Files
Published - art_3A10.1186_2Fgb-2013-14-1-r8.pdf
Supplemental Material - 13059_2012_3037_MOESM10_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM11_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM1_ESM.PDF
Supplemental Material - 13059_2012_3037_MOESM2_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM3_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM4_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM5_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM6_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM7_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM8_ESM.eps
Supplemental Material - 13059_2012_3037_MOESM9_ESM.eps
Files
Name | Size | Download all |
---|---|---|
md5:cadeef966e806b9f7bcd70dc1ad64709
|
7.8 kB | Download |
md5:54537ea1aa33a5abef5a88547a831f27
|
5.1 kB | Download |
md5:63ef41e56eb4a1ee366212abed82a3e1
|
4.6 kB | Download |
md5:6e4af3c4bc3567b9d70e8a8a59137dea
|
7.9 kB | Download |
md5:60961307fa99892e7bb035fec0f390f7
|
8.5 kB | Download |
md5:ca6d2187db6d63832ab348248fded0b1
|
5.2 kB | Download |
md5:10bd09fa7e4a9787185b5e921babd38b
|
685.4 kB | Preview Download |
md5:5702fe28a5742b337fde2f9f3998871b
|
6.6 kB | Download |
md5:c93352a07ce4014f7b977b68e35e2ff9
|
8.9 kB | Download |
md5:5e4bd40635d7b1c1850d04c1894c9ea2
|
7.3 kB | Download |
md5:5346a9910b045978e81b0670db9513e1
|
7.7 kB | Download |
md5:8629a08b9bf45cbc3cfedad4d2490da2
|
163.4 kB | Preview Download |
Additional details
- PMCID
- PMC3663106
- Eprint ID
- 74740
- Resolver ID
- CaltechAUTHORS:20170303-155431520
- NIH
- R21 HG006583
- Fulbright Foundation
- 15093630
- Created
-
2017-03-06Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field