Genome annotation by high-throughput 5' RNA end determination
Abstract
Complete gene identification and annotation, including alternative transcripts, remains a challenge in understanding genome organization. Such annotation can be achieved by a combination of computational analysis and experimental confirmation. Here, we describe a high-throughput technique, trans-spliced exon coupled RNA end determination (TEC-RED), that identifies 5' ends of expressed genes in nematodes. TEC-RED can distinguish coding regions from regulatory regions and identify genes as well as their alternative transcripts that have different 5' ends. Application of TEC-RED to approximate to 10% of the Caenorhabditis elegans genome yielded tags 75% of which experimentally verified predicted 5'-RNA ends and 25% of which provided previously unknown information about 5'-RNA ends, including the identification of 99 previously unknown genes and 32 previously unknown operons. This technique will be applicable in any organisms that have a trans-splicing reaction from spliced leader RNA. We also describe an efficient sequential method for concatenating short sequence tags for any serial analysis of gene expression-like techniques.
Additional Information
© 2004 by the National Academy of Sciences. Communicated by Philip P. Green, University of Washington School of Medicine, Seattle, WA, December 16, 2003 (received for review November 20, 2003). Published online before print February 2, 2004, 10.1073/pnas.0308384100 We thank S. Gharib for DNA preparation, T. Blumenthal for discussions about operons, and C. Bastiani, T. Blumenthal, Y. Kee, E. Schwarz, and S. Vernooy for careful reading of the manuscript. This work was supported by the Howard Hughes Medical Institute, with which P.W.S. is an Investigator and B.J.H. and H.-M.M. were Associates, by a W. M. Keck Foundation/California Institute of Technology discovery award (to B.J.H. and P.W.S.), and by National Human Genome Research Institute/National Institutes of Health Genome Scholar and Faculty Transition Award K22HG02907-01 (to B.J.H.).Attached Files
Published - HWApnas04b.pdf
Files
Name | Size | Download all |
---|---|---|
md5:2b63f533a71e0ea8cc928769b36821ef
|
773.3 kB | Preview Download |
Additional details
- PMCID
- PMC341809
- Eprint ID
- 1337
- Resolver ID
- CaltechAUTHORS:HWApnas04b
- Howard Hughes Medical Institute (HHMI)
- W. M. Keck Foundation
- Caltech
- National Human Genome Research Institute
- NIH
- K22HG02907-01
- Created
-
2006-01-11Created from EPrint's datestamp field
- Updated
-
2023-06-01Created from EPrint's last_modified field