Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published December 10, 2013 | Supplemental Material + Published
Journal Article Open

Characterization of the human ESC transcriptome by hybrid sequencing

Abstract

Although transcriptional and posttranscriptional events are detected in RNA-Seq data from second-generation sequencing, fulllength mRNA isoforms are not captured. On the other hand, thirdgeneration sequencing, which yields much longer reads, has current limitations of lower raw accuracy and throughput. Here, we combine second-generation sequencing and third-generation sequencing with a custom-designed method for isoform identification and quantification to generate a high-confidence isoform dataset for human embryonic stem cells (hESCs). We report 8,084 RefSeq-annotated isoforms detected as full-length and an additional 5,459 isoforms predicted through statistical inference. Over one-third of these are novel isoforms, including 273 RNAs from gene loci that have not previously been identified. Further characterization of the novel loci indicates that a subset is expressed in pluripotent cells but not in diverse fetal and adult tissues; moreover, their reduced expression perturbs the network of pluripotency- associated genes. Results suggest that gene identification, even in well-characterized human cell lines and tissues, is likely far from complete.

Additional Information

© 2013 National Academy of Sciences. Freely available online through the PNAS open access option. Contributed by Wing Hung Wong, November 5, 2013 (sent for review August 6, 2013). We thank Dr. Arnold Ludwig Hayer, Dr. Tobias Meyer, Dr. Laughing Bear Torrez, and Dr. Diana Cepeda for help in designing siRNA experiments and for technical help in performing the knockdown experiments. K.F.A. and W.H.W. were supported by the National Human Genome Research Institute (R01HG005717). B.A.W. was partially supported by the California Institute of Technology Beckman Center for Functional Genomics. The Illumina short-read data were generated as part of the Encode project (National Human Genome Research Institute Grants U54 HG004576 and U54 HG006998 to Barbara Wold). V.S., J.D.D., and R.A.R.-P. were supported by the National Heart, Lung, and Blood Institute (U01HL100397) and the National Institute of Child Health and Human Development (U54 HD068158). Author contributions: K.F.A. and W.H.W. designed research; K.F.A., V.S., P.T.A., J.D.D., H.V.B., R.A.R.-P., and J.G.U. performed research; K.F.A., L.L., B.A.W., and E.E.S. contributed new reagents/analytic tools; K.F.A. analyzed data; and K.F.A. and W.H.W. wrote the paper. The authors declare no conflict of interest.

Attached Files

Published - PNAS-2013-Au-E4821-30.pdf

Supplemental Material - sapp.pdf

Files

sapp.pdf
Files (6.3 MB)
Name Size Download all
md5:1ef4a53b4dbfc73fa8547c3bc82a6f24
3.5 MB Preview Download
md5:ca7d73ae0cc095d5f3114978c5e0283c
2.7 MB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 25, 2023