Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published March 12, 2009 | Supplemental Material + Accepted Version
Journal Article Open

Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals

Abstract

There is growing recognition that mammalian cells produce many thousands of large intergenic transcripts. However, the functional significance of these transcripts has been particularly controversial. Although there are some well-characterized examples, most (>95%) show little evidence of evolutionary conservation and have been suggested to represent transcriptional noise. Here we report a new approach to identifying large non-coding RNAs using chromatin-state maps to discover discrete transcriptional units intervening known protein-coding loci. Our approach identified ~1,600 large multi-exonic RNAs across four mouse cell types. In sharp contrast to previous collections, these large intervening non-coding RNAs (lincRNAs) show strong purifying selection in their genomic loci, exonic sequences and promoter regions, with greater than 95% showing clear evolutionary conservation. We also developed a functional genomics approach that assigns putative functions to each lincRNA, demonstrating a diverse range of roles for lincRNAs in processes from embryonic stem cell pluripotency to cell proliferation. We obtained independent functional validation for the predictions for over 100 lincRNAs, using cell-based assays. In particular, we demonstrate that specific lincRNAs are transcriptionally regulated by key transcription factors in these processes such as p53, NFκB, Sox2, Oct4 (also known as Pou5f1) and Nanog. Together, these results define a unique collection of functional lincRNAs that are highly conserved and implicated in diverse biological processes.

Additional Information

© 2009 Macmillan Publishers Limited. Received 5 August 2008; Accepted 25 November 2008; Published online 1 February 2009. We would like to thank our colleagues at the Broad Institute, especially J. P. Mesirov for discussions and statistical insights, X. Xie for statistical help with conservation analyses, J. Robinson for visualization help, M. Ku, E. Mendenhall and X. Zhang for help generating ChIP samples, and N. Novershtern and A. Levy for providing transcription factor lists. M. Guttman is a Vertex scholar, I.A. acknowledges the support of the Human Frontier Science Program Organization. This work was funded by Beth Israel Deaconess Medical Center, National Human Genome Research Institute, and the Broad Institute of MIT and Harvard. Author Contributions: J.L.R., E.S.L., A.R. and M. Guttman conceived and designed experiments. The manuscript was written by M. Guttman, A.R., J.L.R. and E.S.L. J.L.R., I.A., C.F., D.F., M.H., B.W.C., J.P.C. and M. Guttman performed molecular biology experiments. All data analyses were performed by M. Guttman in conjunction with M. Garber (conservation analyses), M.F.L. (codon substitution frequency), T.S.M. (ChlP-seq data), O.Z. (motif analysis) and M.N.C. (lincRNA genomic location analysis). Reagents were provided by M. Garber (pre-published conservation analysis tools); T.J. and D.F. (p53 wild-type and knockout MEFs); N.H., A.R. and I.A. (dendritic cell stimulated time course); B.E.B. (ChlP data); R.J., B.W.C. and J.P.C. (luciferase assays); and M.K. and M.F.L. (codon substitution frequency code). Microarray data have been deposited in the Gene Expression Omnibus (GEO) under accession number GSE13765.

Attached Files

Accepted Version - nihms106797.pdf

Supplemental Material - nature07672-s1.pdf

Supplemental Material - nature07672-s10.xls

Supplemental Material - nature07672-s11.xls

Supplemental Material - nature07672-s12.xls

Supplemental Material - nature07672-s13.txt

Supplemental Material - nature07672-s14.xls

Supplemental Material - nature07672-s15.xls

Supplemental Material - nature07672-s16.xls

Supplemental Material - nature07672-s17.xls

Supplemental Material - nature07672-s18.xls

Supplemental Material - nature07672-s19.xls

Supplemental Material - nature07672-s2.pdf

Supplemental Material - nature07672-s3.xls

Supplemental Material - nature07672-s4.xls

Supplemental Material - nature07672-s5.doc

Supplemental Material - nature07672-s6.xls

Supplemental Material - nature07672-s7.xls

Supplemental Material - nature07672-s8.xls

Supplemental Material - nature07672-s9.xls

Files

nature07672-s1.pdf
Files (13.7 MB)
Name Size Download all
md5:4f7c39f318705b0e47351e2209d21191
856.1 kB Download
md5:beb305676fdbb581c89d7c480d2959f9
2.1 MB Preview Download
md5:c4772b75f84bbc2ac6449e9a424e4599
24.1 kB Download
md5:bdd7beb810875313affc10deba65faab
430.1 kB Download
md5:d508cad3b32d27c3da645a74aa51885e
2.1 MB Preview Download
md5:fddec43be90e3ac8f55c2f300156f21a
178.7 kB Download
md5:de1662dbc01004a68f70b030c0144e74
17.9 kB Download
md5:e51245b7bdb67fc02f558c8bd8c00ec3
36.9 kB Download
md5:5fd10bc65800ea4fb579a868a29afeca
125.4 kB Download
md5:8f640dba90439770cdeea8deb0ed9602
29.2 kB Download
md5:1d384e44a15f0f91a0699a8a4b912bec
355.8 kB Download
md5:112d0271faab8935aab7908d8c826325
23.6 kB Download
md5:409b2f42aafb3694dc4e23cc77abf2ef
649.2 kB Download
md5:fcd157b7bb1a5cc2997cceae07a5f007
23.0 kB Download
md5:6c8dae57ff826dba1dc211f1ff25cfc6
27.1 kB Download
md5:769be27a5dc9daa15f1fcbc7ac1a467e
6.4 MB Preview Download
md5:9203184dba3732a5a1864e401666f066
110.1 kB Download
md5:bc8756edc00e9995111b96eefca136b2
151.2 kB Preview Download
md5:3af51f8aa3e22c12c318541698352b8b
31.7 kB Download
md5:75462592c7e84b9d962a77bd54e76401
28.2 kB Download

Additional details

Created:
August 20, 2023
Modified:
October 23, 2023