Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals
- Creators
- Guttman, Mitchell
- Amit, Ido
- Garber, Manuel
- French, Courtney
- Lin, Michael F.
- Feldser, David
- Huarte, Maite
- Zuk, Or
- Carey, Bryce W.
- Cassady, John P.
- Cabili, Moran N.
- Jaenisch, Rudolf
- Mikkelsen, Tarjei S.
- Jacks, Tyler
- Hacohen, Nir
- Bernstein, Bradley E.
- Kellis, Manolis
- Regev, Aviv
- Rinn, John L.
- Lander, Eric S.
Abstract
There is growing recognition that mammalian cells produce many thousands of large intergenic transcripts. However, the functional significance of these transcripts has been particularly controversial. Although there are some well-characterized examples, most (>95%) show little evidence of evolutionary conservation and have been suggested to represent transcriptional noise. Here we report a new approach to identifying large non-coding RNAs using chromatin-state maps to discover discrete transcriptional units intervening known protein-coding loci. Our approach identified ~1,600 large multi-exonic RNAs across four mouse cell types. In sharp contrast to previous collections, these large intervening non-coding RNAs (lincRNAs) show strong purifying selection in their genomic loci, exonic sequences and promoter regions, with greater than 95% showing clear evolutionary conservation. We also developed a functional genomics approach that assigns putative functions to each lincRNA, demonstrating a diverse range of roles for lincRNAs in processes from embryonic stem cell pluripotency to cell proliferation. We obtained independent functional validation for the predictions for over 100 lincRNAs, using cell-based assays. In particular, we demonstrate that specific lincRNAs are transcriptionally regulated by key transcription factors in these processes such as p53, NFκB, Sox2, Oct4 (also known as Pou5f1) and Nanog. Together, these results define a unique collection of functional lincRNAs that are highly conserved and implicated in diverse biological processes.
Additional Information
© 2009 Macmillan Publishers Limited. Received 5 August 2008; Accepted 25 November 2008; Published online 1 February 2009. We would like to thank our colleagues at the Broad Institute, especially J. P. Mesirov for discussions and statistical insights, X. Xie for statistical help with conservation analyses, J. Robinson for visualization help, M. Ku, E. Mendenhall and X. Zhang for help generating ChIP samples, and N. Novershtern and A. Levy for providing transcription factor lists. M. Guttman is a Vertex scholar, I.A. acknowledges the support of the Human Frontier Science Program Organization. This work was funded by Beth Israel Deaconess Medical Center, National Human Genome Research Institute, and the Broad Institute of MIT and Harvard. Author Contributions: J.L.R., E.S.L., A.R. and M. Guttman conceived and designed experiments. The manuscript was written by M. Guttman, A.R., J.L.R. and E.S.L. J.L.R., I.A., C.F., D.F., M.H., B.W.C., J.P.C. and M. Guttman performed molecular biology experiments. All data analyses were performed by M. Guttman in conjunction with M. Garber (conservation analyses), M.F.L. (codon substitution frequency), T.S.M. (ChlP-seq data), O.Z. (motif analysis) and M.N.C. (lincRNA genomic location analysis). Reagents were provided by M. Garber (pre-published conservation analysis tools); T.J. and D.F. (p53 wild-type and knockout MEFs); N.H., A.R. and I.A. (dendritic cell stimulated time course); B.E.B. (ChlP data); R.J., B.W.C. and J.P.C. (luciferase assays); and M.K. and M.F.L. (codon substitution frequency code). Microarray data have been deposited in the Gene Expression Omnibus (GEO) under accession number GSE13765.Attached Files
Accepted Version - nihms106797.pdf
Supplemental Material - nature07672-s1.pdf
Supplemental Material - nature07672-s10.xls
Supplemental Material - nature07672-s11.xls
Supplemental Material - nature07672-s12.xls
Supplemental Material - nature07672-s13.txt
Supplemental Material - nature07672-s14.xls
Supplemental Material - nature07672-s15.xls
Supplemental Material - nature07672-s16.xls
Supplemental Material - nature07672-s17.xls
Supplemental Material - nature07672-s18.xls
Supplemental Material - nature07672-s19.xls
Supplemental Material - nature07672-s2.pdf
Supplemental Material - nature07672-s3.xls
Supplemental Material - nature07672-s4.xls
Supplemental Material - nature07672-s5.doc
Supplemental Material - nature07672-s6.xls
Supplemental Material - nature07672-s7.xls
Supplemental Material - nature07672-s8.xls
Supplemental Material - nature07672-s9.xls
Files
Name | Size | Download all |
---|---|---|
md5:4f7c39f318705b0e47351e2209d21191
|
856.1 kB | Download |
md5:beb305676fdbb581c89d7c480d2959f9
|
2.1 MB | Preview Download |
md5:c4772b75f84bbc2ac6449e9a424e4599
|
24.1 kB | Download |
md5:bdd7beb810875313affc10deba65faab
|
430.1 kB | Download |
md5:d508cad3b32d27c3da645a74aa51885e
|
2.1 MB | Preview Download |
md5:fddec43be90e3ac8f55c2f300156f21a
|
178.7 kB | Download |
md5:de1662dbc01004a68f70b030c0144e74
|
17.9 kB | Download |
md5:e51245b7bdb67fc02f558c8bd8c00ec3
|
36.9 kB | Download |
md5:5fd10bc65800ea4fb579a868a29afeca
|
125.4 kB | Download |
md5:8f640dba90439770cdeea8deb0ed9602
|
29.2 kB | Download |
md5:1d384e44a15f0f91a0699a8a4b912bec
|
355.8 kB | Download |
md5:112d0271faab8935aab7908d8c826325
|
23.6 kB | Download |
md5:409b2f42aafb3694dc4e23cc77abf2ef
|
649.2 kB | Download |
md5:fcd157b7bb1a5cc2997cceae07a5f007
|
23.0 kB | Download |
md5:6c8dae57ff826dba1dc211f1ff25cfc6
|
27.1 kB | Download |
md5:769be27a5dc9daa15f1fcbc7ac1a467e
|
6.4 MB | Preview Download |
md5:9203184dba3732a5a1864e401666f066
|
110.1 kB | Download |
md5:bc8756edc00e9995111b96eefca136b2
|
151.2 kB | Preview Download |
md5:3af51f8aa3e22c12c318541698352b8b
|
31.7 kB | Download |
md5:75462592c7e84b9d962a77bd54e76401
|
28.2 kB | Download |
Additional details
- PMCID
- PMC2754849
- Eprint ID
- 72212
- Resolver ID
- CaltechAUTHORS:20161121-144950442
- Vertex Scholarship
- Human Frontier Science Program
- Beth Israel Deaconess Medical Center
- National Human Genome Research Institute
- Broad Institute of MIT and Harvard
- NIH
- Created
-
2016-11-21Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field