RNA editing in the human ENCODE RNA-seq data
Abstract
RNA-seq data can be mined for sequence differences relative to the reference genome to identify both genomic SNPs and RNA editing events. We analyzed the long, polyA-selected, unstranded, deeply sequenced RNA-seq data from the ENCODE Project across 14 human cell lines for candidate RNA editing events. On average, 43% of the RNA sequencing variants that are not in dbSNP and are within gene boundaries are A-to-G(I) RNA editing candidates. The vast majority of A-to-G(I) edits are located in introns and 3′ UTRs, with only 123 located in protein-coding sequence. In contrast, the majority of non–A-to-G variants (60%–80%) map near exon boundaries and have the characteristics of splice-mapping artifacts. After filtering out all candidates with evidence of private genomic variation using genome resequencing or ChIP-seq data, we find that up to 85% of the high-confidence RNA variants are A-to-G(I) editing candidates. Genes with A-to-G(I) edits are enriched in Gene Ontology terms involving cell division, viral defense, and translation. The distribution and character of the remaining non–A-to-G variants closely resemble known SNPs. We find no reproducible A-to-G(I) edits that result in nonsynonymous substitutions in all three lymphoblastoid cell lines in our study, unlike RNA editing in the brain. Given that only a fraction of sites are reproducibly edited in multiple cell lines and that we find a stronger association of editing and specific genes suggests that the editing of the transcript is more important than the editing of any individual site.
Additional Information
© 2012, Published by Cold Spring Harbor Laboratory Press. Freely available online through the Genome Research Open Access option. Received November 16, 2011; accepted in revised form May 1, 2012. We thank Wendy Lee and Alicia Rogers for assistance. The work of A.M. and E.P. on this manuscript was supported by the UC Irvine Center for Complex Biological Systems and U.S. National Institutes of Health (NIH) P50 GM076516, and the work of B.W. and B.J.W. was supported by the Beckman Foundation, the Donald Bren Endowment, and NIH grant U54 HG004576.Attached Files
Published - Genome_Res.-2012-Park-1626-33.pdf
Supplemental Material - SuppFigsS1_S12.pdf
Supplemental Material - SupplementalTable1.txt
Supplemental Material - SupplementalTable10.txt
Supplemental Material - SupplementalTable11.txt
Supplemental Material - SupplementalTable12.txt
Supplemental Material - SupplementalTable13.txt
Supplemental Material - SupplementalTable14.txt
Supplemental Material - SupplementalTable15.txt
Supplemental Material - SupplementalTable16.txt
Supplemental Material - SupplementalTable17.txt
Supplemental Material - SupplementalTable18.txt
Supplemental Material - SupplementalTable19.txt
Supplemental Material - SupplementalTable2.txt
Supplemental Material - SupplementalTable3.txt
Supplemental Material - SupplementalTable4.txt
Supplemental Material - SupplementalTable5.txt
Supplemental Material - SupplementalTable6.txt
Supplemental Material - SupplementalTable7.txt
Supplemental Material - SupplementalTable8.txt
Supplemental Material - SupplementalTable9.txt
Supplemental Material - SupplementaryTable20.xlsx
Files
Name | Size | Download all |
---|---|---|
md5:aadb5bb685b2bca26393141da032131b
|
1.1 MB | Preview Download |
md5:fea9848118335db0e17cde7b2494719a
|
515.5 kB | Preview Download |
md5:e30e80655949810142f7a7350022d93a
|
8.9 kB | Preview Download |
md5:b5a740a546d29b2978488bcad23c0405
|
22.1 kB | Preview Download |
md5:f7b77c43ece7f2fce9e65022b2f6ccd0
|
694 Bytes | Preview Download |
md5:cc13e5cf990b938a73b24474450cdd8c
|
20.6 kB | Preview Download |
md5:b852a62c1ed3ee0c80f41dba20d6cb1e
|
63.3 kB | Download |
md5:4915c4322bc5904ebb46c68d643e9350
|
22.3 kB | Preview Download |
md5:4cab4ae03c9181de17e11bc2dc6c4668
|
50.0 kB | Preview Download |
md5:ff4e5a78fd5f4c195454617243698d45
|
9.0 kB | Preview Download |
md5:c630c4f2ffbd30d87a771ed8d1a91bc7
|
52.3 kB | Preview Download |
md5:517afc0871146efb811838f8feed210a
|
814.2 kB | Preview Download |
md5:2a2cddf321ca9432309e79ba20a5f696
|
868 Bytes | Preview Download |
md5:5bd97805a5f35931de6cd0f9c6ec0480
|
2.0 kB | Preview Download |
md5:b7469e0ffa770ee633a5e3d2bb031e79
|
23.4 kB | Preview Download |
md5:fa70036318d5ad09e440e351450f3459
|
973 Bytes | Preview Download |
md5:3d931548841b64ca283a74f70442a3e4
|
47.7 kB | Preview Download |
md5:ac58ae771236736f9d61ab719439e362
|
31.7 kB | Preview Download |
md5:6cfc1882eae25a7b8b914ab3fa805f34
|
29.2 kB | Preview Download |
md5:d4836545527158a5e2873322de262aeb
|
22.7 kB | Preview Download |
md5:493e4b12d6621825c514f9e1ede6ba08
|
25.0 kB | Preview Download |
md5:467e0bc21eed362d4e88b69347420b44
|
18.2 kB | Preview Download |
Additional details
- PMCID
- PMC3431480
- Eprint ID
- 34515
- Resolver ID
- CaltechAUTHORS:20120927-130942776
- University of California, Irvine
- NIH
- P50 GM076516
- Army Research Office (ARO)
- Donald Bren Endowment
- NIH
- U54 HG004576
- Created
-
2012-09-27Created from EPrint's datestamp field
- Updated
-
2021-11-09Created from EPrint's last_modified field