Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published April 29, 2014 | Supplemental Material + Published
Journal Article Open

Defining functional DNA elements in the human genome

Abstract

With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.

Additional Information

© 2014 National Academy of Sciences. Freely available online through the PNAS open access option. Edited by Robert Haselkorn, University of Chicago, Chicago, IL, and approved January 29, 2014 (received for review October 16, 2013). M.K., B.W., M.P.S., B.E.B., and R.C.H. contributed equally to this work. A.K., G.K.M., and L.D.W. contributed equally to this work. Author contributions: M.K., B.W., M.P.S., B.E.B., and R.C.H. designed research; M.K., B.W., M.P.S., B.E.B., A.K., G.K.M., L.D.W., and R.C.H. performed research; A.K., G.K.M., and L.D.W. contributed computational analysis and tools; M.K., B.W., M.P.S., B.E.B., E.B., G.E.C., J.D., I.D., L.L.E., P.J.F., E.A.F., M.G., M.C.G., D.M.G., T.R.G., E.D.G., R.G., T.H., J.K., J.D.L., R.M.M., M.J.P., B.R., J.A.S., Z.W., K.P.W., and R.C.H. contributed to manuscript discussions and ideas; and M.K., B.W., M.P.S., B.E.B., and R.C.H. wrote the paper. The authors declare no conflict of interest. This article is a PNAS Direct Submission. Data deposition: In addition to data already released via the ENCODE Data Coordinating Center, the erythroblast DNase-seq data reported in this paper have been deposited in the Gene Expression Omnibus (GEO) database, www.ncbi.nlm.nih.gov/geo (accession nos. GSE55579, GSM1339559, and GSM1339560). Authored by members of the ENCODE Consortium. This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1318948111/-/DCSupplemental.

Attached Files

Published - PNAS-2014-Kellis-6131-8.pdf

Supplemental Material - pnas.201318948SI.pdf

Files

PNAS-2014-Kellis-6131-8.pdf
Files (1.3 MB)
Name Size Download all
md5:78ae761a9f5bb214625e16f54288ff3a
885.1 kB Preview Download
md5:025afd96fcff6c0d892942bfd43ffe51
385.3 kB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 26, 2023