MetMap Enables Genome-Scale Methyltyping for Determining Methylation States in Populations
Abstract
The ability to assay genome-scale methylation patterns using high-throughput sequencing makes it possible to carry out association studies to determine the relationship between epigenetic variation and phenotype. While bisulfite sequencing can determine a methylome at high resolution, cost inhibits its use in comparative and population studies. MethylSeq, based on sequencing of fragment ends produced by a methylation-sensitive restriction enzyme, is a method for methyltyping (survey of methylation states) and is a site-specific and cost-effective alternative to whole-genome bisulfite sequencing. Despite its advantages, the use of MethylSeq has been restricted by biases in MethylSeq data that complicate the determination of methyltypes. Here we introduce a statistical method, MetMap, that produces corrected site-specific methylation states from MethylSeq experiments and annotates unmethylated islands across the genome. MetMap integrates genome sequence information with experimental data, in a statistically sound and cohesive Bayesian Network. It infers the extent of methylation at individual CGs and across regions, and serves as a framework for comparative methylation analysis within and among species. We validated MetMap's inferences with direct bisulfite sequencing, showing that the methylation status of sites and islands is accurately inferred. We used MetMap to analyze MethylSeq data from four human neutrophil samples, identifying novel, highly unmethylated islands that are invisible to sequence-based annotation strategies. The combination of MethylSeq and MetMap is a powerful and cost-effective tool for determining genome-scale methyltypes suitable for comparative and association studies.
Additional Information
© 2010 Singer et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received: April 1, 2010; Accepted: July 15, 2010; Published: August 19, 2010. This work was supported by the NIH grants HL084474 (DB), ES016581 (DM), CA115768 (DM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Meromit Singer and Lior Pachter received no funding for this work. We thank Lu Zhang from Illumina, Inc., Hayward, CA for the MethylSeq data used in this manuscript, the ENCODE project for generation of the FAIRE datasets, Sriram Sankararaman for many enlightening discussions and careful feedback on the manuscript, and Cole Trapnell for critical reading of the manuscript. Author Contributions: Conceived and designed the experiments: MS DB DIKM LP. Performed the experiments: MS JD AS GPS DIKM. Analyzed the data: MS DB JD AS DIKM LP. Contributed reagents/materials/analysis tools: GPS. Wrote the paper: MS DB DIKM LP. The authors have declared that no competing interests exist.Attached Files
Published - journal.pcbi.1000888.PDF
Supplemental Material - journal.pcbi.1000888.s001.PDF
Supplemental Material - journal.pcbi.1000888.s002.PDF
Supplemental Material - journal.pcbi.1000888.s003.PDF
Supplemental Material - journal.pcbi.1000888.s004.PDF
Supplemental Material - journal.pcbi.1000888.s005.PDF
Files
Name | Size | Download all |
---|---|---|
md5:7e7c67a655cdd7bca676893598f4e957
|
21.2 kB | Preview Download |
md5:c55dffea91c75ae3ea3c3446e56c420c
|
1.0 MB | Preview Download |
md5:89c2cad9fe2664aaf7f362506be04e5c
|
451.7 kB | Preview Download |
md5:572c7a7d7ea4cc290428e4b47836a723
|
91.9 kB | Preview Download |
md5:827cee12beb119bd20701a536e2278aa
|
243.7 kB | Preview Download |
md5:1c6d836370e4daf092f3d32749fad434
|
3.5 MB | Preview Download |
Additional details
- PMCID
- PMC2924245
- Eprint ID
- 74788
- Resolver ID
- CaltechAUTHORS:20170306-121310345
- NIH
- HL084474
- NIH
- ES016581
- NIH
- CA115768
- Created
-
2017-03-06Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field