Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published February 2017 | Published + Supplemental Material
Journal Article Open

Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

Abstract

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.

Additional Information

© 2017 Poole et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received: April 10, 2016; Accepted: January 4, 2017; Published: February 7, 2017. This work was supported by grant U24CA143835 from the National Cancer Institute, http://www.cancer.gov/, and grant P50GM076547 from The Center For Systems Biology, http://www.centerforsystemsbiology.org/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Availability: In order to facilitate exploration of our data, including mutation clusters and pathway associations, we have created an interactive graphical website: m2c.systemsbiology.net. The multiscale mutation clustering algorithm has also been made publicly available: https://github.com/IlyaLab/M2C. All significant results from our pipeline (including those from other methods) can be found as a multi-tabbed excel document S1 Tables. These same tables are also available as TSV's from our website. Detailed descriptions of the tables and the data they contain are in S1 Table Descriptions. Data Availability: All Metadata and analyses are included as supplemental information. TCGA-related data can be downloaded from: http://ezid.cdlib.org/id/doi:10.7908/C1K64H78 or http://gdac.broadinstitute.org/runs/analyses__2014_10_17/data/. Drug response data are available from GDSC: http://www.cancerrxgene.org/downloads. Author Contributions Conceptualization: WP IS TAK BB. Data curation: WP KL. Formal analysis: WP TAK BB. Funding acquisition: IS. Investigation: WP TAK BB. Methodology: WP TAK BB. Project administration: IS. Resources: TAK BB IS. Software: WP KL TAK BB. Supervision: IS TAK BB. Validation: WP TAK BB. Visualization: WP KL TAK BB. Writing – original draft: WP KL IS TAK BB. Writing – review & editing: WP KL IS TAK BB. The authors have declared that no competing interests exist.

Attached Files

Published - journal.pcbi.1005347.pdf

Supplemental Material - journal.pcbi.1005347.s001.tif

Supplemental Material - journal.pcbi.1005347.s002.tif

Supplemental Material - journal.pcbi.1005347.s003.tif

Supplemental Material - journal.pcbi.1005347.s004.tif

Supplemental Material - journal.pcbi.1005347.s005.tif

Supplemental Material - journal.pcbi.1005347.s006.xlsx

Supplemental Material - journal.pcbi.1005347.s007.docx

Supplemental Material - journal.pcbi.1005347.s008.docx

Files

journal.pcbi.1005347.pdf
Files (23.8 MB)
Name Size Download all
md5:cb440d10fbade817b9b8c31513ab5623
4.7 MB Preview Download
md5:14b3b79bce7f6d14d2dcd8ee893753d7
7.2 MB Download
md5:ec4ce9dad06c9d3949ae9f45e855a1a8
3.2 MB Preview Download
md5:159d59fa889f831c3615e4e41a9d70dd
27.7 kB Download
md5:7439a5e7bff35c3bb6fa28b6c5317d26
1.9 MB Preview Download
md5:40737af52c0ea1aa1778de00a4ce2fd4
32.5 kB Download
md5:ed2ef3054d6223ec58828a67e67c69ca
1.9 MB Preview Download
md5:f9369e7d48ec84017a6caf404572a41a
2.2 MB Preview Download
md5:7cea804e224bf71e0eef2c7a9775caa6
2.6 MB Preview Download

Additional details

Created:
August 19, 2023
Modified:
October 25, 2023