Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published April 15, 2022 | Published + Submitted + Supplemental Material
Journal Article Open

MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect

Abstract

Multiplex assays of variant effect (MAVEs) are a family of methods that includes deep mutational scanning experiments on proteins and massively parallel reporter assays on gene regulatory sequences. Despite their increasing popularity, a general strategy for inferring quantitative models of genotype-phenotype maps from MAVE data is lacking. Here we introduce MAVE-NN, a neural-network-based Python package that implements a broadly applicable information-theoretic framework for learning genotype-phenotype maps—including biophysically interpretable models—from MAVE datasets. We demonstrate MAVE-NN in multiple biological contexts, and highlight the ability of our approach to deconvolve mutational effects from otherwise confounding experimental nonlinearities and noise.

Additional Information

© The Author(s) 2022. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. Received 16 July 2021; Accepted 24 March 2022; Published 15 April 2022. This work was supported by NIH grant R35GM133777 (awarded to JBK), NIH Grant R35GM133613 (awarded to DMM), an Alfred P. Sloan Research Fellowship (awarded to DMM), a grant from the CSHL/Northwell Health partnership, and funding from the Simons Center for Quantitative Biology at Cold Spring Harbor Laboratory. Review history: The review history is available as Additional file 2. Peer review information: Tim Sands was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team. Contributions: AT, WTI, DMM, and JBK conceived the project. AT and JBK wrote the software with assistance from AP and MK. WTI and JBK wrote a preliminary version of the software. AT, MK, and JBK performed the data analysis. JBK, AT, and DMM wrote the manuscript with contributions from MK and AP. All author(s) read and approved the final manuscript. Ethics approval and consent to participate: Not applicable. The authors declare that they have no competing interests.

Attached Files

Published - 13059_2022_Article_2661.pdf

Submitted - 2020.07.14.201475v4.full.pdf

Supplemental Material - 13059_2022_2661_MOESM1_ESM.pdf

Supplemental Material - 13059_2022_2661_MOESM2_ESM.docx

Files

13059_2022_2661_MOESM1_ESM.pdf
Files (9.7 MB)
Name Size Download all
md5:aac7add80eab12cef6be7584eb0cd737
1.3 MB Preview Download
md5:8f37e4e62ab538342b054bdc76cc165f
3.1 MB Preview Download
md5:7544a2df76d526dfb7e282b7b7aee5e4
5.3 MB Preview Download
md5:815c2ec4c0ad0e7a5c5435d6f053b8d1
36.5 kB Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023