Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published October 2012 | Published + Supplemental Material
Journal Article Open

Random Field Model Reveals Structure of the Protein Recombinational Landscape

Abstract

We are interested in how intragenic recombination contributes to the evolution of proteins and how this mechanism complements and enhances the diversity generated by random mutation. Experiments have revealed that proteins are highly tolerant to recombination with homologous sequences (mutation by recombination is conservative); more surprisingly, they have also shown that homologous sequence fragments make largely additive contributions to biophysical properties such as stability. Here, we develop a random field model to describe the statistical features of the subset of protein space accessible by recombination, which we refer to as the recombinational landscape. This model shows quantitative agreement with experimental results compiled from eight libraries of proteins that were generated by recombining gene fragments from homologous proteins. The model reveals a recombinational landscape that is highly enriched in functional sequences, with properties dominated by a large-scale additive structure. It also quantifies the relative contributions of parent sequence identity, crossover locations, and protein fold to the tolerance of proteins to recombination. Intragenic recombination explores a unique subset of sequence space that promotes rapid molecular diversification and functional adaptation.

Additional Information

© 2012 Romero, Arnold. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received June 5, 2012; Accepted August 3, 2012; Published October 4, 2012. Funding: The authors acknowledge support from the National Institutes of Health, ARRA (grant R01 GM068664) for funding the theoretical and P450 chimera work, and the U.S. Army Research Office, Institute for Collaborative Biotechnologies (grant W911NF-09-D-0001) for funding design and construction of the cellulase libraries. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We thank Zhen-Gang Wang and M. Shum for helpful discussions, and D. A. Drummond for feedback on the manuscript. Author Contributions: Analyzed the data: PAR FHA. Wrote the paper: PAR FHA.

Attached Files

Published - journal.pcbi.1002713.pdf

Supplemental Material - Figure_S1.tiff

Supplemental Material - Text_S1.pdf

Files

journal.pcbi.1002713.pdf
Files (1.7 MB)
Name Size Download all
md5:9c9dc0280aa6bd1274bb7411f2aa4a86
829.6 kB Preview Download
md5:ce89ee357647d69b1eb72589229069c2
748.1 kB Preview Download
md5:372a0a971b7019141979c1743f70b5ae
99.9 kB Preview Download

Additional details

Created:
August 19, 2023
Modified:
October 23, 2023