Random Field Model Reveals Structure of the Protein Recombinational Landscape
- Creators
-
Romero, Philip A.
-
Arnold, Frances H.
Abstract
We are interested in how intragenic recombination contributes to the evolution of proteins and how this mechanism complements and enhances the diversity generated by random mutation. Experiments have revealed that proteins are highly tolerant to recombination with homologous sequences (mutation by recombination is conservative); more surprisingly, they have also shown that homologous sequence fragments make largely additive contributions to biophysical properties such as stability. Here, we develop a random field model to describe the statistical features of the subset of protein space accessible by recombination, which we refer to as the recombinational landscape. This model shows quantitative agreement with experimental results compiled from eight libraries of proteins that were generated by recombining gene fragments from homologous proteins. The model reveals a recombinational landscape that is highly enriched in functional sequences, with properties dominated by a large-scale additive structure. It also quantifies the relative contributions of parent sequence identity, crossover locations, and protein fold to the tolerance of proteins to recombination. Intragenic recombination explores a unique subset of sequence space that promotes rapid molecular diversification and functional adaptation.
Additional Information
© 2012 Romero, Arnold. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received June 5, 2012; Accepted August 3, 2012; Published October 4, 2012. Funding: The authors acknowledge support from the National Institutes of Health, ARRA (grant R01 GM068664) for funding the theoretical and P450 chimera work, and the U.S. Army Research Office, Institute for Collaborative Biotechnologies (grant W911NF-09-D-0001) for funding design and construction of the cellulase libraries. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We thank Zhen-Gang Wang and M. Shum for helpful discussions, and D. A. Drummond for feedback on the manuscript. Author Contributions: Analyzed the data: PAR FHA. Wrote the paper: PAR FHA.Attached Files
Published - journal.pcbi.1002713.pdf
Supplemental Material - Figure_S1.tiff
Supplemental Material - Text_S1.pdf
Files
Additional details
- PMCID
- PMC3464211
- Eprint ID
- 37014
- Resolver ID
- CaltechAUTHORS:20130220-094932746
- NIH
- R01 GM068664-01
- Army Research Office (ARO)
- W911NF-09-D-0001
- Created
-
2013-02-22Created from EPrint's datestamp field
- Updated
-
2021-11-09Created from EPrint's last_modified field