Improved prediction of site-rates from structure with averaging across homologs
(2024) In Protein Science 33(7).- Abstract
Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging... (More)
Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging site-specific rates across many extant protein structures can lead to correct recovery of site-rate prediction. This result is also demonstrated in natural protein sequences and experimental structures. Using predicted structures, we demonstrate that atomistic models can improve upon contact density metrics in predicting site-specific rates from a structure. The results give fundamental insights into the factors governing the distribution of site-specific rates in protein families.
(Less)
- author
- Norn, Christoffer LU ; Oliveira, Fábio LU and André, Ingemar LU
- organization
- publishing date
- 2024-07
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- evolutionary dynamics, evolutionary rates in proteins, protein stability, structure prediction
- in
- Protein Science
- volume
- 33
- issue
- 7
- article number
- e5086
- publisher
- The Protein Society
- external identifiers
-
- pmid:38923241
- scopus:85196713112
- ISSN
- 0961-8368
- DOI
- 10.1002/pro.5086
- language
- English
- LU publication?
- yes
- id
- 95b9e2dd-7ebc-4673-9137-e21201e96250
- date added to LUP
- 2024-08-14 13:37:42
- date last changed
- 2024-10-09 21:59:50
@article{95b9e2dd-7ebc-4673-9137-e21201e96250, abstract = {{<p>Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging site-specific rates across many extant protein structures can lead to correct recovery of site-rate prediction. This result is also demonstrated in natural protein sequences and experimental structures. Using predicted structures, we demonstrate that atomistic models can improve upon contact density metrics in predicting site-specific rates from a structure. The results give fundamental insights into the factors governing the distribution of site-specific rates in protein families.</p>}}, author = {{Norn, Christoffer and Oliveira, Fábio and André, Ingemar}}, issn = {{0961-8368}}, keywords = {{evolutionary dynamics; evolutionary rates in proteins; protein stability; structure prediction}}, language = {{eng}}, number = {{7}}, publisher = {{The Protein Society}}, series = {{Protein Science}}, title = {{Improved prediction of site-rates from structure with averaging across homologs}}, url = {{http://dx.doi.org/10.1002/pro.5086}}, doi = {{10.1002/pro.5086}}, volume = {{33}}, year = {{2024}}, }