Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Improved prediction of site-rates from structure with averaging across homologs

Norn, Christoffer LU ; Oliveira, Fábio LU and André, Ingemar LU orcid (2024) In Protein Science 33(7).
Abstract

Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging... (More)

Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging site-specific rates across many extant protein structures can lead to correct recovery of site-rate prediction. This result is also demonstrated in natural protein sequences and experimental structures. Using predicted structures, we demonstrate that atomistic models can improve upon contact density metrics in predicting site-specific rates from a structure. The results give fundamental insights into the factors governing the distribution of site-specific rates in protein families.

(Less)
Please use this url to cite or link to this publication:
author
; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
evolutionary dynamics, evolutionary rates in proteins, protein stability, structure prediction
in
Protein Science
volume
33
issue
7
article number
e5086
publisher
The Protein Society
external identifiers
  • scopus:85196713112
  • pmid:38923241
ISSN
0961-8368
DOI
10.1002/pro.5086
language
English
LU publication?
yes
id
95b9e2dd-7ebc-4673-9137-e21201e96250
date added to LUP
2024-08-14 13:37:42
date last changed
2024-08-14 13:38:07
@article{95b9e2dd-7ebc-4673-9137-e21201e96250,
  abstract     = {{<p>Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging site-specific rates across many extant protein structures can lead to correct recovery of site-rate prediction. This result is also demonstrated in natural protein sequences and experimental structures. Using predicted structures, we demonstrate that atomistic models can improve upon contact density metrics in predicting site-specific rates from a structure. The results give fundamental insights into the factors governing the distribution of site-specific rates in protein families.</p>}},
  author       = {{Norn, Christoffer and Oliveira, Fábio and André, Ingemar}},
  issn         = {{0961-8368}},
  keywords     = {{evolutionary dynamics; evolutionary rates in proteins; protein stability; structure prediction}},
  language     = {{eng}},
  number       = {{7}},
  publisher    = {{The Protein Society}},
  series       = {{Protein Science}},
  title        = {{Improved prediction of site-rates from structure with averaging across homologs}},
  url          = {{http://dx.doi.org/10.1002/pro.5086}},
  doi          = {{10.1002/pro.5086}},
  volume       = {{33}},
  year         = {{2024}},
}