TY - JOUR
T1 - Structural heterogeneity and precision of implications drawn from cryo-electron microscopy structures
T2 - SARS-CoV-2 spike-protein mutations as a test case
AU - Mehra, Rukmankesh
AU - Kepp, Kasper P.
PY - 2022
Y1 - 2022
N2 - Protein structures may be used to draw functional implications at the residue level, but how sensitive are these implications to the exact structure used? Calculation of the effects of SARS-CoV-2 S-protein mutations based on experimental cryo-electron microscopy structures have been abundant during the pandemic. To understand the precision of such estimates, we studied three distinct methods to estimate stability changes for all possible mutations in 23 different S-protein structures (3.69 million ΔΔG values in total) and explored how random and systematic errors can be remedied by structure-averaged mutation group comparisons. We show that computational estimates have low precision, due to method and structure heterogeneity making results for single mutations uninformative. However, structure-averaged differences in mean effects for groups of substitutions can yield significant results. Illustrating this protocol, functionally important natural mutations, despite individual variations, average to a smaller stability impact compared to other possible mutations, independent of conformational state (open, closed). In summary, we document substantial issues with precision in structure-based protein modeling and recommend sensitivity tests to quantify these effects, but also suggest partial solutions to the problem in the form of structure-averaged "ensemble" estimates for groups of residues when multiple structures are available.
AB - Protein structures may be used to draw functional implications at the residue level, but how sensitive are these implications to the exact structure used? Calculation of the effects of SARS-CoV-2 S-protein mutations based on experimental cryo-electron microscopy structures have been abundant during the pandemic. To understand the precision of such estimates, we studied three distinct methods to estimate stability changes for all possible mutations in 23 different S-protein structures (3.69 million ΔΔG values in total) and explored how random and systematic errors can be remedied by structure-averaged mutation group comparisons. We show that computational estimates have low precision, due to method and structure heterogeneity making results for single mutations uninformative. However, structure-averaged differences in mean effects for groups of substitutions can yield significant results. Illustrating this protocol, functionally important natural mutations, despite individual variations, average to a smaller stability impact compared to other possible mutations, independent of conformational state (open, closed). In summary, we document substantial issues with precision in structure-based protein modeling and recommend sensitivity tests to quantify these effects, but also suggest partial solutions to the problem in the form of structure-averaged "ensemble" estimates for groups of residues when multiple structures are available.
KW - Cryo-electron microscopy
KW - Structural heterogeneity
KW - Mutations
KW - Computer models
KW - Spike protein
U2 - 10.1007/s00249-022-01619-8
DO - 10.1007/s00249-022-01619-8
M3 - Journal article
C2 - 36167828
SN - 0175-7571
VL - 51
SP - 555
EP - 568
JO - European Biophysics Journal
JF - European Biophysics Journal
ER -