Back

Adaptive and Spandrel-like Constraints at Functional Sites in Protein Folds

Poley-Gil, M.; Fernandez-Martin, M.; Banka, A.; Heinzinger, M.; Rost, B.; Valencia, A.; Parra, R. G.

2026-02-11 bioinformatics
10.64898/2026.02.09.704872 bioRxiv
Show abstract

Understanding the relationships among amino acid sequences, structures and functions in proteins and how they evolve, remains a central challenge in molecular biology. It is still unclear which sequence elements differentially contribute to structural integrity or molecular function. Even more, there are ongoing debates on whether protein folds emerge as a result of evolution or as a consequence of physical laws. The energy landscapes theory states that proteins are minimally frustrated systems, i.e. they fold by minimising their energetic conflicts. However, some local frustration, believed to be selected for functional reasons, remains in the native state of proteins. Here, we combine reverse folding and structure prediction methods with sequence and local frustration analysis to address the aforementioned ideas. We found that reverse folding techniques are unable to erase evolutionary conserved frustration from certain residues, even when detrimental for structural integrity. We propose that certain frustration hotspots behave like architectural spandrels, not directly shaped by selection but emerging from physical constraints in protein folds which evolution can later co-opt for function. Our results provide a new perspective revealing how sequence variation and functional specificity could evolve from evolutionary, structural and biophysical constraints.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.5%
23.1%
2
The Journal of Physical Chemistry B
158 papers in training set
Top 0.1%
10.3%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.5%
6.5%
4
eLife
5422 papers in training set
Top 21%
4.1%
5
Biophysical Journal
545 papers in training set
Top 1%
4.1%
6
Molecular Biology and Evolution
488 papers in training set
Top 1%
3.8%
50% of probability mass above
7
Nature Communications
4913 papers in training set
Top 38%
3.7%
8
Journal of Molecular Biology
217 papers in training set
Top 0.5%
3.7%
9
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.2%
3.3%
10
Scientific Reports
3102 papers in training set
Top 42%
3.0%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 25%
2.7%
12
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
2.1%
13
PLOS ONE
4510 papers in training set
Top 51%
1.8%
14
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.5%
1.7%
15
Protein Science
221 papers in training set
Top 1.0%
1.5%
16
Journal of Structural Biology
58 papers in training set
Top 0.8%
1.5%
17
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1.0%
1.4%
18
iScience
1063 papers in training set
Top 24%
1.0%
19
Structure
175 papers in training set
Top 3%
1.0%
20
Cell Systems
167 papers in training set
Top 10%
1.0%
21
Physical Biology
43 papers in training set
Top 2%
0.9%
22
Communications Biology
886 papers in training set
Top 18%
0.9%
23
Biomolecules
95 papers in training set
Top 1%
0.9%
24
PLOS Genetics
756 papers in training set
Top 14%
0.8%
25
Nucleic Acids Research
1128 papers in training set
Top 16%
0.8%
26
Current Research in Structural Biology
11 papers in training set
Top 0.1%
0.8%
27
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
28
Genetics
225 papers in training set
Top 4%
0.7%
29
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%
30
National Science Review
22 papers in training set
Top 2%
0.7%