Back

Reverse-engineering amyloid strains with generative protein design

Gadhe, L.; Konstantoulea, K.; Mazumder, A.; Chen, J.; Joachimiak, L. A.; Louros, N. N.

2026-05-11 biophysics
10.64898/2026.05.08.723915 bioRxiv
Show abstract

Amyloid fibrils are intrinsically polymorphic protein assemblies that form distinct structural strains linked to diverse biological and pathological outcomes. Yet, the principles governing how sequence encodes diverse fibril architectures, and the extent to which a given fold constrains underlying amino-acid sequence compatibility, remain poorly understood. Here, we apply generative protein design to directly interrogate the sequence-structure relationship of defined fibril architectures, using -synuclein (S), a protein known to form highly polymorphic amyloid fibrils, as a model system. Sampling sequence space under structural constraints reveals a continuous compatibility manifold in which diverse sequences encode a common amyloid architecture. De novo designed sequences assemble into fibrils, often with enhanced aggregation efficiency relative to S. A subset exhibits strain-like behaviour, including similar morphologies, efficient cross-templating, and induction of S cellular propagation, thereby functionally validating structural compatibility with the native fibril fold. Energetic analysis shows that stability is achieved through distinct but compensatory interactions, supporting a non-unique mapping between sequence and structure. Together, our results define a continuous and constrained compatibility landscape underlying amyloid strains, providing a framework for understanding the determinants of polymorphism and establishing generative protein design as a strategy to access this space, interrogate amyloid sequence-structure relationships, and engineer fibrillar protein assemblies and functional biomaterials.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 4%
21.6%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 4%
13.8%
3
Science
429 papers in training set
Top 4%
8.1%
4
Nature Physics
39 papers in training set
Top 0.3%
6.1%
5
Nature Materials
21 papers in training set
Top 0.1%
4.7%
50% of probability mass above
6
Journal of the American Chemical Society
199 papers in training set
Top 2%
4.1%
7
Cell Systems
167 papers in training set
Top 4%
3.4%
8
ACS Nano
99 papers in training set
Top 1%
3.4%
9
Nature Chemistry
34 papers in training set
Top 0.2%
2.8%
10
Advanced Science
249 papers in training set
Top 8%
2.5%
11
Science Advances
1098 papers in training set
Top 14%
2.0%
12
eLife
5422 papers in training set
Top 37%
2.0%
13
Cell
370 papers in training set
Top 11%
1.8%
14
Nature Nanotechnology
30 papers in training set
Top 0.6%
1.8%
15
Nature
575 papers in training set
Top 11%
1.7%
16
Chemical Science
71 papers in training set
Top 0.9%
1.7%
17
Nature Structural & Molecular Biology
218 papers in training set
Top 3%
1.7%
18
Nature Biotechnology
147 papers in training set
Top 6%
1.3%
19
Nature Chemical Biology
104 papers in training set
Top 3%
1.2%
20
ACS Central Science
66 papers in training set
Top 2%
0.9%
21
Developmental Cell
168 papers in training set
Top 11%
0.9%
22
Neuron
282 papers in training set
Top 8%
0.9%
23
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
24
Cell Reports
1338 papers in training set
Top 35%
0.7%
25
Angewandte Chemie International Edition
81 papers in training set
Top 4%
0.6%