Back

Symmetric Self-play Online Preference Optimization for Protein Inverse Folding

Zeng, W.; Li, X.; Zou, H.; Dou, Y.; Zhao, X.; Peng, S.

2026-03-30 bioinformatics
10.64898/2026.03.26.714453 bioRxiv
Show abstract

Multi-objective reinforcement learning based on predicted structure feedback has been introduced into protein inverse folding. However, existing methods typically rely on a single model to optimize multiple structural objectives via a scalarized reward, which can bias the optimization toward dominant objectives and limit the exploration of diverse solutions. Here, we propose a online Symmetric Self-play Preference Optimization (SSP) framework that decouples the optimization of multiple structural objectives by training separate preference models with distinct reward signals, while enabling interaction through a shared sampling pool. This design allows the models to explore diverse optimization trajectories without enforcing a single dominant direction. Extensive experiments on both natural and de novo binder backbone inverse folding tasks demonstrate that SSP consistently improves sequence design self-consistency compared to single-model and existing baselines. Further analysis shows that different structural objectives are only partially aligned and induce distinct optimization directions, as evidenced by metric correlation and white-box analyses. This supports the effectiveness of decoupling objectives to enable higher design quality in protein design.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 8%
17.7%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.6%
9.8%
3
Briefings in Bioinformatics
326 papers in training set
Top 1.0%
6.2%
4
Cell Systems
167 papers in training set
Top 2%
6.1%
5
Bioinformatics
1061 papers in training set
Top 4%
6.1%
6
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.2%
4.2%
50% of probability mass above
7
Nature Machine Intelligence
61 papers in training set
Top 0.8%
3.8%
8
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
9
Communications Biology
886 papers in training set
Top 2%
3.5%
10
Advanced Science
249 papers in training set
Top 6%
3.5%
11
The Journal of Physical Chemistry B
158 papers in training set
Top 0.9%
2.0%
12
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 34%
1.6%
14
Protein Science
221 papers in training set
Top 1.0%
1.6%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.6%
16
Journal of Cheminformatics
25 papers in training set
Top 0.4%
1.4%
17
Communications Chemistry
39 papers in training set
Top 0.4%
1.4%
18
Scientific Reports
3102 papers in training set
Top 67%
1.2%
19
eLife
5422 papers in training set
Top 50%
1.2%
20
Bioinformatics Advances
184 papers in training set
Top 4%
1.1%
21
National Science Review
22 papers in training set
Top 2%
0.8%
22
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
23
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
24
ACS Synthetic Biology
256 papers in training set
Top 3%
0.7%
25
The Journal of Physical Chemistry Letters
58 papers in training set
Top 2%
0.7%
26
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
27
Structure
175 papers in training set
Top 3%
0.7%
28
Journal of Molecular Biology
217 papers in training set
Top 4%
0.7%
29
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 1%
0.7%
30
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.6%