Back

Accurate Macromolecular Complex Modeling for Cryo-EM with CryoZeta

Zhang, Z.; Li, S.; Farheen, F.; Kagaya, Y.; Liu, B.; Ibtehaz, N.; Terashi, G.; Nakamura, T.; Zhu, H.; Khan, K.; Zhang, Y.; Kihara, D.

2026-02-16 bioinformatics
10.64898/2026.02.13.705846 bioRxiv
Show abstract

Cryogenic electron microscopy (cryo-EM) has become a widely used technique for determining the three-dimensional structures of biological macromolecules. Despite its advantages, building accurate structural models from cryo-EM data remains challenging, particularly at non-atomic resolutions. Here, we present CryoZeta, a de novo structure modeling program that leverages a diffusion-based generative deep neural network to integrate cryo-EM map density features with a biomolecular structure prediction pipeline similar to Alphafold3. By jointly leveraging sequence information and density-based features, CryoZeta generates highly accurate structural models that are consistent with the experimental map density. Evaluated on benchmark datasets covering protein complexes, protein-nucleic acid assemblies, and nucleic acid-only systems at resolutions up to 10 [A], CryoZeta consistently outperforms existing cryo-EM modeling methods in atomic accuracy. These results highlight the benefits of directly incorporating cryo-EM density into modern structure prediction pipelines and establish the method as a robust tool for automated, high-fidelity modeling from cryo-EM maps.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 4%
22.1%
2
Journal of Structural Biology
58 papers in training set
Top 0.1%
12.1%
3
Structure
175 papers in training set
Top 0.2%
9.9%
4
Communications Biology
886 papers in training set
Top 0.1%
8.2%
50% of probability mass above
5
Nature Methods
336 papers in training set
Top 1%
7.0%
6
Bioinformatics
1061 papers in training set
Top 5%
3.9%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.8%
8
Nature Biotechnology
147 papers in training set
Top 3%
2.7%
9
Science
429 papers in training set
Top 11%
2.5%
10
Advanced Science
249 papers in training set
Top 9%
2.0%
11
Nature Computational Science
50 papers in training set
Top 0.7%
1.7%
12
Scientific Reports
3102 papers in training set
Top 60%
1.7%
13
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.5%
14
Cell Systems
167 papers in training set
Top 9%
1.3%
15
Cell Reports Methods
141 papers in training set
Top 3%
1.2%
16
The Innovation
12 papers in training set
Top 0.7%
0.9%
17
Nature Machine Intelligence
61 papers in training set
Top 3%
0.9%
18
Science Advances
1098 papers in training set
Top 27%
0.9%
19
eLife
5422 papers in training set
Top 54%
0.9%
20
Cell Research
49 papers in training set
Top 2%
0.8%
21
Communications Chemistry
39 papers in training set
Top 1%
0.8%
22
Nano Letters
63 papers in training set
Top 3%
0.7%
23
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.4%
0.7%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
25
Protein Science
221 papers in training set
Top 2%
0.7%
26
Biophysical Journal
545 papers in training set
Top 6%
0.6%
27
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.6%
28
PLOS ONE
4510 papers in training set
Top 72%
0.6%