Back

A Hierarchical Robust Linear Model for Cryo-EM Map Analysis

Tu, I.-P.; Zheng, S.-C.; Lien, Y.-H.; Lin, S. H.; Lin, P.-C.; Chang, W.-H.

2025-07-14 bioinformatics
10.1101/2025.07.10.664269 bioRxiv
Show abstract

Cryo-electron microscopy (cryo-EM) has become a pivotal tool for determining the atomic structures of biological macromolecules. In this study, we introduce a robust hierarchical linear (RHL) model to estimate key atom-specific parameters: the amplitude and width of Gaussian functions, which are typically simplified using uniform widths and amplitudes scaled by atomic number in cryo-EM map related studies. Our RHL framework incorporates minimum density power divergence estimation (MDPDE) to account for heteroscedasticity and enhance robustness against outliers. Through both simulation studies and real data analysis, we demonstrate that the proposed method effectively reduces the influence of outliers and yields reliable parameter estimates. When applied to cryo-EM data of human apoferritin (PDB ID: 6Z6U; EMDB ID: 11103), our model reveals that the estimated Gaussian parameters are stable across most amino acids, with nitrogen atoms consistently displaying lower amplitude and width values than predicted by conventional Gaussian modeling. These results underscore the need for a systematic analysis of paired cryo-EM maps and atomic models from the EMDB and PDB to gain deeper insights into atom-specific features embedded in cryo-EM data.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Structural Biology
58 papers in training set
Top 0.1%
22.1%
2
Communications Biology
886 papers in training set
Top 0.1%
14.1%
3
Bioinformatics
1061 papers in training set
Top 4%
7.0%
4
Briefings in Bioinformatics
326 papers in training set
Top 0.9%
6.3%
5
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
50% of probability mass above
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.5%
7
Nature Communications
4913 papers in training set
Top 43%
3.0%
8
Scientific Reports
3102 papers in training set
Top 44%
2.7%
9
Biophysical Journal
545 papers in training set
Top 2%
2.6%
10
Advanced Science
249 papers in training set
Top 9%
2.0%
11
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
12
IUCrJ
29 papers in training set
Top 0.2%
1.5%
13
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.2%
1.5%
14
NeuroImage
813 papers in training set
Top 4%
1.5%
15
The Journal of Physical Chemistry B
158 papers in training set
Top 1%
1.5%
16
PLOS ONE
4510 papers in training set
Top 59%
1.3%
17
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
18
The Innovation
12 papers in training set
Top 0.6%
1.2%
19
BMC Bioinformatics
383 papers in training set
Top 6%
1.1%
20
Nature Computational Science
50 papers in training set
Top 1%
1.1%
21
Communications Chemistry
39 papers in training set
Top 0.8%
0.9%
22
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1%
0.8%
23
Structure
175 papers in training set
Top 3%
0.7%
24
Protein Science
221 papers in training set
Top 2%
0.7%
25
eLife
5422 papers in training set
Top 60%
0.7%
26
International Journal of Molecular Sciences
453 papers in training set
Top 17%
0.7%
27
Nature Methods
336 papers in training set
Top 6%
0.7%
28
Cell Reports Methods
141 papers in training set
Top 6%
0.6%
29
Nano Letters
63 papers in training set
Top 3%
0.6%
30
Viruses
318 papers in training set
Top 6%
0.6%