Back

Identification of Low Population States in Cryo-EM Using Deep Learning

Fraser, A.; Prokhorov, N. S.; Miller, J. M.; Knyazhanskaya, E. S.; Leiman, P. G.

2021-11-07 biophysics
10.1101/2021.11.06.467553 bioRxiv
Show abstract

Cryo-EM has made extraordinary headway towards becoming a semi-automated, high-throughput structure determination technique. In the general workflow, high-to-medium population states are grouped into two- and three-dimensional classes, from which structures can be obtained with near-atomic resolution and subsequently analyzed to interpret function. However, low population states, which are also functionally important, are often discarded. Here, we describe a technique whereby low population states can be efficiently identified with minimal human effort via a deep convolutional neural network classifier. We use this deep learning classifier to describe a transient, low population state of bacteriophage A511 in the midst of infecting its bacterial host. This method can be used to further automate data collection and identify other functionally important low population states.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Structure
175 papers in training set
Top 0.1%
27.6%
2
Nature Communications
4913 papers in training set
Top 18%
10.1%
3
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.1%
10.1%
4
Communications Biology
886 papers in training set
Top 0.2%
6.8%
50% of probability mass above
5
Journal of Structural Biology
58 papers in training set
Top 0.2%
6.8%
6
IUCrJ
29 papers in training set
Top 0.1%
6.3%
7
Journal of Structural Biology: X
15 papers in training set
Top 0.1%
3.9%
8
eLife
5422 papers in training set
Top 26%
3.6%
9
Nature Methods
336 papers in training set
Top 4%
1.9%
10
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 29%
1.9%
11
Scientific Reports
3102 papers in training set
Top 58%
1.7%
12
Scientific Data
174 papers in training set
Top 1%
1.7%
13
Protein Science
221 papers in training set
Top 1%
1.1%
14
Science
429 papers in training set
Top 17%
0.9%
15
PLOS ONE
4510 papers in training set
Top 64%
0.9%
16
The Innovation
12 papers in training set
Top 1.0%
0.7%
17
Cell
370 papers in training set
Top 18%
0.7%
18
Communications Chemistry
39 papers in training set
Top 2%
0.6%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 11%
0.6%
20
Cell Research
49 papers in training set
Top 3%
0.6%
21
Science Advances
1098 papers in training set
Top 33%
0.6%
22
Bioinformatics
1061 papers in training set
Top 10%
0.6%