Back

Prediction and Characterization of Disorder-Order Transition Regions in Proteins by Deep Learning

Yan, Z.; Omori, S.; Yamada, K. D.; Nishi, H.; Kinoshita, K.

2021-06-11 bioinformatics
10.1101/2021.06.11.448022 bioRxiv
Show abstract

The biological functions of proteins are traditionally thought to depend on well-defined three-dimensional structures, but many experimental studies have shown that disordered regions lacking fixed three-dimensional structures also have crucial biological roles. In some of these regions, disorder-order transitions are also involved in various biological processes, such as protein-protein interaction and ligand binding. Therefore, it is crucial to study disordered regions and structural transitions for further understanding of protein functions and folding. Owing to the costs and time requirements of experimental identification of natively disordered or transitional regions, the development of effective computational methods is a key research goal. In this study, we used overall residue dependencies and deep representation learning for prediction and reused the obtained disordered regions for the prediction of disorder-order transitions. Two similar and related prediction tasks were combined. Firstly, we developed a novel deep learning method, Res-BiLstm, for residue-wise disordered region prediction. Our method outperformed other predictors with respect to almost all criteria, as evaluated using an independent test set. For disorder-order transition prediction, we proposed a transfer learning method, Res-BiLstm-NN, with an acceptable but unbalanced performance, yielding reasonable results. To grasp underlining biophysical principles of disorder-order transitions, we performed qualitative analyses on the obtained results and discovered that most transitions have strong disordered or ordered preferences, and more transitions are consistent with the ordered state than the disordered state, different from conventional wisdom. To the best of our knowledge, this is the first sizable-scale study of transition prediction. Availabilityhttps://github.com/Yanzziang/Transition_Disorder_Prediction Contactkengo@ecei.tohoku.ac.jp

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.2%
12.7%
2
Computational Biology and Chemistry
23 papers in training set
Top 0.1%
8.5%
3
Bioinformatics
1061 papers in training set
Top 3%
8.5%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
8.5%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.9%
6
Journal of Chemical Information and Modeling
207 papers in training set
Top 1.0%
4.9%
7
Scientific Reports
3102 papers in training set
Top 27%
4.4%
50% of probability mass above
8
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.4%
2.6%
9
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.3%
2.5%
10
Protein Science
221 papers in training set
Top 0.6%
2.1%
11
The Journal of Physical Chemistry B
158 papers in training set
Top 0.9%
1.9%
12
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
1.9%
13
Journal of Molecular Biology
217 papers in training set
Top 1%
1.8%
14
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
15
Biomolecules
95 papers in training set
Top 0.6%
1.5%
16
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.4%
17
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
18
Communications Biology
886 papers in training set
Top 14%
1.2%
19
Quantitative Biology
11 papers in training set
Top 0.4%
1.1%
20
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
0.9%
21
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
22
PeerJ
261 papers in training set
Top 12%
0.9%
23
ACS Omega
90 papers in training set
Top 3%
0.9%
24
PLOS ONE
4510 papers in training set
Top 66%
0.8%
25
Biophysics and Physicobiology
10 papers in training set
Top 0.2%
0.7%
26
Frontiers in Immunology
586 papers in training set
Top 9%
0.7%
27
The Journal of Physical Chemistry Letters
58 papers in training set
Top 2%
0.7%
28
Molecules
37 papers in training set
Top 2%
0.7%
29
eLife
5422 papers in training set
Top 63%
0.5%
30
iScience
1063 papers in training set
Top 40%
0.5%