Back

iHyd-ProSite: A novel Computational Approach for Identifying Hydroxylation Sites in Proline Via Mathematical Modeling

Mahmood, M. K.

2020-03-03 bioinformatics
10.1101/2020.03.03.974717 bioRxiv
Show abstract

In various cellular functions, post translational modifications (PTM) of protein play a vital role. The addition of certain functional group through a covalent bond to the protein induces PTM. The number of PTMs are identified which are closely linked with diseases for example cancer and neurological disorder. Hydroxylation is one of the PTM, modified proline residue within a polypeptide sequence. The defective hydroxylation of proline causes absences of ascorbic acid in human which produce scurvy, and many other dominant health issues. Undoubtedly, the prediction of hydroxylation sites in proline residues is of challenging frontier. The experimental identification of hydroxyproline site is quite difficult, high-priced and time-consuming. The diversity in protein sequences instigates to develop a computational tool to identify hydroxylated site within short time with excellent prediction accuracy to handle such proteomics problems. In this work a novel in silico predictor is developed through rigorous mathematical modeling to identify which site of proline is hydroxylated and which site is not? Then performance of the predictor was verified using three validations tests, namely self-consistency test, cross-validation test and jackknife test over the benchmark dataset. A comparison was established for jackknife test with the previous methods. In comparison with previous predictors the proposed tool is more accurate than the existing techniques. Hence this scheme is highly useful and inspiring in contrast to all previous predictors.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Proteome Research
215 papers in training set
Top 0.1%
29.3%
2
PROTEOMICS
35 papers in training set
Top 0.1%
15.6%
3
PLOS ONE
4510 papers in training set
Top 43%
2.9%
4
Molecules
37 papers in training set
Top 0.3%
2.9%
50% of probability mass above
5
Analytical Chemistry
205 papers in training set
Top 1.0%
2.8%
6
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.2%
2.2%
7
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
2.0%
8
Bioinformatics
1061 papers in training set
Top 7%
1.9%
9
ACS Omega
90 papers in training set
Top 1%
1.9%
10
Molecular & Cellular Proteomics
158 papers in training set
Top 1%
1.6%
11
Journal of Proteomics
27 papers in training set
Top 0.2%
1.6%
12
Scientific Reports
3102 papers in training set
Top 65%
1.3%
13
SoftwareX
15 papers in training set
Top 0.3%
1.2%
14
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.0%
15
Communications Chemistry
39 papers in training set
Top 0.7%
0.9%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
17
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
18
Analytica Chimica Acta
17 papers in training set
Top 0.5%
0.8%
19
BMC Genomics
328 papers in training set
Top 5%
0.8%
20
ImmunoInformatics
11 papers in training set
Top 0.2%
0.8%
21
Molecular Omics
21 papers in training set
Top 0.3%
0.8%
22
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
23
Biology
43 papers in training set
Top 2%
0.8%
24
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
25
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.5%
0.8%
26
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
27
PLOS Neglected Tropical Diseases
378 papers in training set
Top 5%
0.8%
28
Biomedical Signal Processing and Control
18 papers in training set
Top 0.5%
0.8%
29
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
30
GigaScience
172 papers in training set
Top 3%
0.8%