Back

Computational analysis of transition temperatures (Tts) of proteins fused to elastin-like polypeptide (ELP): deep fake evaluation of proteins, linkers, and trailers features

Ghafari, M. D.; Rasooli, I.; Khajeh, K.; Dabirmanesh, B.; Ghafari, M.; Owlia, P.

2021-12-30 bioinformatics
10.1101/2021.12.29.474407 bioRxiv
Show abstract

The phase transition temperature (Tt) prediction of the Elastin-like polypeptides (ELPs) is not trivial because it is related to complex sets of variables such as composition, sequence length, hydrophobic characterization, hydrophilic characterization, the sequence order in the fused proteins, linkers and trailer constructs. In this paper, two unique quantitative models are presented for the prediction of the Tt of a family of ELPs that could be fused to different proteins, linkers, and trailers. The lack of need to use multiple software, peptide information, such as PDB file, as well as knowing the second and third structures of proteins are the advantages of this model besides its high accuracy and speed. One of our models could predict the Tt values of the fused ELPs by entering the protein, linker, and trailer features with R2=99%. Also, another model is able to predict the Tt value by entering the fused protein feature with R2=96%. For more reliability, our method is enriched by Artificial Intelligence (AI) to generate similar proteins. In this regard, Generative Adversarial Network (GAN) is our AI method to create fake proteins and similar values. The experimental results show that our strategy for prediction of Tt is reliable in large data. Author SummaryThe application of Elastin-like polypeptides (ELPs) as a protein tag is now developed in a variety of biotechnology aspects especially in proteins purification and drug delivery. ELPs application as a protein tag is owed to retain the phase transition behavior when ELPs fused to other proteins, linkers and tags. ELPs undergo the phase transition behavior by changing from soluble phase to insoluble phase above its inverse transition temperature (Tt) within a short time span. This biophysical behavior is usually reversible at the temperature below the Tt. There are few reports for evaluation of the Tt of ELPs types by using the dissimilar equations and algorithms. Our current predictions are the most accurate calculations presented so far by using the protein, linker and trailer effects and the results were evaluated in accordance with the available experimental data. Furthermore, our results also show that our strategy for prediction of Tt is reliable in large data.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Computational Biology and Chemistry
23 papers in training set
Top 0.1%
10.8%
2
Computers in Biology and Medicine
120 papers in training set
Top 0.1%
9.4%
3
PLOS ONE
4510 papers in training set
Top 30%
5.0%
4
Scientific Reports
3102 papers in training set
Top 26%
4.4%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.1%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
7
Biology Methods and Protocols
53 papers in training set
Top 0.2%
3.7%
8
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.7%
9
Bioinformatics
1061 papers in training set
Top 5%
3.7%
10
Physical Biology
43 papers in training set
Top 0.7%
2.2%
50% of probability mass above
11
BioSystems
11 papers in training set
Top 0.1%
1.9%
12
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
13
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.5%
14
Chaos, Solitons & Fractals
32 papers in training set
Top 1%
1.4%
15
Archives of Clinical and Biomedical Research
28 papers in training set
Top 1.0%
1.4%
16
ACS Omega
90 papers in training set
Top 2%
1.3%
17
Bioengineering
24 papers in training set
Top 0.7%
1.3%
18
Expert Systems with Applications
11 papers in training set
Top 0.2%
1.1%
19
Frontiers in Genetics
197 papers in training set
Top 7%
1.0%
20
PROTEOMICS
35 papers in training set
Top 0.5%
1.0%
21
Informatics in Medicine Unlocked
21 papers in training set
Top 0.9%
0.9%
22
Biology
43 papers in training set
Top 2%
0.9%
23
Methods
29 papers in training set
Top 0.4%
0.9%
24
Pharmaceuticals
33 papers in training set
Top 1%
0.9%
25
International Journal of Molecular Sciences
453 papers in training set
Top 14%
0.8%
26
PeerJ
261 papers in training set
Top 13%
0.8%
27
Physica A: Statistical Mechanics and its Applications
10 papers in training set
Top 0.2%
0.8%
28
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.5%
0.8%
29
Biomacromolecules
25 papers in training set
Top 0.4%
0.8%
30
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
0.7%