Back

Phylogenetic, sequence and structural analysis of Insulin superfamily proteins reveals an indelible link between evolution and structure-function relationship

S, S.; Kundapura, S.; Dey, D.; Ramagopal, U. A.; Kulal, A.

2019-09-16 evolutionary biology
10.1101/769497 bioRxiv
Show abstract

The insulin superfamily proteins (ISPs), in particular, insulin, IGFs and relaxins are key modulators of animal physiology. They are known to have evolved from the same ancestral gene and have diverged into proteins with varied sequences and distinct functions, but maintain a similar structural architecture stabilized by highly conserved disulphide bridges. A recent surge of sequence data and the structures of these proteins prompted a need for a comprehensive analysis which connects the evolution of these sequences in the light of available functional and structural information and their interaction with cognate receptors. This study reveals a) unusually high sequence conservation of IGFs (>90%), which has never been reported before. In fact, it was interesting to observe that the functional domains (excluding signal peptide) of human, horse, pig and Ords kangaroo rat are 100% identical. (b) an updated definition of the signature motif of the relaxin family (c) a non-canonical C-peptide cleavage site in a few killifish insulin sequences and so on. We also provide a structure-based rationale for such conservation by introducing a concept called binding partners imposed evolutionary constraints. Furthermore, the high conservation of IGFs appears to represent a classic case of resistance to sequence diversity exerted by physiologically important interactions with multiple partners. Furthermore, we propose a probable mechanism for C-peptide cleavage in killifish insulin sequences.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 23%
8.3%
2
Biochimie
23 papers in training set
Top 0.1%
6.7%
3
Scientific Reports
3102 papers in training set
Top 19%
6.3%
4
Biochemical Journal
80 papers in training set
Top 0.1%
6.3%
5
Journal of Biomolecular Structure and Dynamics
43 papers in training set
Top 0.2%
6.3%
6
International Journal of Biological Macromolecules
65 papers in training set
Top 0.3%
4.8%
7
Protein Science
221 papers in training set
Top 0.4%
3.6%
8
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 2%
2.9%
9
Communications Biology
886 papers in training set
Top 3%
2.9%
10
Open Biology
95 papers in training set
Top 0.3%
2.7%
50% of probability mass above
11
International Journal of Molecular Sciences
453 papers in training set
Top 5%
2.1%
12
Genes
126 papers in training set
Top 0.7%
2.1%
13
BMC Ecology and Evolution
49 papers in training set
Top 1.0%
1.7%
14
BMC Molecular and Cell Biology
14 papers in training set
Top 0.1%
1.7%
15
Cells
232 papers in training set
Top 3%
1.7%
16
Medical Research Archives
11 papers in training set
Top 0.2%
1.7%
17
eLife
5422 papers in training set
Top 43%
1.6%
18
Frontiers in Genetics
197 papers in training set
Top 6%
1.5%
19
Gene
41 papers in training set
Top 1%
1.3%
20
Frontiers in Ecology and Evolution
60 papers in training set
Top 3%
0.9%
21
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.9%
22
PLOS Computational Biology
1633 papers in training set
Top 21%
0.9%
23
The FEBS Journal
78 papers in training set
Top 0.6%
0.9%
24
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
25
Journal of Molecular Biology
217 papers in training set
Top 3%
0.9%
26
Biology Open
130 papers in training set
Top 2%
0.9%
27
PeerJ
261 papers in training set
Top 14%
0.8%
28
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.9%
0.8%
29
BMC Genomic Data
12 papers in training set
Top 0.2%
0.7%
30
Biology
43 papers in training set
Top 3%
0.7%