Back

Compositional restrictions in the flanking regions give potential specificity and strength boost to binding in short linear motifs

Acs, V.; Hatos, A.; Tantos, A.; Kalmar, L.

2024-05-14 bioinformatics
10.1101/2024.05.13.593809 bioRxiv
Show abstract

Short linear motif (SLiM)-mediated protein-protein interactions play important roles in several biological processes where transient binding is needed. They usually reside in intrinsically disordered regions (IDRs), which makes them accessible for interaction. Although information about the possible necessity of the flanking regions surrounding the motifs is increasingly available, it is still unclear if there are any generic amino acid attributes that need to be functionally preserved in these segments. Here, we describe the currently known ligand-binding SLiMs and their flanking regions with biologically relevant residue features and analyse them based on their simplified characteristics. Our bioinformatics analysis reveals several important properties in the widely diverse motif environment that presumably need to be preserved for proper motif function, but remained hidden so far. Our results will facilitate the understanding of the evolution of SLiMs, while also hold potential for expanding and increasing the precision of current motif prediction methods. Author summaryProtein-protein interactions between short linear motifs and their binding domains play key roles in several molecular processes. Mutations in these binding sites have been linked to severe diseases, therefore, the interest in the motif research field has been dramatically increasing. Based on the accumulated knowledge, it became evident that not only the short motif sequences themselves, but their surrounding flanking regions also play crucial roles in motif structure and function. Since most of the motifs tend to be located within highly variable disordered protein regions, searching for functionally important physico-chemical properties in their proximity could facilitate novel discoveries in this field. Here we show that the investigation of the motif flanking regions based on different amino acid attributes can provide further information on motif function. Based on our bioinformatics approach we have found so far hidden features that are generally present within certain motif categories, thus could be used as additional information in motif searching methods as well.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
23.0%
2
PLOS ONE
4510 papers in training set
Top 30%
5.0%
3
Scientific Reports
3102 papers in training set
Top 22%
5.0%
4
PLOS Computational Biology
1633 papers in training set
Top 7%
5.0%
5
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
4.4%
6
Protein Science
221 papers in training set
Top 0.4%
3.7%
7
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.2%
3.7%
8
Journal of Biomolecular Structure and Dynamics
43 papers in training set
Top 0.3%
3.7%
50% of probability mass above
9
Computational Biology and Chemistry
23 papers in training set
Top 0.1%
2.8%
10
The Journal of Physical Chemistry B
158 papers in training set
Top 0.7%
2.7%
11
Journal of Molecular Graphics and Modelling
16 papers in training set
Top 0.1%
2.1%
12
International Journal of Biological Macromolecules
65 papers in training set
Top 1%
2.1%
13
Journal of Molecular Biology
217 papers in training set
Top 1%
2.1%
14
Biomolecules
95 papers in training set
Top 0.3%
1.8%
15
International Journal of Molecular Sciences
453 papers in training set
Top 7%
1.7%
16
PeerJ
261 papers in training set
Top 7%
1.7%
17
Journal of Cellular Biochemistry
10 papers in training set
Top 0.1%
1.7%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.5%
19
Journal of Proteome Research
215 papers in training set
Top 1%
1.3%
20
Biochemistry and Biophysics Reports
28 papers in training set
Top 1.0%
1.0%
21
ACS Omega
90 papers in training set
Top 3%
0.9%
22
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.9%
23
Current Research in Structural Biology
11 papers in training set
Top 0.1%
0.8%
24
Physical Chemistry Chemical Physics
34 papers in training set
Top 0.6%
0.8%
25
Frontiers in Immunology
586 papers in training set
Top 7%
0.8%
26
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
27
Molecules
37 papers in training set
Top 2%
0.8%
28
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
29
Chemistry – A European Journal
13 papers in training set
Top 0.6%
0.7%
30
Pharmaceuticals
33 papers in training set
Top 2%
0.7%