Back

Code for Collagen Folding Deciphered

Malcor, J.-D.; Ferruz, N.; Romero-Romero, S.; Dhingra, S.; Sagar, V.; Jalan, A. A.

2024-02-26 biochemistry
10.1101/2024.02.24.581883 bioRxiv
Show abstract

Collagen triple helix folds in two steps: nucleation of three polypeptides at the C-termini followed by zip-chain like propagation. The triple helices found in all domains of life as well as viruses contain upto 6000 amino acids in each polypeptide that are also frequently interrupted with non-helical sequences that disrupt folding and reduce stability. Given the length of polypeptide and the disruptive interruptions, compensating mechanisms that stabilize against local unfolding during propagation and offset the entropic cost of folding the long polypeptides are not fully understood. Here, we show that the information for correct folding of collagen triple helices is encoded in their sequence as interchain electrostatic interactions. In case of humans, disrupting these interactions causes severe to lethal diseases. Key ResultCollagen triple helices found in all the three domains of life as well as viruses have converged on similar mechanism to fold correctly.

Matching journals

The top 14 journals account for 50% of the predicted probability mass.

1
Biochemical Journal
80 papers in training set
Top 0.1%
8.2%
2
eLife
5422 papers in training set
Top 20%
4.3%
3
Nature Communications
4913 papers in training set
Top 36%
4.2%
4
Journal of Molecular Biology
217 papers in training set
Top 0.5%
4.0%
5
International Journal of Biological Macromolecules
65 papers in training set
Top 0.6%
3.6%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 20%
3.6%
7
Biochemical and Biophysical Research Communications
78 papers in training set
Top 0.1%
3.6%
8
iScience
1063 papers in training set
Top 5%
3.6%
9
Biophysical Journal
545 papers in training set
Top 2%
3.3%
10
Biomolecules
95 papers in training set
Top 0.1%
2.7%
11
The Journal of Physical Chemistry B
158 papers in training set
Top 0.7%
2.6%
12
PLOS ONE
4510 papers in training set
Top 46%
2.4%
13
Protein Science
221 papers in training set
Top 0.6%
2.1%
14
PLOS Computational Biology
1633 papers in training set
Top 14%
2.1%
50% of probability mass above
15
Biochemistry
130 papers in training set
Top 0.7%
1.9%
16
Journal of Biological Chemistry
641 papers in training set
Top 1%
1.9%
17
PLOS Pathogens
721 papers in training set
Top 5%
1.8%
18
Journal of Structural Biology
58 papers in training set
Top 0.7%
1.8%
19
Scientific Reports
3102 papers in training set
Top 58%
1.7%
20
Communications Biology
886 papers in training set
Top 11%
1.5%
21
Advanced Science
249 papers in training set
Top 12%
1.5%
22
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
23
PLOS Biology
408 papers in training set
Top 14%
1.2%
24
Viruses
318 papers in training set
Top 4%
1.2%
25
Structure
175 papers in training set
Top 2%
1.2%
26
Cellular and Molecular Life Sciences
84 papers in training set
Top 0.3%
1.2%
27
Cell Reports
1338 papers in training set
Top 28%
1.2%
28
ACS Chemical Neuroscience
60 papers in training set
Top 2%
1.2%
29
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
30
Open Biology
95 papers in training set
Top 1%
1.2%