Back

Characterization of a novel glycocin from a thermophile

Martini, R. M.; van der Donk, W.

2025-05-20 biochemistry
10.1101/2025.05.19.655019 bioRxiv
Show abstract

Glycocins are a growing family of ribosomally synthesized and posttranslationally modified peptides that are O- and/or S-glycosylated. Using a sequence similarity network of putative glycosyltransferases, the tht biosynthetic gene cluster was identified in the genome of Thermoanaerobacterium thermosaccharolyticum. ThtA is the precursor peptide to a member of the glycocin F family of glycocins. Like other members of this family, the glycosyltransferase (ThtS) encoded in the biosynthetic gene cluster adds N-acetyl-glucosamine to both Ser and Cys residues of ThtA. S-linked glycosylation has been shown to be chemically and enzymatically resistant to cleavage and therefore ThtS may be a valuable starting point for engineering efforts. The glycocin derived from ThtA, which we name thermoglycocin, was structurally characterized. Thermoglycocin is unique in that in addition to two nested disulfide bonds, it contains an additional disulfide bond creating a C-terminal loop. Unexpectedly, ThtA lacks the common double glycine motif that denotes a C39-peptidase leader peptide cleavage site. Based on AlphaFold3 modeling, we postulate that cleavage between the leader and core peptide occurs instead at a GK motif. This study adds to the small number of characterized glycocins, employs AlphaFold3 to aid in predicting the structure of the mature peptide product, and suggests a common naming convention similar to that established for lanthipeptides. One sentence summaryThermoglycocin is a novel glycocin derived from the thermophile Thermoanaerobacterium thermosaccharolyticum, containing three disulfide bonds, O- and S-GlcNAcylation, and is postulated to have a unique C39 protease cut site. O_FIG O_LINKSMALLFIG WIDTH=199 HEIGHT=200 SRC="FIGDIR/small/655019v1_ufig1.gif" ALT="Figure 1"> View larger version (21K): org.highwire.dtl.DTLVardef@17c74bforg.highwire.dtl.DTLVardef@1d4ddb3org.highwire.dtl.DTLVardef@27490dorg.highwire.dtl.DTLVardef@12cffd7_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
ACS Chemical Biology
150 papers in training set
Top 0.1%
26.1%
2
Glycobiology
30 papers in training set
Top 0.1%
6.4%
3
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.1%
6.4%
4
Biochemistry
130 papers in training set
Top 0.2%
4.9%
5
PLOS ONE
4510 papers in training set
Top 31%
4.9%
6
Scientific Reports
3102 papers in training set
Top 30%
4.0%
50% of probability mass above
7
Journal of Biological Chemistry
641 papers in training set
Top 0.5%
3.6%
8
mBio
750 papers in training set
Top 5%
2.8%
9
Protein Science
221 papers in training set
Top 0.5%
2.6%
10
Structure
175 papers in training set
Top 1%
2.5%
11
ACS Omega
90 papers in training set
Top 0.9%
2.4%
12
Microbiology
57 papers in training set
Top 0.4%
2.1%
13
Microbial Cell Factories
22 papers in training set
Top 0.2%
1.7%
14
ACS Infectious Diseases
74 papers in training set
Top 0.6%
1.7%
15
Frontiers in Microbiology
375 papers in training set
Top 6%
1.5%
16
eLife
5422 papers in training set
Top 50%
1.1%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 39%
1.1%
18
ACS Synthetic Biology
256 papers in training set
Top 2%
1.0%
19
Journal of Natural Products
11 papers in training set
Top 0.2%
0.9%
20
BMC Genomics
328 papers in training set
Top 5%
0.8%
21
Applied and Environmental Microbiology
301 papers in training set
Top 3%
0.8%
22
mSystems
361 papers in training set
Top 8%
0.7%
23
RSC Advances
18 papers in training set
Top 2%
0.7%
24
PNAS Nexus
147 papers in training set
Top 2%
0.7%
25
Journal of the American Chemical Society
199 papers in training set
Top 6%
0.5%
26
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.5%
0.5%
27
Royal Society Open Science
193 papers in training set
Top 6%
0.5%
28
Journal of Chemical Information and Modeling
207 papers in training set
Top 4%
0.5%
29
Metabolites
50 papers in training set
Top 2%
0.5%
30
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.7%
0.5%