Back

Comprehensive analysis of CXXX sequence space reveals that S. cerevisiae GGTase-I mainly relies on a2X substrate determinants

Schmidt, W. K.; Sarkar, A.; Hildebrandt, E. R.; Patel, K. V.; Mai, E. T.; Shah, S. S.; Kim, J. H.

2024-03-04 genetics
10.1101/2024.03.04.583369 bioRxiv
Show abstract

Many proteins undergo a post-translational lipid attachment, which increases their hydrophobicity, thus strengthening their membrane association properties or aiding in protein interactions. Geranylgeranyltransferase-I (GGTase-I) is an enzyme involved in a three-step post-translational modification (PTM) pathway that attaches a 20-carbon lipid group called geranylgeranyl at the carboxy-terminal cysteine of proteins ending in a canonical CaaL motif (C - cysteine, a - aliphatic, L - often leucine, but can be phenylalanine, isoleucine, methionine, or valine). Genetic approaches involving two distinct reporters were employed in this study to assess S. cerevisiae GGTase-I specificity, for which limited data exists, towards all 8000 CXXX combinations. Orthogonal biochemical analyses and structure-based alignments were also performed to better understand the features required for optimal target interaction. These approaches indicate that yeast GGTase-I best modifies the Cxa[L/F/I/M/V] sequence that resembles but is not an exact match for the canonical CaaL motif. We also observed that minor modification of non-canonical sequences is possible. A consistent feature associated with well-modified sequences was the presence of a non-polar a2 residue and a hydrophobic terminal residue, which are features recognized by mammalian GGTase-I. These results thus support that mammalian and yeast GGTase-I exhibit considerable shared specificity. Article SummaryThis work investigates yeast GGTase-I specificity through genetics, high throughput sequencing, and two distinct reporter systems. This approach allows for comprehensive evaluation of all CXXX sequence space, which has not been possible with earlier approaches. We identified CXXX sequences supporting geranylgeranylation that differ from the historically defined CaaL sequence often cited in the literature as the GGTase-I target motif, and our results indicate that the last two amino acids of the target motif largely dictate GGTase-I specificity.

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
The FEBS Journal
78 papers in training set
Top 0.1%
9.3%
2
Scientific Reports
3102 papers in training set
Top 9%
8.6%
3
PLOS ONE
4510 papers in training set
Top 31%
4.9%
4
Journal of Biological Chemistry
641 papers in training set
Top 0.3%
4.4%
5
ACS Omega
90 papers in training set
Top 0.3%
4.0%
6
International Journal of Molecular Sciences
453 papers in training set
Top 2%
3.7%
7
International Journal of Biological Macromolecules
65 papers in training set
Top 0.6%
3.7%
8
mSphere
281 papers in training set
Top 2%
3.3%
9
Frontiers in Plant Science
240 papers in training set
Top 3%
3.3%
10
ACS Chemical Biology
150 papers in training set
Top 0.7%
2.1%
11
PLOS Genetics
756 papers in training set
Top 8%
1.8%
12
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 4%
1.7%
50% of probability mass above
13
Molecular Genetics and Genomics
11 papers in training set
Top 0.1%
1.7%
14
Biology
43 papers in training set
Top 0.7%
1.7%
15
Biochimica et Biophysica Acta (BBA) - Molecular Cell Research
28 papers in training set
Top 0.2%
1.5%
16
Applied and Environmental Microbiology
301 papers in training set
Top 2%
1.5%
17
PLOS Computational Biology
1633 papers in training set
Top 18%
1.4%
18
Journal of Proteome Research
215 papers in training set
Top 2%
1.2%
19
BMC Biology
248 papers in training set
Top 2%
1.2%
20
Open Biology
95 papers in training set
Top 1%
1.2%
21
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 2%
1.0%
22
Frontiers in Genetics
197 papers in training set
Top 7%
1.0%
23
Genetics
225 papers in training set
Top 4%
0.8%
24
Analytical Chemistry
205 papers in training set
Top 2%
0.8%
25
Glycobiology
30 papers in training set
Top 0.2%
0.8%
26
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.9%
0.8%
27
FEBS Letters
42 papers in training set
Top 0.3%
0.8%
28
Biochemical and Biophysical Research Communications
78 papers in training set
Top 2%
0.8%
29
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.7%
30
Molecular Biology of the Cell
272 papers in training set
Top 3%
0.7%