Back

Phylogenetic Investigation of the 100 kDa Hexokinase Enzyme Family with the Topiary Ancestral Sequence Reconstruction Pipeline

Freye, C.; Whittington, A. C.; Miller, B. G.

2026-01-30 biochemistry
10.64898/2026.01.30.702642 bioRxiv
Show abstract

The 100 kDa hexokinase (HK) enzyme family represents an attractive model to investigate the molecular origins of allosteric regulation in multidomain enzymes. Extant HK homologs are subject to various allosteric phenomena, including activation and inhibition by both homotropic and heterotropic ligands. Here, we report the results of a phylogenetic investigation of this enzyme family using the recently developed Topiary ancestral sequence reconstruction pipeline. The results agree with prior studies that used a smaller number of sequences from individual HK domains and suggest that modern HK3 isozymes diverged first from a 100 kDa ancestor, followed by gene duplication and divergence of the HK2 isozymes. A subsequent gene duplication event led to divergence of HK1 and the hexokinase domain containing protein 1 (HKDC1). To probe the ability of Topiary to yield functional, allosterically regulated ancestral enzymes, we resurrected and biochemically characterized two HKs from early vertebrate evolution, Anc1 and Anc2. Both enzymes were functionally similar to extant HK1, and possessed a low activity, regulatory N-terminal domain that governs allosteric regulation of the C-terminal active site by two heterotropic effectors, glucose 6-phosphate and inorganic phosphate. Neither ancestor was subject to homotropic regulation by substrate glucose, a characteristic observed in several extant HK3 family members. Our phylogenetic analysis provides a foundation for investigating the evolution of allostery in this enzyme family. It also demonstrates the need to sequence and biochemically characterize additional full-length HKs, especially those from jawless vertebrates, to enable more robust inferences of ancestral regulatory traits.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
eLife
5422 papers in training set
Top 0.1%
27.9%
2
Journal of Biological Chemistry
641 papers in training set
Top 0.1%
18.8%
3
Nature Communications
4913 papers in training set
Top 22%
8.5%
50% of probability mass above
4
Protein Science
221 papers in training set
Top 0.1%
8.3%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 11%
6.4%
6
Structure
175 papers in training set
Top 1.0%
3.1%
7
Nucleic Acids Research
1128 papers in training set
Top 8%
2.1%
8
Scientific Reports
3102 papers in training set
Top 53%
1.9%
9
Cell Reports
1338 papers in training set
Top 23%
1.8%
10
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
11
PLOS Biology
408 papers in training set
Top 13%
1.2%
12
Open Biology
95 papers in training set
Top 1%
1.2%
13
Biochemical Journal
80 papers in training set
Top 0.2%
0.8%
14
Journal of Molecular Biology
217 papers in training set
Top 3%
0.8%
15
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.9%
0.8%
16
Biochemistry
130 papers in training set
Top 2%
0.8%
17
Communications Biology
886 papers in training set
Top 23%
0.8%
18
Biophysical Journal
545 papers in training set
Top 6%
0.5%
19
Science Advances
1098 papers in training set
Top 34%
0.5%
20
PLOS ONE
4510 papers in training set
Top 73%
0.5%
21
Cell Chemical Biology
81 papers in training set
Top 5%
0.5%
22
mBio
750 papers in training set
Top 13%
0.5%