Back

Prediction and analysis of new HisKA-like domains

Silly, L.; Perriere, G.; Ortet, P.

2026-03-02 bioinformatics
10.64898/2026.02.27.708494 bioRxiv
Show abstract

Histidine kinases (HKs) are part of many signaling pathways, by being implicated in two components systems (TCS). Using autophosphorylation and phosphotransfer to a response regulators (RR), they enable organisms to adapt to their environment. Most HKs are transmembrane proteins with a sensing domain outside of the cell and two catalytic domains called HisKA and HATPase. HATPase is required for interaction with the ATP and HisKA contains the phosphorylated histidine residue. HKs are involved in various environmental adaptation mechanisms, like light sensing or biochemical changes. Studying their diversity is therefore important to better understand how cells interacts with their environment. There exist incomplete HKs (iHKs) lacking either the HisKA or HATPase domain. Some iHKs with an HATPase domain possess a section of their sequence where an HisKA domain could be expected. These iHKs may contain "true" HKs, with unknown HisKA domain, that could fill gaps in various signaling pathways. In this study we analyzed 869 964 sequences of iHKs having an HATPase domain but lacking an HisKA domain. We identified 18 HisKA-like profiles and did multiple meta-studies to assessed their HisKA-like characteristics. We found that their 3D structures matched the structure of known HisKA domains. We saw that the genomic context of the genes associated to these profiles contained genes implicated in signal transduction pathways. We cross-validated some of our profiles with curated annotations, as well as with a "negative dataset" made of non-HK proteins. We believe that our work could help improve the annotation of regulation pathways in prokaryotes.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 2%
14.2%
2
PLOS ONE
4510 papers in training set
Top 19%
10.0%
3
Frontiers in Microbiology
375 papers in training set
Top 2%
4.3%
4
Scientific Reports
3102 papers in training set
Top 28%
4.3%
5
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
3.5%
6
F1000Research
79 papers in training set
Top 0.6%
3.5%
7
PeerJ
261 papers in training set
Top 3%
3.5%
8
mSystems
361 papers in training set
Top 3%
3.0%
9
Genomics
60 papers in training set
Top 0.4%
3.0%
10
BMC Bioinformatics
383 papers in training set
Top 3%
2.8%
50% of probability mass above
11
Biosystems
18 papers in training set
Top 0.1%
2.6%
12
Frontiers in Genetics
197 papers in training set
Top 4%
2.1%
13
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
1.9%
14
Microorganisms
101 papers in training set
Top 0.7%
1.8%
15
Database
51 papers in training set
Top 0.4%
1.7%
16
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.6%
1.5%
17
Genes
126 papers in training set
Top 1%
1.5%
18
mSphere
281 papers in training set
Top 4%
1.2%
19
Genome Biology and Evolution
280 papers in training set
Top 1%
1.2%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
21
BMC Genomics
328 papers in training set
Top 4%
1.2%
22
Biology
43 papers in training set
Top 1%
1.1%
23
BMC Ecology and Evolution
49 papers in training set
Top 1%
1.1%
24
Bioinformatics
1061 papers in training set
Top 8%
0.9%
25
Access Microbiology
22 papers in training set
Top 0.5%
0.9%
26
Journal of Proteome Research
215 papers in training set
Top 2%
0.8%
27
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.8%
28
BioSystems
11 papers in training set
Top 0.3%
0.8%
29
Microbiology Spectrum
435 papers in training set
Top 6%
0.7%
30
International Journal of Biological Macromolecules
65 papers in training set
Top 4%
0.7%