Back

An Explainable Machine Learning Approach to study the positional significance of histone post-translational modifications in gene regulation

Ramachandran, S.; Ramakrishnan, N.

2026-02-02 bioinformatics
10.64898/2026.01.30.702742 bioRxiv
Show abstract

Epigenetic mechanisms regulate gene-expression by altering the structure of the chromatin without modifying the underlying DNA sequence. Histone post-translational modifications (PTMs) are critical epigenetic signals that influence transcriptional activity, promoting or repressing gene-expression.Understanding the impact of individual PTMs and the combinatorial effects is essential to deciphering gene regulatory mechanisms.In this study,we analyzed the ChIP-seq data for 26 PTMs in yeast, examining the PTM intensities gene-wise from positions-3 to 8 in each gene.Using XGBoost classifiers, we predicted gene transcription rates and identified key histone modifications and nucleosomal positions that are critical in gene-expression using explainability measures (such as SHAP). Our study provides a comprehensive insight into the histone modifications, their positions and their combinations that are most critical in gene regulation in yeast.The proposed explainable Machine Learning models can be easily extended to other model organisms to provide meaningful insights into gene regulation by epigenetic mechanisms.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.4%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
14.1%
3
Epigenetics
43 papers in training set
Top 0.1%
6.3%
4
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
6.2%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
6
BMC Bioinformatics
383 papers in training set
Top 2%
3.9%
7
Nucleic Acids Research
1128 papers in training set
Top 5%
3.9%
50% of probability mass above
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.8%
9
Journal of Molecular Biology
217 papers in training set
Top 0.7%
3.5%
10
PLOS Genetics
756 papers in training set
Top 5%
3.5%
11
Scientific Reports
3102 papers in training set
Top 51%
2.0%
12
iScience
1063 papers in training set
Top 11%
2.0%
13
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
14
Advanced Science
249 papers in training set
Top 10%
1.8%
15
Epigenetics & Chromatin
42 papers in training set
Top 0.1%
1.8%
16
PLOS ONE
4510 papers in training set
Top 55%
1.7%
17
Nature Communications
4913 papers in training set
Top 57%
1.2%
18
Communications Biology
886 papers in training set
Top 15%
1.2%
19
Genetics
225 papers in training set
Top 4%
0.9%
20
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.9%
21
eLife
5422 papers in training set
Top 54%
0.9%
22
Genome Biology
555 papers in training set
Top 7%
0.8%
23
Genomics
60 papers in training set
Top 3%
0.7%
24
Gene
41 papers in training set
Top 2%
0.7%
25
Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms
14 papers in training set
Top 0.2%
0.7%
26
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.8%
0.7%
27
Quantitative Biology
11 papers in training set
Top 0.9%
0.7%
28
International Journal of Molecular Sciences
453 papers in training set
Top 18%
0.6%
29
Genome Research
409 papers in training set
Top 5%
0.6%