Back

HMCVelo: A Deterministic Model for Hydroxymethylation Velocity in Single Cells

Mishra, P.

2026-04-22 bioinformatics
10.64898/2026.04.20.719607 bioRxiv
Show abstract

I present hydroxymethylation velocity (HMCVelo), the first velocity framework for DNA methylation dynamics. HMCVelo is a deterministic ordinary differential equation (ODE) model that computes the time derivative of hydroxymethylation state for individual cells and genes. The model exploits a recent advance in single-cell epigenomics--Joint single-nucleus hydroxymethylcytosine and methylcytosine sequencing (Joint-snhmC-seq)--which enables subtraction-free quantification of 5-hydroxymethylcytosine (5hmC) and 5-methylcytosine (5mC) at single-cell resolution, resolving temporal methylation dynamics from static molecular snapshots. HM-CVelo models the methylation-demethylation cycle as three coupled processes--methylation, hydroxymethylation, and demethylation--governed by gene-specific rate parameters estimated at steady state via constrained least-squares regression. Scale invariance reduces the parameter space from three to two free parameters per gene. Applied to murine cortical cells (n = 519 and n = 545), HMCVelo infers cellular trajectories with velocity confidence scores exceeding 0.89 across all cell types, compared to confidence scores below 0.45 when RNA velocity is repurposed on the same data. I further prove that in any closed biochemical system with a conservation law, the complement variable cannot resolve trajectory bifurcations--a result with implications for embedding basis selection in all future velocity frameworks applied to cyclic biochemical systems. This work provides a foundation for multi-omic trajectory inference integrating epigenetic and transcriptomic measurements.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Cell Systems
167 papers in training set
Top 0.7%
14.1%
2
Bioinformatics
1061 papers in training set
Top 2%
12.1%
3
Nature Methods
336 papers in training set
Top 1%
9.0%
4
Nature Biotechnology
147 papers in training set
Top 1%
7.0%
5
Nature Communications
4913 papers in training set
Top 30%
6.3%
6
Genome Biology
555 papers in training set
Top 2%
4.8%
50% of probability mass above
7
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 21%
3.5%
9
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.8%
10
BMC Bioinformatics
383 papers in training set
Top 4%
1.9%
11
Nature Computational Science
50 papers in training set
Top 0.6%
1.7%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
13
Advanced Science
249 papers in training set
Top 11%
1.7%
14
PLOS ONE
4510 papers in training set
Top 57%
1.5%
15
eLife
5422 papers in training set
Top 46%
1.5%
16
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
17
Nature
575 papers in training set
Top 13%
1.2%
18
Biophysical Journal
545 papers in training set
Top 4%
1.2%
19
Scientific Reports
3102 papers in training set
Top 67%
1.2%
20
Bioinformatics Advances
184 papers in training set
Top 4%
1.1%
21
Genome Research
409 papers in training set
Top 4%
0.9%
22
Nature Machine Intelligence
61 papers in training set
Top 3%
0.9%
23
Communications Biology
886 papers in training set
Top 19%
0.9%
24
Genome Medicine
154 papers in training set
Top 8%
0.8%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.8%
26
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
27
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.7%
28
iScience
1063 papers in training set
Top 33%
0.7%
29
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
30
Science
429 papers in training set
Top 20%
0.7%