Back

Clonal haematopoiesis without identified genetic drivers: insights from analyses of 407,512 individuals

Wen, S.; Campos, R.; Karpinski, M.; Sharma, R.; Manojlovic, V.; Deevi, S. V. V.; O'Dell, S.; Li, X.; Hu, F.; O'Connell, J.; Nag, A.; Megy, K.; MacArthur, S.; Wasilewski, S.; Zou, X. Z.; Vitsios, D.; Wang, Q.; Petrovski, S.; Harper, A. R.; Fabre, M. A.; Vassiliou, G. S.; Mitchell, J.

2026-05-20 genetic and genomic medicine
10.64898/2026.05.17.26353391 medRxiv
Show abstract

Clonal haematopoiesis (CH) becomes ubiquitous as humans age. The role of somatic driver mutations in its development has been studied widely, but little is known about CH without identified genetic drivers, also known as "CH with unknown drivers" (CH-UD). A fundamental unresolved question is whether CH-UD is driven by undiscovered somatic genetic drivers or by other cell-heritable traits. Here, to investigate this, we develop a new machine learning classifier to improve CH-UD detection from whole-genome sequencing data. After excluding 77,885 individuals with previously documented driver CH or mosaic chromosomal alterations (mCA), we applied our classifier to 407,512 UK Biobank participants and identified 26,963 (6.6%) with CH-UD. A genome-wide association study (GWAS) of common germline variants identified 31 polymorphic loci associated with predisposition to CH-UD. Of these, 25 were associated with other forms of CH at genome-wide significance. Linkage Disequilibrium Score Regression analyses revealed an unexpectedly high genetic correlation (rg=0.794) between CH-UD and non-DNMT3A driver CH, indicative of a remarkable overlap between the genetic aetiologies of the two phenomena. Analysis of 2,941 plasma protein measurements in 47,757 individuals revealed that TCL1A was the most significantly elevated plasma protein in CH-UD, mirroring the finding that the TCL1A locus was in the top two most significant associations of CH-UD GWAS and TET2-CH and ASXL1-CH GWAS, the two most common forms of non-DNMT3A-CH. Furthermore, TCL1A plasma levels rose steadily with age even in those without detectable CH, particularly among carriers of the common TCL1A risk variant (rs2887399-G), potentially via stochastic promoter demethylation as described in TET2-CH and ASXL1-CH. Phenome-wide association analysis of 13,225 binary and 1,682 quantitative traits revealed that, similarly to non-DNMT3A-CH, CH-UD was significantly associated with several malignant (haematological and solid organ) and non-malignant (including cardiovascular and renal) diseases. Our findings reveal striking genetic and phenotypic similarities between CH-UD and non-DNMT3A driver CH, including a strong dependence on TCL1A, a protein recently found to inhibit DNA methylation. Collectively, these observations propose that CH-UD develops through selection acting on ageing-associated epigenetic changes that mirror those of non-DNMT3A-CH, but without the need for somatic genetic drivers.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 1%
27.6%
2
Nature Genetics
240 papers in training set
Top 1%
6.8%
3
Leukemia
39 papers in training set
Top 0.3%
4.3%
4
Aging Cell
144 papers in training set
Top 1%
3.9%
5
Developmental Cell
168 papers in training set
Top 5%
3.6%
6
eLife
5422 papers in training set
Top 26%
3.6%
7
Journal of Clinical Investigation
164 papers in training set
Top 1%
3.6%
50% of probability mass above
8
Nature
575 papers in training set
Top 7%
3.6%
9
Nature Medicine
117 papers in training set
Top 0.8%
3.6%
10
Genome Medicine
154 papers in training set
Top 3%
3.1%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.7%
12
Cell Genomics
162 papers in training set
Top 2%
2.4%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 28%
2.1%
14
EMBO Molecular Medicine
85 papers in training set
Top 1%
1.9%
15
Journal of Experimental Medicine
106 papers in training set
Top 2%
1.9%
16
Haematologica
24 papers in training set
Top 0.2%
1.8%
17
Blood
67 papers in training set
Top 0.9%
1.5%
18
Cancer Discovery
61 papers in training set
Top 1%
1.2%
19
PLOS Genetics
756 papers in training set
Top 11%
1.2%
20
Cell
370 papers in training set
Top 14%
1.1%
21
Science Advances
1098 papers in training set
Top 24%
1.1%
22
Blood Advances
54 papers in training set
Top 1%
0.8%
23
Cell Reports
1338 papers in training set
Top 32%
0.8%
24
Brain
154 papers in training set
Top 4%
0.8%
25
Cell Metabolism
49 papers in training set
Top 3%
0.7%
26
Nature Aging
51 papers in training set
Top 2%
0.7%
27
JCI Insight
241 papers in training set
Top 8%
0.7%
28
Nature Metabolism
56 papers in training set
Top 3%
0.6%