Back

Smoking drives an epigenetic memory of aberrant hematopoiesis

Breeze, C. E.; Goodney, G.; Wang, H.; Hubbard, A. K.; Lim, J.; Machiela, M. J.; Hoang, T. T.; Richards-Barber, M.; Tran, C.; Tolentino, M.; Hansen, M.; Porecha, R.; Renke, N.; Zhou, W.; Franceschini, N.; Berndt, S. I.; Hofmann, J.; Lee, M.; London, S. J.; Wong, J. Y.

2026-05-21 epidemiology
10.64898/2026.05.14.26353250 medRxiv
Show abstract

Tobacco smoking induces DNA methylation (DNAm) changes in blood and other tissues, which may influence chronic health outcomes. However, the breadth of smoking-related DNAm changes remains unmapped, offering a space for employing novel technologies. To expand our understanding of smoking impacts on DNAm, we conducted an epigenome-wide association study (EWAS) comparing ever smokers to never smokers, using blood from a multiethnic U.S. study population (n=887). We employed the newly developed Illumina Methylation Screening Array (MSA) covering 269,094 unique sites, including 123,776 CpGs not assayed in previous EWAS. Trans-ethnic meta-analysis identified 152 differentially methylated positions (DMPs) associated with ever-smoking status (n=764); European-specific analysis yielded 129 DMPs (n=674), including 106 overlapping with trans-ethnic analysis. A separate, large-scale replication EWAS (n=2,190) confirmed 91 trans-ethnic and 77 European-specific DMPs. Among our findings, we identified 61 DMPs at CpGs novel to the MSA platform, including near both new and known smoking-associated genes. Most notably, we uncovered a dense cluster of 12 DMPs within a 1117 bp region of ECEL1P1, forming the most long-lasting, persistent smoking-associated DMR ever detected, even among former smokers who quit decades prior. We also detected new signals at AHRR, a well-known locus for smoking-related DNAm changes. eFORGE analysis revealed that detected smoking-associated DNAm changes are predominantly located in hematopoietic stem and progenitor cell (HSPC) DNase I hotspots, aligning with gene set enrichment analyses that highlighted pathways related to hematopoietic stem cell differentiation. Our findings suggest that HSPCs serve as a reservoir for an epigenetic memory of smoking. Additionally, we observed short-term cell-specific smoking-associated DNAm changes in myeloid cells. Our results demonstrate the utility of the MSA in expanding our knowledge of both transient and persistent environmental exposure-associated DNAm changes.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 3%
22.4%
2
eLife
5422 papers in training set
Top 10%
7.1%
3
Epigenetics
43 papers in training set
Top 0.1%
6.3%
4
Cell Reports
1338 papers in training set
Top 10%
4.8%
5
Science Advances
1098 papers in training set
Top 2%
4.8%
6
Cell Genomics
162 papers in training set
Top 1%
3.9%
7
Genome Medicine
154 papers in training set
Top 2%
3.6%
50% of probability mass above
8
JNCI: Journal of the National Cancer Institute
16 papers in training set
Top 0.2%
3.1%
9
Circulation
66 papers in training set
Top 1%
2.3%
10
Scientific Reports
3102 papers in training set
Top 50%
2.1%
11
Science Translational Medicine
111 papers in training set
Top 2%
2.1%
12
Science
429 papers in training set
Top 12%
2.1%
13
Nature
575 papers in training set
Top 10%
1.9%
14
The Journal of Infectious Diseases
182 papers in training set
Top 2%
1.8%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 33%
1.7%
16
Cancer Discovery
61 papers in training set
Top 1%
1.7%
17
Clinical Epigenetics
53 papers in training set
Top 0.6%
1.5%
18
Clinical Infectious Diseases
231 papers in training set
Top 3%
1.5%
19
Nature Genetics
240 papers in training set
Top 5%
1.3%
20
The American Journal of Human Genetics
206 papers in training set
Top 3%
1.2%
21
eBioMedicine
130 papers in training set
Top 3%
1.1%
22
Neuropsychopharmacology
134 papers in training set
Top 2%
0.9%
23
JCI Insight
241 papers in training set
Top 7%
0.8%
24
Nature Biotechnology
147 papers in training set
Top 7%
0.8%
25
PLOS Genetics
756 papers in training set
Top 16%
0.7%
26
Molecular Ecology
304 papers in training set
Top 4%
0.7%
27
Environmental Health Perspectives
17 papers in training set
Top 0.6%
0.7%
28
Cell
370 papers in training set
Top 18%
0.7%
29
JAMA Psychiatry
13 papers in training set
Top 0.6%
0.7%
30
Communications Biology
886 papers in training set
Top 29%
0.6%