Back

Mapping Protein Occupancy on DNA with an Unnatural Cytosine Modification in Bio-orthogonal Contexts

Zhu, R.; Loo, C. E.; Hurley, C. M.; Parker, J. B.; Kohli, R. M.

2025-11-26 biochemistry
10.1101/2025.11.24.690163 bioRxiv
Show abstract

The epigenome provides a dynamic layer of gene regulatory control above the static genetic sequence. Epigenetic DNA base modifications are a key regulator of gene expression, which in mammalian genomes predominantly occur in CpG contexts and are disproportionately distributed in regulatory regions like promoters. Chromatin-associated proteins and transcription factors work in tandem with these modifications to further control gene regulation. Given the interplay of these factors, mapping DNA base modifications concurrently with protein-DNA occupancy is therefore critical to interpreting the epigenome. Current multi-modal mapping methods employ DNA methyltransferases that mark accessible protein-unbound DNA in non-CpG contexts. However, the overlap of this exogenous DNA methylation with naturally occurring modifications can confound readouts and significantly limit compatibility with methods to simultaneously read epigenetic states. To circumvent these limitations, we explored the possibility of introducing an unnatural DNA base modification as an alternative label for protein occupancy. Here, we report our efforts to rationally engineer non-CpG-specific DNA carboxymethyltransferases, characterize their neomorphic activity, and assess DNA carboxymethylation as a reporter of protein occupancy on DNA. We find that DNA carboxymethylation of cytosines in GpC contexts shows broad compatibility with the most widely used epigenetic detection methods and reliably reports on protein occupancy state. Our results demonstrate that unnatural DNA modifications in bio-orthogonal sequence contexts, coupled with either chemical or enzymatic deamination, can potentiate new approaches to multimodal epigenetic profiling. Abstract Graphic O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=110 SRC="FIGDIR/small/690163v2_ufig1.gif" ALT="Figure 1"> View larger version (33K): org.highwire.dtl.DTLVardef@13f97cforg.highwire.dtl.DTLVardef@7b465org.highwire.dtl.DTLVardef@1d49387org.highwire.dtl.DTLVardef@1219a8c_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Chemical Society
199 papers in training set
Top 0.2%
19.6%
2
ACS Chemical Biology
150 papers in training set
Top 0.1%
10.2%
3
Cell Reports Methods
141 papers in training set
Top 0.2%
8.3%
4
Nucleic Acids Research
1128 papers in training set
Top 3%
6.9%
5
Angewandte Chemie International Edition
81 papers in training set
Top 0.4%
6.4%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 37%
4.0%
7
JACS Au
35 papers in training set
Top 0.1%
3.1%
8
ACS Bio & Med Chem Au
11 papers in training set
Top 0.1%
1.9%
9
Analytical Chemistry
205 papers in training set
Top 1%
1.7%
10
ChemBioChem
50 papers in training set
Top 0.5%
1.7%
11
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
12
iScience
1063 papers in training set
Top 17%
1.5%
13
Methods
29 papers in training set
Top 0.3%
1.3%
14
eLife
5422 papers in training set
Top 47%
1.3%
15
ACS Central Science
66 papers in training set
Top 1%
1.3%
16
PLOS ONE
4510 papers in training set
Top 60%
1.2%
17
Cell Systems
167 papers in training set
Top 9%
1.1%
18
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
19
Nature Biotechnology
147 papers in training set
Top 6%
0.9%
20
ACS Synthetic Biology
256 papers in training set
Top 3%
0.8%
21
RSC Chemical Biology
32 papers in training set
Top 0.5%
0.8%
22
Advanced Science
249 papers in training set
Top 17%
0.8%
23
Journal of Biological Chemistry
641 papers in training set
Top 4%
0.8%
24
Journal of Molecular Biology
217 papers in training set
Top 3%
0.8%
25
ACS Nano
99 papers in training set
Top 4%
0.7%
26
Scientific Reports
3102 papers in training set
Top 76%
0.7%
27
Nature Chemistry
34 papers in training set
Top 0.9%
0.7%
28
Epigenetics & Chromatin
42 papers in training set
Top 0.3%
0.7%
29
Small
70 papers in training set
Top 1%
0.6%
30
Chemical Communications
24 papers in training set
Top 1%
0.6%