Back

Massively parallel characterization and deep learning of enhancers in plant genomes

Jores, T.; Mueth, N. A.; Gorjifard, S.; Triesch, S.; Schirmer, D.; Tonnies, J.; Bubb, K. L.; Cuperus, J. T.; Fields, S.; Queitsch, C.

2026-04-29 plant biology
10.64898/2026.04.26.720828 bioRxiv
Show abstract

Enhancers coordinate gene expression in response to developmental and environmental cues. Because plant enhancers lack the readily detectable molecular hallmarks of animal enhancers, their systematic functional characterization has yet to be accomplished. Here, we characterize the species- and condition-specific enhancer activity of over 350,000 sequences derived from accessible chromatin regions of the Arabidopsis, tomato, maize, and sorghum genomes. Enabled by the massive scale of the data, we developed plantGREP, a deep learning model that predicts enhancer strength and identifies the underlying functional sequence motifs. We apply plantGREP to evolve strong constitutive as well as species- and condition-specific enhancers, and to locate regions with enhancer activity upstream of developmental genes in crop genomes. These results should facilitate the targeted editing of enhancers in crop genomes and the design of cell-type-specific plant enhancers.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 2%
22.9%
2
Genome Biology
555 papers in training set
Top 0.1%
15.0%
3
Science
429 papers in training set
Top 4%
7.3%
4
Nature Plants
84 papers in training set
Top 0.2%
6.9%
50% of probability mass above
5
Cell Systems
167 papers in training set
Top 2%
4.9%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 17%
4.0%
7
Nature Biotechnology
147 papers in training set
Top 3%
3.7%
8
Nature Genetics
240 papers in training set
Top 2%
3.7%
9
Nucleic Acids Research
1128 papers in training set
Top 7%
3.3%
10
Science Advances
1098 papers in training set
Top 9%
2.8%
11
Nature
575 papers in training set
Top 9%
2.6%
12
Cell
370 papers in training set
Top 11%
1.7%
13
Molecular Plant
36 papers in training set
Top 0.7%
1.7%
14
Nature Methods
336 papers in training set
Top 4%
1.7%
15
Genome Research
409 papers in training set
Top 3%
1.4%
16
Developmental Cell
168 papers in training set
Top 10%
1.0%
17
Advanced Science
249 papers in training set
Top 16%
0.9%
18
Cell Reports
1338 papers in training set
Top 33%
0.8%
19
eLife
5422 papers in training set
Top 57%
0.8%
20
Nature Ecology & Evolution
113 papers in training set
Top 4%
0.7%
21
Molecular Cell
308 papers in training set
Top 11%
0.7%
22
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.7%
23
Plant Communications
35 papers in training set
Top 2%
0.7%
24
PLOS ONE
4510 papers in training set
Top 72%
0.5%
25
Nature Cell Biology
99 papers in training set
Top 6%
0.5%
26
Communications Biology
886 papers in training set
Top 32%
0.5%
27
Development
440 papers in training set
Top 4%
0.5%