Back

PEPR-GNN: Perturbation-Enhancer-Promoter-RNA Graph Neural Networks for Multiome Perturb-Seq modeling of regulomes

Markham, Z. E.; Li, B.; Nguyen, L.; Wang, L.; Munshi, N. V.; Hon, G. C.

2026-05-06 genomics
10.64898/2026.05.05.722311 bioRxiv
Show abstract

Cellular reprogramming is a complex interplay between perturbations and regulatory elements, culminating in gene expression changes. Current computational approaches do not explicitly model these regulatory interactions. Here, we performed combinatorial reprogramming with cardiac transcription factors, followed by Multiome Perturb-Seq to measure perturbations, open chromatin, and gene expression in individual cells. We then developed PEPR-GNN (Perturbation-Enhancer-Promoter-RNA Graph Neural Network), a theoretical and computational framework to model regulome responses during complex genetic perturbations. By statistically associating gene regulatory relationships, PEPR-GNN organizes genes into regulomes with shared gene regulatory responses to reprogramming, including easy-to-reprogram cardiac genes, difficult-to-reprogram fibroblast genes, and context-specific genes where the impact of a reprogramming factor depends on the presence of others. Finally, we use PEPR-GNN for in silico modeling of how genetic modifications of enhancers can be used to tune gene responses to reprogramming. Overall, through the use of causal perturbation information and an enhancer-aware regulome model of gene regulation, PEPR-GNN can effectively model complex cellular responses to perturbation. HighlightsO_LIMultiome Perturb-Seq of GHMT reprogramming in MEFs with RNA/ATAC-Seq readout. C_LIO_LIPEPR-GNN: a computational framework to model perturbation-induced regulomes. C_LIO_LIPEPR-GNN aids the interpretation of regulomes by diverse reprogramming responses. C_LIO_LIPEPR-GNN enables in silico perturbation to tune gene responses to reprogramming. C_LI

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.5%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
14.2%
3
PLOS Computational Biology
1633 papers in training set
Top 3%
10.0%
4
Frontiers in Genetics
197 papers in training set
Top 0.9%
6.2%
5
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.3%
6
Nature Communications
4913 papers in training set
Top 36%
4.3%
50% of probability mass above
7
Bioinformatics Advances
184 papers in training set
Top 1.0%
4.1%
8
Nucleic Acids Research
1128 papers in training set
Top 6%
3.5%
9
iScience
1063 papers in training set
Top 6%
3.0%
10
Genome Biology
555 papers in training set
Top 3%
2.7%
11
Genome Research
409 papers in training set
Top 2%
2.1%
12
Scientific Reports
3102 papers in training set
Top 51%
2.1%
13
npj Systems Biology and Applications
99 papers in training set
Top 0.9%
1.9%
14
Development
440 papers in training set
Top 1%
1.9%
15
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.6%
17
Cell Reports Methods
141 papers in training set
Top 3%
1.5%
18
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
19
Communications Biology
886 papers in training set
Top 15%
1.2%
20
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 6%
1.2%
21
PLOS Genetics
756 papers in training set
Top 12%
1.1%
22
Patterns
70 papers in training set
Top 2%
0.9%
23
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
24
eLife
5422 papers in training set
Top 59%
0.7%
25
Cell Genomics
162 papers in training set
Top 7%
0.7%
26
PLOS ONE
4510 papers in training set
Top 69%
0.7%
27
Nature Methods
336 papers in training set
Top 7%
0.7%
28
Cell Reports
1338 papers in training set
Top 35%
0.7%
29
BMC Genomics
328 papers in training set
Top 7%
0.6%