Back

Multi-omics Differential Inference for Functional Interpretation (MoDIFI): A Statistical Framework to Prioritize Cell Lines for Neurodevelopmental Variants

VR, A.; Shaw, G. T.-W.; Manuel, J.; Mosbruger, T. L.; Heins, H.; Ng, J. K.; Kim, H.; Hayeck, T. J.; Turner, T. N.

2026-01-29 genomics
10.64898/2026.01.29.702065 bioRxiv
Show abstract

Noncoding variants contribute to neurodevelopmental disorders (NDDs), but their regulatory effects are often cell-type specific, making it difficult to choose an in vitro model for high-throughput assays such as massively parallel reporter assays. We asked: given a set of noncoding variants, which cell line and regulatory regions are most likely to reveal measurable allele-specific effects? We generated matched multiomics profiles across commonly used NDD in vitro models: human neuronal lines (i.e., IMR-32, SH-SY5Y, SK-N-SH), mouse neuronal lines (i.e., HT-22, Neuro-2a), and a non-neuronal line (i.e., HEK-293), using RNA-seq, ATAC-seq, and Hi-C under consistent conditions. To integrate these orthogonal data types, we developed MoDIFI (Multi-omics Differential Inference for Functional Interpretation), a Bayesian framework that quantifies cell-line-specific regulatory activity by computing posterior inclusion probabilities (PIPs) for differential gene-loop interactions. MoDIFI identifies regulatory regions supported by coordinated 3D contacts, accessibility, and transcriptional output, producing cell-line-resolved regulatory maps that highlight both shared synaptic programs and context-dependent mechanisms. These results provide a practical strategy for prioritizing the most informative cell lines and candidate regulatory elements for targeted functional testing of NDD-relevant noncoding variation.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Cell Genomics
162 papers in training set
Top 0.1%
14.2%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.4%
10.3%
3
Genome Medicine
154 papers in training set
Top 0.5%
10.0%
4
Nature Genetics
240 papers in training set
Top 0.9%
8.3%
5
Genome Biology
555 papers in training set
Top 2%
4.8%
6
Genome Research
409 papers in training set
Top 0.6%
4.8%
50% of probability mass above
7
Nature Communications
4913 papers in training set
Top 35%
4.2%
8
Nucleic Acids Research
1128 papers in training set
Top 5%
3.9%
9
Nature Methods
336 papers in training set
Top 3%
3.0%
10
Bioinformatics
1061 papers in training set
Top 6%
3.0%
11
Cell Reports Methods
141 papers in training set
Top 1%
2.7%
12
Nature Biotechnology
147 papers in training set
Top 3%
2.6%
13
Science
429 papers in training set
Top 12%
2.0%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
15
Science Translational Medicine
111 papers in training set
Top 3%
1.6%
16
Nature
575 papers in training set
Top 11%
1.6%
17
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.5%
18
Nature Machine Intelligence
61 papers in training set
Top 2%
1.3%
19
Nature Computational Science
50 papers in training set
Top 1%
1.2%
20
Nature Neuroscience
216 papers in training set
Top 5%
1.1%
21
Molecular Systems Biology
142 papers in training set
Top 1%
0.9%
22
PLOS Genetics
756 papers in training set
Top 13%
0.9%
23
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
24
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
25
Cell Systems
167 papers in training set
Top 11%
0.8%
26
PLOS Computational Biology
1633 papers in training set
Top 25%
0.7%