Back

Corpus-wide causality: Algorithm design & application for aggregating gene-disease causal evidence

Bansal, N.; Parsodkar, A. P.; Pathak, A.; Narayanan, M.

2026-05-12 bioinformatics
10.64898/2026.05.08.723796 bioRxiv
Show abstract

Identifying causal relationships and distinguishing them from associations is a central scientific endeavor with many applications; knowing causal links between genes and diseases, for instance, can focus drug discovery on curing diseases beyond just symptom management. Despite several studies on automatically extracting relations between entities from large biomedical literature corpora like PubMed, only a few studies extract causal relations from abstracts and even fewer summarize corpus-level evidence for causal links. Recently, Large Language Models (LLMs) have been increasingly deployed to summarize biomedical information and extract relations; however, there is a distinct lack of explicit benchmarking comparing these generalized LLM-based methods against specialized, domain-aware frameworks for corpus-wide causal inference. In this work, we develop a method to infer Corpus-Wide Causal Score (CWCS) of a gene-disease (G-D) pair by integrating two pieces of evidence: (i) network-based causal signals in a prior gene regulatory network, quantified as a CWCS-Net score using an existing multilayer network centrality algorithm; and (ii) corpus-wide literature evidence, quantified as a CWCS-TD (TD for Truth Discovery) score using a newly-developed TD algorithm. Our CWCS-TD (scoring) algorithm jointly and iteratively estimates causal scores for multiple G-D pairs while modeling the reliability of PubMed abstracts co-mentioning them; and represents an advance in the field of TD algorithms due to its incorporation of bibliometric features of publications to address the challenge of sparsity of abstracts that assert a G-D causal relation. Using OMIM as an external expert-curated reference to evaluate classifications of G-D pairs as causal or not, our CWCS method achieved a causal class F1 score of 0.600 across ten diseases, outperforming both LLMs, GPT-4o and MMed-Llama 3 (this performance trend also persists when using area under the precision-recall curve as the evaluation metric). Both LLMs exhibit high recall accompanied by comparatively low precision, resulting in lower causal class F1 scores (0.505 for GPT-4o and 0.522 for MMed-Llama 3) due to large number of false positive predictions. Taken together, these evaluations and other ablation studies show the promise of our carefully designed algorithm in collating and integrating evidence of biomedical causal relations from both network- and literature-based sources, thereby supporting its broader applicability.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.5%
37.8%
2
BMC Bioinformatics
383 papers in training set
Top 2%
6.4%
3
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.9%
4
Bioinformatics Advances
184 papers in training set
Top 0.7%
4.9%
50% of probability mass above
5
Frontiers in Genetics
197 papers in training set
Top 2%
4.0%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
7
Database
51 papers in training set
Top 0.2%
2.6%
8
Journal of Biomedical Informatics
45 papers in training set
Top 0.6%
2.4%
9
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.4%
10
Advanced Science
249 papers in training set
Top 9%
2.1%
11
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
12
PLOS ONE
4510 papers in training set
Top 53%
1.7%
13
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.5%
14
Genome Biology
555 papers in training set
Top 5%
1.5%
15
Scientific Reports
3102 papers in training set
Top 64%
1.3%
16
International Journal of Molecular Sciences
453 papers in training set
Top 10%
1.2%
17
Nature Communications
4913 papers in training set
Top 59%
0.9%
18
Genome Medicine
154 papers in training set
Top 7%
0.9%
19
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.9%
20
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
22
BioData Mining
15 papers in training set
Top 0.9%
0.7%
23
Patterns
70 papers in training set
Top 2%
0.7%
24
Communications Biology
886 papers in training set
Top 26%
0.7%
25
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.7%
0.7%
26
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.6%