Back

EVA: a Foundation Model Advancing Translational Drug Development in Immuno-Inflammation

Fouche, A.; Bruley, A.; Corney, M.; Marschall, P.; Bouget, V.; Duquesne, J.

2025-05-07 bioinformatics
10.1101/2025.05.02.651839 bioRxiv
Show abstract

Drug development is a lengthy and high-risk process, with most investigational drug candidates failing in phase II randomized clinical trials (RCT) due to insufficient efficacy. It makes early prediction of trial outcomes crucial for reducing attrition and guiding strategic decisions, especially in immunology and inflammation (I&I) diseases. Herein, we present EVA, the first pre-trained foundation model in complex inflammatory diseases tailored to support drug development. EVA learns generalizable patterns from large-scale data of cell biology and immunology, enabling superior predictive performance and generalization compared to traditional approaches. EVA is pre-trained on tens of millions of single-cell RNA-seq samples and tens of thousands of bulk RNA-seq samples from I&I diseases patients, enabling it to learn disease-relevant transcriptomic patterns in this therapeutic area. By fine-tuning EVA in few-shot settings on both preclinical (mouse) and clinical (human) data and harnessing its wide pre-training knowledge, EVA predicts drug responses in I&I with high precision at both cohort and patient levels, as illustrated by accurate forecasting of anti-TNF therapeutic activity in ulcerative colitis. By deciphering its decision process, we further highlight that EVAs ability to stratify patients based on predicted drug response can also be leveraged to discover drug response biomarkers as early as preclinical stages. EVAs applications in precision immunology encompass therapeutic target validation prior to clinical entry, identification of patient subpopulations most likely to benefit from treatment, and comparative efficacy analysis against competitor compounds. EVAs versatility makes it an invaluable tool for strategic decision-making throughout the drug development pipeline: by leveraging it to prioritize the most promising drug candidates and optimize RCT designs, it can contribute to reduce late-stage failures and accelerate the delivery of effective therapies. Overall, this work represents a significant advancement in utilizing a pre-trained foundation model for precision drug development in complex inflammatory diseases. Graphical abstractEVA is a pre-trained foundation model specific to immune-mediated inflammatory diseases. It enables the prediction of therapeutic efficacy in patients leveraging data from preclinical disease models. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=73 SRC="FIGDIR/small/651839v1_ufig1.gif" ALT="Figure 1"> View larger version (29K): org.highwire.dtl.DTLVardef@1c783f1org.highwire.dtl.DTLVardef@1a760d9org.highwire.dtl.DTLVardef@1c7748aorg.highwire.dtl.DTLVardef@1b44f82_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.6%
14.6%
2
Patterns
70 papers in training set
Top 0.1%
12.2%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
10.0%
4
Briefings in Bioinformatics
326 papers in training set
Top 0.7%
6.8%
5
Bioinformatics
1061 papers in training set
Top 5%
4.3%
6
npj Systems Biology and Applications
99 papers in training set
Top 0.5%
3.6%
50% of probability mass above
7
Nature Machine Intelligence
61 papers in training set
Top 1.0%
3.6%
8
Cell Genomics
162 papers in training set
Top 2%
2.3%
9
iScience
1063 papers in training set
Top 9%
2.3%
10
Cell Reports Medicine
140 papers in training set
Top 3%
2.1%
11
PLOS Computational Biology
1633 papers in training set
Top 15%
1.8%
12
Science Advances
1098 papers in training set
Top 18%
1.7%
13
Nature Communications
4913 papers in training set
Top 52%
1.7%
14
Frontiers in Immunology
586 papers in training set
Top 4%
1.7%
15
Cell Systems
167 papers in training set
Top 7%
1.7%
16
GigaScience
172 papers in training set
Top 2%
1.5%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
18
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.3%
19
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.2%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.2%
21
npj Digital Medicine
97 papers in training set
Top 3%
1.2%
22
Scientific Reports
3102 papers in training set
Top 68%
1.1%
23
Genome Medicine
154 papers in training set
Top 7%
0.9%
24
Nucleic Acids Research
1128 papers in training set
Top 16%
0.9%
25
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.9%
26
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
27
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
28
eLife
5422 papers in training set
Top 58%
0.7%
29
Communications Biology
886 papers in training set
Top 24%
0.7%
30
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%