Back

A Benchmark of Evo2 Genomic AI Models for Efficient and Practical Deployment

Li, H.; ji, h.; Zeng, Y.; Lv, W.; Wu, J.; Liu, S.; Lin, C.; Yang, H.; Li, Z.; Chen, Y.; Dong, W.

2025-09-12 genomics
10.1101/2025.09.10.675279 bioRxiv
Show abstract

The rapid advancement of DNA foundation language models has brought about a transformative shift in genomics, allowing for the deciphering of intricate patterns and regulatory mechanisms embedded within DNA sequences. The genomic foundation model Evo2 demonstrates remarkable capabilities in decoding DNA functional patterns through cross-species pretraining. However, despite the great potential of Evo2 in basic genomics research, there is currently no clear and systematic guidance on its specific application scenarios, performance, and optimization directions in the field of tumor genomics, and its performance dependency on specialized hardware (such as FP8 precision on H800 GPUs) has not been empirically benchmarked. Here, we present a focused validation of Evo2 using two independent cancer genomic datasets (Bladder Urothelial Carcinoma and Ovarian Cancer), we tested the downstream tasks of Evo2, including the prediction of tumor pathogenic variants and the prediction of mutational effects, and compared its performance on A100 and H800 GPUs. The results show that critical importance of FP8 precision, enabling the H800 to achieve a 4x faster inference speed than the A100 with stable accuracy (AUC 0.88-0.95). The 7B-parameter model emerged as the top performer, whereas the 40B model experienced a severe performance drop (AUC to 0.48) on non-FP8 hardware like the A100. These findings empirically validated Evo2s hardware specifications and provided practical insights for researchers implementing the model with similar computational resources. Futhermore, our findings provide a framework for the application and optimization of downstream tasks of the DNA language model Evo2 in cancer, and can guide researchers in effectively applying it in genomic studies. Key PointsO_LIHardware Precision Impact: FP8 precision on H800 GPUs is critical for Evo 2s performance, enabling 4x faster inference than A100 (without FP8 support) while maintaining high accuracy (AUC 0.88-0.95). C_LIO_LIModel Scale Optimization: The 7B-parameter model outperformed larger variants (e.g., 40B), which suffered severe accuracy drops (AUC as low as 0.48) on non-FP8 hardware, highlighting a balance between efficiency and performance. C_LIO_LIPractical Guidelines: We provide a framework for deploying Evo 2 in cancer genomics, including hardware recommendations, dataset curation, and downstream task optimization--valuable for researchers with varied computational resources. C_LI

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
13.9%
2
GigaScience
172 papers in training set
Top 0.1%
12.1%
3
BMC Bioinformatics
383 papers in training set
Top 0.7%
12.0%
4
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.1%
6.6%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.7%
6.2%
50% of probability mass above
6
Frontiers in Genetics
197 papers in training set
Top 1%
4.7%
7
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.7%
8
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
9
Bioinformatics Advances
184 papers in training set
Top 2%
3.5%
10
Nature Communications
4913 papers in training set
Top 43%
3.0%
11
Genome Biology
555 papers in training set
Top 3%
2.5%
12
Genome Research
409 papers in training set
Top 2%
2.3%
13
BMC Genomics
328 papers in training set
Top 2%
2.0%
14
PLOS ONE
4510 papers in training set
Top 51%
1.8%
15
Cell Genomics
162 papers in training set
Top 4%
1.6%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.6%
17
BMC Medical Genomics
36 papers in training set
Top 0.8%
1.2%
18
Scientific Reports
3102 papers in training set
Top 72%
0.9%
19
iScience
1063 papers in training set
Top 28%
0.9%
20
BioData Mining
15 papers in training set
Top 0.7%
0.9%
21
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
22
Nature Machine Intelligence
61 papers in training set
Top 4%
0.6%
23
Genome Medicine
154 papers in training set
Top 9%
0.6%