Back

BGC-QUAST: a quality assessment tool for genome mining software

Kushnareva, A.; Tupikina, D.; Almessady, H.; McHardy, A.; Gurevich, A.

2026-05-07 bioinformatics
10.64898/2026.05.04.722653 bioRxiv
Show abstract

SummaryBiosynthetic gene clusters (BGCs) encode microbial natural products, many of which have important ecological and biomedical roles. Genome mining tools enable large-scale BGC prediction, but their outputs differ substantially, complicating comparison and interpretation. We present BGC-QUAST, a framework for evaluating and comparing BGC predictions across three analysis modes: comparison across samples, assessment of BGC recovery in draft assemblies relative to reference genomes, and comparison of predictions from different tools using overlap analysis. BGC-QUAST provides standardized metrics, interactive visualizations, and integrated outputs for joint inspection of predictions, enabling the comprehensive comparison of genome mining results and facilitating sample prioritisation based on biosynthetic potential. Availability and implementationBGC-QUAST is publicly available at https://github.com/gurevichlab/bgc-quast

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.6%
34.2%
2
BMC Bioinformatics
383 papers in training set
Top 1%
7.2%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.8%
4
Genome Biology
555 papers in training set
Top 1%
6.3%
50% of probability mass above
5
Nature Biotechnology
147 papers in training set
Top 2%
4.8%
6
Microbial Genomics
204 papers in training set
Top 0.6%
3.7%
7
Bioinformatics Advances
184 papers in training set
Top 2%
3.1%
8
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.9%
3.1%
9
PLOS Computational Biology
1633 papers in training set
Top 11%
2.9%
10
GigaScience
172 papers in training set
Top 0.8%
2.6%
11
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.4%
12
mSystems
361 papers in training set
Top 4%
2.4%
13
PLOS ONE
4510 papers in training set
Top 56%
1.5%
14
Microbiome
139 papers in training set
Top 2%
1.5%
15
Nature Methods
336 papers in training set
Top 5%
1.2%
16
Nature Protocols
30 papers in training set
Top 0.1%
1.2%
17
Nature Communications
4913 papers in training set
Top 58%
1.1%
18
Genome Research
409 papers in training set
Top 3%
0.9%
19
mSphere
281 papers in training set
Top 5%
0.9%
20
Genome Medicine
154 papers in training set
Top 8%
0.8%
21
PeerJ
261 papers in training set
Top 13%
0.8%
22
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
23
Frontiers in Microbiology
375 papers in training set
Top 9%
0.7%
24
Scientific Data
174 papers in training set
Top 3%
0.6%
25
BMC Genomics
328 papers in training set
Top 7%
0.6%