Back

Machine learning approach to assess the pathogenicity of BRCA1/2 genetic variants : brca-NOVUS

Vatsyayan, A.; Scaria, V.

2023-10-20 health informatics
10.1101/2023.10.20.23297295
Show abstract

Breast cancer is globally the leading type of cancer in terms of both incidence and mortality. BRCA1 and BRCA2 gene variants have long been linked to and studied in context of the disease. Rapid variant discovery has further been made freely accessible by advances in Next-generation sequencing, making it a demanding task to accurately interpret these variants for clinical and research applications. To establish the nature of these variants, the American College of Medical Genetics and Genomics and the Association of Molecular Pathologists (ACMG-AMP) have issued a set of guidelines for variant classification. However, given the huge number of variants associated with the two large and well-studied genes, functional studies or ACMG-AMP classification is a mountainous challenge. Here we describe brca-NOVUS, a machine learning approach trained on a gold-standard ACMG-qualified dataset for the accurate interpretation of variants at large scale. Using two independent test and validation datasets of ACMG-qualified variants, we show that brca-NOVUS can be used to for the classification of variants in clinical as well as research settings.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Scientific Reports
based on 701 papers
Top 13%
11.0%
2
Nature Communications
based on 483 papers
Top 8%
10.0%
3
Journal of Biomedical Informatics
based on 37 papers
Top 0.9%
7.5%
4
PLOS ONE
based on 1737 papers
Top 59%
7.5%
5
BMC Medical Genomics
based on 12 papers
Top 0.1%
4.4%
6
BMC Medical Informatics and Decision Making
based on 36 papers
Top 3%
4.4%
7
JCO Clinical Cancer Informatics
based on 14 papers
Top 0.6%
4.4%
8
Nature Medicine
based on 88 papers
Top 2%
3.8%
50% of probability mass above
9
iScience
based on 74 papers
Top 1%
2.8%
10
Bioinformatics
based on 24 papers
Top 0.6%
2.4%
11
JAMIA Open
based on 35 papers
Top 4%
2.4%
12
Genome Medicine
based on 56 papers
Top 4%
1.7%
13
Journal of the American Medical Informatics Association
based on 53 papers
Top 5%
1.6%
14
Computers in Biology and Medicine
based on 39 papers
Top 5%
1.3%
15
JMIR Medical Informatics
based on 16 papers
Top 4%
1.3%
16
Communications Medicine
based on 63 papers
Top 2%
1.3%
17
BMJ Health & Care Informatics
based on 13 papers
Top 3%
1.2%
18
Journal of Personalized Medicine
based on 17 papers
Top 1%
1.2%
19
Genetics in Medicine
based on 57 papers
Top 5%
1.2%
20
npj Digital Medicine
based on 85 papers
Top 11%
1.2%
21
Cell Genomics
based on 34 papers
Top 3%
1.2%
22
Cancers
based on 57 papers
Top 7%
0.8%
23
PLOS Computational Biology
based on 141 papers
Top 10%
0.8%
24
Communications Biology
based on 36 papers
Top 5%
0.8%
25
Patterns
based on 15 papers
Top 3%
0.8%
26
Frontiers in Artificial Intelligence
based on 11 papers
Top 2%
0.8%
27
International Journal of Medical Informatics
based on 25 papers
Top 6%
0.7%
28
Biology Methods and Protocols
based on 19 papers
Top 4%
0.7%
29
eLife
based on 262 papers
Top 35%
0.7%