Back

An artificial intelligence-based model for prediction of Clonal Hematopoiesis mutants in cell-free DNA samples

Arango-Argoty, G.; Haghighi, M.; Sun, G. J.; Markovets, A.; Barrett, J. C.; Lai, Z.; Jacob, E.

2024-12-16 bioinformatics
10.1101/2024.12.11.627785 bioRxiv
Show abstract

Circulating tumor DNA is a critical biomarker in cancer diagnostics, but its accurate interpretation requires careful consideration of clonal hematopoiesis (CH), which can contribute to variants in cell-free DNA and potentially obscure true tumor-derived signals. Accurate detection of somatic variants of CH origin in plasma samples remains challenging in the absence of matched white blood cells sequencing. Here we present an open-source machine learning framework (MetaCHIP) which classifies variants in cfDNA from plasma-only samples as CH or tumor origin, surpassing state-of-the-art classification rates.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
40.6%
2
Nature Communications
4913 papers in training set
Top 21%
8.7%
3
Scientific Reports
3102 papers in training set
Top 26%
4.4%
50% of probability mass above
4
Clinical Chemistry
22 papers in training set
Top 0.2%
3.2%
5
Nucleic Acids Research
1128 papers in training set
Top 7%
2.7%
6
Advanced Science
249 papers in training set
Top 7%
2.7%
7
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.4%
8
Communications Biology
886 papers in training set
Top 4%
2.4%
9
Nature Machine Intelligence
61 papers in training set
Top 1%
2.1%
10
PLOS ONE
4510 papers in training set
Top 49%
1.9%
11
Bioinformatics
1061 papers in training set
Top 7%
1.9%
12
PLOS Computational Biology
1633 papers in training set
Top 17%
1.5%
13
Genome Biology
555 papers in training set
Top 5%
1.4%
14
iScience
1063 papers in training set
Top 21%
1.3%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.0%
16
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
17
Frontiers in Genetics
197 papers in training set
Top 8%
0.8%
18
Cancer Research Communications
46 papers in training set
Top 1.0%
0.8%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
20
International Journal of Molecular Sciences
453 papers in training set
Top 15%
0.8%
21
npj Precision Oncology
48 papers in training set
Top 1%
0.8%
22
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
23
Cell Reports Methods
141 papers in training set
Top 5%
0.7%
24
BMC Medical Genomics
36 papers in training set
Top 2%
0.7%
25
EMBO Molecular Medicine
85 papers in training set
Top 6%
0.5%
26
GigaScience
172 papers in training set
Top 4%
0.5%
27
eLife
5422 papers in training set
Top 63%
0.5%
28
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 48%
0.5%