Back

Learning from Drops: AI-Guided Integration of Liquid Biopsy Features in Cancer Studies

Andueza, M.; Villoslada-Blanco, P.; De Dreuille, B.; Alonso, L.; Sabroso-Lasa, S.; Pantel, K.; Alix-Panabieres, C.; Lopez de Maturana, E.; Malats, N.

2026-05-17 bioinformatics
10.64898/2026.05.12.724535 bioRxiv
Show abstract

Cancer is a major global health issue with rising incidence and mortality. Early detection, tumor characterization, and disease surveillance are crucial for timely and effective treatment, ultimately reducing mortality rates. Liquid biopsy (LB) has emerged as a valuable detection tool offering a non-invasive method to determine tumor-derived biomarkers in body fluids with demonstrated translational potential. To increase biomarker sensitivity, high-throughput sequencing platforms deliver massive volumes of data. Artificial Intelligence (AI) is pivotal in enabling huge and complex data integration. This contribution aims to assess the current state of integrative AI-based research in the LB field and provide methodological guidance. First, we conducted a PubMed search and found that the literature is sparse in studies integrating LB features, particularly by applying AI. When adopting the latter approach, defining the study objectives is crucial to guide the subsequent methodological aspects, including study design, patient selection criteria, sample size, nature of the LB features, and metadata to collect. Specifically, we propose strategies and tools for data preprocessing, including normalization and batch correction, as well as handling outliers and missing data. Furthermore, we recommend various Machine/Deep Learning approaches for feature selection techniques to ensure model robustness, and we highlight the importance of undergoing rigorous internal and external validations of the selected models. Assessing clinical utility and interpretability is often overlooked but fundamental for real-world implementation. In conclusion, we provide the LB scientific community with an AI-based methodological guidance to bridge the two fields and enhance the integrative analysis of LB features. Graphical abstractWorkchart for multiomics integrative studies in the liquid biopsy field. Note: CTCs, circulating tumor cells; ctDNA, circulating tumor-DNA; TEPs, tumor-educated platelets; miRNA, microRNA; cfRNAs, cell-free RNAs. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=159 SRC="FIGDIR/small/724535v1_ufig1.gif" ALT="Figure 1"> View larger version (45K): org.highwire.dtl.DTLVardef@1f250b2org.highwire.dtl.DTLVardef@18fe36corg.highwire.dtl.DTLVardef@19c02b9org.highwire.dtl.DTLVardef@176f6e0_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
14.2%
2
PLOS ONE
4510 papers in training set
Top 28%
6.3%
3
Briefings in Bioinformatics
326 papers in training set
Top 0.9%
6.3%
4
International Journal of Molecular Sciences
453 papers in training set
Top 2%
3.9%
5
GigaScience
172 papers in training set
Top 0.4%
3.9%
6
Heliyon
146 papers in training set
Top 0.3%
3.9%
7
PeerJ
261 papers in training set
Top 3%
3.5%
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.5%
9
Scientific Reports
3102 papers in training set
Top 38%
3.5%
10
BMC Bioinformatics
383 papers in training set
Top 3%
3.0%
50% of probability mass above
11
Biology Methods and Protocols
53 papers in training set
Top 0.4%
2.7%
12
Life
27 papers in training set
Top 0.1%
1.9%
13
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
14
PROTEOMICS
35 papers in training set
Top 0.4%
1.7%
15
Frontiers in Bioinformatics
45 papers in training set
Top 0.4%
1.3%
16
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
17
iScience
1063 papers in training set
Top 20%
1.3%
18
Clinical Chemistry
22 papers in training set
Top 0.5%
1.3%
19
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.3%
20
Cancer Research Communications
46 papers in training set
Top 0.7%
1.2%
21
Journal of Translational Medicine
46 papers in training set
Top 2%
1.2%
22
Bioinformatics
1061 papers in training set
Top 8%
1.2%
23
Computational Biology and Chemistry
23 papers in training set
Top 0.2%
1.2%
24
Patterns
70 papers in training set
Top 2%
1.2%
25
Bioinformatics Advances
184 papers in training set
Top 4%
1.2%
26
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
27
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.9%
28
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
29
Analytical Chemistry
205 papers in training set
Top 2%
0.9%
30
BMC Genomics
328 papers in training set
Top 5%
0.8%