Back

Tumour marker analysis using a machine learning assisted vibrational spectroscopy approach

Fatayer, R.; Sammut, S.-J.; Senthil Murugan, G.

2026-03-31 biochemistry
10.64898/2026.03.27.714840 bioRxiv
Show abstract

Tumour biomarkers such as CA125, CA15-3, CA19-9, AFP and CEA are routinely used in the oncology clinic to diagnose cancer, monitor response to therapy, and detect relapse. However, their quantification depends on immunoassay-based methods that are time-consuming, reagent-dependent, and poorly suited to resource-limited settings. Here, we present a machine learning-assisted ATR-FTIR spectroscopy approach for label-free tumour biomarker analysis to enable simple and rapid quantification at the bedside. Using principal component analysis (PCA), we first demonstrate that these five clinically relevant biomarkers are spectrally separable, with the protein-associated region (1200-1700 cm-1) providing the greatest discriminative information. We then develop partial least squares regression (PLSR) models to quantify CA125 in phosphate-buffered saline (R2 = 0.95) and in human serum across a clinically relevant concentration range, achieving reliable predictions at and above the clinical decision threshold of 35 U/mL. A semi-quantitative classification model further demonstrated robust identification of elevated CA125, with a macro-average sensitivity of 0.86 and specificity of 0.92. These results support ATR-FTIR spectroscopy as a rapid, reagent-free platform for cancer biomarker monitoring, with potential utility in resource-limited settings. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=109 SRC="FIGDIR/small/714840v1_ufig1.gif" ALT="Figure 1"> View larger version (27K): org.highwire.dtl.DTLVardef@1be9c03org.highwire.dtl.DTLVardef@f49e5eorg.highwire.dtl.DTLVardef@1c93e39org.highwire.dtl.DTLVardef@1141e6f_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
Analytical Chemistry
205 papers in training set
Top 0.4%
7.3%
2
Photoacoustics
11 papers in training set
Top 0.1%
6.4%
3
ACS Sensors
45 papers in training set
Top 0.2%
6.4%
4
Scientific Reports
3102 papers in training set
Top 18%
6.4%
5
Biosensors and Bioelectronics
52 papers in training set
Top 0.3%
4.2%
6
Advanced Science
249 papers in training set
Top 4%
4.0%
7
Analytica Chimica Acta
17 papers in training set
Top 0.2%
3.1%
8
Journal of the American Chemical Society
199 papers in training set
Top 2%
3.1%
9
ACS Omega
90 papers in training set
Top 0.8%
2.6%
10
PLOS ONE
4510 papers in training set
Top 45%
2.6%
11
Nature Communications
4913 papers in training set
Top 46%
2.1%
12
Chemical Communications
24 papers in training set
Top 0.3%
2.1%
50% of probability mass above
13
ACS Applied Materials & Interfaces
39 papers in training set
Top 0.4%
2.1%
14
Communications Chemistry
39 papers in training set
Top 0.2%
1.7%
15
iScience
1063 papers in training set
Top 14%
1.7%
16
Water Research
74 papers in training set
Top 1.0%
1.5%
17
The Analyst
15 papers in training set
Top 0.3%
1.5%
18
ChemBioChem
50 papers in training set
Top 0.6%
1.5%
19
eLife
5422 papers in training set
Top 47%
1.3%
20
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.3%
21
ACS Central Science
66 papers in training set
Top 1%
1.2%
22
Talanta
12 papers in training set
Top 0.5%
1.2%
23
Cancers
200 papers in training set
Top 4%
1.1%
24
Journal of Biomedical Optics
25 papers in training set
Top 0.6%
0.8%
25
JACS Au
35 papers in training set
Top 0.9%
0.8%
26
Nano Letters
63 papers in training set
Top 3%
0.8%
27
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.8%
28
Chemical Science
71 papers in training set
Top 2%
0.8%
29
The Journal of Physical Chemistry Letters
58 papers in training set
Top 2%
0.8%
30
Optica
25 papers in training set
Top 0.8%
0.7%