Back

Machine Learning-Augmented Analysis of Nano-electrochemical Sensor Data for Predictive and Quantitative Assays of Complex Biological Samples

Gupta, C.; Aryasomayajula, K.; Fischer, S. R.; Jia, Z.; Chin, N.; Nguyen, H.; Baig, Z.; Hui, J.; Howe, R. T.; Baldoni, J. M.; Quevy, E.

2025-12-01 bioengineering
10.1101/2025.11.27.691014 bioRxiv
Show abstract

Analytical chemistry provides the content of nearly every scientific, technical and business decision relating to what atoms, molecules and devices are and how they interact in their environment. Various workflows come together in describing those interactions--separation, identification, quantitation, classification--to inform decisions across society. Each workflow has associated equipment, infrastructure and subject matter experts that integrate results for input into those decisions. However, those existing workflows are constrained by their evolution and often require integrated applications of the multiple analytical techniques integrated to form conclusions, thereby increasing cost, complexity and quality. This paper describes an alternative approach to addressing the many limitations of contemporary analytical chemistry. Here machine learning is leveraged to analyze high dimensional data derived from a nano-electrochemical sensor platform, to perform classification, identification and quantitation analyses in an integrative manner from a single-shot measurement. The approach described here for biological samples does not require sample-specific preparation or marker-specific probes and labels to identify and/or quantitate the analyte of interest, thereby liberating the physical measurement of the sample from chemicals and workflows that are tied to the underlying biochemical hypothesis. The nano-electrochemical sensor, as described in a prior publication, transduces the vibrational frequencies of multiple species in the sample into a collection of discrete electronic signatures that represent the sample in a high dimensional space, while using only 2-4{micro}l sample volume. The sample analysis relies on training data to identify features in the high dimensional input dataset that are unique to the molecule or phenotype under consideration and distinct from the sample background matrix and/or a set of defined sample controls. The molecule- or phenotype-associated features can be tracked with clinically identified disease burden in the patient or with biomolecule concentration in the biological sample. The identification of patients with dementia from a plasma test is outlined here to demonstrate the applicability of this approach to classifying phenotypes associated with complex neurological disorders in a marker-agnostic manner. This approach is applied to a specific and quantitative assay of insulin Humalog and insulin Toujeo in a batch that comprises of a mixture of the two peptides, where Toujeo differs from Humalog in three amino acids residues only. We also describe the development of an assay for IL-6 in spiked human plasma samples with this method, thereby demonstrating the applicability of this analysis paradigm to more complex samples and protein-like species.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Analytical Chemistry
205 papers in training set
Top 0.2%
12.8%
2
Biosensors and Bioelectronics
52 papers in training set
Top 0.1%
10.3%
3
The Analyst
15 papers in training set
Top 0.1%
8.6%
4
PLOS ONE
4510 papers in training set
Top 24%
7.0%
5
Analytical and Bioanalytical Chemistry
17 papers in training set
Top 0.1%
7.0%
6
ACS Sensors
45 papers in training set
Top 0.2%
6.5%
50% of probability mass above
7
Scientific Reports
3102 papers in training set
Top 34%
3.7%
8
Analytica Chimica Acta
17 papers in training set
Top 0.1%
3.1%
9
Sensors
39 papers in training set
Top 0.8%
1.9%
10
Advanced Science
249 papers in training set
Top 10%
1.7%
11
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.2%
1.7%
12
ACS Omega
90 papers in training set
Top 2%
1.7%
13
ACS Applied Bio Materials
21 papers in training set
Top 0.6%
1.1%
14
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
15
Talanta
12 papers in training set
Top 0.6%
0.9%
16
Journal of Virological Methods
36 papers in training set
Top 0.6%
0.8%
17
Advanced Materials
53 papers in training set
Top 2%
0.8%
18
Diagnostics
48 papers in training set
Top 2%
0.8%
19
ACS Nano
99 papers in training set
Top 3%
0.8%
20
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.5%
0.8%
21
Analytical Biochemistry
26 papers in training set
Top 0.2%
0.8%
22
Journal of Biomedical Optics
25 papers in training set
Top 0.6%
0.8%
23
Chemosphere
15 papers in training set
Top 0.5%
0.8%
24
Lab on a Chip
88 papers in training set
Top 1%
0.8%
25
Journal of Open Source Software
22 papers in training set
Top 0.2%
0.7%
26
Frontiers in Digital Health
20 papers in training set
Top 2%
0.7%
27
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
28
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
29
Bioinformatics
1061 papers in training set
Top 10%
0.7%
30
International Journal of Molecular Sciences
453 papers in training set
Top 19%
0.5%