Back

Generalizable Cysteine Quantification in Pea Cultivars from SERS Spectra Using AI

Gorgannejad, E.; Liu, Q.; Findlay, C.; Nadimi, M.; Chun-Te Ko, A.; Bhowmik, P.; Paliwal, J.

2026-03-24 bioengineering
10.64898/2026.03.20.713189 bioRxiv
Show abstract

Rapid quantification of sulfur-containing amino acids, particularly cysteine, in legumes is critical for assessing nutritional quality, supporting breeding program screening, and ensuring consistency in quality control processes. However, conventional methods, such as high-performance liquid chromatography (HPLC), are time-consuming and resource-intensive for high-throughput applications. This study evaluated artificial intelligence models for predicting cysteine concentration from surface-enhanced Raman spectroscopy (SERS) spectra of pea extracts. SERS spectra were acquired from 20 cultivars grown at three geographically distinct locations, with HPLC-measured cysteine concentrations as a ground truth reference. Linear regression, partial least squares regression, support vector regression, random forest regression, and a one-dimensional convolutional neural network (1D-CNN) were compared using within-cultivar splits and leave-one-cultivar-out (LOCO) evaluation. The 1D-CNN achieved RMSE 0.008 g/100 g within cultivars and maintained performance under LOCO, while other models showed limited generalization. Shapley Additive Explanations highlighted informative bands in the 630-760 cm-1 range, and noise modeling optimized scan-count selection.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Frontiers in Plant Science
240 papers in training set
Top 0.7%
12.3%
2
Scientific Reports
3102 papers in training set
Top 7%
10.1%
3
Advanced Science
249 papers in training set
Top 1%
10.1%
4
PLOS ONE
4510 papers in training set
Top 28%
6.3%
5
Water Research
74 papers in training set
Top 0.6%
3.6%
6
Biosensors and Bioelectronics
52 papers in training set
Top 0.4%
3.6%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
8
Chemical Engineering Journal
10 papers in training set
Top 0.2%
2.1%
50% of probability mass above
9
ACS Sensors
45 papers in training set
Top 0.6%
2.1%
10
Analytical Chemistry
205 papers in training set
Top 1%
1.7%
11
Communications Biology
886 papers in training set
Top 9%
1.7%
12
Analytica Chimica Acta
17 papers in training set
Top 0.3%
1.7%
13
Nature Communications
4913 papers in training set
Top 53%
1.5%
14
Nano Letters
63 papers in training set
Top 2%
1.5%
15
The Analyst
15 papers in training set
Top 0.3%
1.5%
16
Plant Phenomics
17 papers in training set
Top 0.2%
1.2%
17
New Phytologist
309 papers in training set
Top 4%
1.2%
18
ACS Omega
90 papers in training set
Top 3%
1.2%
19
Journal of Experimental Botany
195 papers in training set
Top 2%
1.2%
20
Sensors
39 papers in training set
Top 1%
1.2%
21
International Journal of Biological Macromolecules
65 papers in training set
Top 3%
0.9%
22
Journal of Agricultural and Food Chemistry
14 papers in training set
Top 1%
0.9%
23
eLife
5422 papers in training set
Top 53%
0.9%
24
Environmental Science & Technology
64 papers in training set
Top 2%
0.8%
25
Plant Methods
39 papers in training set
Top 0.7%
0.7%
26
Plant Physiology
217 papers in training set
Top 3%
0.7%
27
Scientific Data
174 papers in training set
Top 2%
0.7%
28
Journal of Natural Products
11 papers in training set
Top 0.3%
0.7%
29
Molecules
37 papers in training set
Top 2%
0.7%
30
Journal of Biomedical Optics
25 papers in training set
Top 0.7%
0.7%