Back

Uncovering Xenobiotics in the Dark Metabolome using Ion Mobility Spectrometry, Mass Defect Analysis and Machine Learning

Foster, M.; Rainey, M.; Watson, C.; Dodds, J. N.; Fernandez, F.; Baker, E.

2021-11-21 biochemistry
10.1101/2021.11.21.469449 bioRxiv
Show abstract

The identification of xenobiotics in nontargeted metabolomic analyses is a vital step in understanding human exposure. Xenobiotic metabolism, excretion, and co-existence with other endogenous molecules however greatly complicate nontargeted studies. While mass spectrometry (MS)-based platforms are commonly used in metabolomic measurements, deconvoluting endogenous metabolites and xenobiotics is often challenged by the lack of xenobiotic parent and metabolite standards as well as the numerous isomers possible for each small molecule m/z feature. Here, we evaluate the use of ion mobility spectrometry coupled with MS (IMS-MS) and mass defect filtering in a xenobiotic structural annotation workflow to reduce large metabolomic feature lists and uncover potential xenobiotic classes and species detected in the metabolomic studies. To evaluate the workflow, xenobiotics having known high toxicities including per- and polyfluoroalkyl substances (PFAS), polycyclic aromatic hydrocarbons (PAHs), polychlorinated biphenyls (PCBs) and polybrominated diphenyl ethers (PBDEs) were examined. Initially, to address the lack of available IMS collision cross section (CCS) values for per- and polyfluoroalkyl substances (PFAS), 88 PFAS standards were evaluated with IMS-MS to both develop a targeted PFAS CCS library and for use in machine learning predictions. The CCS values for biomolecules and xenobiotics were then plotted versus m/z, clearly distinguishing the biomolecules and halogenated xenobiotics. The xenobiotic structural annotation workflow was then used to annotate potential PFAS features in NIST human serum. The workflow reduced the 2,423 detected LC-IMS-MS features to 80 possible PFAS with 17 confidently identified through targeted analyses and 48 additional features correlating with possible CompTox entries.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Environmental Science & Technology
64 papers in training set
Top 0.1%
23.6%
2
Analytica Chimica Acta
17 papers in training set
Top 0.1%
10.6%
3
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.1%
8.8%
4
PLOS ONE
4510 papers in training set
Top 30%
5.1%
5
Water Research
74 papers in training set
Top 0.5%
4.5%
50% of probability mass above
6
Analytical Chemistry
205 papers in training set
Top 0.7%
3.8%
7
Metabolites
50 papers in training set
Top 0.2%
3.2%
8
Scientific Reports
3102 papers in training set
Top 44%
2.7%
9
The Analyst
15 papers in training set
Top 0.1%
2.5%
10
Journal of Proteome Research
215 papers in training set
Top 1.0%
2.2%
11
Analytical and Bioanalytical Chemistry
17 papers in training set
Top 0.1%
2.2%
12
Metabolomics
11 papers in training set
Top 0.1%
1.9%
13
Nature Communications
4913 papers in training set
Top 50%
1.8%
14
Environmental Pollution
35 papers in training set
Top 2%
1.0%
15
Journal of Visualized Experiments
30 papers in training set
Top 0.6%
0.8%
16
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.8%
17
mSystems
361 papers in training set
Top 7%
0.8%
18
Talanta
12 papers in training set
Top 0.6%
0.8%
19
Environment International
42 papers in training set
Top 1%
0.8%
20
Communications Chemistry
39 papers in training set
Top 1.0%
0.8%
21
Journal of Agricultural and Food Chemistry
14 papers in training set
Top 1%
0.8%
22
Biosensors and Bioelectronics
52 papers in training set
Top 1%
0.8%
23
Journal of Hazardous Materials
19 papers in training set
Top 1.0%
0.7%
24
Chemistry – A European Journal
13 papers in training set
Top 0.7%
0.5%
25
Science of The Total Environment
179 papers in training set
Top 6%
0.5%
26
Biochemistry
130 papers in training set
Top 2%
0.5%
27
Bioinformatics
1061 papers in training set
Top 10%
0.5%
28
Environmental Research
46 papers in training set
Top 2%
0.5%
29
PLOS Computational Biology
1633 papers in training set
Top 28%
0.5%
30
eBioMedicine
130 papers in training set
Top 6%
0.5%