Back

Breath volatile profiling reveals a diagnostic signature of MASLD in children

Berna, A. Z.; Panganiban, J.; Liu, Y.; Logan, J.; Russo, P.; Aryal, A.; Hafertepe, K.; Abu-Alreesh, S.; DeBosch, B.; Stoll, J.; John, A. R. O.

2026-05-27 gastroenterology
10.64898/2026.05.26.26353794 medRxiv
Show abstract

Background & Aims: Metabolic Dysfunction Associated Steatotic Liver Disease (MASLD) is the leading cause of chronic liver disease in children. However, accurate, noninvasive diagnostic tools remain limited. Current screening methods are invasive or lack sensitivity. Breath-based volatile organic compound (VOC) analysis offers a simple approach with potential for point of care screening. This study aimed to identify and validate breath VOC signatures of pediatric MASLD. Approach & Results: We conducted a prospective IRB approved cohort study at the Childrens Hospital of Philadelphia (CHOP). Children aged between 7 and 20 years with MASLD (n=22), as defined by hepatic steatosis either by liver biopsy or imaging and 1 cardiometabolic risk factor, and a control group without MASLD (n=20) were enrolled. Breath samples were collected using a standardized protocol and analyzed by untargeted comprehensive two-dimensional gas chromatography-mass spectrometry (GCGCMS). Machine learning and unsupervised clustering were applied to identify discriminatory VOCs and assess heterogeneity. Untargeted GCGCMS analysis identified a distinct breath VOC signature in children with MASLD compared with non MASLD controls. A Random Forest model achieved a sensitivity of 73% and specificity of 65%, with AUC of 0.84. The VOC 2,4-dimethyl-1-heptene demonstrated strong diagnostic performance in the discovery cohort with a sensitivity of 85%, specificity of 77% and an AUC of 0.81. Unsupervised clustering revealed four MASLD subgroups with distinct volatile phenotypes associated with differences in liver enzymes and metabolic parameters. External validation in a second pediatric cohort confirmed reproducible reductions in o/p-xylene in subjects with MASLD. Conclusions: Pediatric MASLD is associated with a reproducible breath VOC signature identified by untargeted GCGCMS. These findings support breath analysis as a scalable, noninvasive screening and stratification tool for pediatric MASLD and warrant validation in larger, longitudinal studies.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Metabolomics
11 papers in training set
Top 0.1%
15.5%
2
Metabolites
50 papers in training set
Top 0.1%
13.4%
3
Scientific Reports
3102 papers in training set
Top 4%
11.0%
4
eBioMedicine
130 papers in training set
Top 0.2%
4.2%
5
Microbiology Spectrum
435 papers in training set
Top 1%
3.0%
6
PLOS ONE
4510 papers in training set
Top 43%
2.9%
7
Clinical and Translational Science
21 papers in training set
Top 0.3%
2.2%
50% of probability mass above
8
Environment International
42 papers in training set
Top 0.6%
2.0%
9
Frontiers in Medicine
113 papers in training set
Top 3%
1.9%
10
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 3%
1.9%
11
American Journal of Gastroenterology
15 papers in training set
Top 0.2%
1.8%
12
Hepatology Communications
21 papers in training set
Top 0.2%
1.8%
13
Med
38 papers in training set
Top 0.3%
1.6%
14
BMC Medicine
163 papers in training set
Top 4%
1.6%
15
Cellular and Molecular Gastroenterology and Hepatology
41 papers in training set
Top 0.4%
1.3%
16
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.9%
1.2%
17
Hepatology
18 papers in training set
Top 0.3%
1.0%
18
mSystems
361 papers in training set
Top 6%
1.0%
19
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.5%
1.0%
20
Pediatric Research
18 papers in training set
Top 0.3%
0.9%
21
Immunology & Cell Biology
11 papers in training set
Top 0.2%
0.9%
22
Viruses
318 papers in training set
Top 4%
0.8%
23
Bioengineering & Translational Medicine
21 papers in training set
Top 0.7%
0.8%
24
Biomedicines
66 papers in training set
Top 3%
0.8%
25
Cell Reports Medicine
140 papers in training set
Top 7%
0.8%
26
Nature Communications
4913 papers in training set
Top 62%
0.8%
27
Gastroenterology
40 papers in training set
Top 2%
0.8%
28
Biomolecules
95 papers in training set
Top 2%
0.8%
29
ERJ Open Research
44 papers in training set
Top 0.8%
0.8%
30
Journal of Lipid Research
35 papers in training set
Top 0.5%
0.8%