Back

Protective and Susceptibility Clusters of Environmental Factors, Gene Expression, Antibody Responses, and Cytokines in Pediatric Atopic Dermatitis: Insights from Multi-Modal Data Integration

Zhakparov, D.; Lunjani, N.; Schmid, M.; Moriarty, K.; Roquero, D.; Dreher, A.; Heldstab, J. I.; Nadeau, K. C.; Akdis, C.; Levin, M.; Hlela, C.; Sokolowska, M.; O'Mahony, L.; Baerenfaller, K.

2026-01-13 allergy and immunology
10.64898/2026.01.10.26343854 medRxiv
Show abstract

BackgroundAtopic dermatitis (AD) is a chronic skin disease that typically occurs in early childhood. In this cross-sectional case-control study, our objective was to employ machine learning approaches to identify novel clusters of protective or susceptibility features associated with AD. Methods and FindingsWe utilised an integrated dataset comprising previously established environmental, cytokine, antibody, and gene expression data from AmaXhosa children, both healthy and with AD, living in either rural or urban settings of South Africa, aged 12-36 months. The applied machine learning methods included the GeneSelectR workflow to identify a subset of relevant genes, the calculation of SHAP values to explain the machine learning output, and the use of DIABLO to integrate the datasets for a comprehensive analysis. Key findings included the identification of a protective cluster of environmental features primarily found in the rural setting, which were correlated with plasma cytokine levels and with expression of autophagy-related genes. Additionally, we identified AD susceptibility clusters where levels of allergen-specific and total IgE antibodies correlated with the cytokines MCP-4 and TARC. Lastly, we identified an RNA-Seq feature signature specific to the disease endotype. ConclusionsThe application of various machine learning methods enabled the identification of significant factors associated with AD in a complex, multi-modular dataset, making the output explainable and potentially informing targeted interventions and improved diagnostic criteria.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Allergy
23 papers in training set
Top 0.1%
17.6%
2
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.1%
14.4%
3
Immunology
29 papers in training set
Top 0.1%
12.8%
4
PLOS ONE
4510 papers in training set
Top 20%
9.2%
50% of probability mass above
5
International Journal of Medical Informatics
25 papers in training set
Top 0.2%
4.9%
6
Thorax
32 papers in training set
Top 0.2%
3.6%
7
Scientific Reports
3102 papers in training set
Top 41%
3.1%
8
Journal of Translational Medicine
46 papers in training set
Top 0.4%
2.4%
9
BMJ Open Respiratory Research
32 papers in training set
Top 0.2%
2.4%
10
PLOS Neglected Tropical Diseases
378 papers in training set
Top 3%
2.4%
11
Journal of Investigative Dermatology
42 papers in training set
Top 0.3%
2.1%
12
Frontiers in Plant Science
240 papers in training set
Top 3%
1.9%
13
Frontiers in Immunology
586 papers in training set
Top 4%
1.8%
14
Experimental Dermatology
10 papers in training set
Top 0.2%
1.5%
15
eBioMedicine
130 papers in training set
Top 2%
1.3%
16
European Respiratory Journal
54 papers in training set
Top 1%
1.2%
17
Archives of Disease in Childhood
15 papers in training set
Top 0.3%
1.2%
18
Nature Communications
4913 papers in training set
Top 58%
1.1%
19
Eye
11 papers in training set
Top 0.3%
1.0%
20
eLife
5422 papers in training set
Top 52%
1.0%
21
Frontiers in Medicine
113 papers in training set
Top 5%
0.9%
22
Clinical Immunology
21 papers in training set
Top 0.7%
0.6%