Back

Disentangling Symptom Heterogeneity in Large-Scale Psychiatric Text: Domain-Adapted vs. Instruction-Tuned Transformers

Varone, G.; Kumar, P.; Brown, J.; Boulila, W.

2026-02-26 psychiatry and clinical psychology
10.64898/2026.02.24.26347006 medRxiv
Show abstract

Psychiatric disorders are fundamentally challenged by symptom heterogeneity, high comorbidity, and the absence of objective biomarkers, which together result in substantial variability in clinical assessment and treatment selection. Patient-generated language captures rich information about subjective experience and symptom severity, which can be systematically encoded and analyzed using computational models, making it a scalable signal for psychiatric assessment. We compare two approaches: (i) a domain-specialized transformer fine-tuned on clinical language, based on the Bio-ClinicalBERT encoder architecture, and (ii) a large-scale instruction-tuned generalist encoder (Instructor-XL) used as a frozen feature extractor with a shallow classification head. A corpus of N = 151,228 de-identified texts was compiled from five public sources, covering four psychiatric phenotypes: anxiety, depression, schizophrenia, and suicidal intention. Models were evaluated using stratified 10-fold cross-validation with cost-sensitive training, prioritizing imbalance-aware metrics, including Macro-F1 and Matthews Correlation Coefficient (MCC), over accuracy. Bio-ClinicalBERT achieved superior overall performance (Macro-F1 = 0.78, MCC = 0.6752), indicating more reliable separation of diagnostically overlapping affective categories. In contrast, Instructor-XL achieved its highest class-specific performance for schizophrenia (F1 = 0.798). Explainability analyses suggest that the domain-specialized model places greater weight on clinically relevant terms, whereas the generalist model relies on a broader set of lexical features.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Medicine
117 papers in training set
Top 0.1%
25.9%
2
npj Digital Medicine
97 papers in training set
Top 0.5%
10.2%
3
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
8.5%
4
Translational Psychiatry
219 papers in training set
Top 1%
4.9%
5
Scientific Reports
3102 papers in training set
Top 29%
4.2%
50% of probability mass above
6
Schizophrenia
19 papers in training set
Top 0.1%
3.9%
7
Schizophrenia Bulletin
29 papers in training set
Top 0.3%
2.6%
8
Nature Neuroscience
216 papers in training set
Top 3%
2.1%
9
Nature Communications
4913 papers in training set
Top 47%
2.1%
10
JAMA Psychiatry
13 papers in training set
Top 0.2%
1.9%
11
Nature
575 papers in training set
Top 10%
1.9%
12
Genome Medicine
154 papers in training set
Top 4%
1.9%
13
Biological Psychiatry
119 papers in training set
Top 2%
1.7%
14
Communications Medicine
85 papers in training set
Top 0.3%
1.7%
15
Frontiers in Psychiatry
83 papers in training set
Top 2%
1.7%
16
Communications Psychology
20 papers in training set
Top 0.2%
1.1%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
1.0%
18
PLOS ONE
4510 papers in training set
Top 64%
0.9%
19
Science
429 papers in training set
Top 19%
0.8%
20
eBioMedicine
130 papers in training set
Top 4%
0.8%
21
Molecular Psychiatry
242 papers in training set
Top 3%
0.8%
22
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
23
eLife
5422 papers in training set
Top 57%
0.8%
24
BioData Mining
15 papers in training set
Top 0.9%
0.8%
25
NeuroImage: Clinical
132 papers in training set
Top 4%
0.7%
26
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.7%
27
European Journal of Human Genetics
49 papers in training set
Top 2%
0.7%
28
Nature Human Behaviour
85 papers in training set
Top 6%
0.5%
29
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.5%
30
Journal of Medical Internet Research
85 papers in training set
Top 6%
0.5%