Back

Auditable cross-instrument detection of unusual multivariate psychiatric response configurations using a semantically aligned covariance subspace

Periwal, V.

2026-05-27 psychiatry and clinical psychology
10.64898/2026.05.22.26353902 medRxiv
Show abstract

Background: Conventional psychiatric screening instruments summarize symptoms within individual scales and prioritize cases with high single-instrument additive score severity. This design treats items as independent within instruments and ignores cross-instrument covariance structure, making it insensitive to respondents whose responses are distributed across multiple domains in unusual combinations that remain below threshold on every individual scale. Methods: We analyzed two cohorts spanning older and younger adults. Item prompts from depression, stress, anxiety, and sleep instruments were embedded into a shared semantic space using a pretrained sentence encoder. Principal component analysis of the item-prompt embeddings alone---with no use of respondent data at this stage---was used to construct a low-dimensional subspace retaining 80\% of variance in the item embedding matrix. Normalized participant responses were then projected into this subspace, with Jaccard-based stability analysis used as a check on dimensional robustness. Multivariate deviation from the cohort norm was quantified with Mahalanobis distance using Ledoit-Wolf covariance regularization. Candidate outliers were defined by the empirical 95th percentile of the cohort-specific distance distribution. To isolate response configurations not already captured by conventional single-instrument extreme-value logic, we excluded all outlier respondents who had endorsed any individual item at the maximum value of its Likert scale on any instrument. For the remaining outliers, anomalous components were backtracked to their original item loadings for interpretation. Results: In the older-adult Health and Retirement Study (HRS) cohort, principal component analysis of 27 item-prompt embeddings showed that a 10-dimensional subspace provided a stable representation of cross-instrument semantic structure. In the younger-adult Xinxiang cohort the corresponding stable solution was 16-dimensional. In each cohort, seven respondents remained as multivariate outliers despite falling below every single-instrument extreme-value threshold. These cases were not characterized by uniformly severe symptom scores but by unusual cross-domain response configurations that became visible only in the shared semantic covariance subspace. The response structure of the retained configurations differed across cohorts: older-adult cases more often involved weak endorsement of mood-labeled items alongside nonzero body- and sleep-related responses, whereas younger-adult cases more often involved incomplete response configurations spanning mood, sleep, stress, and self-harm-related items. Conclusions: A semantically aligned, auditable covariance subspace provides a practical tool for flagging unusual multivariate response configurations that single-instrument additive screening may not flag. The method is interpretable at the level of original item contributions. It should be understood as a hypothesis-generating screen for unusual response configurations requiring further clinical assessment, not as a diagnostic instrument. Outcome validity remains to be established by prospective study.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Translational Psychiatry
219 papers in training set
Top 0.4%
12.3%
2
npj Digital Medicine
97 papers in training set
Top 0.7%
8.2%
3
Journal of Affective Disorders
81 papers in training set
Top 0.3%
6.8%
4
Journal of Medical Internet Research
85 papers in training set
Top 0.6%
6.8%
5
Frontiers in Psychiatry
83 papers in training set
Top 0.5%
6.4%
6
Scientific Reports
3102 papers in training set
Top 18%
6.4%
7
Psychiatry Research
35 papers in training set
Top 0.4%
4.0%
50% of probability mass above
8
PLOS ONE
4510 papers in training set
Top 39%
3.6%
9
European Psychiatry
10 papers in training set
Top 0.2%
2.6%
10
Biological Psychiatry: Cognitive Neuroscience and Neuroimaging
62 papers in training set
Top 0.7%
2.1%
11
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.2%
1.9%
12
BJPsych Open
25 papers in training set
Top 0.4%
1.7%
13
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.5%
14
Journal of Translational Medicine
46 papers in training set
Top 1%
1.3%
15
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.2%
16
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.3%
1.1%
17
JAMA Network Open
127 papers in training set
Top 3%
0.9%
18
Acta Neuropsychiatrica
12 papers in training set
Top 0.7%
0.9%
19
Communications Psychology
20 papers in training set
Top 0.2%
0.9%
20
Computational Psychiatry
12 papers in training set
Top 0.1%
0.9%
21
JMIR mHealth and uHealth
10 papers in training set
Top 0.4%
0.9%
22
Psychological Medicine
74 papers in training set
Top 1%
0.9%
23
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
24
Schizophrenia Research
29 papers in training set
Top 0.6%
0.7%
25
International Journal of Environmental Research and Public Health
124 papers in training set
Top 7%
0.7%
26
Biological Psychiatry
119 papers in training set
Top 2%
0.7%
27
JMIR Research Protocols
18 papers in training set
Top 2%
0.7%
28
Psychiatry and Clinical Neurosciences
11 papers in training set
Top 0.4%
0.7%
29
Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring
38 papers in training set
Top 1%
0.7%
30
Journal of Psychiatric Research
28 papers in training set
Top 0.8%
0.7%