Back

Smartphone Missingness as a Depression Biomarker: A Baseline-Controlled Re-analysis of StudentLife

Olcan, C.

2026-05-01 psychiatry and clinical psychology
10.64898/2026.04.30.26351977 medRxiv
Show abstract

BackgroundWhether gaps in smartphone passive-sensing data carry psychological signal -- beyond what a baseline self-report already provides -- is a recurring question in digital phenotyping that has rarely been tested with multiplicity control and cross-validation on the same cohort. ObjectiveTo test whether participant-level missingness in passive sensing and ecological momentary assessment (EMA) is an incremental predictor of depression beyond a single baseline self-report, in the canonical StudentLife cohort. MethodsExploratory re-analysis of the publicly available StudentLife dataset. The original 2014 publication reports n = 48 enrolled undergraduates; the public archive contains 59 distinct sensor-instrumented UIDs, but the additional 11 are sparse-data or PHQ-9-incomplete records and contribute nothing to the between-person analyses, which use only the 38 participants with paired pre-term and post-term Patient Health Questionnaire-9 (PHQ-9). We computed 89 participant-level missingness features from nine continuous sensor streams, five phone-activity logs and 27 EMA prompts, and evaluated them under leave-one-out cross-validation with nested-CV-tuned hyperparameters, cluster-bootstrap confidence intervals, an omnibus joint F-test, and Benjamini-Hochberg multiplicity control. ResultsA pre-term PHQ-9 baseline alone explained 59% of out-of-sample variance in post-term scores (n = 32; 95% cluster-bootstrap CI [0.22, 0.81]). Tuned regularized linear models trained on missingness alone reached only the cohort-mean baseline; adding missingness to pre PHQ-9 did not improve performance. The omnibus joint F-test of all nine continuous-stream missingness rates against post-term PHQ-9, adjusted for pre-term PHQ-9, was non-significant (F (9, 27) = 0.43, P = 0.91). No individual feature survived multiplicity correction. A separate within-person day-level analysis (2,186 person-days) yielded a small valence-specific prospective effect (r = +0.082, 95% CI [+0.011, +0.162]) opposite in direction to the withdrawal hypothesis. ConclusionIn this cohort, smartphone-data missingness did not add incremental predictive value beyond a single baseline PHQ-9. The analysis is exploratory and StudentLife-specific; it should not be read as evidence that missingness is never informative. Plain-language summaryMany studies use the gaps in someones smartphone data -- missing GPS readings, missed survey prompts, fewer phone interactions -- as a possible warning sign of depression. This re-analysis tested that idea on a widely used public dataset from a class of 48 college students. After accounting for each students depression score at the start of the term, the gaps in their phone data added no useful information about their depression score at the end of the term. The result is specific to this dataset and does not mean that smartphone gaps are never informative, but it shows that such claims need careful baseline comparisons. Key PointsO_ST_ABSQuestionC_ST_ABSIs smartphone-data missingness an incremental depression biomarker beyond a single baseline self-report? FindingsIn an exploratory re-analysis of the StudentLife cohort (38 participants with paired PHQ-9 scores), missingness features did not improve prediction of post-term PHQ-9 beyond pre-term PHQ-9 under leave-one-out cross-validation, nested-CV-tuned models, an omnibus joint F-test, or multiplicity-controlled univariate screens. MeaningSmartphone-data missingness should not be interpreted as a psychological signal in absence of baseline-controlled, cross-validated, multiplicity-aware evidence. The result is specific to a high-functioning undergraduate cohort over a ten-week term and requires replication.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Journal of Affective Disorders
81 papers in training set
Top 0.1%
14.3%
2
BJPsych Open
25 papers in training set
Top 0.1%
8.4%
3
JAMA Network Open
127 papers in training set
Top 0.2%
8.4%
4
Journal of Medical Internet Research
85 papers in training set
Top 0.6%
7.1%
5
Frontiers in Psychiatry
83 papers in training set
Top 0.7%
4.8%
6
npj Digital Medicine
97 papers in training set
Top 1%
3.9%
7
PLOS ONE
4510 papers in training set
Top 38%
3.7%
50% of probability mass above
8
Psychological Medicine
74 papers in training set
Top 0.6%
3.6%
9
Translational Psychiatry
219 papers in training set
Top 2%
2.7%
10
Journal of Psychiatric Research
28 papers in training set
Top 0.3%
2.6%
11
JMIR Formative Research
32 papers in training set
Top 0.5%
2.6%
12
Frontiers in Digital Health
20 papers in training set
Top 0.5%
2.1%
13
JMIR mHealth and uHealth
10 papers in training set
Top 0.2%
2.1%
14
Social Science & Medicine
15 papers in training set
Top 0.3%
2.1%
15
Brain, Behavior, and Immunity
105 papers in training set
Top 1%
1.9%
16
Social Psychiatry and Psychiatric Epidemiology
11 papers in training set
Top 0.3%
1.8%
17
BMJ Open
554 papers in training set
Top 9%
1.8%
18
European Psychiatry
10 papers in training set
Top 0.4%
1.7%
19
Psychoneuroendocrinology
33 papers in training set
Top 0.2%
1.7%
20
JMIR Research Protocols
18 papers in training set
Top 1.0%
1.2%
21
Molecular Psychiatry
242 papers in training set
Top 2%
1.2%
22
Biological Psychiatry: Cognitive Neuroscience and Neuroimaging
62 papers in training set
Top 1%
0.9%
23
JAMA Psychiatry
13 papers in training set
Top 0.5%
0.9%
24
Psychiatry Research
35 papers in training set
Top 1%
0.9%
25
Epidemiology and Psychiatric Sciences
10 papers in training set
Top 0.3%
0.8%
26
BMJ Mental Health
15 papers in training set
Top 0.4%
0.8%
27
Scientific Reports
3102 papers in training set
Top 72%
0.8%
28
Journal of Neurology, Neurosurgery & Psychiatry
29 papers in training set
Top 1%
0.7%
29
Acta Neuropsychiatrica
12 papers in training set
Top 1.0%
0.7%
30
JMIR Public Health and Surveillance
45 papers in training set
Top 4%
0.7%