Back

Heterogeneous but Segmentable: A Data-Driven Approach to Modelling Long-Term Care Trajectories in Multiple Sclerosis

Vesinurm, M.; Makitie, L.; Lillrank, P.; Saarinen, L.; Torkki, P.; Laakso, S. M.; Koskinen, M.

2026-01-30 health systems and quality improvement
10.64898/2026.01.28.26345045 medRxiv
Show abstract

Managing chronic diseases with unpredictable care demand creates significant operational challenges for healthcare systems. Mapping long-term care trajectories is crucial for improving resource allocation, anticipating service needs, and designing efficient care pathways. We used a data-driven approach to map six-year care trajectories for 962 newly diagnosed multiple sclerosis patients, identify utilization clusters, and determine predictors of high utilization. We analyzed Event logs of remote, outpatient, emergency, and inpatient contacts from one year pre- to five years post-diagnosis using K-means clustering to identify utilization clusters, logistic regression to identify predictors of high utilization, and process mining to model variation between care trajectories. We identified two distinctive utilization clusters: a high-utilization cluster (14.1 % of patients) with persistently elevated annual encounter volumes across all care settings and low-utilization cluster (85.9 % of patients) with lower and declining use. Median service costs were {euro}18,736 vs. {euro}6,052 in high- and low-utilization clusters, respectively. Two or more early relapses were the strongest predictor of high utilization (OR = 6.33, 95 % CI 3.49-11.50, p < 0.001), with number of planned early remote and outpatient care contacts being also associated with future service utilization (OR = 1.07, 95 % CI 1.04-1.10, p < 0.001). High-utilization trajectories were approximately three times longer (82.4 vs 25.9 events) and more variable (3.1 vs 2.4 unique events per patient). These utilization clusters and their distinct trajectories provide a pragmatic segmentation of multiple sclerosis patients to support early identification of high-utilization subgroups and more robust capacity planning in specialist care. HighlightsO_LIWe tracked the care trajectories of 962 people with relapsing-remitting multiple sclerosis using a Finnish population-based specialist-care datalake covering both inpatient and outpatient neurology services. C_LIO_LIPatients fell into two distinct utilization clusters: a high-utilization cluster with frequent contacts across all care settings and a low-utilization cluster with lower and declining use. C_LIO_LITwo or more early relapses, and the number of early outpatient and remote contacts were strong predictors of a patients long-term affiliation in the high-utilization cluster. C_LIO_LISegmented care trajectories showed that high-utilization patients followed longer, more varied, and acute-oriented care patterns and had much higher service encounter costs. C_LIO_LIThese findings can help clinicians and managers identify potential high-utilization patients early, target resources more effectively, and plan for future healthcare demand. C_LI

Matching journals

The top 14 journals account for 50% of the predicted probability mass.

1
Annals of Neurology
57 papers in training set
Top 0.1%
8.8%
2
Scientific Reports
3102 papers in training set
Top 12%
7.1%
3
PLOS ONE
4510 papers in training set
Top 30%
5.1%
4
BMJ Open
554 papers in training set
Top 4%
5.1%
5
Communications Medicine
85 papers in training set
Top 0.1%
4.2%
6
Journal of Infection
71 papers in training set
Top 0.3%
4.2%
7
Journal of Neurology, Neurosurgery & Psychiatry
29 papers in training set
Top 0.4%
2.7%
8
Heliyon
146 papers in training set
Top 0.9%
2.2%
9
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 1%
2.0%
10
Brain Communications
147 papers in training set
Top 1%
2.0%
11
Journal of Personalized Medicine
28 papers in training set
Top 0.2%
2.0%
12
BMC Public Health
147 papers in training set
Top 3%
1.9%
13
PLOS Neglected Tropical Diseases
378 papers in training set
Top 3%
1.8%
14
Brain
154 papers in training set
Top 3%
1.7%
50% of probability mass above
15
NeuroImage: Clinical
132 papers in training set
Top 2%
1.7%
16
BMC Infectious Diseases
118 papers in training set
Top 3%
1.6%
17
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.3%
1.6%
18
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.7%
1.4%
19
Multiple Sclerosis and Related Disorders
15 papers in training set
Top 0.1%
1.4%
20
Nature Communications
4913 papers in training set
Top 54%
1.4%
21
Multiple Sclerosis Journal
18 papers in training set
Top 0.1%
1.4%
22
PLOS Digital Health
91 papers in training set
Top 2%
1.4%
23
Frontiers in Neurology
91 papers in training set
Top 3%
1.4%
24
npj Digital Medicine
97 papers in training set
Top 3%
1.3%
25
Frontiers in Public Health
140 papers in training set
Top 6%
1.2%
26
Communications Biology
886 papers in training set
Top 15%
1.2%
27
Journal of the Neurological Sciences
17 papers in training set
Top 0.5%
1.0%
28
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
29
BMC Medicine
163 papers in training set
Top 6%
0.8%
30
PLOS Biology
408 papers in training set
Top 18%
0.8%