Back

Machine Learning Prediction of Disease Trajectories for Children with Juvenile Idiopathic Arthritis

Lee, S.; Davidian, M.; Natter, M. D.; Reeve, B. B.; Schanberg, L. E.; Belkin, E.; Chang, M.-L.; Kimura, Y.; Ong, M.-S.

2026-04-20 rheumatology
10.64898/2026.04.18.26351165 medRxiv
Show abstract

BackgroundDespite advances in therapy, optimal management of juvenile idiopathic arthritis (JIA) remains challenging. The ability to predict disease progression in JIA can improve personalized treatment decisions, but few reliable clinical predictors have been identified. We developed machine learning approaches to predict disease trajectories in children with JIA. MethodsUsing data from the Childhood Arthritis and Rheumatology Research Alliance (CARRA) Registry (years 2015-2024), we developed machine learning models to predict attainment of inactive disease in children with non-systemic JIA. We applied Dynamic Bayesian Networks (DBN) to model temporal dependencies and causal relationships, and Convolutional Neural Networks (CNN) to capture complex non-linear patterns. Model input included demographic factors, longitudinal clinical factors, and medication use in the preceding 12 months. FindingsA total of 8,093 participants were included. When tested on an independent test cohort, both DBN (AUC:0.76; precision:0.73; recall:0.83; F1-score:0.78; accuracy:0.71) and CNN (AUC:0.76; precision:0.71; recall:0.63; F1-score:0.67; accuracy:0.70) models achieved comparable performance in predicting inactive disease. Disease activity levels in the preceding 12 months, presence of enthesitis and uveitis were the strongest predictors. Causal relationships captured in the DBN model revealed suboptimal care patterns, likely shaped by insurance constraints and a predominantly reactive approach to JIA management. InterpretationOur study demonstrates that machine learning approaches can predict disease trajectories in JIA with good discriminative performance. Unlike prior studies that predict outcomes at single timepoints, our models are the first to predict inactive disease longitudinally. However, suboptimal care patterns in retrospective data limit models capacity to learn treatment-outcome relationships, underscoring critical opportunities to improve JIA care and the need for prospective comparative studies to better inform prediction models. FundingPatient-Centered Outcomes Research Institute (PCORI) Award (ME-2022C2-25573-IC). RESEARCH IN CONTEXT Evidence before this studyNumerous studies have sought to identify clinical predictors of JIA progression and outcomes. However, few reliable predictors have emerged and existing prediction models demonstrate limited performance. As a result, our ability to personalize treatment decisions based on individual risk of severe disease course remains limited. Added value of this studyWe developed novel machine learning models that predict individualized disease trajectories in children with polyarticular and oligoarticular JIA using data from their preceding 12-month clinical course. These models demonstrated strong discriminative performance and outperformed previously published machine learning approaches in JIA. Unlike prior studies limited to single time-point predictions, our models are the first to predict inactive disease longitudinally, enabling a patient-specific projection of disease progression over time. Importantly, our findings also bright to light patterns of suboptimal care, likely driven by insurance constraints and a reactive treatment paradigm, underscoring critical opportunities to improve JIA management. Implications of all the available evidenceOur models have the potential to support clinical decision-making by enabling early identification of children with JIA at risk for unfavorable disease trajectories. In addition, the suboptimal care patterns and systems-level barriers identified through our analyses highlight priority areas for quality improvement initiatives and policy interventions to reduce gaps in JIA care delivery.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Rheumatology
21 papers in training set
Top 0.1%
19.1%
2
Annals of the Rheumatic Diseases
32 papers in training set
Top 0.1%
19.0%
3
PLOS ONE
4510 papers in training set
Top 18%
10.3%
4
The Lancet Rheumatology
11 papers in training set
Top 0.1%
7.0%
50% of probability mass above
5
RMD Open
13 papers in training set
Top 0.1%
4.0%
6
BMJ Open
554 papers in training set
Top 5%
4.0%
7
JCI Insight
241 papers in training set
Top 1%
3.7%
8
Nature Communications
4913 papers in training set
Top 39%
3.7%
9
Arthritis & Rheumatology
33 papers in training set
Top 0.3%
1.9%
10
Scientific Reports
3102 papers in training set
Top 52%
1.9%
11
Genome Medicine
154 papers in training set
Top 4%
1.7%
12
Frontiers in Public Health
140 papers in training set
Top 5%
1.5%
13
The Journal of Infectious Diseases
182 papers in training set
Top 3%
1.5%
14
eBioMedicine
130 papers in training set
Top 2%
1.4%
15
Frontiers in Medicine
113 papers in training set
Top 5%
1.3%
16
Frontiers in Immunology
586 papers in training set
Top 6%
0.9%
17
Human Genetics and Genomics Advances
70 papers in training set
Top 0.6%
0.9%
18
Pain
70 papers in training set
Top 0.7%
0.8%
19
Vaccine
189 papers in training set
Top 2%
0.8%
20
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.7%
0.8%
21
eClinicalMedicine
55 papers in training set
Top 2%
0.8%
22
Healthcare
16 papers in training set
Top 2%
0.7%
23
Genetics in Medicine
69 papers in training set
Top 1%
0.7%
24
Cell Reports Medicine
140 papers in training set
Top 9%
0.7%
25
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
26
JAMA
17 papers in training set
Top 0.5%
0.5%
27
Patterns
70 papers in training set
Top 3%
0.5%
28
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.5%