Back

Deep Learning-Based Multimodal Clustering Model for Endotyping and Post-Arthroplasty Response Classification using Knee Osteoarthritis Subject-Matched Multi-Omic Data

Rockel, J. S.; Sharma, D.; Espin-Garcia, O.; Hueniken, K.; Sandhu, A.; Pastrello, C.; Sundararajan, K.; Potla, P.; Fine, N.; Lively, S. S.; Perry, K.; Mahomed, N. N.; Syed, K.; Jurisica, I.; Perruccio, A. V.; Rampersaud, Y. R.; Gandhi, R.; Kapoor, M.

2024-06-13 orthopedics
10.1101/2024.06.13.24308857 medRxiv
Show abstract

BackgroundPrimary knee osteoarthritis (KOA) is a heterogeneous disease with clinical and molecular contributors. Biofluids contain microRNAs and metabolites that can be measured by omic technologies. Deep learning captures complex non-linear associations within multimodal data but, to date, has not been used for multi-omic-based endotyping of KOA patients. We developed a novel multimodal deep learning framework for clustering of multi-omic data from three subject-matched biofluids to identify distinct KOA endotypes and classify one-year post-total knee arthroplasty (TKA) pain/function responses. Materials and MethodsIn 414 KOA patients, subject-matched plasma, synovial fluid and urine were analyzed by microRNA sequencing or metabolomics. Integrating 4 high-dimensional datasets comprising metabolites from plasma (n=151 features), along with microRNAs from plasma (n=421), synovial fluid (n=930), or urine (n=1225), a multimodal deep learning variational autoencoder architecture with K-means clustering was employed. Features influencing cluster assignment were identified and pathway analyses conducted. An integrative machine learning framework combining 4 molecular domains and a clinical domain was then used to classify WOMAC pain/function responses post-TKA within each cluster. FindingsMultimodal deep learning-based clustering of subjects across 4 domains yielded 3 distinct patient clusters. Feature signatures comprising microRNAs and metabolites across biofluids included 30, 16, and 24 features associated with Clusters 1-3, respectively. Pathway analyses revealed distinct pathways associated with each cluster. Integration of 4 multi-omic domains along with clinical data improved response classification performance, with Cluster 3 achieving AUC=0{middle dot}879 for subject pain response classification and Cluster 2 reaching AUC=0{middle dot}808 for subject function response, surpassing individual domain classifications by 12% and 15% respectively. InterpretationWe have developed a deep learning-based multimodal clustering model capable of integrating complex multi-fluid, multi-omic data to assist in KOA patient endotyping and test outcome response to TKA surgery. FundingCanada Research Chairs Program, Tony and Shari Fell Chair, Campaign to Cure Arthritis, University Health Network Foundation.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Arthritis & Rheumatology
33 papers in training set
Top 0.1%
29.3%
2
Osteoarthritis and Cartilage
30 papers in training set
Top 0.1%
8.9%
3
BMC Medicine
163 papers in training set
Top 0.3%
7.6%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.3%
7.2%
50% of probability mass above
5
Scientific Reports
3102 papers in training set
Top 15%
6.7%
6
Arthritis Research & Therapy
15 papers in training set
Top 0.1%
3.9%
7
npj Digital Medicine
97 papers in training set
Top 1%
2.9%
8
PLOS ONE
4510 papers in training set
Top 44%
2.8%
9
JAMIA Open
37 papers in training set
Top 0.6%
2.2%
10
Nature Communications
4913 papers in training set
Top 48%
2.0%
11
eBioMedicine
130 papers in training set
Top 0.8%
2.0%
12
JMIR Medical Informatics
17 papers in training set
Top 0.9%
1.4%
13
eLife
5422 papers in training set
Top 48%
1.3%
14
Annals of the Rheumatic Diseases
32 papers in training set
Top 0.5%
1.3%
15
BMJ Open
554 papers in training set
Top 11%
1.2%
16
Journal of Orthopaedic Research
19 papers in training set
Top 0.2%
1.0%
17
Science Translational Medicine
111 papers in training set
Top 4%
1.0%
18
JCI Insight
241 papers in training set
Top 6%
0.8%
19
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
20
Science Advances
1098 papers in training set
Top 28%
0.8%
21
RMD Open
13 papers in training set
Top 0.3%
0.8%
22
Advanced Science
249 papers in training set
Top 23%
0.5%
23
Applied Sciences
24 papers in training set
Top 1%
0.5%
24
Trials
25 papers in training set
Top 2%
0.5%
25
PLOS Genetics
756 papers in training set
Top 18%
0.5%
26
PLOS Computational Biology
1633 papers in training set
Top 28%
0.5%