Back

AI and Hierarchical clustering techniques for accurate patient stratification

Diaz Ochoa, J. G.; Puskaric, M.; Layer, N.; Jensch, A.; Knott, M.; Krohn, A.

2026-03-15 health informatics
10.64898/2026.03.13.26348331 medRxiv
Show abstract

Graph-based methods for data representation and analysis are well suited for encoding both data points and their interrelationships. This approach integrates data and topology, enabling the representation of interrelated information. In this study, we represent patient cohorts as cohort graphs and discuss their application for real-world patient data. We particularly focus on developing methods to cluster patients with similar symptoms and examine how bias parameters (such as sex and age group) influence interlinking within CGs, thereby improving results for accurate patient stratification and personalized decision-making in a clinical context. In particular we illustrate how by considering sex and age groups we can improve the symptom-clustering of a patient population with lung and gastro-intestinal cancer. Finally, we discuss the essential role of high-performance computing (HPC) in upscaling analytical methods for CGs.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 3%
14.1%
2
PLOS ONE
4510 papers in training set
Top 20%
9.9%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.3%
4
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.1%
4.8%
5
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.6%
4.8%
6
Bioinformatics
1061 papers in training set
Top 5%
4.1%
7
Computers in Biology and Medicine
120 papers in training set
Top 0.7%
3.9%
8
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.5%
3.5%
50% of probability mass above
9
GigaScience
172 papers in training set
Top 0.6%
3.5%
10
BMC Bioinformatics
383 papers in training set
Top 3%
3.5%
11
SoftwareX
15 papers in training set
Top 0.1%
3.0%
12
Physical Biology
43 papers in training set
Top 0.8%
2.0%
13
iScience
1063 papers in training set
Top 12%
1.9%
14
Artificial Intelligence in Medicine
15 papers in training set
Top 0.3%
1.7%
15
Frontiers in Physiology
93 papers in training set
Top 3%
1.6%
16
Journal of Biomedical Informatics
45 papers in training set
Top 0.9%
1.5%
17
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.5%
18
Frontiers in Microbiology
375 papers in training set
Top 7%
1.2%
19
Frontiers in Bioinformatics
45 papers in training set
Top 0.5%
1.2%
20
Communications Biology
886 papers in training set
Top 16%
1.1%
21
Expert Systems with Applications
11 papers in training set
Top 0.3%
0.9%
22
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.5%
0.9%
23
Patterns
70 papers in training set
Top 2%
0.9%
24
JAMIA Open
37 papers in training set
Top 2%
0.7%
25
Mathematics
11 papers in training set
Top 0.5%
0.7%
26
Nature Communications
4913 papers in training set
Top 64%
0.7%
27
Biology Methods and Protocols
53 papers in training set
Top 3%
0.6%
28
BMC Medical Research Methodology
43 papers in training set
Top 2%
0.6%
29
PLOS Digital Health
91 papers in training set
Top 3%
0.6%
30
JCO Clinical Cancer Informatics
18 papers in training set
Top 1%
0.6%