Back

A Rule-Based Machine Learning Model for Predicting Virological Failure Among Children Living With HIV in Malawi

Chiphe, C.

2026-03-10 hiv aids
10.64898/2026.03.09.26347945 medRxiv
Show abstract

Malawis HIV treatment monitoring system faces serious challenges because of a shortage of experts and reliance on viral load testing every 3 to 12 months. The process causes dangerous delays in identifying treatment failure. This leads to a higher risk of disease progression, transmission, and death. To tackle this issue, this study used a machine learning model based on association rules and combined it with clustering analysis to create a machine learning framework to identify key factors and risk profiles for virological failure among children living with HIV (CLHIV) in Malawi. The methodology combines a Random Forest classifier for feature importance, association rule mining to find predictive rules, and k-Prototype clustering for risk profiling among CLHIV. The random forest feature importance results show that Body Mass Index (BMI), CD4 count, TB status, ART regimen, gender, ART adherence, and treatment duration are major drivers of virological failure. In addition to these individual factors, the analysis produced highly reliable association rules with over 90% confidence. This establishes a framework for identifying complex risk profiles and informing focused clinical interventions. The high lift values of 4.9 across the most significant rules demonstrate the models effectiveness by revealing strong, non-random associations. Clustering analysis also identified two distinct risk profiles associated with virological failure. The k-prototype clustering model performed optimally with a cluster purity of 100% and a silhouette score of 79%.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 3%
33.0%
2
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
8.4%
3
PLOS Global Public Health
293 papers in training set
Top 1%
6.8%
4
PLOS Computational Biology
1633 papers in training set
Top 5%
6.8%
50% of probability mass above
5
Heliyon
146 papers in training set
Top 0.2%
4.2%
6
Infectious Diseases of Poverty
10 papers in training set
Top 0.1%
2.1%
7
Epidemics
104 papers in training set
Top 0.7%
2.1%
8
Frontiers in Microbiology
375 papers in training set
Top 4%
2.1%
9
Tropical Medicine & International Health
15 papers in training set
Top 0.2%
1.9%
10
Journal of The Royal Society Interface
189 papers in training set
Top 2%
1.7%
11
BMC Infectious Diseases
118 papers in training set
Top 3%
1.7%
12
Journal of the International AIDS Society
20 papers in training set
Top 0.2%
1.7%
13
International Journal of Environmental Research and Public Health
124 papers in training set
Top 4%
1.7%
14
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.5%
15
AIDS and Behavior
14 papers in training set
Top 0.3%
1.3%
16
Communications Biology
886 papers in training set
Top 14%
1.2%
17
Infection, Genetics and Evolution
43 papers in training set
Top 0.8%
0.9%
18
AIDS
31 papers in training set
Top 0.4%
0.8%
19
BMJ Public Health
18 papers in training set
Top 0.7%
0.7%
20
BMJ Global Health
98 papers in training set
Top 3%
0.7%
21
Journal of Medical Virology
137 papers in training set
Top 4%
0.7%
22
Infectious Disease Modelling
50 papers in training set
Top 1%
0.7%
23
Parasites & Vectors
57 papers in training set
Top 1%
0.7%
24
BMC Medicine
163 papers in training set
Top 7%
0.7%
25
International Journal of Infectious Diseases
126 papers in training set
Top 4%
0.7%
26
Clinical Chemistry
22 papers in training set
Top 0.9%
0.7%
27
BMC Public Health
147 papers in training set
Top 6%
0.6%
28
Viruses
318 papers in training set
Top 6%
0.6%
29
JAIDS Journal of Acquired Immune Deficiency Syndromes
19 papers in training set
Top 0.4%
0.6%