Back

Identification and Developmental Analysis of the Facial Characteristics Associated with Sickle Cell Disease using Machine Learning

Spencer, D.; Liu, X.; Mosema-Be-Amoti, K.; Kandosi, G.; Bramble, M. S.; Munajjed, F. A.; Likuba, E.; Okitundu-Luwa E-Andjafono, D.; Tshibambe, L.; Colwell, B.; Howell, K.; O'Brien, N.; Moxon, C.; Anwar, S. M.; Porras, A. R.; Ngoyi, D. M.; Vilain, E.; Tshala-Katumbay, D.; Linguraru, M. G.

2026-03-10 public and global health
10.64898/2026.03.03.26346563 medRxiv
Show abstract

BackgroundSickle cell disease (SCD) is a common inherited genetic disorder and contributor to global childhood mortality and morbidity. In the Democratic Republic of the Congo, nearly 40,000 newborns, approximately 2% of all newborns, are estimated to be affected each year. Despite progress in the treatment and care of the disorder, its detection and management in lower-resource settings remain challenging. MethodsWe collected 308 front facing photos of patients and their age-and sex-matched controls aged from 5 months to 19 years in the Democratic Republic of the Congo. Facial features were extracted and categorized into geometric and texture-based descriptors. A support vector machine ranked features according to their relevance for distinguishing SCD patients from controls. ResultsThe facial analysis algorithm identified eight geometric and six texture discriminative features that were significantly different between the cohorts. An explainable machine learning model identified sickle cell disease with 79.5% accuracy using a combination of six geometric features: distance between medial and lateral canthi, angle at nasal ala, distance from nasion to philtrum, distance from medial canthi to the columella, distance from columella to the lower lip, and distance between nasal alae. SCD related features were identified to become increasingly discriminative with age. ConclusionThese findings demonstrate the potential machine learning based methodologies to be leveraged to inform point-of-care tools in the screening and management of sickle cell disease. The discriminative facial features identified here may provide further opportunities into Artificial-Intelligence based diagnostics and personalized care strategies of sickle cell disease.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 7%
22.7%
2
Scientific Reports
3102 papers in training set
Top 6%
10.2%
3
Frontiers in Medicine
113 papers in training set
Top 0.6%
6.4%
4
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.1%
4.9%
5
BMC Medical Genomics
36 papers in training set
Top 0.1%
3.6%
6
Frontiers in Psychiatry
83 papers in training set
Top 1%
3.6%
50% of probability mass above
7
Frontiers in Public Health
140 papers in training set
Top 3%
2.7%
8
PLOS Neglected Tropical Diseases
378 papers in training set
Top 3%
2.4%
9
JMIR Formative Research
32 papers in training set
Top 0.6%
1.9%
10
npj Genomic Medicine
33 papers in training set
Top 0.4%
1.7%
11
Cureus
67 papers in training set
Top 3%
1.7%
12
The American Journal of Tropical Medicine and Hygiene
60 papers in training set
Top 2%
1.7%
13
BMJ Open
554 papers in training set
Top 9%
1.7%
14
Frontiers in Physiology
93 papers in training set
Top 4%
1.3%
15
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.3%
16
PLOS Computational Biology
1633 papers in training set
Top 18%
1.3%
17
PeerJ
261 papers in training set
Top 10%
1.2%
18
Heliyon
146 papers in training set
Top 4%
1.1%
19
Frontiers in Neurology
91 papers in training set
Top 4%
1.0%
20
Journal of Infection
71 papers in training set
Top 2%
0.9%
21
American Journal of Medical Genetics Part A
17 papers in training set
Top 0.3%
0.9%
22
JMIRx Med
31 papers in training set
Top 2%
0.8%
23
Database
51 papers in training set
Top 0.8%
0.8%
24
Clinical Chemistry
22 papers in training set
Top 0.7%
0.8%
25
BioTechniques
24 papers in training set
Top 0.2%
0.8%
26
Journal of Stroke and Cerebrovascular Diseases
12 papers in training set
Top 0.5%
0.8%
27
IEEE Transactions on Biomedical Engineering
38 papers in training set
Top 0.8%
0.8%
28
Diagnostics
48 papers in training set
Top 2%
0.8%
29
Applied Sciences
24 papers in training set
Top 0.9%
0.8%
30
PLOS Digital Health
91 papers in training set
Top 3%
0.8%