Back

Predictive modeling for bacterial vaginosis in a Tanzanian cohort of women living with HIV

Ojo, D. P.; Gachunga, W.; Sokolik, C. C.; Parker, I. K.

2026-03-13 obstetrics and gynecology
10.64898/2026.03.11.26347251 medRxiv
Show abstract

IntroductionThe vaginal microbiome is an important factor affecting both HIV risk and reproductive health in women living with HIV, and women of African descent are disproportionately affected. Here we use 16S rRNA sequencing data to predict BV in HIV-positive women living in Africa and delineate microbial features that are important for accurate prediction in this cohort compared to two HIV negative cohorts. MethodsOur study population was comprised of a cohort of HIV-positive women living in Tanzania, and two cohorts of HIV-negative women living in the United States. Using OTU variables from data in the Tanzanian cohort, we used random forest, logistic regression, support vector machine, and multi-layer perceptron algorithms to predict BV outcome. To evaluate our models, we compared predicted BV outcome to BV outcome determined using Nugent scoring. ResultsIn evaluating model performance, all four models predicted BV outcome better for the HIV-negative cohorts than the HIV-positive cohorts. Overall, models were better at predicting negative BV outcomes for low Nugent scores than positive BV outcomes for high Nugent scores. Evaluation of significant predictor variables of BV in each cohort revealed that shared features existed between the Tanzanian and Symptomatic cohorts, but not among all three cohorts. Upon comparing performance of models in predicting BV outcome for Black women in HIV-positive and HIV-negative cohorts, we observed that all four models perform better at predicting BV in the HIV-negative cohorts. ConclusionsThe lower predictive performance observed for this Tanzanian HIV positive cohort, coupled with the difference in microbial communities important for accurate prediction suggest that distinct microbial communities in women who are living with HIV may create challenges that affect the accuracy of BV diagnosis and treatment. This study highlights the need for diagnostic tools that consider unique biological and epidemiological factors of populations to address health disparities.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 1%
38.7%
2
Scientific Reports
3102 papers in training set
Top 2%
14.7%
50% of probability mass above
3
Pathogens
53 papers in training set
Top 0.1%
6.5%
4
Clinical Infectious Diseases
231 papers in training set
Top 2%
2.4%
5
Cell Communication and Signaling
35 papers in training set
Top 0.3%
2.1%
6
BMJ Open
554 papers in training set
Top 8%
1.8%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
8
Frontiers in Public Health
140 papers in training set
Top 5%
1.5%
9
Cureus
67 papers in training set
Top 3%
1.4%
10
Clinical Chemistry
22 papers in training set
Top 0.5%
1.3%
11
Heliyon
146 papers in training set
Top 3%
1.3%
12
PeerJ
261 papers in training set
Top 10%
1.3%
13
Frontiers in Medicine
113 papers in training set
Top 5%
1.0%
14
Nature Communications
4913 papers in training set
Top 59%
0.9%
15
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 5%
0.9%
16
BMC Biology
248 papers in training set
Top 3%
0.9%
17
BMC Infectious Diseases
118 papers in training set
Top 5%
0.8%
18
Journal of Clinical Pathology
12 papers in training set
Top 0.5%
0.8%
19
PLOS Neglected Tropical Diseases
378 papers in training set
Top 5%
0.8%
20
Viruses
318 papers in training set
Top 5%
0.8%
21
Wellcome Open Research
57 papers in training set
Top 2%
0.8%
22
BMC Public Health
147 papers in training set
Top 6%
0.8%
23
JAMIA Open
37 papers in training set
Top 2%
0.7%
24
Sexually Transmitted Infections
21 papers in training set
Top 0.4%
0.7%
25
PLOS Global Public Health
293 papers in training set
Top 6%
0.7%
26
Microbial Genomics
204 papers in training set
Top 2%
0.7%
27
AIDS
31 papers in training set
Top 0.5%
0.5%
28
mSystems
361 papers in training set
Top 8%
0.5%
29
Journal of Advanced Research
15 papers in training set
Top 1%
0.5%
30
BMJ Open Quality
15 papers in training set
Top 1.0%
0.5%