Back

A scientometric analysis of fairness in health AI literature

Alberto, I.; Alberto, N. R.; Altinel, Y.; Blacker, S.; Binotti, W.; Celi, L. A.; Chua, T.; Fiske, A.; Griffin, M.; Karaca, G.; Mokolo, N.; Naawu, D.; Patscheider, J.; Petushkov, A.; Quion, J. M.; Senteio, C.; Taisbak, S.; Tirnova, I.; Tokashiki, H.; Velasquez, A.; Yaghy, A.; Yap, K.

2023-03-20 public and global health
10.1101/2023.03.20.23287471 medRxiv
Show abstract

OBJECTIVEArtificial intelligence (AI) and machine learning are central components of todays medical environment. The fairness of AI, i.e. the ability of AI to be free from bias, has repeatedly come into question. This study investigates the diversity of the members of academia whose scholarship poses questions about the fairness of AI. METHODSThe articles that combine the topics of fairness, artificial intelligence, and medicine were selected from Pubmed, Google Scholar, and Embase using keywords. Eligibility and data extraction from the articles were done manually and cross-checked by another author for accuracy. 375 articles were selected for further analysis, cleaned, and organized in Microsoft Excel; spatial diagrams were generated using Public Tableau. Additional graphs were generated using Matplotlib and Seaborn. The linear and logistic regressions were analyzed using Python. RESULTSWe identified 375 eligible publications, including research and review articles concerning AI and fairness in healthcare. When looking at the demographics of all authors, out of 1984, 794 were female, and 1190 were male. Out of 375 first authors, 155 (41.33%) were female, and 220 (58.67%) were male. For last authors 110 (31.16%) were female, and 243 (68.84%) were male. In regards to ethnicity, 234 (62.40%) of the first authors were white, 103 (27.47%) were Asian, 24 (6.40%) were black, and 14 (3.73%) were Hispanic. For the last authors, 234 (66.29%) were white, 96 (27.20%) were Asian, 12 (3.40%) were black, and 11 (3.11%) were Hispanic. Most authors were from the USA, Canada, and the United Kingdom. The trend continued for the first and last authors of the articles. When looking at the general distribution, 1631 (82.2%) were based in high-income countries, 209 (10.5 %) were based in upper-middle-income countries, 135 (6.8%) were based in lower-middle-income countries, and 9 (0.5 %) were based in low-income countries. CONCLUSIONSAnalysis of the bibliographic data revealed that there is an overrepresentation of white authors and male authors, especially in the roles of first and last author. The more male authors a paper had the more likely they were to be cited. Additionally, analysis showed that papers whose authors are based in higher-income countries were more likely to be cited more often and published in higher impact journals. These findings highlight the lack of diversity among the authors in the AI fairness community whose work gains the largest readership, potentially compromising the very impartiality that the AI fairness community is working towards.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 3%
28.8%
2
JMIRx Med
31 papers in training set
Top 0.1%
8.8%
3
Scientific Reports
3102 papers in training set
Top 29%
4.1%
4
Journal of Public Health
23 papers in training set
Top 0.1%
3.7%
5
BMC Public Health
147 papers in training set
Top 1%
3.7%
6
Public Health
34 papers in training set
Top 0.3%
2.5%
50% of probability mass above
7
F1000Research
79 papers in training set
Top 0.9%
2.5%
8
BMJ Open
554 papers in training set
Top 8%
2.0%
9
Journal of Clinical Epidemiology
28 papers in training set
Top 0.2%
2.0%
10
International Journal of Environmental Research and Public Health
124 papers in training set
Top 3%
2.0%
11
Cureus
67 papers in training set
Top 2%
1.8%
12
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.5%
13
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.4%
14
Social Science & Medicine
15 papers in training set
Top 0.6%
1.4%
15
Research Synthesis Methods
20 papers in training set
Top 0.1%
1.4%
16
BMC Cancer
52 papers in training set
Top 2%
1.3%
17
EClinicalMedicine
21 papers in training set
Top 0.4%
1.3%
18
Systematic Reviews
11 papers in training set
Top 0.3%
1.0%
19
BMC Medicine
163 papers in training set
Top 5%
1.0%
20
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
21
Heart
10 papers in training set
Top 0.8%
0.8%
22
Frontiers in Medicine
113 papers in training set
Top 6%
0.8%
23
PLOS Digital Health
91 papers in training set
Top 3%
0.8%
24
PeerJ
261 papers in training set
Top 14%
0.8%
25
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.8%
26
Frontiers in Public Health
140 papers in training set
Top 8%
0.8%
27
JAMA Network Open
127 papers in training set
Top 4%
0.7%
28
Journal of General Internal Medicine
20 papers in training set
Top 1%
0.7%
29
Heliyon
146 papers in training set
Top 9%
0.5%
30
BMC Medical Research Methodology
43 papers in training set
Top 2%
0.5%