Back

Comparing AI and Human Coding of NIH Grant Abstracts to Identify Innovations in Opioid Addiction Treatment

Alkhatib, S. A.; Jiwa, N.; Judd, D.; Luningham, J. M.; Sawyer-Morris, G.; Ulukaya, M.; Molfenter, T.; Taxman, F. S.; Walters, S. T.

2026-02-17 health informatics
10.64898/2026.02.13.26346235 medRxiv
Show abstract

Large language models (LLMs) are increasingly used for qualitative analysis in substance use research, yet their performance relative to human coders remains underexplored. This study compares ChatGPT-4.0 with human coders in identifying and describing the core innovation of NIH grants focused on reducing opioid overdose. A total of 118 NIH HEAL Initiative grant abstracts were independently coded by ChatGPT and humans to generate innovation descriptions, which were then evaluated by both human raters and ChatGPT for depth/detail and relevance/completeness using 5-point Likert scales. Identical instructions were used across all coding and evaluation stages. ChatGPT-generated descriptions were consistently rated higher than human-generated descriptions on both dimensions. Human evaluators rated ChatGPT outputs at an average of 4.47 for both depth/detail and relevance/completeness, compared to 3.33 and 3.24 for human outputs, respectively (F(1,176)=133.9, p<0.001). These findings suggest that LLMs, when carefully prompted, can enhance the efficiency and quality of qualitative research evaluation.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
International Journal of Drug Policy
11 papers in training set
Top 0.1%
10.2%
2
PLOS ONE
4510 papers in training set
Top 18%
10.2%
3
International Journal of Medical Informatics
25 papers in training set
Top 0.2%
6.4%
4
JAMIA Open
37 papers in training set
Top 0.2%
4.9%
5
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.6%
4.2%
6
Frontiers in Digital Health
20 papers in training set
Top 0.2%
3.6%
7
JMIR Public Health and Surveillance
45 papers in training set
Top 0.6%
3.6%
8
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.6%
9
BMC Medical Research Methodology
43 papers in training set
Top 0.3%
3.1%
10
Scientific Reports
3102 papers in training set
Top 47%
2.4%
50% of probability mass above
11
JMIR Research Protocols
18 papers in training set
Top 0.4%
2.1%
12
Healthcare
16 papers in training set
Top 0.4%
1.9%
13
Preventive Medicine Reports
14 papers in training set
Top 0.1%
1.9%
14
Journal of Biomedical Informatics
45 papers in training set
Top 0.8%
1.7%
15
BMJ Open
554 papers in training set
Top 9%
1.7%
16
BMC Health Services Research
42 papers in training set
Top 1%
1.5%
17
PLOS Digital Health
91 papers in training set
Top 2%
1.3%
18
Biology Methods and Protocols
53 papers in training set
Top 1%
1.3%
19
DIGITAL HEALTH
12 papers in training set
Top 0.4%
1.3%
20
Drug and Alcohol Dependence
37 papers in training set
Top 0.4%
1.3%
21
International Journal of Environmental Research and Public Health
124 papers in training set
Top 5%
1.3%
22
JMIR Formative Research
32 papers in training set
Top 1.0%
1.3%
23
JMIR Medical Informatics
17 papers in training set
Top 0.9%
1.3%
24
JMIRx Med
31 papers in training set
Top 1%
1.2%
25
Acta Neuropsychiatrica
12 papers in training set
Top 0.7%
1.1%
26
BMJ Health & Care Informatics
13 papers in training set
Top 0.6%
1.1%
27
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.1%
28
Frontiers in Public Health
140 papers in training set
Top 7%
0.9%
29
Psychiatry Research
35 papers in training set
Top 1%
0.9%
30
Frontiers in Psychiatry
83 papers in training set
Top 3%
0.9%