Back
Top 0.3%
17.9%
Top 0.3%
12.0%
Top 1%
9.1%
Top 0.5%
7.9%
Top 2%
7.9%
Top 1%
7.9%
Top 3%
5.1%
Top 63%
4.2%
#1
3.9%
Top 3%
2.0%
Top 36%
2.0%
Top 7%
2.0%
Top 92%
2.0%
Top 3%
2.0%
Top 2%
2.0%
Top 0.4%
2.0%
Top 2%
1.6%
Top 5%
1.0%
Top 3%
1.0%
Top 4%
0.7%
LLM-Based Web Data Collection for Research Dataset Creation
2025-05-25
health informatics
Title + abstract only
View on medRxiv
Show abstract
Researchers across many fields rely on web data to gain new insights and validate methods. However, assembling accurate and comprehensive datasets typically demands manual review of numerous web pages to identify and record only those data points relevant to specific research objectives. The vast and scattered nature of online information makes this process time-consuming and prone to human error. To address these challenges, we present a human-in-the-loop framework that automates web-scale data...
Predicted journal destinations
1
Journal of the American Medical Informatics Association
53 training papers
2
Journal of Biomedical Informatics
37 training papers
3
PLOS Digital Health
88 training papers
4
JAMIA Open
35 training papers
5
npj Digital Medicine
85 training papers
6
Journal of Medical Internet Research
81 training papers
7
BMC Medical Informatics and Decision Making
36 training papers
8
Scientific Reports
701 training papers
9
Patterns
15 training papers
10
International Journal of Medical Informatics
25 training papers
11
Nature Communications
483 training papers
12
PLOS Computational Biology
141 training papers
13
PLOS ONE
1737 training papers
14
Computers in Biology and Medicine
39 training papers
15
JMIR Medical Informatics
16 training papers
16
Bioinformatics
24 training papers
17
BMC Medical Research Methodology
41 training papers
18
JMIR Public Health and Surveillance
45 training papers
19
Scientific Data
30 training papers
20
Frontiers in Digital Health
18 training papers