Back

LLM-Based Web Data Collection for Research Dataset Creation

2025-05-25 health informatics Title + abstract only
View on medRxiv
Show abstract

Researchers across many fields rely on web data to gain new insights and validate methods. However, assembling accurate and comprehensive datasets typically demands manual review of numerous web pages to identify and record only those data points relevant to specific research objectives. The vast and scattered nature of online information makes this process time-consuming and prone to human error. To address these challenges, we present a human-in-the-loop framework that automates web-scale data...

Predicted journal destinations