Back

PCRAgent: A Multi-Agent Framework for Transforming Noisy clinical conversations into Structured Pre-Consultation Medical Records and Reusable Clinical Data Resources

Zhang, M.; Zhao, J.; Tang, W.; Xing, J.; Li, J.; Zhang, H.; Qiu, J.; Zhang, Y.

2026-06-11 health informatics
10.64898/2026.06.10.26355372 medRxiv
Show abstract

In primary care and outpatient settings, clinically important patient information is often embedded in fragmented, ambiguous, repetitive, and noisy communication between physicians and patients. This limits physicians ability to obtain a clear preconsultation overview of symptoms, history of present illness, and visit intent, while also preventing real world clinical dialogues from being reused in hospital information systems and medical artificial intelligence applications. To address this challenge, we developed PCRAgent, a centrally coordinated multi agent framework for preconsultation clinical information organization. Guided by physician inquiry logic, PCRAgent identifies, extracts, corrects, and standardizes patient-reported information from noisy consultations. Its coordinated modules including error detection, semantic editing, output control, contextual memory, and intent recognition enable robust parallel handling of spelling errors, repetitions, grammatical inconsistencies, medical ambiguities, and non-medical interference. A traceable edit list records intermediate corrections and context, allowing iterative refinement without redundant modifications. PCRAgent generates two complementary outputs. One is a PreConsultation Clinical Report for rapid physician review. The other is a Structured Clinical Conversation Dataset for hospital data construction and downstream AI applications. In evaluations using 220000 strongly perturbed consultations, PCRAgent maintained high robustness, achieving a clinical information accuracy of 4.99 out of 5 and key element completeness of 5 out of 5, outperforming GPT4o. Expert review of Chinese and English dialogues confirmed high clinical accuracy of 4.85 out of 5 and high safety of 4.79 out of 5. Multicenter validation in real-world outpatient workflows further demonstrated practical utility. These findings indicate that PCRAgent can efficiently transform noisy and unstructured consultations into physician ready reports and AI ready structured data, improving outpatient efficiency, reducing cognitive burden, ensuring information completeness, supporting precise decision-making, and enabling high-quality reuse of clinical data.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
28.2%
2
Scientific Reports
3102 papers in training set
Top 16%
6.5%
3
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.5%
6.4%
4
Nature Communications
4913 papers in training set
Top 36%
4.2%
5
Journal of Biomedical Informatics
45 papers in training set
Top 0.4%
4.0%
6
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.4%
3.7%
50% of probability mass above
7
iScience
1063 papers in training set
Top 4%
3.7%
8
Bioinformatics
1061 papers in training set
Top 6%
2.6%
9
PLOS ONE
4510 papers in training set
Top 45%
2.4%
10
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
2.4%
11
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.1%
12
JAMIA Open
37 papers in training set
Top 0.7%
1.8%
13
Artificial Intelligence in Medicine
15 papers in training set
Top 0.3%
1.7%
14
Frontiers in Digital Health
20 papers in training set
Top 0.6%
1.7%
15
Nature Medicine
117 papers in training set
Top 2%
1.7%
16
Med
38 papers in training set
Top 0.3%
1.7%
17
International Journal of Medical Informatics
25 papers in training set
Top 0.8%
1.7%
18
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.5%
1.5%
19
Advanced Science
249 papers in training set
Top 13%
1.4%
20
JMIR Medical Informatics
17 papers in training set
Top 1%
1.1%
21
GigaScience
172 papers in training set
Top 2%
0.9%
22
Frontiers in Microbiology
375 papers in training set
Top 7%
0.9%
23
Patterns
70 papers in training set
Top 2%
0.8%
24
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
25
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 6%
0.7%
26
BMC Bioinformatics
383 papers in training set
Top 8%
0.7%
27
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
28
eLife
5422 papers in training set
Top 61%
0.7%
29
Journal of Personalized Medicine
28 papers in training set
Top 2%
0.5%
30
Nature Computational Science
50 papers in training set
Top 2%
0.5%