Back

Democratizing Scientific Publishing: A Local, Multi-Agent LLM Framework for Objective Manuscript Editing

Bhansali, R.; Gorenshtein, A.; Westover, B.; Goldenholz, D. M.

2026-04-17 health informatics
10.64898/2026.04.13.26350761 medRxiv
Show abstract

Manuscript preparation is a critical bottleneck in scientific publishing, yet existing AI writing tools require cloud transmission of sensitive content, creating data-confidentiality barriers for clinical researchers. We introduce the Paper Analysis Tool (PAT), a free, multi-agent framework that deploys 31 specialized agents powered by small language models (SLMs) to audit manuscripts across multiple quality dimensions without external data transmission. Applied to three published clinical neurological papers, PAT generated 540 evaluable suggestions. Validation by two expert reviewers (R.B., A.G.) confirmed 391 actionable, high-value revisions (90% agreement), achieving a 72.4% overall usefulness accuracy spanning methodological, statistical, and visual domains. Furthermore, deterministic re-evaluation of 126 agent-suggested rewrite pairs using Phase 0 metrics confirmed text improvement: total word count decreased by 25%, passive voice prevalence dropped sharply from 35% to 5%, average sentence length decreased by 24%, long-sentence fraction fell by 67%, and the Flesch-Kincaid grade improved by 17% . Our validation confirms that systematic, agent-driven pre-submission review drives measurable improvements, successfully converting manuscript optimization from an opaque, manual endeavor into a transparent and rigorous scientific process. Manuscript preparation is a critical bottleneck in scientific publishing, yet existing AI writing tools require cloud transmission of sensitive content, creating data-confidentiality barriers for clinical researchers. We introduce the Paper Analysis Tool (PAT), a free, multi-agent framework that deploys 31 specialized agents powered by small language models (SLMs) to audit manuscripts across multiple quality dimensions without external data transmission. Applied to three published clinical neurological papers, PAT generated 540 evaluable suggestions. Independent validation by two expert reviewers (R.B., A.G.) confirmed 391 actionable, high-value revisions (90% agreement), achieving a 72.4% overall usefulness accuracy spanning methodological, statistical, and visual domains. Furthermore, deterministic re-evaluation of 126 suggested Phase 0 rewrite pairs confirmed text improvement: total word count decreased by 25%, passive voice prevalence dropped sharply from 35% to 5%, average sentence length decreased by 24%, and long-sentence fraction fell by 67%, and the Flesch-Kincaid grade improved modestly. Our validation confirms that systematic, agent-driven pre-submission review drives measurable improvements, successfully converting manuscript optimization from an opaque, manual endeavor into a transparent and rigorous scientific process.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.2%
23.1%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.2%
15.1%
3
Scientific Reports
3102 papers in training set
Top 9%
8.6%
4
PLOS ONE
4510 papers in training set
Top 35%
4.1%
50% of probability mass above
5
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
2.8%
6
Nature Communications
4913 papers in training set
Top 44%
2.7%
7
Annals of Internal Medicine
27 papers in training set
Top 0.2%
2.5%
8
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.1%
9
PLOS Digital Health
91 papers in training set
Top 1%
1.7%
10
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
11
iScience
1063 papers in training set
Top 17%
1.5%
12
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.5%
1.5%
13
Patterns
70 papers in training set
Top 1%
1.4%
14
Nature Medicine
117 papers in training set
Top 3%
1.0%
15
JAMIA Open
37 papers in training set
Top 1%
0.9%
16
Frontiers in Digital Health
20 papers in training set
Top 1%
0.9%
17
eLife
5422 papers in training set
Top 53%
0.9%
18
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
0.9%
19
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
20
Bioinformatics
1061 papers in training set
Top 9%
0.9%
21
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
22
PLOS Biology
408 papers in training set
Top 18%
0.8%
23
Scientific Data
174 papers in training set
Top 2%
0.8%
24
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
25
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
26
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.8%
27
Frontiers in Neurology
91 papers in training set
Top 5%
0.7%
28
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
29
Med
38 papers in training set
Top 0.9%
0.7%
30
Communications Biology
886 papers in training set
Top 31%
0.5%