Back

Free-text MAUDE narratives provide a source-robust representation layer for biomaterial-device surveillance

Chen, H.

2026-05-05 health informatics
10.64898/2026.05.03.26352339 medRxiv
Show abstract

Implantable biomaterial devices require effective post-market surveillance because clinically important failure patterns often emerge only after widespread use. However, surveillance workflows often rely on structured coded summaries that compress heterogeneous adverse-event narratives into coarse categories. This study compares coded and free-text narrative representations across 1,500 FDA MAUDE reports from three biomaterial device classes (coronary stents, bone cement, and surgical mesh) to test whether narratives preserve a more source-robust surveillance representation. Under manufacturer-held-out evaluation, narrative TF-IDF features outperformed structured code-only features (macro F1 0.925 versus 0.827), while delexicalized narratives retained strong grouped performance after masking device-class, manufacturer, brand, and legal-template tokens (F1 0.897). Narrative topics resolved reported events into procedural, anatomical, host-response, and reporting-context patterns, and an interpretable classifier recovered code-derived complication phenotypes from narrative text alone (mean F1 0.902, AUC 0.967). These findings support free-text adverse-event narratives as a complementary representation layer for post-market device surveillance, while remaining bounded by passive adverse-event reporting limitations and requiring validation across additional years, device classes, and independently adjudicated outcomes. Author SummaryWhen an implanted medical device fails inside a patient, the event is reported to the FDAs MAUDE database. Each report includes both a standardized code and a written narrative describing what happened. We asked whether these two representations carry the same information. Using 1,500 reports covering coronary stents, bone cement, and surgical mesh, we found that coded fields lose much of the clinical detail present in narratives. Importantly, narrative-based classifiers remained accurate even when tested on reports from manufacturers not seen during training, while code-based classifiers dropped substantially. This matters because real-world surveillance must generalize across different reporting sources. We also found that narrative text can recover clinically meaningful complication patterns that are defined by codes, and that most reports never name the specific biomaterial involved. These findings suggest that narrative text deserves a more central role in post-market device monitoring, complementing the coded fields that current surveillance pipelines rely on.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
53.5%
50% of probability mass above
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.5%
6.5%
3
Scientific Reports
3102 papers in training set
Top 26%
4.4%
4
JMIR Medical Informatics
17 papers in training set
Top 0.2%
4.4%
5
Journal of Biomedical Informatics
45 papers in training set
Top 0.6%
2.4%
6
JAMIA Open
37 papers in training set
Top 0.6%
2.1%
7
Journal of Medical Internet Research
85 papers in training set
Top 2%
1.7%
8
PLOS Digital Health
91 papers in training set
Top 2%
1.4%
9
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.1%
10
PLOS ONE
4510 papers in training set
Top 61%
1.1%
11
The Lancet Digital Health
25 papers in training set
Top 0.7%
1.0%
12
BMJ Health & Care Informatics
13 papers in training set
Top 0.7%
1.0%
13
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.9%
14
European Heart Journal - Digital Health
15 papers in training set
Top 0.5%
0.8%
15
Nature Communications
4913 papers in training set
Top 61%
0.8%
16
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.8%
0.8%
17
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.8%
18
Frontiers in Digital Health
20 papers in training set
Top 2%
0.5%
19
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.5%
20
PLOS Computational Biology
1633 papers in training set
Top 28%
0.5%
21
JAMA Network Open
127 papers in training set
Top 5%
0.5%
22
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.5%
23
JMIR Public Health and Surveillance
45 papers in training set
Top 4%
0.5%