Back

faers: A High-Fidelity Framework and R/Bioconductor Package for Precision Adverse Event Surveillance

Wang, Z.; Peng, Y.; Zhou, J.-G.; Bu, X.; Zhao, Y.; Li, Z.; Yan, B.; Sun, Y.; Wang, C.; Shu, C.; Cui, Y.; Wang, S.

2026-03-28 genetic and genomic medicine
10.64898/2026.03.26.26349444 medRxiv
Show abstract

Background: The FDA Adverse Event Reporting System (FAERS) is a critical pillar of post-marketing pharmacovigilance; however, its utility is constrained by data heterogeneity, pervasive reporting redundancies, and inconsistent medical terminology. These structural barriers impede reproducible, large-scale analyses and the implementation of precision drug safety surveillance. Methods: We developed faers, an open-source R package that delivers a standardized framework and an end-to-end workflow for transforming raw FAERS data into analysis-ready formats. The package implements a regulatory-compliant multi-level deduplication strategy, automated MedDRA terminology mapping, and an R S4-based object-oriented system to ensure data integrity, traceability, and efficient management of complex relational structures. It further integrates a full suite of disproportionality signal detection methods, including the Reporting Odds Ratio (ROR), Proportional Reporting Ratio (PRR), Bayesian Confidence Propagation Neural Network (BCPNN), and Empirical Bayes Geometric Mean (EBGM). Performance was benchmarked on large-scale FAERS datasets, and validity was confirmed by replicating published findings on anti-PD-1/PD-L1-associated cardiotoxicity and CAR-T cell therapy outcomes, with additional application to immune-related adverse events (irAEs). Findings: The package demonstrated high computational efficiency and near-linear scalability when processing extensive quarterly FAERS data. Validation analyses of two case studies showed excellent concordance with prior literature. Application to an irAE cohort further identified a statistically significant age-by-sex interaction in risk patterns, demonstrating the tool's ability to uncover nuanced demographic signals that are often missed by conventional approaches. Interpretation: The faers package provides a transparent, scalable, and fully reproducible framework for FAERS-based pharmacovigilance. By automating data cleaning, standardization, and advanced signal detection, it lowers technical barriers for researchers and regulators while promoting high-quality, open pharmacoepidemiological research to strengthen drug safety monitoring.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
22.9%
2
Genome Medicine
154 papers in training set
Top 0.3%
12.6%
3
npj Digital Medicine
97 papers in training set
Top 0.4%
10.6%
4
PLOS ONE
4510 papers in training set
Top 37%
3.7%
5
Scientific Data
174 papers in training set
Top 0.5%
3.7%
50% of probability mass above
6
Clinical and Translational Science
21 papers in training set
Top 0.2%
3.1%
7
Frontiers in Pharmacology
100 papers in training set
Top 1%
2.8%
8
Nature Communications
4913 papers in training set
Top 43%
2.8%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
10
PLOS Computational Biology
1633 papers in training set
Top 17%
1.5%
11
JAMA Network Open
127 papers in training set
Top 2%
1.5%
12
Journal of Biomedical Informatics
45 papers in training set
Top 0.9%
1.4%
13
BMC Medical Research Methodology
43 papers in training set
Top 0.8%
1.2%
14
Bioinformatics
1061 papers in training set
Top 8%
1.2%
15
Med
38 papers in training set
Top 0.4%
1.2%
16
Nucleic Acids Research
1128 papers in training set
Top 13%
1.2%
17
Scientific Reports
3102 papers in training set
Top 66%
1.2%
18
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.5%
1.0%
19
Communications Medicine
85 papers in training set
Top 0.6%
1.0%
20
Pharmacoepidemiology and Drug Safety
13 papers in training set
Top 0.3%
0.9%
21
BioData Mining
15 papers in training set
Top 0.6%
0.9%
22
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
23
European Respiratory Journal
54 papers in training set
Top 2%
0.9%
24
The Lancet Digital Health
25 papers in training set
Top 0.9%
0.8%
25
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
26
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
27
JCO Clinical Cancer Informatics
18 papers in training set
Top 1.0%
0.7%
28
Journal of Translational Medicine
46 papers in training set
Top 3%
0.7%
29
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.7%
30
Biometrics
22 papers in training set
Top 0.2%
0.7%