Back

Early Detection of Absurdity Signals in Pharmacovigilance: A Machine Learning Ensemble Approach to Identify Rare Adverse Drug Reactions

Dasgupta, R.

2026-02-09 health informatics
10.64898/2026.02.06.26345783 medRxiv
Show abstract

BackgroundTraditional pharmacovigilance methods based on biostatistical approaches systematically exclude outliers and rare events, potentially missing critical safety signals. These methods fail to detect micro-clusters of adverse events and comorbidity patterns that may indicate serious but low-frequency adverse drug reactions (ADRs). We introduce the concept of absurdity signal detection - the identification of statistically anomalous but clinically significant adverse event patterns that conventional methods dismiss as outliers. MethodsWe developed an ensemble machine learning framework combining five distinct algorithms (Random Forest, Gradient Boosting, XGBoost, Neural Networks, and Support Vector Machines) to analyze FDA Adverse Event Reporting System (FAERS) data. The system employs outlier-inclusive modeling, multi-dimensional cluster detection, and severity-weighted propensity scoring. We validated our approach on Losartan, analyzing 500 adverse event reports to detect absurdity signals that may have been missed by conventional biostatistical surveillance. ResultsOur ensemble approach achieved 75% accuracy in identifying high-risk adverse events, with the best-performing model successfully detecting 15 distinct absurdity signals. The top five identified events were: cough (propensity score 1.525), angioedema (1.298), insomnia (1.290), nausea (1.180), and hyperkalemia (1.114). Notably, our method identified several rare but severe ADRs that would have been excluded as statistical outliers in traditional disproportionality analyses. The ensemble approach demonstrated superior performance compared to individual models, with inter-model agreement providing an additional confidence metric for signal validation. ConclusionsMachine learning-based absurdity signal detection offers a paradigm shift in pharmacovigilance by preserving and analyzing rare adverse events rather than excluding them. This approach has significant implications for patient safety, potentially preventing serious adverse events in vulnerable populations with atypical response profiles. Our methodology is scalable, validated against FDA data sources, and provides a framework for real-time safety monitoring in the $138 billion pharmaceutical industry. Future work will extend this approach to drug-drug interaction detection and personalized risk stratification.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
JAMIA Open
37 papers in training set
Top 0.1%
18.5%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
18.5%
3
npj Digital Medicine
97 papers in training set
Top 0.5%
10.4%
4
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.3%
9.1%
50% of probability mass above
5
Journal of Medical Internet Research
85 papers in training set
Top 0.6%
6.8%
6
Journal of Biomedical Informatics
45 papers in training set
Top 0.4%
3.9%
7
International Journal of Medical Informatics
25 papers in training set
Top 0.5%
2.7%
8
Scientific Reports
3102 papers in training set
Top 46%
2.6%
9
BMC Medical Research Methodology
43 papers in training set
Top 0.4%
2.1%
10
JMIR Medical Informatics
17 papers in training set
Top 0.5%
2.1%
11
PLOS ONE
4510 papers in training set
Top 50%
1.9%
12
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
13
JMIR Public Health and Surveillance
45 papers in training set
Top 2%
1.8%
14
Clinical and Translational Science
21 papers in training set
Top 0.5%
1.5%
15
Frontiers in Digital Health
20 papers in training set
Top 1%
0.9%
16
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.6%
0.9%
17
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.8%
0.7%
18
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.5%
0.6%
19
Integrative Biology
13 papers in training set
Top 0.2%
0.6%
20
Heliyon
146 papers in training set
Top 8%
0.6%
21
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.6%
22
eLife
5422 papers in training set
Top 61%
0.6%
23
European Respiratory Journal
54 papers in training set
Top 2%
0.6%