Back

CohortSymmetry: An R package to perform sequence symmetry analysis using the OMOP common data model

Chen, X.; Stanford, T.; Guo, Y.; Raventos, B.; Du, M.; Li, X.; Lam, A.; Corby, G.; Mercade-Besora, N.; Alcalde Herraiz, M.; Lopez-Guell, K.; Delmestri, A.; Man, W. Y.; PRIETO-ALHAMBRA, D.; Burn, E.; Catala, M.; Pratt, N.; Jodicke, A.; Newby, D.

2025-11-17 pharmacology and therapeutics
10.1101/2025.11.14.25340229
Show abstract

BackgroundReal-world data are valuable for detecting adverse drug events, and Sequence Symmetry Analysis (SSA) is a simple yet effective method frequently used for this purpose. However, heterogeneous implementations across studies limit reproducibility and scalability. To address this, we developed an open-source R package that standardises SSA analytics using data mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). MethodsWe developed CohortSymmetry, an R package that implements SSA for OMOP CDM data. The package was validated through unit testing and evaluated empirically by estimating adjusted sequence ratios (ASRs) with 95% confidence intervals (CIs) for 23 positive and 10 negative controls across six European databases, including CPRD GOLD (UK) and THIN(R) (Belgium, Italy, Romania, Spain, UK). Sensitivity and specificity were defined as the proportions of positive and negative controls correctly identified by SSA. Sensitivity analyses varied key parameters, including the washout period. ResultsCohortSymmetry passed high-coverage unit tests. Of 33 eligible controls, four showed results consistent with expectations across all databases; for example, the amiodarone-levothyroxine pair had a lower 95% CI bound >1 in each. Sensitivity was moderate, whereas specificity was high in the primary analyses. Parameter variation influenced outcomes; a 365-day prior observation requirement reduced specificity in CPRD GOLD from 75% to 38%. ConclusionsCohortSymmetry enables reproducible SSA using OMOP CDM data. Differences across databases likely reflect heterogeneity in data capture and prescribing patterns. Limitations include residual data variability and SSAs susceptibility to time-varying confounding, underscoring the need for tailored analytic design in pharmacovigilance studies. Key MessagesO_LIWe developed CohortSymmetry, an open-source R package that standardises SSA analytics using OMOP CDM-mapped data and verified the correctness of functions via unit testing and application to real-world datasets. C_LIO_LICohortSymmetry passed high-coverage tests, and among 33 selected controls, four showed results consistent with expectations across all databases; varying analytical parameters affected results. C_LIO_LIThe package provides a reproducible and scalable framework for multi-database SSA studies, supporting robust pharmacovigilance, but careful specification of parameters is required to account for the characteristics of the medical domain under investigation. C_LI

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Clinical Pharmacology & Therapeutics
based on 19 papers
Top 0.2%
12.9%
2
British Journal of Clinical Pharmacology
based on 21 papers
Top 0.1%
12.9%
3
Pharmacoepidemiology and Drug Safety
based on 12 papers
Top 0.1%
12.9%
4
Clinical and Translational Science
based on 14 papers
Top 0.1%
6.6%
5
Frontiers in Pharmacology
based on 27 papers
Top 0.5%
5.5%
50% of probability mass above
6
BMJ
based on 49 papers
Top 0.7%
5.2%
7
Trials
based on 24 papers
Top 0.8%
3.0%
8
PLOS Medicine
based on 95 papers
Top 4%
2.9%
9
Scientific Reports
based on 701 papers
Top 53%
2.9%
10
PLOS ONE
based on 1737 papers
Top 79%
2.9%
11
Journal of Antimicrobial Chemotherapy
based on 19 papers
Top 0.4%
2.5%
12
Journal of Biomedical Informatics
based on 37 papers
Top 3%
2.4%
13
BMJ Open
based on 553 papers
Top 39%
1.8%
14
BJGP Open
based on 12 papers
Top 1.0%
1.6%
15
Nature
based on 58 papers
Top 5%
1.6%
16
Journal of the American Medical Informatics Association
based on 53 papers
Top 5%
1.4%
17
Nature Communications
based on 483 papers
Top 34%
1.4%
18
Antimicrobial Agents and Chemotherapy
based on 17 papers
Top 1%
1.2%
19
Clinical Infectious Diseases
based on 219 papers
Top 19%
0.8%
20
Journal of Medical Internet Research
based on 81 papers
Top 16%
0.7%
21
JAMIA Open
based on 35 papers
Top 6%
0.7%
22
The Journal of Infectious Diseases
based on 137 papers
Top 11%
0.7%
23
The British Journal of Psychiatry
based on 21 papers
Top 4%
0.7%
24
JMIRx Med
based on 29 papers
Top 7%
0.7%