AI-driven selection of patients with non-valvular atrial fibrillation for oral anticoagulation therapy: a multi-cohort validation and impact evaluation study
Rao, S.; Walli-Attaei, M.; Ahmed, N.; Fan, Z.; Petrazzini, B.; Lian, J.; Ghamari, S.; Wamil, M.; Lip, G. Y. H.; Leal, J.; Rahimi, K.
Show abstract
Background: Current risk assessment tools for guiding direct oral anticoagulant (DOAC) therapy for patients with atrial fibrillation (AF) based on clinical risk factors demonstrate modest predictive performance limiting clinical impact. Additionally, while guidelines recommend periodic reassessment of risk over time, there remains an absence of modelling solutions for capturing evolving risk in AF patients. Methods: Using UK electronic health records, we developed and validated the Transformer-based Risk assessment survival model (TRisk), an artificial intelligence model that predicts 12-month thromboembolic and bleeding events in AF patients by leveraging temporal patient journeys up to baseline. A cohort of 411,850 prevalent non-valvular AF patients aged [≥]18 years between 2010 and 2020 was identified from 1,442 English general practices. Practices were randomly allocated to derivation (n=1,079) and external validation (n=363) cohorts. TRisk was compared with CHA2DS2-VASc and CHA2DS2-VA for thromboembolic event prediction, and HAS-BLED and ORBIT for bleeding prediction, with subgroup analyses by sex, age, and baseline characteristics. A second validation of TRisk was also performed on 16,218 US AF patients between 2010 and 2023. A decision model compared outcomes and healthcare costs for TRisk versus standard care. Findings: TRisk achieved higher discrimination for thromboembolic event prediction (C-index: 0.82; 95% confidence interval [CI]: [0.81, 0.83]) as compared to CHA2DS2-VASc (0.71 [0.70, 0.73]) in UK validation. Application of TRisk to US data yielded similar C-index: 0.82 (0.80, 0.84). For bleeding prediction, TRisk (C-index: 0.70 [0.69-0.71]) outperformed both HAS-BLED (0.63; [0.61, 0.64]) and ORBIT (0.64; [0.63, 0.65]), with comparable US results (0.71; [0.69, 0.74]). The model remained well-calibrated across both populations and performed equitably across subgroups, including by race and during the COVID-19 pandemic. Impact analyses showed TRisk could reduce DOAC prescriptions by 8% in the UK and 7% in the US relative to guideline-recommended approaches, while preventing at least as many thromboembolic events. This refined approach would generate annual healthcare savings of GBP 5.5 million and USD 456.2 million in the UK and US respectively among patients initiating DOACs, rising to GBP 48.6 million and USD 1.8 billion when extended to all AF patients on DOACs. Interpretation: TRisk enabled more precise prediction for both thromboembolic and bleeding events across AF populations in UK and US compared to established clinical scoring systems. Incorporating TRisk into routine AF care would result in substantial cost savings without compromising the identification of true high-risk patients. Funding: None
Matching journals
The top 6 journals account for 50% of the predicted probability mass.