Back

Large-scale evaluation of an AI system as an independent reader for double reading in breast cancer screening

Sharma, N.; Ng, A. Y.; James, J. J.; Khara, G.; Ambrozay, E.; Austin, C. C.; Forrai, G.; Glocker, B.; Heindl, A.; Karpati, E.; Rijken, T. M.; Venkataraman, V.; Yearsley, J. E.; Kecskemethy, P. D.

2021-03-01 radiology and imaging
10.1101/2021.02.26.21252537 medRxiv
Show abstract

ImportanceScreening mammography with two human readers increases cancer detection and lowers recall rates, but workforce shortages make double reading unsustainable in many countries. Artificial intelligence (AI) as an independent reader in double reading may support screening performance while improving cost-effectiveness. The clinical validation of AI requires large-scale, multi-vendor studies on unenriched cohorts. ObjectiveTo evaluate the performance of the Mia(R) AI system on data that the AI system would process in real-world deployments. DesignA retrospective study simulating the impact of AI on an unenriched screening sample. SettingSeven European breast screening sites representing four centers: three from the UK and one in Hungary (HU), between 2009 and 2019. ParticipantsThe sample included 275,900 cases (177,882 participants) from seven screening sites, involving two countries and four hardware vendors from 2009 to 2019. InterventionSimulation of double reading using AI as an independent reader in breast cancer screening on historical data. Main Outcomes and MeasuresPerformance was determined for standalone AI compared to the historical single reader and for simulated double reading with AI compared to historical double reading, assessing non-inferiority and superiority on relevant screening metrics using a non-inferiority margin of 10% relative difference and a one-sided alpha of 2.5% for both tests. ResultsStandalone AI detected 29.8% of missed interval cancers. When compared with historical double reading, double reading with AI showed non-inferiority for sensitivity and superiority for recall rate, specificity and positive predictive value. AI as an independent reader reduced the workload for the second human reader but increased the arbitration rate from 3.3% to 12.3%. Applying the AI system could have reduced the human reading time required by up to 44.8% and reduced the recall rate by a relative 7.7% (from 5.2% to 4.8%). Conclusions and RelevanceUsing the AI system as an independent reader maintains or improves the double reading standard of care, while substantially reducing the workload. Thus, it has the potential to provide operational and economic benefits. Trial RegistrationRegistered on ISRCTN, study ID: ISRCTN18056078

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
The Lancet Digital Health
25 papers in training set
Top 0.1%
15.0%
2
BMJ Open
554 papers in training set
Top 3%
7.3%
3
PLOS Digital Health
91 papers in training set
Top 0.3%
6.5%
4
BMC Medicine
163 papers in training set
Top 0.5%
6.5%
5
JAMA Network Open
127 papers in training set
Top 0.4%
6.5%
6
PLOS ONE
4510 papers in training set
Top 27%
6.4%
7
Nature Communications
4913 papers in training set
Top 32%
4.9%
50% of probability mass above
8
Diagnostics
48 papers in training set
Top 0.3%
4.4%
9
npj Digital Medicine
97 papers in training set
Top 1%
4.2%
10
Scientific Reports
3102 papers in training set
Top 28%
4.2%
11
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.2%
3.7%
12
European Radiology
14 papers in training set
Top 0.3%
2.1%
13
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.4%
1.4%
14
International Journal of Cancer
42 papers in training set
Top 0.8%
1.2%
15
eLife
5422 papers in training set
Top 51%
1.0%
16
International Journal of Epidemiology
74 papers in training set
Top 2%
1.0%
17
Breast Cancer Research
32 papers in training set
Top 0.4%
1.0%
18
Cancer Medicine
24 papers in training set
Top 1%
1.0%
19
Science Translational Medicine
111 papers in training set
Top 5%
0.9%
20
BMC Research Notes
29 papers in training set
Top 0.4%
0.9%
21
BMC Health Services Research
42 papers in training set
Top 2%
0.9%
22
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.5%
0.9%
23
PLOS Medicine
98 papers in training set
Top 4%
0.8%
24
Nature Medicine
117 papers in training set
Top 4%
0.8%
25
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
26
npj Precision Oncology
48 papers in training set
Top 1%
0.7%
27
Frontiers in Oncology
95 papers in training set
Top 4%
0.7%
28
Communications Medicine
85 papers in training set
Top 1%
0.7%
29
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.5%
30
GigaScience
172 papers in training set
Top 4%
0.5%