Back

Leveraging Deep Learning of Chest Radiograph Images to Identify Individuals at High Risk for Chronic Obstructive Pulmonary Disease

Doroodgar Jorshery, S.; Chandra, J.; Walia, A.; Sturniolo, A.; Corey, K.; Zekavat, S. M.; Zinzuwadia, A.; Patel, K.; Short, S.; Mega, J.; Plowman, S.; Pagidipati, N. J.; Sullivan, S.; Mahaffey, K.; Shah, S. H.; Hernandez, A. F.; Christiani, D.; Aerts, H.; Weiss, J.; Lu, M. T.; Raghu, V.

2024-11-15 radiology and imaging
10.1101/2024.11.14.24317055 medRxiv
Show abstract

BackgroundThis study assessed whether deep learning applied to routine outpatient chest X-rays (CXRs) can identify individuals at high risk for incident chronic obstructive pulmonary disease (COPD). MethodsUsing cancer screening trial data, we previously developed a convolutional neural network (CXR-Lung-Risk) to predict lung-related mortality from a CXR image. In this study, we externally validated CXR-Lung-Risk to predict incident COPD from routine CXRs. We identified outpatients without lung cancer, COPD, or emphysema who had a CXR taken from 2013-2014 at a Mass General Brigham site in Boston, Massachusetts. The primary outcome was 6-year incident COPD. Discrimination was assessed using AUC compared to the TargetCOPD clinical risk score. All analyses were stratified by smoking status. A secondary analysis was conducted in the Project Baseline Health Study (PBHS) to test associations between CXR-Lung-Risk with pulmonary function and protein abundance. FindingsThe primary analysis consisted of 12,550 ever-smokers (mean age 62{middle dot}4{+/-}6{middle dot}8 years, 48.9% male, 12.4% rate of 6-year COPD) and 15,298 never-smokers (mean age 63{middle dot}0{+/-}8{middle dot}1 years, 42.8% male, 3.8% rate of 6-year COPD). CXR-Lung-Risk had additive predictive value beyond the TargetCOPD score for 6-year incident COPD in both ever-smokers (CXR-Lung-Risk + TargetCOPD AUC: 0{middle dot}73 [95% CI: 0{middle dot}72-0{middle dot}74] vs. TargetCOPD alone AUC: 0{middle dot}66 [0{middle dot}65-0{middle dot}68], p<0{middle dot}01) and never-smokers (CXR-Lung-Risk + TargetCOPD AUC: 0{middle dot}70 [0{middle dot}67-0{middle dot}72] vs. TargetCOPD AUC: 0{middle dot}60 [0{middle dot}57-0{middle dot}62], p<0{middle dot}01). In secondary analyses of 2,097 individuals in the PBHS, CXR-Lung-Risk was associated with worse pulmonary function and with abundance of SCGB3A2 (secretoglobin family 3A member 2) and LYZ (lysozyme), proteins involved in pulmonary physiology. InterpretationIn external validation, a deep learning model applied to a routine CXR image identified individuals at high risk for incident COPD, beyond known risk factors. FundingThe Project Baseline Health Study and this analysis were funded by Verily Life Sciences, San Francisco, California. ClinicalTrials.gov IdentifierNCT03154346

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
The Lancet Digital Health
25 papers in training set
Top 0.1%
23.1%
2
Thorax
32 papers in training set
Top 0.1%
19.1%
3
Nature Communications
4913 papers in training set
Top 25%
7.0%
4
JAMA Network Open
127 papers in training set
Top 0.7%
4.1%
50% of probability mass above
5
European Respiratory Journal
54 papers in training set
Top 0.5%
3.7%
6
American Journal of Respiratory and Critical Care Medicine
39 papers in training set
Top 0.2%
3.7%
7
BMC Medicine
163 papers in training set
Top 2%
3.2%
8
eBioMedicine
130 papers in training set
Top 0.5%
2.7%
9
PLOS ONE
4510 papers in training set
Top 45%
2.5%
10
Nature Medicine
117 papers in training set
Top 1%
2.4%
11
Scientific Reports
3102 papers in training set
Top 49%
2.1%
12
Journal of Internal Medicine
12 papers in training set
Top 0.1%
1.9%
13
BMJ Open Respiratory Research
32 papers in training set
Top 0.3%
1.7%
14
Respiratory Research
19 papers in training set
Top 0.2%
1.7%
15
npj Digital Medicine
97 papers in training set
Top 2%
1.7%
16
ERJ Open Research
44 papers in training set
Top 0.7%
0.8%
17
International Journal of Epidemiology
74 papers in training set
Top 3%
0.8%
18
Nicotine & Tobacco Research
11 papers in training set
Top 0.2%
0.8%
19
Science Translational Medicine
111 papers in training set
Top 6%
0.8%
20
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
21
American Journal of Respiratory Cell and Molecular Biology
38 papers in training set
Top 0.8%
0.7%
22
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.7%
23
Patterns
70 papers in training set
Top 3%
0.5%
24
Nutrients
64 papers in training set
Top 2%
0.5%
25
The Journal of Infectious Diseases
182 papers in training set
Top 6%
0.5%
26
EBioMedicine
39 papers in training set
Top 2%
0.5%