Back

Diagnostics

MDPI AG

Preprints posted in the last 30 days, ranked by how well they match Diagnostics's content profile, based on 48 papers previously published here. The average preprint has a 0.08% match score for this journal, so anything above that is already an above-average fit.

1
Detection of Pancreatic Cancer Using a Methylation-Specific PCR-Based Multi-Cancer Early Detection Test

Pham, H. T.; Bussey, K. J.; Oshiro, M. M.; Rounseville, M.; Moses, M.; Zulbaran-Rojas, A.; Nguyen, V.; Bernert, R. A.; Routh, J.; Watts, G.; Block, G. D.; Fisher, W. E.; Nelson, M. A.

2026-05-31 molecular biology 10.64898/2026.05.27.728292 medRxiv
Top 0.1%
22.5%
Show abstract

ContextPancreatic ductal adenocarcinoma (PDAC) is an aggressive malignancy often diagnosed at advanced stages due to the lack of early clinical symptoms. DNA methylation alterations arise early in PDAC tumorigenesis and may serve as promising biomarkers for blood-based cancer detection. ObjectiveTo evaluate the performance of EPISEEK, a laboratory-developed blood-based multi-cancer early detection (MCED) assay, for detecting PDAC across disease stages. DesignA retrospective cohort study included 97 patients with stage I-IV PDAC and 201 asymptomatic healthy controls. Sensitivity, specificity, area under the curve (AUC), and stage-specific performance were assessed. EPISEEK-MCED performance was also compared with CA 19-9 alone and in combination with CA 19-9. ResultsEPISEEK-MCED classified 65 of 97 PDAC cases as positive, corresponding to an observed sensitivity of 70.1% (95% CI, 60.3% - 78.3%) at 99.5% specificity. The assay demonstrated strong discrimination between PDAC cases and healthy controls, with an AUC of 0.916 (95% CI, 0.88 - 0.952). Sensitivity increased with advancing stage while remaining substantial in early-stage disease, measuring 53.6% for stage I and 65.1% for stage II PDAC, 100% for stage III and 94.7% for stage IV. Across stages, EPISEEK-MCED outperformed CA 19-9 alone, particularly in early-stage disease. Combined analysis of EPISEEK-MCED and CA 19-9 further improved detection performance, achieving sensitivity of 57.1% and 81.4% for stage I and II, respectively. ConclusionsEPISEEK-MCED demonstrated high specificity and sensitivity for PDAC detection across disease stages, including early-stage disease. Combining EPISEEK-MCED with CA19-9 further improved performance, supporting its clinical utility for PDAC detection.

2
Imaging-detected benign breast findings in a forensic autopsy cohort unselected for breast symptoms: descriptive results from the Sisyphus study

Sidiropoulou, Z.; Santos, C.

2026-05-12 radiology and imaging 10.64898/2026.05.07.26352434 medRxiv
Top 0.1%
18.1%
Show abstract

Rationale and ObjectivesPublished estimates of benign breast disease (BBD) are derived mainly from clinical, surgical, screening-recall, or reduction-mammoplasty series. Forensic autopsy cohorts can reduce referral and symptom-selection bias, although they are not necessarily representative of the whole living population. We describe imaging-detected benign breast findings in the Sisyphus forensic autopsy cohort. Materials and MethodsConsecutive medico-legal autopsies of individuals aged 40 years or older were prospectively evaluated over a multi-year period at a medico-legal autopsy service in Portugal. Bilateral breast specimens obtained by subcutaneous modified radical mastectomy were examined with specimen digital mammography and ultrasonography. Findings were classified according to BI-RADS terminology. Lesions requiring tissue diagnosis in the post-mortem protocol underwent wire-guided or direct excisional biopsy. Female cadavers were analysed as the primary cohort; male cadavers were analysed separately as an exploratory subgroup. Proportions are reported with exact 95% confidence intervals (CIs). ResultsThe cohort included 291 cadavers: 217 women and 74 men. Among female breast specimens, 236/434 were BI-RADS 1 (54.4%; 95% CI, 49.6-59.1), 189/434 were BI-RADS 2 (43.5%; 95% CI, 38.8-48.4), and 8/434 were protocol-sampled suspicious findings (1.8%; 95% CI, 0.8-3.6). At the cadaver level, 99/217 women had at least one benign imaging finding (45.6%; 95% CI, 38.9-52.5). Mammographic benign findings were present in 91/217 women (41.9%; 95% CI, 35.3-48.8), dominated by calcifications; ultrasonographic benign findings were present in 51/217 (23.5%; 95% CI, 18.0-29.7), most often simple cysts and duct ectasia. Plasma cell mastitis-pattern calcifications were observed in 8/217 women (3.7%; 95% CI, 1.6-7.1). Male benign findings were less frequent (9/74, 12.2%; 95% CI, 5.7-21.8) and were dominated by benign lymph-node variants. All nine protocol-sampled lesions were benign at histology. Clinical breast examination identified 5/8 protocol-sampled female lesions (62.5%; 95% CI, 24.5-91.5). ConclusionIn this forensic autopsy cohort unselected for breast symptoms, benign imaging findings were common in women aged 40 years or older and less frequent in men. The results provide descriptive post-mortem imaging reference data, but lesion-specific estimates, especially rare entities, should be interpreted with caution because of small numerators, the older age profile, limited clinical history, and the original cancer-focused design of the Sisyphus study.

3
Development of a Deep Learning Model Integrating CT Images and Blood Data for the Diagnosis of Acute Cholecystitis

HORAGUCHI, T.; Nomura, R.; Sakai, S. A.; Saito, N.; Kurihara, K.; Ohira, M.; Takaha, R.; Mitsui, N.; Yokoi, R.; Hatanaka, Y.; Hayashi, H.; Kuno, M.; Fukada, M.; Sato, Y.; Yasufuku, I.; Asai, R.; Bando, H.; Yamashita, R.; Matsuhashi, N.

2026-05-12 radiology and imaging 10.64898/2026.05.08.26352724 medRxiv
Top 0.1%
17.8%
Show abstract

PurposeIn this study, we aimed to develop and evaluate an artificial intelligence-based diagnostic model for the diagnosis of acute cholecystitis (AC) using non-contrast CT images and clinical data. Materials and MethodsThis retrospective study included 199 patients (100 AC, 99 non-AC) treated between January 2016 and December 2025 at a single center. Patients were randomly divided into training (n=139) and test (n=60) datasets. Three models were constructed: an imaging-based deep learning model, a clinical data-based machine learning model, and a hybrid machine learning model integrating deep learning-derived imaging features with clinical data. CT images were preprocessed, and gallbladder regions were segmented. Clinical variables included white blood cell counts and levels of C-reactive protein and liver function markers. Model performance was evaluated using accuracy, precision, recall, specificity, F1 score, and area under the receiver operating characteristic curve (AUC). Statistical comparisons were performed using Welchs t-test and Chi-square test. ResultsThe imaging-based model achieved accuracy 0.883, precision 0.848, recall 0.933, specificity 0.833, and AUC 0.916. The blood-based model achieved accuracy 0.917, precision 0.931, recall 0.900, specificity 0.933, and AUC 0.949. The hybrid model showed the highest performance, with accuracy 0.950, precision 0.909, recall 1.000, specificity 0.900, F1 score 0.952, and AUC 0.986. ConclusionA hybrid model integrating CT imaging and clinical data improved diagnostic performance for AC compared with single-modality models.

4
Automated identification of bolus types in modified barium swallow studies using deep learning: a preliminary study

Mao, S.; Sahli, A. J.; Buoy, S. N.; Hutcheson, C.; Gelabert, G. A.; Barbon, C. E. A.; Naser, M. A.; Fuller, C. D.; Brock, K. K.; Hutcheson, K. A.

2026-05-20 radiology and imaging 10.64898/2026.05.16.26353385 medRxiv
Top 0.1%
10.8%
Show abstract

Purpose: Modified Barium Swallow (MBS) studies utilize videofluoroscopy, a dynamic X-ray technique for evaluating swallowing anatomy and physiology. Each MBS exam typically includes multiple bolus trials, often involving different bolus consistencies. Accurate classification of bolus types is essential, as swallowing dynamics, aspiration risks, and residue levels vary with bolus consistency. In this preliminary study, we propose a deep learning-based approach for automated bolus type classification in MBS, aiming to provide a standardized and efficient framework for automated processing of swallowing assessments. Methods: A total of 206 patients (Mean +/- SD age: 60.24 +/- 9.02 years; 89.32% men) underwent MBS examinations, comprising 277 individual MBS studies. The dataset included 2,752 bolus-level video segments, categorized by bolus type as follows: 1,711 liquid (IDDSI 0-3, 62.17%), 521 pudding (IDDSI 4, 18.93%), and 520 solid boluses (IDDSI 7, cookie or cracker, 18.89%). To standardize variable video lengths for the data pipeline, each MBS video was temporally segmented into a fixed-length frame sequence, with shorter videos padded using static frames and longer videos randomly cropped to the target length. We employed an Inflated 3D convolutional neural network to develop the deep learning model. Results: Each video segment contained an average of 273.03 +/- 195.81 frames. On the independent test set, the deep learning model achieved an overall accuracy of 96.13%, and the macro F1-score was 95.05% in classifying food bolus types within MBS videos. Conclusions: The developed AI-based system demonstrated effective automated classification of food bolus types in MBS videos, representing an important step toward fully automated MBS analysis for swallowing efficiency assessment. The AI model reduces the reliance on manual labels, thereby promising to streamline clinical and research workflows.

5
Prospective Comparison of FDG PET, and Contrast-Enhanced MRI for Predicting Pathologic Response after Neoadjuvant Chemotherapy in Breast Cancer

Luo, Y.; Zhang, X.; Li, R.; Zeng, Y.; Zhao, Y.; Li, L.; Qian, B.; Xiao, Y.; Li, M.; Zhao, Y.; Xu, S.; Yang, Q.; Zhang, H.; Chen, H.; Lu, C.; Lan, X.; Liu, C.

2026-05-13 radiology and imaging 10.64898/2026.05.05.26352015 medRxiv
Top 0.1%
10.6%
Show abstract

Assessment of pathologic complete response (pCR) following neoadjuvant chemotherapy (NAC) remains an unmet clinical need in breast cancer. Fibroblast activation protein inhibitor (FAPI) PET targets the tumor microenvironment and may therefore enhance response evaluation after NAC. This study aimed to compare the performance of [68Ga]Ga-FAPI-04 PET, [18F]FDG PET, and contrast-enhanced MRI for predicting pathologic response after NAC in breast cancer, with separate analyses for primary breast lesions and axillary lymph nodes. MethodsIn this prospective single-center diagnostic accuracy study, women with biopsy-confirmed stage II-III breast cancer underwent baseline and post-therapy [68Ga]Ga-FAPI-04 PET/MRI, [18F]FDG PET/CT, and contrast-enhanced MRI before surgery. Quantitative PET parameters were evaluated for primary tumors and axillary lymph nodes. pCR was defined as ypT0/isN0. Significant variables identified in univariable analyses were further explored using least absolute shrinkage and selection operator (LASSO) analysis, and receiver-operating-characteristic (ROC) analysis was performed to assess diagnostic performance. Fibroblast activation protein expression was also assessed by immunohistochemistry in paired pre- and post-therapy tumor specimens from a subset of patients. ResultsTwenty-four patients completed the study protocol, yielding 25 primary lesions and 44 metastatic lymph nodes across 27 axillary compartments. Overall patient-level pCR was achieved in 13 of 24 patients (54.17%). The lesion-level pCR rate was 60.00% (15/25) for primary breast lesions, and the node-level pCR rate was 72.73% (32/44) for axillary lymph nodes. For primary tumor response, post-therapy [68Ga]Ga-FAPI-04 SUVmax showed the highest diagnostic performance (AUC, 0.84; sensitivity, 80.00%; specificity, 80.00%; accuracy, 80.00%), whereas the optimal [18F]FDG parameter was {Delta} TBR% (AUC, 0.747). For nodal response, post-therapy [68Ga]Ga-FAPI-04 SULmean showed the highest diagnostic performance (AUC, 0.89; sensitivity, 91.67%; specificity, 81.25%; accuracy, 84.09%) and was significantly different from the best [18F]FDG parameter ({Delta} SULmax%, AUC, 0.669) on DeLong testing (P < 0.05). MRI achieved AUCs of 0.733 for primary lesions and 0.770 for lymph nodes. Stromal FAP expression positively correlated with [68Ga]Ga-FAPI-04 SUVmax and was markedly reduced in lesions achieving pCR. ConclusionPost-therapy [68Ga]Ga-FAPI-04 PET may serve as a promising adjunctive imaging biomarker for predicting pathologic response after NAC in breast cancer, particularly for axillary nodal assessment. These findings suggest that FAPI PET may provide clinically relevant information for preoperative evaluation of residual disease burden, potentially contributing to more individualized surgical planning and treatment decision-making.

6
A Radiologic Masquerade: Camrelizumab-Associated Breast Lesions That Mimic Progression

Hu, Y.; Shui, Y.; Li, W.; Liang, J.; Song, Y.; Wang, M.; Zhang, F.; Zhang, M.; Wang, H.; Ji, L.; Li, M.; Wang, C.; Shao, N.; Kuang, X.; He, S.; Zhang, X.

2026-06-03 radiology and imaging 10.64898/2026.05.30.26353749 medRxiv
Top 0.1%
10.5%
Show abstract

Abstract Background Immune-related adverse events (irAEs) involving the breast remain rarely reported. Purpose To characterize clinical and imaging features of camrelizumab-associated breast lesions (CABLs). Materials and Methods This retrospective dual cohort study (October 2019 to February 2026) included 196 female patients. Cohort A comprised 180 non-breast cancer patients; Cohort B comprised 16 breast cancer patients receiving neoadjuvant camrelizumab. Baseline characteristics, treatment response, and CT/MRI features were compared between CABL-positive and CABL-negative groups using Mann-Whitney U and chi-square tests. Results CABLs developed in 34.4% (62/180) of Cohort A and 93.8% (15/16) of Cohort B. CABL-positive patients were younger (median 50.5 vs 54.5 years; P = 0.006) and more often premenopausal (46.8% vs 26.3%; P = 0.009). The objective response rate was relatively high among patients with positive lesions; in Group A, the disease progression rate was lower in the CABL-positive group than in the CABL-negative group (3.2% vs 17.8%), whilst in Group B, the pathological complete response rate was as high as 53.3% (8/15). On CT/MRI, CABLs were predominantly multiple (62.5%), with well-defined margins and unrestricted diffusion. The predominant time-intensity curve (TIC) pattern was washout (46.7%). Median time to onset was 2-3 cycles (the second MRI scan); most lesions disappeared (40.3%) and shrank (46.8%) during follow-up. ADC values of lesions were significantly higher than those of primary tumors (1.847+/-0.284 vs 0.976+/-0.055 x10[-3] mm[2]/s; P < 0.001). Histopathology of four lesions revealed lymphocytic infiltration and fibrosis without malignancy. Conclusion CABLs are benign reactive changes driven by multiple factors. Their recognition prevents misinterpretation as disease progression, thereby avoiding unnecessary treatment discontinuation or biopsy.

7
Impact of AI-Assisted Mammography Reading on Quality Indicators in the Czech Breast Cancer Screening Programme: A Retrospective Study

Veverkova, L.; Dolezalova, Z.; Marackova, V.; Mathew, E.; Urbankova, M.; Ambrozova, M.; Piskovsky, T.; Ngo, O.; Majek, O.

2026-05-26 oncology 10.64898/2026.05.25.26353869 medRxiv
Top 0.1%
10.4%
Show abstract

Objectives: The aim of mammographic screening is the early detection of invasive cancers. In the era of artificial intelligence (AI), this tool may improve diagnosis of earlier stages. The purpose of this study was to assess the impact on selected quality indicators retrospectively. Method: The data source was the Breast Cancer Screening Registry using data from one Screening Unit that currently uses AI routinely. The indicators of the cancer detection rate (CDR), further assessment rate (FAR), and recall rate (RR) in the year 2023, when AI was used, and the year 2022, without AI, in women aged 45-69 were compared. The statistical evaluation used the chi-square test and logistic regression adjusting for the effects of age, a woman's risk level, and the screening round at a 5% significance level. Results: In 2022, without AI, 4,034 women aged 45-69 were included, compared with 4,049 women in 2023 when AI was used. This study showed a non-significant increase in CDR from 5.0 breast cancers detected per 1,000 women (non-AI assessment) to 5.2 (AI-assisted assessment), p = 0.919; OR (95% CI): 1.034 (0.542-1.974), a significant decrease in the FAR from 5.2% to 3.9%, p < 0.001; OR (95% CI): 0.665 (0.529-0.836), and a decrease in RR from 2.4% to 1.9%, p = 0.083; OR (95% CI): 0.754 (0.548-1.037). Conclusion: AI has the potential to be a useful tool in the early detection of breast cancer by improving quality through a decrease in FAR and RR, while probably maintaining CDR.

8
AI-Based Coronary Artery Calcification on Non-contrast CT: Performance Across Calcium Scoring, Lung Cancer Screening, and Liver Transplant Candidate Cohorts

Ludwig, K. D.; Hatt, C. R.; Keith, L.; Matyga, A. W.; Te, H. S.; Landeras, L.; Chelala, L.; Patel, A. R.; Chung, J. H.

2026-05-15 radiology and imaging 10.64898/2026.05.12.26352904 medRxiv
Top 0.1%
8.7%
Show abstract

Objective: Coronary artery calcification (CAC) assessment for cardiovascular risk stratification is traditionally achieved using ECG-gated computed tomography (CT). Automated deep-learning (DL) algorithms may streamline opportunistic CAC detection and scoring, particularly on non-gated CT scans. This study evaluated the performance of a fully automated DL-based CAC scoring algorithm ("DL-CAC") against expert human scoring. Methods: The algorithm was trained on 1,260 chest CT scans from multiple databases to automatically identify coronary calcium, calculate Agatston scores, and assign a cardiovascular disease (CVD) risk classification. Performance was assessed on a holdout dataset (n=500) comprising ECG-gated calcium scoring CT scans and lung cancer screening non-gated chest CTs as well as in an external, independent CT dataset (n=129) from liver transplant candidates. Agreement with expert scoring was assessed using intraclass correlation coefficient (ICC) for Agatston scores and Cohen's {kappa} for CVD risk classification. Results: The algorithm demonstrated high agreement with expert scoring in the pooled calcium scoring and lung cancer screening cohorts, with an ICC of 0.947 for Agatston scores and {kappa} of 0.936 for CVD risk classification. For liver transplant candidates, the algorithm exhibited substantial agreement with expert scoring of non-gated CT scans ({kappa}=0.79) and a sensitivity of 90.4% and specificity of 96.4% in high-risk cases. Conclusion: These findings suggest that DL-based CAC scoring on non-gated CT scans may be a feasible alternative to traditional methods and could support opportunistic cardiovascular risk assessment in routine imaging. Further validation is warranted to assess clinical integration in broader practice settings.

9
Compact serum miRNA qPCR model for pancreatic cancer discrimination with independent and clinical validation

Yotsutsuji, S.; Kataoka, H.; Ando, T.; Inada, M.; Sugano, M.; Takada, M.; Esaki, M.; Kato, K.; Yamamoto, Y.; Sano, Y.

2026-05-14 cancer biology 10.64898/2026.05.11.724428 medRxiv
Top 0.1%
7.4%
Show abstract

BackgroundFor pancreatic cancer, practical blood-based tests for early detection and postoperative surveillance remain elusive. We sought to develop a qPCR-measurable serum microRNA (miRNA) panel that robustly discriminates pancreatic cancer from non-cancer controls and other malignancies. MethodsWe profiled 255 serum miRNAs in batch 1 (n=72) and selected 27 candidates. Candidates were refined in batch 2 (n=552) and cross-batch evaluation was performed with batch 3 (n=391) to derive a miRNA model. Independent validation used batch 4 (n=616). Clinical relevance was assessed in an independent clinical cohort of resection patients with samples obtained preoperatively and at 1 and 12 months postoperatively. ResultsThe miRNA model trained on batches 2 and 3 achieved an area under the curve (AUC) of 0.91 and 0.83 for pancreatic cancer versus non-cancer controls and non-cancer plus other cancers, respectively, when independently validated in batch 4. Stage-wise AUCs in batch 4 were 0.91 (I), 0.94 (II), 0.86 (III) and 0.90 (IV). In the clinical batch, the score decreased postoperatively (preoperative vs month 1; p<0.01) and was higher in recurrence than non-recurrence (p<0.001). ConclusionsThe developed compact miRNA qPCR assay discriminated pancreatic cancer across independent assay batches and showed clinical relevance for postoperative surveillance. Clinical Trial RegistrationNot applicable.

10
Comparative Study on Image Quality of Deep Learning and Adaptive Statistical Iterative Reconstruction-V in Thin Layer CT of liver Lesions

Yang, J.; Li, L.; Cao, J.; Zhang, J.

2026-05-26 radiology and imaging 10.64898/2026.05.23.26353923 medRxiv
Top 0.2%
6.8%
Show abstract

Objective:This study aims to compare the advantages and disadvantages of DLIR and adaptive statistical iterative reconstruction-V (ASIR-V) in thin-slice (2.5 mm) CT images of hepatic lesions characterized by high and low contrast. Additionally, the study seeks to determine the optimal DLIR strength for the evaluation of liver lesions. Methods:A retrospective analysis was performed on 90 patients who underwent abdominal contrast-enhanced CT scans. Group A comprised 48 patients with low-contrast lesions, while Group B included 42 patients with high-contrast lesions. The acquired images were reconstructed using post-processing DLIR at low (DLIR-L), medium (DLIR-M), and high (DLIR-H) strengths, all with a slice thickness of 2.5 mm (subgroups A1-A3, B1-B3). Furthermore, images were reconstructed with ASIR-V at 50% strength at slice thicknesses of 2.5 mm and 5 mm (subgroups A4/B4 and A5/B5, respectively). CT values and standard deviations (SD) of the liver and lesions were measured, and the corresponding signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) were calculated. The edge rise slope (ERS) was determined using ImageJ software by measuring CT values along a line from the liver parenchyma to the lesion. Objective metrics were compared using one-way ANOVA, with independent samples t-tests applied for inter-group differences. Subjective scoring, which encompassed noise level, diagnostic confidence, and lesion margin delineation, was conducted by two radiologists, with differences analyzed using the Kappa test. Results: Objective evaluation revealed a progressive decrease in lesion SD and a progressive increase in SNR and CNR from subgroups A1/B1 to A3/B3. The SD of Group A2 decreased by 57.4% compared to A4, while the SNR and CNR of A2 icreased by 19.3% and 24.6% compared to A4. Although subgroup B2 had a lower SNR than B5, the difference was not statistically significant. SNR and CNR in B2 increased by 24.1% and 11.9%, respectively, compared to B4. ERS gradually decreased from A1/B1 to A3/B3. ERS values in A2 and B2 increased by 27.0% and 39.4%, respectively, relative to A5 and B5. Although A3 had a lower ERS than A1 and A2, all DLIR subgroups exhibited higher ERS than A5; similar trends were observed in Group B. Subjective evaluation indicated good inter-reader agreement (Kappa > 0.61, p < 0.05). As DLIR strength increased, noise scores rose progressively in both groups. However, noise in A2 and B2 was lower than in A4/A5 and B4/B5. Diagnostic confidence and lesion margin delineation scores were highest in A2 and B2, while all subjective scores were lowest in A5 and B5. Discussion: Most prior studies evaluated the liver, vessels, or confirmed that image quality can be guaranteed at low doses. However, there are few studies on specific individual lesions. Therefore, this study aims to investigate specific individual lesions. The details and detection rate were analyzed separately to confirm the clinical acceptability of 2.5-mm DLIR image in different contrast lesions. Conclusion: For both high- and low-contrast hepatic lesions, DLIR provides superior image quality compared to ASIR-V, with the 2.5mm DLIR-M setting being optimal. DLIR-M reduces image noise, improves spatial resolution, and produces images more suitable for diagnostic purposes.

11
Scan length as a major driver of CT radiation dose: a diagnostic reference level audit from Kosovo

Rudi, G.; Vula, F.; Bicaku, A.; Dedushi, K.; Ahmetgjekaj, I.

2026-05-17 radiology and imaging 10.64898/2026.05.12.26353024 medRxiv
Top 0.2%
6.5%
Show abstract

Computed tomography is the largest contributor to population radiation dose from medical imaging, yet no diagnostic reference levels (DRLs) have been published from Kosovo or the Western Balkans. This retrospective audit analyzed all CT examinations performed on a 128- slice scanner at the University Clinical Centre of Kosovo between January and March 2026. After exclusions, 1,535 acquisitions from 1,092 patients across nine examination categories were analyzed. Local DRLs were defined as the 75th percentile and compared against German (BfS 2022) and Turkish (Kahraman et al., 2024) reference values. Head CT (n = 590) demonstrated CTDIvol 4.7% below the BfS DRL yet scan length 98.5% above the orientation value (median 25.8 vs 13 cm). Abdomen-pelvis CTDIvol matched the BfS reference while scan length exceeded it by 28%. Coronary CTA showed CTDIvol +377%, consistent with retrospective ECG gating. Excess scan length, not CTDIvol, is the major driver of elevated dose at this institution. The identified excesses are correctable through technologist landmarking training, protocol review, and enabling iterative reconstruction.

12
Multimodal neuroimaging approach for cognitive impairment in Alzheimer disease

Gonzales, M.; Kang, X.; Adamson, M. M.; Chao, S. Z.; Yoon, B. C.

2026-06-06 radiology and imaging 10.64898/2026.06.04.26354924 medRxiv
Top 0.2%
6.4%
Show abstract

PURPOSE: Alzheimer disease (AD) is associated with cognitive impairment, brain atrophy, and elevated amyloid-beta and tau. The study aimed to characterize regional atrophy associated with elevated amyloid-beta and tau, as measured by [18F]florbetapir (FBP) and [18F]flortaucipir (FTP) positron emission tomography (PET), respectively, and determine whether combining PET and atrophy data improves the prediction of cognitive impairment. METHODS: Alzheimer Disease Neuroimaging Initiative data (n = 381) were retrospectively analyzed. PET results were correlated with cortical thickness, gray matter (GM) volumes, Mini-Mental State Examination, and Montreal Cognitive Assessment. Linear/logistic regression and area under the curve (AUC) were used to evaluate for significant correlations and compare performances in distinguishing cognitive impairment, respectively. RESULTS: Incremental loss of cortical thickness and GM volume was observed from FBP-/FTP- (n = 205) to single PET-positive (FBP+/FTP-, n = 133; FBP-/FTP+, n = 5) and FBP+/FTP+ (n = 38) groups, particularly in the temporal and parietal lobes. FBP+/FTP+ showed the most severe cortical thickness loss in the entorhinal cortex, temporal lobe GM atrophy, and cognitive impairment. Adding brain atrophy as the third variable resulted in higher odds ratios and improved AUCs for cognitive impairment, with FBP+/FTP+/temporal GM or entorhinal cortical atrophy+ demonstrating the strongest associations with cognitive impairment. CONCLUSION: A multimodal approach combining PET and MRI may help improve the assessment of cognitive impairment in AD.

13
Efficacy Validation of a Novel MRI-Based Whole-Body Rapid Bone Scan (WB-RBS) Strategy for Diagnosing Bone Metastases: A Prospective Trial

Wu, X.; Zhang, J.; He, Y.; Zhang, Y.; Kang, X.; Hu, W.; Li, Y.; Ma, H.; Wang, Y.; Song, Y.; Chen, X.; Huo, F.; Zhang, Y.; Yin, H.; Xi, Y.

2026-05-24 radiology and imaging 10.64898/2026.05.17.26352855 medRxiv
Top 0.2%
6.2%
Show abstract

Background: Traditional bone scintigraphy for detecting malignant bone metastases is limited by suboptimal accuracy and radiation exposure. Whole-body magnetic resonance imaging (WB-MRI), while an alternative, requires lengthy scan times and high patient compliance. Purpose: To develop a novel, rapid whole body bone screening (WB-RBS) MRI protocol and evaluate its diagnostic performance for bone metastasis detection. Materials and Methods: Patients with pathologically confirmed malignancies and healthy controls were prospectively enrolled. All participants underwent WB-RBS (acquisition time: about 10 min); patients additionally underwent WB-MRI (about 70 min). Three radiologists, blinded to clinical data, independently evaluated the images for bone metastases. A consensus expert diagnosis served as the reference standard to calculate the diagnostic performance of WB-RBS. Specificity was further assessed in the healthy control group. Results: Seventy patients and 19 healthy controls were included. WB-RBS demonstrated excellent inter-reader agreement at the patient level. Compared with the reference standard, WB-RBS achieved an accuracy of 77.1%-91.4% at the patient level and a slightly lower accuracy (70.6%-82.5%) at the lesion level. At diagnostic confidence thresholds 1-3, the correlations between WB-RBS ratings and the reference standard were statistically significant for both patient- and lesion-level analyses. Conclusion: WB-RBS showed favorable inter-reader agreement and high accuracy for bone metastasis screening at the patient level, while substantially reducing scan time and cost. Its rapid, radiation-free nature and high accessibility offer distinct clinical advantages, supporting its potential as an alternative screening tool to conventional bone scintigraphy.

14
Ejaculatory Function and Clinical Outcomes Following Robotic Aquablation for Prostatic Bladder Outflow Obstruction: A Retrospective Real-World Cohort Study Protocol

Shroff, D. E.; Newman, T.; Malde, S.; Martyn-Hemphill, C.

2026-05-30 urology 10.64898/2026.05.28.26354125 medRxiv
Top 0.2%
5.4%
Show abstract

Introduction Aquablation for surgical treatment of benign prostatic enlargement (BPE) causing bladder outflow obstruction (BOO) has demonstrated good functional outcomes, even for large glands, with high rates of ejaculatory preservation reported. This is a protocol for a study that aims to review real-world outcomes of ejaculatory preservation or restoration post-Aquablation in an unselected cohort and compare to published clinical trial outcomes. Methods Retrospective data will be collected from a prospectively maintained consecutive case series of patients who underwent Aquablation, in a single UK centre. The primary outcome is ejaculatory function subjectively reported by men post-operatively, and classified as: antegrade ejaculation, retrograde/low volume ejaculation, anejaculation or not sexually active. Secondary outcomes are International Prostate Symptom Severity (IPSS), Quality of Life (QoL) Score, post-void residual (PVR), and incontinence. Descriptive and comparative statistical tests will be performed. Conclusions This study will review real-world ejaculatory function and clinical outcomes following robotic Aquablation for prostatic bladder outflow obstruction and compare this to published clinical trial outcomes.

15
Performance of Vision-Language Models for Zero-Shot Lung Nodule Detection on Chest Radiographs

Nishio, M.; Matsuo, H.; Matsunaga, T.; Fujimoto, K.; Deperrois, N.; Nooralahzadeh, F.; Frauenfelder, T.; Krauthammer, M.; Murakami, T.

2026-06-03 radiology and imaging 10.64898/2026.05.31.26354565 medRxiv
Top 0.2%
5.0%
Show abstract

Background and Objectives: The ability of vision-language models (VLMs) to detect lung nodules on chest radiographs remains uncertain. This retrospective study aimed to compare the zero-shot performances of six VLMs for lung nodule detection using data from the Japanese Society of Radiological Technology (JSRT) chest radiograph database. Methods: A total of 247 chest radiographs from the JSRT database (154 with nodules and 93 without) were preprocessed and evaluated using six VLMs: RadVLM, gpt-4o-mini, Qwen3-VL-8B-Instruct, MedGemma-4b-it, LLaVA-Rad, and CheXpert Plus Model. Each model was tested using a zero-shot setting. The text outputs were binarized into nodule-present or nodule-absent labels by consensus between the two radiologists. Sensitivity, specificity, accuracy, precision, and F1 scores were calculated. Pairwise differences in sensitivity, specificity, and accuracy were assessed using McNemar test with Holm correction. Results: The overall performance was limited across all models. RadVLM achieved the highest accuracy (44.5%, 110/247) with perfect specificity (100.0%, 93/93) and precision (100.0%); however, its sensitivity was low (11.0%, 17/154). LLaVA-Rad showed the highest sensitivity (27.3%, 42/154) and F1 score (37.7%), but lower specificity (71.0%, 66/93). MedGemma-4b-it achieved 100.0% specificity, with a sensitivity of only 5.2% (8/154). Grade-specific analysis showed that detection rates were highest for obvious nodules and remained limited for subtle nodules. Pairwise analyses revealed significant differences in sensitivity and specificity for the selected model pairs, particularly between RadVLM and LLaVA-Rad. Conclusion: Current VLMs show limited zero-shot generalizability for lung nodule detection in the JSRT database, with marked trade-offs between sensitivity and specificity. Their near-term value may lie more in radiologist-assisted workflows than in stand-alone detection. Clinical Impact: Current VLMs should not be used as stand-alone tools for lung nodule detection on chest radiographs because of their limited sensitivity and substantial model-dependent trade-offs. However, their high-specificity outputs in some models and higher-sensitivity behavior in others suggest potential roles in radiologist-assisted workflows, such as report drafting and second-reader support.

16
Design and methodology of a randomized clinical trial of prolonged daily antibiotic suppression with and without fulguration for uncomplicated recurrent urinary tract infections in women

Zimmern, P. E.; Souders, C.; Prokesch, B. C.; Lutz, K.; De Nisco, N. J.

2026-05-14 urology 10.64898/2026.05.11.26352945 medRxiv
Top 0.2%
5.0%
Show abstract

ObjectiveRecurrent urinary tract infections (rUTIs) significantly decrease quality of life and antibiotics are becoming increasingly less effective due to antimicrobial resistance. Alternative effective treatment strategies are urgently needed for rUTIs. Prior studies have indicated that women can experience resolved or improved rUTI following electrofulguration (EF). To further investigate these findings, we report on the design and methodology behind a randomized trial examining two treatment arms: standard prolonged antibiotic treatment with nitrofurantoin (NF) alone or in combination with EF. Patients and MethodsThe aim of this randomized trial is to determine, at two institutions, the efficacy of two interventions for rUTI associated with early stages of chronic cystitis (stages 1 and 2): conventional 6 months low-dose (100mg) NF daily antibiotic suppression alone (NF) or conventional NF with EF (EF + NF). The study is also designed to analyze changes in the urinary microbiomes in the two different treatment arms and to determine the durability of clinical outcomes in both treatment arms at 2 years after the end of each intervention. The primary outcomes will be obtained from 6 to 18 months, as well as 18 - 30 months following completion of the original 6-month intervention. Failure is defined based on UTI symptoms documented by a validated questionnaire with a documented urine culture confirming a bacterial strain at each UTI episode following the end of the 6-month intervention. ConclusionsThis randomized trial is designed to examine the efficacy and durability of treating women with rUTIs using the standard of care of NF alone, or an EF procedure with NF.

17
Economic costing of evaluating, deploying and monitoring an artificial intelligence-based reconstruction for acceleration of rectal MRI examinations

Harrison, C. A.; Wu, M.; White, O.; Hopkinson, G.; Hughes, J.; Robertson, S.; Scurr, E.; Shur, J.; Castagnoli, F.; Charles-Edwards, G.; Koh, D.-M.; Winfield, J.

2026-05-21 radiology and imaging 10.64898/2026.05.18.26353474 medRxiv
Top 0.2%
5.0%
Show abstract

Objectives: AI-based reconstructions can reduce MRI acquisition times and/or improve image quality. Guidelines recommend clinical evaluations and post-deployment monitoring of these novel methods, however, there has been little investigation of the clinical resources required for such assessments. The aim of this study was to evaluate the healthcare resource utilisation and potential savings associated with AI-based reconstructions in rectal MRI. Methods: A retrospective economic costing analysis was conducted from the NHS healthcare perspective. Resource utilisation data were extracted from the Electronic Patient Records for 9 healthy volunteer scans and 104 rectal MRI examinations evaluating an AI-based reconstruction. The resource profile included the MRI scan and the staff time required for data acquisition and analysis. Results: The clinical evaluation of the AI-based reconstruction cost {pound}15,023. Deployment of the AI-based reconstruction reduced the length of an MRI rectum scan by 22 minutes, theoretically saving approximately {pound}3,437 per month. Addition of post-deployment quality control scans reduced this monthly saving to {pound}2,636. If the quality control scans were evaluated using radiologists rather than image quality metrics, monthly savings would be approximately {pound}2,541. With ongoing quality control, the clinical evaluation cost would be recouped between 5.8 and 6 months, compared with 4.4 months without ongoing quality control. Conclusions: Deploying AI-based reconstructions can yield cost savings through reduced scanning times. Quality control tests using image quality metrics would save radiological burden and reduce costs compared with conducting repeated image scoring by radiologists.

18
Consensus-based technical recommendations for clinical translation of renal Dynamic Contrast-Enhanced (DCE) MRI

Gunwhy, E. R.; Kurugol, S.; Serai, S.; van der Molen, A. J.; Abou El-Ghar, M.; Buckley, D. L.; Hockings, P. D.; Jones, R. A.; Lim, R. P.; Mendichovszky, I. A.; Pedersen, M.; Reynolds, H. M.; Sanmiguel Serpa, L. C.; Wentland, A.; Zoellner, F. G.; Sourbron, S.; Dekkers, I. A.

2026-05-14 radiology and imaging 10.64898/2026.05.11.26352525 medRxiv
Top 0.3%
4.9%
Show abstract

BackgroundDynamic contrast-enhanced (DCE) MRI has the potential to be a useful tool for non-invasively assessing renal haemodynamics and function, however insufficient standardisation and difficulties in post-processing remain barriers to clinical translation. PurposeTo develop expert consensus-based technical recommendations for performing renal DCE-MRI in humans, relating to aspects of patient preparation, MRI hardware and acquisition parameters, and data analysis. Study TypeSystematic consensus process using an approximation to the two-step modified Delphi method. PopulationNot applicable. Field Strength / Sequence1.5 T and 3 T / Renal gradient echo-based 3D DCE-MRI. AssessmentAn international panel of experts were recruited and surveyed following a modified Delphi method to create consensus-based technical recommendations. Key areas for consensus were initially identified through a mixture of online and in-person discussions, and an initial survey round consisting of open- and close-ended questions. Consensus statements were formulated and iteratively refined to create the final recommendations. Statistical TestsConsensus was defined as [&ge;] 75% agreement in response (excluding abstentions), and clear preference was defined as [60-74]% agreement among the experts. Statements with [&ge;]40% abstentions were either excluded from subsequent survey rounds or recirculated as a modified statement. Results22 experts initially participated in the Delphi panel, of which 16 responded to the first survey. 15 panellists responded to all subsequent surveys. Out of 46 statements, 37 reached consensus and one showed clear preference. [&ge;]40% abstention was found in seven statements which were excluded from the final set of recommendations. Data conclusionThese recommendations provide a starting point for MRI centres worldwide wishing to perform renal DCE-MRI, contributing to the harmonisation of DCE-MRI scan protocols and facilitating clinical translation. These recommendations provide a practical minimum technical dataset for renal DCE-MRI acquisition and analysis to improve cross-site comparability and support responsible clinical translation.

19
Assessment of the accuracy of lung lesions diagnosis in adolescents with osteosarcoma using artificial intelligence

Uskova, N. G.; Gombolevskiy, V. A.; Chernina, V. Y.; Burenchev, D. V.; Akhaladze, D. G.; Panina, E. V.; Karachunskiy, A. I.; Tereschenko, G. V.; Goncharov, M. Y.; Soboleva, E. A.; Konopleva, E. I.; Bydanov, O. I.; Plekhov, S. Y.; Grachev, N. S.

2026-06-10 radiology and imaging 10.64898/2026.06.08.26354011 medRxiv
Top 0.3%
4.4%
Show abstract

Background. Lung metastases in osteosarcoma (OS) are the main cause of the death. The accuracy of the diagnosis of nodules by computed tomography (CT) of the lungs is critically important for determining the disseminated stage of the disease and planning surgical treatment. The use of artificial intelligence (AI) in the search for lung nodules increases the accuracy of diagnosis and reduces the chance of missing metastases. Objective: to evaluate the accuracy of lung nodules diagnosis in adolescents with OS using AI. Methods. A retrospective assessment of CT scans of adolescents with OS was performed. A pathological nodule with an average size of [&ge;]4 mm was considered a target finding. The diagnostic accuracy of an AI algorithm previously trained on an adult dataset was evaluated, and the number of false positives (FP) and false negatives (FN) was determined. Sensitivity, specificity, accuracy, area under the ROC curve (AUC), positive predictive value, negative predictive value, and F1-measure were calculated. Based on the obtained results, the effectiveness of the algorithm was assessed. Results. 248 CT scans of adolescents with OS were evaluated. The following results were obtained: in 5 cases, the AI algorithm showed a FP result (2.02%), in 34 cases, it showed a FN result (13.71%), and in 209 cases, a correct result (both true positive and true negative) (84.27%). The diagnostic accuracy of the algorithm was 0.843 (95% CI 0.794-0.887). The application of the AI algorithm in the practice of an X-ray doctor in a specific clinical task would allow to increase the sensitivity from 0.805 to 0.891, while ensuring an absolute decrease in the number of FN results by 8.59% and a relative decrease by 44%. Conclusion. The obtained results confirm the practical value of the application of the AI algorithm and justify the implementation of AI-assisted systems in the diagnostic protocols for lung metastases in adolescents with OS.

20
An Automated CT-derived Marker of Renal Tumor Complexity: The CLARITY Score

Jonnalagadda, R.; Patel, S. H.; Abusafieh, H. T.; Seshadri, R.; Jevnikar, D.; Younis, S.; Al-Bayati, A.; Saputro, N.; Knorr, J.; Wang, B.; Ozery-Flato, M.; Rosen-Zvi, M.; Abouassaly, R.; Remer, E.; Heller, N.; Weight, C.

2026-05-12 urology 10.64898/2026.05.08.26352647 medRxiv
Top 0.3%
4.3%
Show abstract

Background and ObjectiveSurgical complexity for renal tumors has traditionally been assessed using manual nephrometry scores, which require unreimbursed physician effort and are subject to interobserver variability. This study introduces an objective, fully automated alternative derived from decades of experience at a large academic center. MethodsWe trained a CT classification model to predict whether a patient would ultimately undergo Partial or Radical Nephrectomy (PN or RN). We hypothesized that the models confidence in RN (termed the CLARITY score) would serve as a surrogate for the difficulty of nephron-sparing approaches and thus for tumor complexity. This hypothesis was tested using multivariate logistic regression for failure to achieve trifecta, estimated blood loss (EBL) [&ge;] 500 mL, and length of stay [&ge;] 3 d. CLARITY was compared with tumor size and R.E.N.A.L. score. External validation in a geographically distinct cohort was performed. Key Findings and LimitationsFor predicting RN, CLARITY achieved an AUROC of 0.899 internally and 0.898 externally. In the external PN subgroup, it outperformed tumor size and R.E.N.A.L. score in predicting failure to achieve trifecta (AUROC 0.613), EBL [&ge;] 500 mL (0.727), and length of stay [&ge;] 3 d (0.673). In multivariable analysis, CLARITY remained associated with each outcome, whereas R.E.N.A.L. and size were not. This study is limited by its retrospective design. Conclusions and Clinical ImplicationsCLARITY is an automated CT-derived marker that quantifies renal tumor complexity more effectively than tumor size and R.E.N.A.L. score and may support scalable, objective preoperative complexity assessment. To support reproducibility and external validation, we have released a public inference pipeline and web-based DICOM upload portal for research use.