Back

Diagnostics

MDPI AG

Preprints posted in the last 7 days, ranked by how well they match Diagnostics's content profile, based on 48 papers previously published here. The average preprint has a 0.08% match score for this journal, so anything above that is already an above-average fit.

1
Multimodal neuroimaging approach for cognitive impairment in Alzheimer disease

Gonzales, M.; Kang, X.; Adamson, M. M.; Chao, S. Z.; Yoon, B. C.

2026-06-06 radiology and imaging 10.64898/2026.06.04.26354924 medRxiv
Top 0.2%
6.4%
Show abstract

PURPOSE: Alzheimer disease (AD) is associated with cognitive impairment, brain atrophy, and elevated amyloid-beta and tau. The study aimed to characterize regional atrophy associated with elevated amyloid-beta and tau, as measured by [18F]florbetapir (FBP) and [18F]flortaucipir (FTP) positron emission tomography (PET), respectively, and determine whether combining PET and atrophy data improves the prediction of cognitive impairment. METHODS: Alzheimer Disease Neuroimaging Initiative data (n = 381) were retrospectively analyzed. PET results were correlated with cortical thickness, gray matter (GM) volumes, Mini-Mental State Examination, and Montreal Cognitive Assessment. Linear/logistic regression and area under the curve (AUC) were used to evaluate for significant correlations and compare performances in distinguishing cognitive impairment, respectively. RESULTS: Incremental loss of cortical thickness and GM volume was observed from FBP-/FTP- (n = 205) to single PET-positive (FBP+/FTP-, n = 133; FBP-/FTP+, n = 5) and FBP+/FTP+ (n = 38) groups, particularly in the temporal and parietal lobes. FBP+/FTP+ showed the most severe cortical thickness loss in the entorhinal cortex, temporal lobe GM atrophy, and cognitive impairment. Adding brain atrophy as the third variable resulted in higher odds ratios and improved AUCs for cognitive impairment, with FBP+/FTP+/temporal GM or entorhinal cortical atrophy+ demonstrating the strongest associations with cognitive impairment. CONCLUSION: A multimodal approach combining PET and MRI may help improve the assessment of cognitive impairment in AD.

2
Assessment of the accuracy of lung lesions diagnosis in adolescents with osteosarcoma using artificial intelligence

Uskova, N. G.; Gombolevskiy, V. A.; Chernina, V. Y.; Burenchev, D. V.; Akhaladze, D. G.; Panina, E. V.; Karachunskiy, A. I.; Tereschenko, G. V.; Goncharov, M. Y.; Soboleva, E. A.; Konopleva, E. I.; Bydanov, O. I.; Plekhov, S. Y.; Grachev, N. S.

2026-06-10 radiology and imaging 10.64898/2026.06.08.26354011 medRxiv
Top 0.3%
4.4%
Show abstract

Background. Lung metastases in osteosarcoma (OS) are the main cause of the death. The accuracy of the diagnosis of nodules by computed tomography (CT) of the lungs is critically important for determining the disseminated stage of the disease and planning surgical treatment. The use of artificial intelligence (AI) in the search for lung nodules increases the accuracy of diagnosis and reduces the chance of missing metastases. Objective: to evaluate the accuracy of lung nodules diagnosis in adolescents with OS using AI. Methods. A retrospective assessment of CT scans of adolescents with OS was performed. A pathological nodule with an average size of [≥]4 mm was considered a target finding. The diagnostic accuracy of an AI algorithm previously trained on an adult dataset was evaluated, and the number of false positives (FP) and false negatives (FN) was determined. Sensitivity, specificity, accuracy, area under the ROC curve (AUC), positive predictive value, negative predictive value, and F1-measure were calculated. Based on the obtained results, the effectiveness of the algorithm was assessed. Results. 248 CT scans of adolescents with OS were evaluated. The following results were obtained: in 5 cases, the AI algorithm showed a FP result (2.02%), in 34 cases, it showed a FN result (13.71%), and in 209 cases, a correct result (both true positive and true negative) (84.27%). The diagnostic accuracy of the algorithm was 0.843 (95% CI 0.794-0.887). The application of the AI algorithm in the practice of an X-ray doctor in a specific clinical task would allow to increase the sensitivity from 0.805 to 0.891, while ensuring an absolute decrease in the number of FN results by 8.59% and a relative decrease by 44%. Conclusion. The obtained results confirm the practical value of the application of the AI algorithm and justify the implementation of AI-assisted systems in the diagnostic protocols for lung metastases in adolescents with OS.

3
Understanding Human AI Discrepancy in Breast Cancer TIL Assessment: A Multi-Rater and Perceptual Bias Study

Capar, A.; Aloglu, I.; Aker, F.; Ertano, M.; Mese, Y. E.; Ungor, A.; Yildiz, B. E.

2026-06-04 pathology 10.64898/2026.05.29.26354196 medRxiv
Top 0.4%
4.0%
Show abstract

Objective: Tumor-infiltrating lymphocytes (TILs) in breast cancer are one of the most important indicators of the immune response within the tumor microenvironment. They play a particularly significant prognostic and predictive role in triple-negative and HER2-positive subtypes. However, substantial inter-observer variability has been reported in TIL scoring among pathologists, which limits its reliability in clinical practice. The aim of this study was to evaluate the agreement between artificial intelligence (AI) models and pathologists in TIL scoring and to compare this agreement using different statistical approaches, thereby assessing the potential of AI integration into pathology practice. Materials and Methods: Digitized histopathological images of breast cancer cases were included in the study. Tumor regions annotated by pathologists were evaluated for both stromal TIL percentage and the proportion of stromal tumor area within each ROI, with assessments performed independently by three pathologists and two AI models. Agreement was assessed among pathologists, between pathologists and AI, and between AI models. Statistical analyses included intraclass correlation coefficient (ICC), Cohen and Fleiss kappa, correlation tests, and Bland-Altman analysis. In addition, categorical agreement was examined using different cut-off values. Results: Inter-pathologist agreement was high, with an ICC of 0.81. In contrast, the global agreement between pathologists and AI models was lower (ICC 0.41). Pairwise comparisons of pathologist-AI agreement yielded substantially lower ICC values (0.12-0.21), although this improved to 0.53 when three pathologists were assessed jointly with a single AI model. The strongest categorical agreement was observed with dichotomized TIL scores ([≤]10% vs. >10%), whereas multi-category classifications were associated with a marked reduction in kappa values. Spearman correlation coefficients between pathologists and AI models ranged from moderate to good ({rho} = 0.48-0.81). Agreement between the two AI models themselves was moderate, with an ICC of 0.64

4
Incremental Clinical Value of Single-Molecule Nanopore Sequencing in Thalassemia Testing: A Prospective Double-blind, Multicenter Study

Xiang, J.; Zhu, B.; Xu, H.; Chen, Y.; Sun, X.; xiang, r.; Zhao, Y.; Liu, W.; Zhang, L.; He, J.; liu, j.; Chen, Y.; Fan, Z.; Zhang, H.; Tan, J.; Pang, L.; Shi, L.; Kong, Y.; Cai, A.

2026-06-09 hematology 10.64898/2026.06.09.26354559 medRxiv
Top 0.4%
3.8%
Show abstract

Background Thalassemia is one of the most common monogenic disorders worldwide, current screening strategies combining hematological testing with molecular assays still carry a risk of missed diagnoses and undesirable efficiency, particularly for complex structural variants and rare mutations. Methods In this prospective double-blind, multicenter cohort study of 3,842 participants (3,362 pregnant women and 480 male partners), we conducted a head-to-head comparison to systematically evaluate the incremental clinical value and detection performance of single-molecule nanopore sequencing in thalassemia (SMITH) against conventional hematological testing and next-generation sequencing (NGS). Findings The overall concordance rate between NGS and SMITH was 98.6% (3789/3842). The discrepant cases (n=53) were directly attributed to the superior detection capabilities of SMITH, which successfully identified complex structural rearrangements-including 45 -globin gene triplications and four HK alleles-that were missed by NGS. Furthermore, SMITH accurately detected four rare variants (c.134_135insT/, c.-22(C>T)/, {beta}N/{beta}c.316-290delinsAGGGCAATAATTT and {beta}3.5 kb deletion/{beta}N ) and resolved ten trans and three cis configurations within the globin gene allele. Clinically, these technical advantages translated to a 9.3% (5/54) increase in the detection rate of high-risk prenatal couples, effectively preventing one birth affected by moderate-to-severe thalassemia. Additionally, SMITH corrected a diagnostic discrepancy in one case (HK vs. -3.7), sparing the couple from an unnecessary invasive procedure. Interpretation Our findings demonstrate that SMITH provides a powerful platform for resolving globin gene rearrangements, detecting rare variants, and enabling direct haplotype phasing. By effectively eliminating diagnostic blind spots, SMITH is expected to become an optimal method for thalassemia prevention programs. Funding This study was supported by Chinese National Natural Science Foundation Projects 81760037 and 82271894.

5
Minimally Invasive Aortic Root Surgery Without Sternotomy: Clinical and Quality-of-Life Benefits of a Totally Endoscopic Approach

Hamiko, M.; Salamate, S.; Bayram, A.; Piekarski, F.; Rogaczewski, J.; Eghbalzadeh, K.; Silaschi, M.; Kruse, J.; El-Sayed Ahmad, A.; Bakhtiary, F.

2026-06-08 cardiovascular medicine 10.64898/2026.06.06.26354391 medRxiv
Top 0.6%
2.5%
Show abstract

Background Totally endoscopic aortic root (AR) surgery via right anterior minithoracotomy (RAMT) may reduce surgical trauma and accelerate recovery compared with full sternotomy (FS). However, the approach is technically demanding due to limited access and anatomical complexity. This study compares early clinical outcomes and quality of life (QoL) after RAMT versus FS to evaluate the feasibility and safety of the totally endoscopic approach. Methods This single-center, retrospective study included 149 patients underwent AR surgery via RAMT (n=74) or FS (n=75) between January 2021 and March 2026. Patients with aortic dissection, infective endocarditis, redo surgery, concomitant procedures, or arch replacement were excluded. Operative outcomes, postoperative recovery, 30-day and 1-year mortality were analyzed. QoL was assessed using the Short Form-8 (SF-8) questionnaire. Results The median age was 60.0 years, and 79.9% of patients were male. Bentall procedure was performed in 84.6% of patients, 15.4% underwent a David procedure. Compared with FS-AR, RAMT-AR was associated with shorter median operative time (147.0 vs. 178.0 min; p<0.001), lower median chest drainage volume (650.0 vs. 850.0 mL; p<0.001), and shorter median ICU stay (24.0 vs. 25.0 h; p=0.008) and hospital stay (6.0 vs. 8.0 days; p=0.028). Overall, 30-day and 1-year mortality was 0.7%. SF-8 analysis demonstrated significantly higher physical and mental component scores in RAMT-AR patients. Conclusion In specialized centers, totally endoscopic AR surgery via RAMT is a safe and feasible minimally invasive approach associated with favorable early outcomes and a potential benefit in postoperative physical and mental QoL by reducing surgical trauma.

6
Epidemiology of Cervical Precancerous Lesions: Prevalence and Predictors from Pap Smear Screening in Hawassa City Hospitals, Sidama Region, Ethiopia. Institutional-Based Cross-sectional Study

Fisshatsion, A. B.; Zewude, Y. A.; Nisro, A. M.; Abebe, R. F.

2026-06-10 public and global health 10.64898/2026.06.09.26355254 medRxiv
Top 0.8%
1.9%
Show abstract

Background: Cervical cancer is the fourth most common cancer in women worldwide and remains a major public health challenge. In Ethiopia, it is the second leading cause of cancer deaths, with around 8,000 new cases and 6,000 deaths each year. Region?specific data on the prevalence and predictors of precancerous lesions remain scarce, yet such information is vital for guiding targeted reproductive health strategies. This study therefore examined the prevalence and predictors of cervical precancerous lesions among women aged 21-60 years undergoing Pap smear screening in public hospitals in Hawassa City, Sidama Region. Methods: An institution-based cross-sectional study was conducted among 241 women attending Pap smear screening at public hospitals in Hawassa City from March to August 2025. Sociodemographic and clinical data were collected via interviews and medical records. Lesions were classified based on the standardized international framework for reporting cervical cytology results from Pap smears per the Bethesda system. Multivariable logistic regression identified predictors p<0.05). Result: Of 241 women screened (mean age 35.3 years), cervical epithelial abnormalities were detected in 52 (prevalence 21.6%). Atypical squamous cells of undetermined significance was the most common abnormality (16.6%). Multivariable analysis showed HIV infection was significantly associated with precancerous lesions (AOR = 3.7, 95% CI: 1.69-8.12, p<0.05), while hormonal contraceptive use was protective (AOR = 0.27, 95% CI: 0.11-0.67, p<0.05). Conclusion: These results underscore the urgent need to strengthen cervical cancer prevention through targeted screening and early intervention. Integrating routine HIV testing with Pap smear programs would be especially valuable. Health authorities should expand accessible screening for women aged 21-60, with particular attention to those living with HIV, to help reduce the burden of precancerous lesions.

7
Conus Medullaris Position in 9,808 Pediatric Lumbosacral MRI Examinations: A Large-Cohort Reference Distribution and the Normally Positioned Conus in Surgically Treated Tethered Cord

Tang, W.; Dong, Y.; Chen, J.; Yang, Y.; Huang, H.; Yu, M.; Zhu, J.; Shen, G.

2026-06-08 radiology and imaging 10.64898/2026.06.06.26355031 medRxiv
Top 0.8%
1.9%
Show abstract

Background. Tethered cord syndrome (TCS) is classically associated with a low-lying conus medullaris, yet many surgically treated children have a normally positioned conus (occult TCS). Large-scale normative data on conus position in children, and the diagnostic value of quantitative conus assessment, are limited. Purpose. To establish a large-cohort reference distribution for conus medullaris termination level in children, to quantify conus position in children surgically treated for presumed (occult) TCS, and to test whether automated conus segmentation and radiomics can distinguish TCS from normal. Materials and Methods. In this retrospective single-center study, conus termination level was extracted from structured radiology reports of consecutive pediatric lumbosacral MRI examinations and encoded numerically (L1 = 1, L2 = 2, etc.). Children surgically treated for tethered cord were identified by linkage to an operative registry (name and date of birth) and restricted to preoperative examinations. A deep-learning model (nnU-Net) was trained for conus segmentation on axial T2-weighted images. IBSI-compliant radiomic features were extracted; reproducibility was assessed by intra- and inter-observer intraclass correlation (ICC). A case-control radiomics analysis used batch-only ComBat harmonization and cross-validated L1-penalized logistic regression; discrimination was compared with conus level by paired bootstrap. Results. Among 9,808 examinations with a parseable conus level (98.5% of reports; parser validated against dual blinded annotation, 99.4% agreement, weighted kappa 0.946), the conus terminated in the L1 region in 85.7% and the L2 region in 14.3% of the reference cohort (postoperative examinations excluded, n = 9,655); a low-lying conus (>=L3) occurred in only 0.05% (5/9,655), and remained rare (0.14%, 14/9,808) including operated examinations (median L1; mean 1.13 +/- 0.33). A slightly more cephalad position was seen with increasing age (negligible correlation). Among 475 preoperative children surgically treated for tethered cord, 99.6% had a normally positioned conus (<=L2) and only 0.4% were low-lying. Automated conus segmentation achieved a held-out Dice of 0.85. Conus radiomics likewise did not distinguish TCS from controls (equivalence-tested null; full segmentation/radiomics pipeline reported in the companion methodological paper). Conclusion. In children, the conus medullaris terminates at L1-L2 in more than 99% of cases and is normally positioned in virtually all children surgically treated for TCS. Within the conus, neither position nor texture (radiomics) identifies tethered cord; whether the filum terminale carries a diagnostic signal was not tested here.

8
Development of a Novel Blood-Based Assay for Brain-Derived Tau and Its Validation in Traumatic Brain Injury

Balogun, W. G.; Zeng, X.; Nafash, M. N.; Sehrawat, A.; Shi, R.; Svirsky, S. E.; Okonkwo, D. O.; Puccio, A. M.; Karikari, T. K.

2026-06-10 neurology 10.64898/2026.06.05.26354965 medRxiv
Top 0.9%
1.8%
Show abstract

Brain-derived tau (BD-tau) is an emerging blood-based biomarker for neurodegeneration, yet there are currently limited well validated BD-tau assays available for research and clinical use. To enhance access to this vital biomarker for neurological disorders including traumatic brain injury (TBI), we developed a novel blood-based immunoassay for BD-tau on the ultra-sensitive Quanterix HD-X platform using Single Molecule Array technology. Analytical validation assessed dilution linearity, specificity, precision, detection limits, and spike recovery, each recording robust metrics in agreement with international expert recommendations. The assay demonstrated robust validation metrics, achieving between-run stability of 95% when analyzing aliquots from six independent plasma and serum samples across five analytical runs. It also showed strong dilution linearity when diluted four-fold and achieved over 90% recovery when spiked with cerebrospinal fluid. Next, we evaluated the clinical utility of the assay in cohorts of individuals with traumatic brain injury (TBI), where strong performances were recorded whether using the 2-step or 3-step assay formats ({rho}= 0.94; p < 0.0001). Furthermore, plasma BD-tau distinguished samples from TBI patients based on time from injury and severity (AUC=0.93). Plasma BD-tau differentiated between favorable and unfavorable functional outcomes in the acute-severe group. Our findings underscore the significant potential of the BD-tau assay as a biomarker for TBI in the severe phase.

9
Title: Development of a Human Papillomavirus genotype-informed risk-stratification model to improve Cervical Cancer screening in resource-limited settings: a cross-sectional study

Kambou Kountchou, K. D. K. K.; Tommo Tchouaket, M. C.; Moko Fotso, L. G.; Fokou Bomgning, B. N.; Fippo Fitime, L.; Talom Teumadjou, A.; Routoube, M.; Efakika Gabisa, J.; Ngoufack Jagni Semengue, E.; Nka, A. D.; Kae, A. C.; Dobgima Pisoh, W.; Deutou, L.; Takou, D.; Fainguem, N.; Sosso, S. M.; Kamgaing Simo, R.; Yagai, B.; Tabola Fossa, L.; Perno, C.-F.; Colizzi, V.; Enow-Orock, G.; Fokam, J.; Terrinoni, A.; Kuiate, J.-R.

2026-06-10 pathology 10.64898/2026.06.06.26355059 medRxiv
Top 1.0%
1.7%
Show abstract

Background: In resource-limited settings, a critical bottleneck in cervical cancer prevention is the lack of practical strategies to triage high-risk human papillomavirus (HR-HPV)- positive women. Therefore, this study aimed to develop and internally validate a genotype-specific risk stratification model. Methods: A cross-sectional study enrolled 555 women in Cameroon. Data collection integrated cervical cytology and HPV genotyping using Abbott m2000rt and Sacace multiplex systems. An iterative modeling approach with bootstrap validation was used to develop the model and address model instability. HR-HPV genotypes were transformed into a hierarchical risk variable due to sparsity and integrated with significant predictors. The final model was translated into a scoring system, and the risk gradients and performances were evaluated at two thresholds. Data was analyzed using SPSS 27.0. Results: The mean age was 44.8 years, and the prevalence of HR-HPV was 26.5% (147/555). The final model, incorporating HPV categories, age, and tobacco, demonstrated moderate discriminative ability (AUC=0.702, 0.642-0.762) with a good calibration (Hosmer-Lemeshow {chi}{superscript 2}=4.05, p=0.399). The scoring system assigned women to risk groups based on their total scores which produced a clear monotonic risk gradient; the observed probability of high-grade lesions/cancer ranged from 15% (score 0) to >65% (score [&ge;]4). At a conservative threshold ([&ge;]4 points), 4.7% (26/555) of women were classified as high-risk, concentrating 46% (6/13) of cancers (positive predictive value[PPV]=58%) while a sensitive threshold ([&ge;]3 points) had 16.8% (93/555) high-risk, concentrating 77% (10/13) cancers (PPV=38%). Both thresholds maintained a high negative predictive value (>95%). Conclusion: This bootstrap-validated, risk-stratification tool is a proof-of-concept in resource limited settings that assigns HR-HPV-positive women to distinct management pathways using three variables. After refining through a longitudinal study and external validation, this scoring system can improve the efficiency of cervical cancer screening programs in low-resource settings.

10
Comparative Thermal Effects of Single Shot Pulsed Field Ablation Systems using a Thermochromic Hydrogel

Gill, J.; Saija, C.; Sagar, V.; Zuberi, Z.; Bajpai, A.; Rhode, K.; Leung, L. W.; Gallagher, M. M.

2026-06-04 cardiovascular medicine 10.64898/2026.06.02.26354772 medRxiv
Top 1%
1.7%
Show abstract

Background Pulse-field ablation (PFA) is regarded as a non-thermal ablation modality, but there is an increasing range of complications that could be due to thermal effects. Methods The hydrogel undergoes permanent colour change when a target temperature is reached allowing direct visualisation of the surface thermal footprint and depth. Comparative lesion sets using a variable loop circular catheter (VP), circular over-the-wire catheter (PS) and pentaspline catheter (FP) were performed. Protocols included single and stacked applications with variation of force, irrigation, and voltage. The hydrogel lesions were analysed en-face and by section using digital image analysis. Results All 3 PFA catheters tested had significant thermal footprints. The VP catheter had the largest mean surface footprint (156.1mm2) and thermal depth (1.31mm) compared to the other two catheters (PS 55.4mm2 & 1.1mm, FP 29.8mm2 & 1.05mm, p<0.005). Increasing irrigation showed a trend to reduce thermal footprint but did not achieve statistical significance. Increasing voltage increased thermal footprint, but increasing force had negligible effect. Stacked lesions incrementally increased thermal lesion footprint and depth in all catheters. Thermal depths of up to 2.4mm were observed. Areas of darkening and degradation of the hydrogel were observed with the VP and FP catheters, consisting of up to 47% of lesion area. No darkening was observed with the PS catheter. Conclusions There are significant thermal footprints in all the systems tested. Temperatures exceeding 60oC have been demonstrated, comparable to radiofrequency ablation, and this may explain the mechanism of injury in some reports of collateral damage during PFA.

11
Acceptability and Perceptions of Artificial Intelligence in Organized Breast Cancer Screening: A Study of French Women

Jean, A.; Merceron, A.; Le Saux, A.; Mercier, E.; Benillouche, P.

2026-06-09 radiology and imaging 10.64898/2026.06.07.26354883 medRxiv
Top 1%
1.7%
Show abstract

This study aims to assess women's perceptions of artificial intelligence (AI) used in breast cancer screening in France by examining their knowledge of AI and the barriers to their participation in organized screening. The results of a survey conducted in June 2025 among a national sample of 2000 women (aged 40-75) reveal limited participation and persistent concerns among women. Nevertheless, despite a low awareness of specific AI applications, a large majority of the women surveyed are very favorable to the use of AI in breast cancer diagnosis, even considering it a lever to increase screening participation.

12
Next-Generation Skin Cancer Detection Using Efficient Fuzzy Fusion of Genomic and Imaging Data

Molla, A. R.; Maity, A.; Saha, S.; Bhattacharya, R.; Chakraborty, A.; Biswas, S.; Nath, S.

2026-06-08 health informatics 10.64898/2026.06.05.26355024 medRxiv
Top 1%
1.5%
Show abstract

Skin cancer requires early detection for improved survival rates. Most existing methods rely on deep learning based image classification, which is affected by visual similarity among lesions. Fewer studies use Gene Expression (GE) analysis, which captures molecular characteristics but lacks structural and visual details. To overcome limitations of individual modalities, this paper proposes a multimodal framework integrating dermoscopic images and GE profiles for skin cancer classification. EfficientNet and logistic regression are used for image based analysis and genomic skin lesion profiling, respectively, followed by fuzzy rule based decision systems to reduce uncertainty within individual modalities. Finally, fuzzy fusion combines predictions from both modalities using uncertainty based weighting of classifier outputs. The experimental findings show that both the image based and GE based classification models individually achieved accuracies of nearly 92%. However, the integration of prediction results through the proposed fuzzy fusion strategy further enhanced the classification performance, achieving an overall accuracy of 94.25%. The results obtained outperform contemporary methods, highlighting the effectiveness of combining complementary multimodal information compared with single modality approaches.

13
Closing the Paediatric Gap: Adult-Trained AI Generalises Robustly to Paediatric Coeliac Disease Diagnosis

Jaeckle, F.; Gillett, P. M.; Kirkwood, K. J.; Natu, S.; Chan, J. Y. H.; Bateman, A. C.; Arends, M. J.; Soilleux, E. J.

2026-06-05 pathology 10.64898/2026.06.04.26354889 medRxiv
Top 1%
1.4%
Show abstract

Background Coeliac disease (CD) diagnosis on duodenal biopsies is limited by interobserver variability. We have previously demonstrated pathologist-level performance with our artificial intelligence (AI) model for the histopathological diagnosis of adult CD, but not in paediatric practice. As paediatric CD screening programmes expand internationally, accurate and scalable diagnostic tools are needed. We investigated whether an AI model trained exclusively on adult whole-slide images (WSIs) can generalise to paediatric CD diagnosis across independent centres. Methods A training and validation dataset of 9,958 WSIs from 8,421 adult patients (961 CD) from five centres was used to develop an ensemble of multiple-instance learning models using features from a foundation model. Testing was performed on 708 consecutive paediatric patients (86 CD) from two centres (Edinburgh and Southampton) not included in training. Model calibration was assessed, and probability outputs were grouped into clinically interpretable categories. Findings In adult cross-validation, the AI model achieved an area under the receiver operating characteristic curve (AUC) of 98.7%, sensitivity of 84.9%, specificity of 99.0%, and negative predictive value (NPV) of 98.1%. On testing (paediatric) datasets, performance remained high (AUC 98.8%, sensitivity 80.2%, specificity 98.4%, NPV 97.3%). Restricting analysis to predictions outside the intermediate-probability range (predicted CD probability <10% or [&ge;]65%; 85.3% of cases) improved sensitivity to 100% and specificity to 98.7%. No misclassifications were observed among high-confidence predictions (<2% or [&ge;]85%; 66.0% of cases). The expected calibration error was 0.03. Performance improved significantly when biopsies from both duodenal sites (bulb [D1] and descending [D2/3]) were considered. Interpretation Our AI model, trained on adult biopsies, generalises to paediatric CD diagnosis across centres and scanner platforms. Well-calibrated probability outputs provide clinically interpretable measures of diagnostic confidence and could support safe identification of CD-negative biopsies within defined thresholds. These findings demonstrate the feasibility of applying adult-derived AI models in paediatric populations and reinforce the importance of multi-site (D1 & D2) biopsy sampling.

14
A Comparison of Manual and Automated Approaches to Developing Computable Algorithms for Identifying Acute Pancreatitis

Bann, M. A.; Carrell, D. S.; Gruber, S.; Heagerty, P. J.; Williamson, B. D.; Nelson, J. C.; Hazlehurst, B.; Felcher, A.; Nyongesa, D. B.; Slaughter, M. T.; Sapp, D. S.; Cronkite, D. J.; Ball, R.; Floyd, J. S.

2026-06-08 health informatics 10.64898/2026.06.05.26354934 medRxiv
Top 1%
1.3%
Show abstract

Objective: Clinical phenotyping methods that rely on clinical and informatics expertise can be time-intensive and costly. We tested both manual and highly automated approaches using electronic health record (EHR) data to identify an FDA Sentinel Initiative health outcome of interest, acute pancreatitis. Materials and Methods: We trained and evaluated machine learning algorithms using EHR data with two approaches: a custom approach that included manually curated features and trained on outcomes data validated with medical record review, and a highly automated approach that greatly simplifies and automates feature engineering and relies on low-cost silver-standard outcomes for model training. Results: Custom algorithms using manually curated structured claims data discriminated cases from non-cases with a high degree of accuracy (cv-AUC 0.89 [95%CI 0.84-0.94]); the inclusion of natural language processing (NLP)-derived covariates from clinical notes increased performance slightly (cv-AUC 0.91[95%CI 0.86-0.97]). The automated algorithm trained on the outcome count of diagnosis codes performed less well (AUC 0.80 [95% CI 0.75-0.85]) but improved using maximum lipase value as an outcome (AUC 0.88 [95% CI 0.84-0.92]). At a positive predictive value of 90%, the custom algorithm had a sensitivity of 92%, the automated algorithm trained on diagnosis code count had a sensitivity of 45%, and the automated algorithm trained on maximum lipase value had a sensitivity of 84%. However, a prediction rule derived by clinicians during chart review was nearly as accurate (maximum lipase value [&ge;] 3 times upper limit of normal; AUC 0.86, PPV 85%, sensitivity 92%). Discussion: Machine learning algorithms with manually curated structured data and NLP features trained on validated outcomes data successfully identified validated events. Use of an outcome in the automated model based on specific phenotype knowledge (maximum lipase value) allowed for performance similar to the custom model and with considerably less resources.

15
Computational and Experimental Antibody Affinity and Diagnostic Accuracy Quantification of SARS-CoV-2 SD2 Major Disulfide Loop Analog

Pollo, B. A. L. V.; Perias, G. A.; Aguimatang, R. H.; Espiritu, A. P.; Ching, D.; Idolor, M. I.; King, R. A.; Climacosa, F. M.; Caoili, S. E.

2026-06-08 infectious diseases 10.64898/2026.06.05.26353587 medRxiv
Top 1%
1.2%
Show abstract

Introduction: Synthetic oligopeptides provide a rapid and cost-efficient approach to developing antibodies and diagnostics for emerging viral variants. Methods: This study computationally and experimentally characterized a synthetic peptide analog of the SARS-CoV-2 spike subdomain 2 major disulfide loop (SD2MDL), designated S621 (CPVAIHADQLTPTWRVYSTC). Binding affinity was computationally estimated using the Heuristic Affinity Prediction Tool for Immune Complexes (HAPTIC), while experimental validation was performed using enzyme-linked immunosorbent assay (ELISA) with rabbit-derived antipeptide antibodies. Clinical diagnostic accuracy testing was done using plasma samples from RT-PCR-confirmed COVID-19 patients and pre-COVID-19 controls. Results: S621 demonstrated nanomolar binding affinity (Kdapp = 1.14 nM) and high avidity (3.67 nM), closely matching HAPTIC predictions (3.54 nM). Diagnostic evaluation yielded a sensitivity of 89.92% and specificity of 27.79%, corresponding to an overall accuracy of 71.79%. Discussion: These findings demonstrate that a single synthetic peptide derived from a conserved spike subdomain can function as a high-affinity surrogate for full-length antigens, supporting its potential application in rapid peptide-based immunodiagnostics.

16
Immunohistochemical phenotype is associated with metastatic site in breast cancer: a retrospective pathomorphological study of women from the Lower Aral Sea region, Uzbekistan

Khodjaniyazov, A. A.; Rojobov, R. R.

2026-06-08 pathology 10.64898/2026.06.05.26354969 medRxiv
Top 1%
1.2%
Show abstract

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer death in women worldwide, and the great majority of these deaths are caused by metastatic disease. Whether the immunohistochemical (IHC) phenotype of breast cancer is associated with the anatomical site of metastasis has been characterized mainly in high-income, registry-based populations, while data from ecologically stressed and medically under-served regions such as the Lower Aral Sea basin are lacking. Methods: We retrospectively reviewed 652 women diagnosed with breast cancer at the Khorezm Branch of the Republican Specialized Scientific-Practical Medical Center of Oncology and Radiology (Uzbekistan) between 2020 and 2024, of whom 213 had metastatic disease (306 metastatic foci). Histological type was assessed on hematoxylin-eosin and van Gieson-stained sections; quantitative morphometry was performed in Fiji/ImageJ; and HER2, estrogen receptor (ER), progesterone receptor (PR) and Ki-67 were assessed by IHC. The association between marker expression and metastatic site (liver, lung, lymph node) was tested in 187 foci with adequate tissue using the chi-square test, with significance at p < 0.05. Results: Invasive ductal carcinoma predominated. Metastatic site was significantly associated with the IHC phenotype. Liver metastases showed the highest frequency of HER2 3+ (45.7%), ER-negativity (65.2%), PR-negativity (69.6%) and high proliferation (Ki-67 [&ge;] 60%; 47.8%), whereas lymph-node metastases were more often hormone-receptor-positive (ER+ 58.7%; PR+ 52.4%) with lower HER2 3+ (22.2%); lung metastases were intermediate (all p < 0.05). The combination of HER2 3+ and Ki-67 [&ge;] 60% was associated with multi-organ spread. Morphometry corroborated these patterns: liver lesions had larger atypical cells (up to 132.8 m), a higher nuclear-to-cytoplasmic ratio (0.76 vs 0.51) and more extensive necrosis and microvascularity than lymph-node lesions. A pragmatic 5-criterion morphological score (histological type, Ki-67, HER2, ER/PR status, atypical-cell size) stratified metastatic risk into three tiers. Conclusions: In this regional cohort, the IHC phenotype of breast cancer tracked the anatomical site of metastasis, with an aggressive HER2-driven, hormone-receptor-negative profile concentrated in liver metastases and a hormone-receptor-positive profile in lymph-node metastases. These findings reproduce established organotropism patterns in a previously uncharacterized population and support phenotype-aware, site-specific surveillance together with a low-cost morphological risk score for resource-limited settings.

17
Dementia and Frailty Impact Postoperative Care Trajectories and Burden among Older Adults Undergoing Radical Cystectomy for Bladder Cancer

Ernandez, J.; Xiang, L.; Adler, R.; Hsu, J.; Shah, S. K.; Kim, D.; Gershman, B.; Mossanen, M.; Weissman, J. S.

2026-06-06 urology 10.64898/2026.06.04.26354768 medRxiv
Top 2%
1.0%
Show abstract

OBJECTIVE: Bladder cancer (BC) is predominantly a disease of older, comorbid adults, and radical cystectomy (RC), which is the gold standard treatment, carries considerable morbidity. We sought to determine the impact of baseline dementia and frailty on the care trajectory beyond the immediate postoperative period. We hypothesized that frail patients and those with dementia undergoing RC for BC will have poorer care trajectories. METHODS AND MATERIALS: We identified Medicare beneficiaries [&ge;] 66 years old who underwent RC for BC in 2017 with 12 months of pre- and post-RC enrollment. Frailty and dementia were characterized using validated, claims-based measures. Associations between baseline frailty and dementia with postoperative care trajectory outcomes were determined using Fine-Gray competing risk models. RESULTS: We identified 3,600 beneficiaries of whom 11.6% were frail and 3.4% met criteria for dementia. Patients with dementia were more likely to be frail, comorbid, and not receive standard-of-care neoadjuvant chemotherapy. Frailty was independently associated with [&ge;] 2 transitions in care level after index discharge from RC and skilled nursing facility (SNF) admissions within 1 year of RC, exposure to intensive post-RC interventions, including dialysis and feeding tube placement, and poorer survival. Dementia remained associated with SNF admissions regardless of frailty level. CONCLUSIONS: Among a contemporary cohort of older adults undergoing RC for BC, preoperative dementia and frailty were independently associated with poorer care trajectory beyond the immediate postoperative period after RC. Our work highlights a role for preoperative geriatric assessment in identifying and optimizing patients at greatest risk.

18
Hemorrhagic Transformation After Endovascular Thrombectomy in Young Adults: A Prediction Model

Lv, Q.; Yuan, K.; Liao, A.; Wang, Z.; Li, Y.; Xiao, G.; Liu, W.; Zhou, Z.; Yang, D.; Huang, K.; Chen, C.; Dong, W.; Pan, L.; Zhu, W.; Liu, X.

2026-06-05 neurology 10.64898/2026.06.03.26354874 medRxiv
Top 2%
0.9%
Show abstract

Background and Purpose: Hemorrhagic transformation (HT) is a serious complication of endovascular thrombectomy (EVT), yet dedicated prediction models for young adults are lacking. We aimed to develop and externally validate a simplified risk score for HT in young adults with acute ischemic stroke undergoing EVT. Methods: This multicenter retrospective study included patients aged 18 to 49 years with acute anterior circulation large vessel occlusion who underwent EVT. The primary outcome was any HT within 24 hours after EVT. Multivariable logistic regression was used to identify independent predictors of HT, from which the NO?PAIN Score was derived. External validation was performed in an independent cohort of 138 patients. Results: Among 598 patients in the derivation cohort, HT occurred in 176 (29.4%). Five independent predictors were identified: admission NIHSS, number of thrombectomy passes, atrial fibrillation, alcohol consumption, and mTICI grade. The mTICI grade demonstrated a non-linear, inverted U-shaped relationship with HT risk, peaking at partial recanalization. The NO-PAIN Score showed acceptable discrimination in both the derivation (C-index, 0.737; optimism-corrected C-index, 0.748) and external validation cohorts (C-index, 0.726), with satisfactory calibration. Conclusions: The NO-PAIN Score is a simple risk prediction tool for HT after EVT in young adults with acute anterior circulation large vessel occlusion. It may assist in individualized risk stratification in this population.

19
CarotidMamba: Foundation Model-Enabled CTA Phenotyping of Symptomatic Carotid Plaques in a Multi-Center Retrospective Study

Liu, Y.-S.; Dou, X.-W.; Zheng, P.-Y.; Feng, W.; Ma, L.-J.; You, Y.-N.; Shao, G.-W.; Shen, J.-G.; Yu, X.; Qiao, C.; Cheng, Z.-W.; Li, Z.-W.; Su, F.; Zhang, B.-W.; Qu, X.-H.; Jiang, g.

2026-06-05 cardiovascular medicine 10.64898/2026.06.02.26354776 medRxiv
Top 2%
0.9%
Show abstract

Background: Treatment decisions for carotid atherosclerotic disease rely primarily on luminal stenosis, although plaque vulnerability and symptomatic status better reflect short-term cerebrovascular risk. A scalable CTA tool for automated phenotyping of symptomatic carotid disease is lacking. Materials & Methods: In this multi-institutional retrospective study, 689 patients (mean age, 67.9 {+/-} 7.7 years; 366 men) from four hospitals were analyzed after screening 705 CTA examinations. 423 patients from one center were used for five-fold development and internal validation, and 266 patients from three centers for independent external validation. CarotidMamba, a deep learning framework combining dual foundation-model encoders with Mamba-based sequence modeling, was developed and benchmarked against clinical, radiomics, clinic-radiomics, CNN, and transformer comparators. Results: In the development cohort, CarotidMamba achieved an AUC of 0.839 (95% CI, 0.799-0.879) and accuracy of 0.825 (95% CI, 0.793-0.857), outperforming the strongest comparator by 0.066 and 0.050, respectively. External validation yielded AUCs of 0.897 (95% CI, 0.835-0.959) in YCH, 0.809 (95% CI, 0.720-0.898) in DCH, and 0.762 (95% CI, 0.649-0.875) in GH-NTC. CarotidMamba showed the lowest Brier score and expected calibration error across cohorts, with calibration slopes near 1.0. Conclusion: CarotidMamba provides an interpretable, clinically oriented, and externally validated CTA framework for phenotyping symptomatic carotid plaques, supporting vulnerability-aware imaging assessment beyond stenosis alone.

20
Registered Report: Artifact Index for Capacitive Electrocardiography Acquired with an Armchair

Warnecke, J. M.; Baumgärtel, D.; Bollmann, J.; Deserno, T. M.

2026-06-09 health informatics 10.64898/2026.06.03.26353526 medRxiv
Top 2%
0.9%
Show abstract

Background Continuous health monitoring enables early detection of diseases and improves therapeutic outcomes. Non-intrusive biosignal sensors, such as capacitive ECG (cECG), offer a practical solution for daily monitoring in private environments, such as smart homes and vehicles. However, artifacts reduce signal quality and compromise reliability. Methods Following a registered report protocol (Warnecke JM et al. Plos One. 2021; 16(7):e0254780), we record data of 44 subjects and develop an artifact index for cECG. We use three signal quality indices (SQIs): the correlation of QRS complexes (corSQI), the R-peak detection consistency (bSQI) and the absolute amplitude ratio (aSQI). Our index classifies overlapping 10s segments with a step-width of 2s into clean or artifact segments. We label a 2s interval as artifacts if all five overlapping segments indicate artifacts. We record cECGs using an armchair with integrated electrodes in a single-arm study involving 44 subjects performing two activities -- reading and watching television (TV); for 11 minutes each. We record a time-synchronized reference ECG with skin electrodes on the chest. To evaluate the artifact index, we compare it with manually generated ground truth. Moreover, we evaluate the clothing materials cotton, linen, jeans, and polyester in 5 subjects. Results Watching TV results in longer, continuously clean signal durations than reading. On average, 88.3% of the signal has a minimum continuous clean duration of 10s, versus 79.8% during reading. All clothing configurations achieve a clean signal duration exceeding 10s. Among the SQI metrics, bSQI performs best, achieving an accuracy of 90.7% and an F1 score of 79.9%. Combining the three SQI metrics in a voting approach improves accuracy to 92.0% and F1 score to 82.1%. Discussion Our artifact index automatically distinguishes clean from artifact cECG segments, promoting health monitoring in unsupervised real-world settings, earlier disease detection, and preventive health management. A limitation is the investigation of only two scenarios (reading and watching TV).