Life — Latest Matching Preprints

1

Multi-Agent Dynamic Refinement Outperforms Static RAG in Clinical Reasoning for Complex Nephrology Cases

Yano, Y.; Kakizaki, H.; Nagasu, H.; Kishi, S.; Koshida, T.; Nihei, Y.; Hirano, A.; Sugawara, Y.; Imaizumi, T.; Osakabe, Y.; Sakaguchi, Y.; Nangaku, M.; Mori, H.; Naito, T.; Ohashi, M.; Maruyama, S.; Matsui, I.; Isaka, Y.; Okada, H.; Suzuki, Y.; Kashihara, N.

2026-07-16 nephrology 10.64898/2026.07.15.26358121 medRxiv

Top 0.4%

1.1%

Show abstract

Background: Large language models (LLMs) struggle with dynamic, longitudinal clinical reasoning. We developed a Multi-Stage Iterative Clinical Reasoning Agent framework to address this gap and systematically decouple the clinical efficacy of static retrieval-augmented generation (RAG) from dynamic self-refinement. Methods: Ten complex longitudinal nephrology cases, rigorously selected via a modified Delphi consensus technique, were blindly evaluated by four board-certified nephrologists and a multi-model AI panel. We compared three architectures across nine cognitive steps: (Model A) a baseline frontier LLM, (Model B) an LLM augmented with static guideline-based RAG, and (Model C) our proposed multi-agent framework featuring RAG integrated with iterative self-critique and refinement. Results: In human evaluations (20-point scale), Model C (mean 17.2, SD 1.2) significantly outperformed both Model A (16.1, 1.3) and Model B (16.2, 1.2) (P < 0.001). Implementing static RAG (Model B) yielded no significant improvement over the baseline. Automated AI evaluations (15-point scale) corroborated these findings: Model C (14.7, 0.6) outscored Model A (14.2, 0.9, P < 0.001) and Model B (14.3, 0.9, P = 0.01). While monolithic models exhibited severe score degradations in planning-heavy tasks such as dynamic differential diagnoses, the multi-agent framework effectively intercepted error cascades, achieving significantly higher diagnostic accuracy (mean 17.6, P = 0.019) and therapeutic management scores (17.3, P = 0.002). Conclusions: Static knowledge retrieval alone fails to enhance frontier LLM performance in longitudinal medical reasoning. Distributing clinical workflows into a multi-agent dynamic refinement pipeline significantly improves reasoning completeness, intercepts error cascades, and safely resolves planning bottlenecks in complex patient care.

2

Microvascular Thrombosis and Acute Kidney Injury in COVID-19: A Systematic Review and Quantitative Analysis

Duarte, C. A.; Uscocovich, V. S. M.; Misael, I.; Duarte, P. D. A. C.; Sestito, E. B.; Da SIlva, P. N.

2026-07-17 nephrology 10.64898/2026.07.14.26357748 medRxiv

Top 0.4%

1.1%

Show abstract

Abstract Objective: To synthesize the available evidence on the association between SARS-CoV-2-related microvascular thrombosis and acute kidney injury (AKI), with emphasis on renal outcomes, mortality, and renal replacement therapy requirements. Methods: This systematic review followed the PRISMA 2020 statement and was prospectively registered in PROSPERO (CRD420251132701). PubMed/MEDLINE, Scopus, and Embase were searched for systematic reviews, including meta-analyses, and umbrella reviews investigating the association between SARS-CoV-2-related microvascular thrombosis and acute kidney injury. Two reviewers independently performed study selection, data extraction, and methodological quality assessment using AMSTAR-2 and ROBIS. Evidence was synthesized through a structured narrative synthesis supported by quantitative data extracted from the included reviews. Results: Six evidence syntheses evaluating kidney involvement, thrombotic events, and microvascular mechanisms in COVID-19 were included. AKI incidence was 9.2% (95%CI 4.6-13.9) among hospitalized patients and 32.6% (95%CI 8.5-56.6) among critically ill patients. In children with multisystem inflammatory syndrome associated with SARS-CoV-2, AKI incidence was 20% (95%CI 14-28). Microvascular or thrombotic events were associated with adverse renal outcomes (OR 2.14; 95%CI 1.32-3.48). AKI was associated with increased mortality (OR 4.68; 95%CI 1.06-20.70) and greater likelihood of renal replacement therapy requirement (OR 2.87; 95%CI 1.45-5.68). The certainty of evidence ranged from moderate to high for the principal outcomes. Conclusion: Current evidence supports an important association between microvascular thrombotic injury and COVID-19-associated AKI. These findings reinforce the relevance of endothelial dysfunction and thromboinflammatory pathways in kidney involvement during COVID-19 and highlight the need for early renal monitoring, risk stratification, and kidney-protective strategies in high-risk patients. Keywords: COVID-19; Acute Kidney Injury; Microvascular Thrombosis; SARS-CoV-2; Renal Replacement Therapy; Systematic Review

3

PARIS (Pneumonia: Acute Respiratory Infection +/- Sepsis): a prospective single-centre observational cohort study of hospitalised patients with pneumonia

Nasser, S. T.; Piercy, C. R.; Falinska, A.; O'Sullivan, D. M.; Devonshire, A.; Martinez-Estrada, F.; Huggett, J.; Creagh-Brown, B. C.

2026-07-17 respiratory medicine 10.64898/2026.07.15.26357955 medRxiv

Top 0.5%

1.1%

Show abstract

Introduction Hospitalised community-acquired pneumonia (CAP) is heterogeneous in aetiology, severity, and outcome. Phenotyping and endotyping approaches offer potential to stratify patients biologically and guide targeted therapy, but require well-characterised cohorts with linked biosamples. We describe the PARIS (Pneumonia: Acute Respiratory Infection +/- Sepsis) study: a prospective observational cohort of hospitalised patients with pneumonia, designed to characterise functional outcomes and to provide a biobank for translational immunological research. Methods Adults admitted with CAP to a single NHS district general hospital were enrolled within 24 hours of admission between December 2020 and March 2022. Clinical, functional, and physiological data were collected at enrolment, hospital discharge, and 6-8 week follow-up. Serial blood samples were collected for flow cytometry, transcriptomics, pathogen DNA detection, and plasma biobanking. Results Forty-seven patients were enrolled (15 without and 32 with sepsis [SOFA >=2] at enrolment); 87% met sepsis criteria by 24 hours post enrolment. Most patients (30/47, 64%) were managed as COVID-19, microbiologically confirmed in 27. Mean age was 57 years (SD 16), 70% were male, and baseline comorbidity burden was low. Severity was moderate (median NEWS2 4 at enrolment, rising to 6 by 24 hours post enrolment; p<0.001). Mortality was 4/47 (8.5%), with 44/47 (94%) alive at hospital discharge. Median length of stay was 8 days (IQR 5.5-11). Translational samples were collected from the majority: fresh flow cytometry (44/47, 94%), transcriptomics from the sepsis subgroup (31/32, 97%), pathogen DNA sampling (35 samples received across study timepoints; see Table 5), and stored plasma (29/47, 62%). The primary outcome of functional decline (Barthel score decrease >=1.85) occurred in only 1/29 patients with paired assessments (3.4%). Persistent CRP elevation (>3 mg/L) at 6-8 week follow-up was present in 16/31 (52%) survivors with available data. Conclusions The PARIS cohort provides a well-characterised clinical platform and linked biobank to support translational studies of pneumonia and sepsis. The low rate of functional decline reflects the younger, lower-comorbidity, COVID-predominant population recruited. Primary protocol endpoints were not achieved owing to pandemic-related disruption. Data and samples underpin a programme of linked translational studies.

4

In Silico Trial Simulation with Artificial Intelligence-Generated Synthetic Control Cohorts Reproduces Results of a Randomized Controlled Trial in Acute Myeloid Leukemia

Kumar Reddy, K.; Hahn, W.; Winter, S.; Roellig, C.; Mueller-Tidow, C.; Serve, H.; Baldus, C. D.; Fransecky, L.; Schliemann, C.; Burchert, A.; Schaefer-Eckart, K.; Kaufmann, M.; Schetelig, J.; Bornhaeuser, M.; Middeke, J. M.; Eckardt, J.-N.

2026-07-16 health informatics 10.64898/2026.07.15.26358123 medRxiv

Top 0.6%

1.0%

Show abstract

Rising costs, slow accrual and molecular substratification of cancers necessitate novel clinical trial designs. We demonstrate that artificial intelligence-generated synthetic patients can replace real controls to reproduce results of the SORAML trial. Using external multimodal data from 1,377 acute myeloid leukemia (AML) patients from previous trials and a real-world registry, we fine-tuned a tabular foundation model to generate synthetic patients, reproducing clinical and genetic features and outcome associations. Synthetic patients were then matched to the original SORAML intervention group using Cox risk scores, replacing the original control and reproducing the original trial result with near-identical median event-free survival (EFS) and treatment effect (original hazard ratio [HR] 0.64, 95%-confidence interval [CI] 0.47-0.87, p=0.004; with synthetic control HR 0.66, 95%-CI 0.48-0.90, p=0.009). Our findings demonstrate that AI-generated synthetic patients can serve as statistically rigorous controls supporting novel trial designs.

5

From Real-World Data to Virtual Intervention: A Probabilistic Neural Network for Simulating Kidney Function Preservation via Proteinuria Reduction

Takeda, A.; Igata, H.; Mizuno, K.; Yano, Y.; Nagasu, H.; Ohashi, M.; Kashihara, N.; Kobayashi, H.

2026-07-15 nephrology 10.64898/2026.07.12.26357786 medRxiv

Top 1%

0.6%

Show abstract

Predicting the long-term kidney function decline is critical for timely intervention but remains challenging. While the urinary protein-to-creatinine ratio (uPCR) is a potential surrogate endpoint, its short-term reduction's link to long-term nephroprotection requires investigation. This study aimed to develop a probabilistic neural network model to capture both the estimated glomerular filtration rate (eGFR) slope and its uncertainty based on baseline clinical characteristics. Using a retrospective dataset, we designed a neural network to output a predictive distribution (mean and standard deviation {sigma}) for the eGFR slope. SHAP (SHapley Additive exPlanations) was used for model interpretation, and a simulation study quantified the impact of uPCR reduction. In the validation set, the model achieved a Pearson's correlation coefficient of 0.56 and an RMSE of 2.81 ml/min/1.73m^2/year between predicted and actual slopes. SHAP analysis identified uPCR as the most potent predictor, with higher baseline levels associated with a more rapid eGFR decline. Furthermore, a simulated 62% uPCR reduction demonstrated a significant improvement in the predicted eGFR slope, an effect most pronounced in patients with high baseline uPCR. This proof-of-concept study reinforces the critical role of uPCR in predicting eGFR slope and suggests its reduction may contribute to long-term kidney function preservation, warranting validation in larger, diverse real-world datasets.

6

MedZone Embedder: a framework for representation learning of Japanese secondary medical care areas from a national ICU registry, characterizing intensive care provision structure and regional vulnerability

Ohno, K.; Hashimoto, S.

2026-07-20 health informatics 10.64898/2026.07.17.26358373 medRxiv

Top 1%

0.6%

Show abstract

Background: In Japan, acute inpatient care is divided into approximately 335 secondary medical care areas, which serve as the basic units for planning healthcare delivery systems under the 8th National Health Care Plan. While comparisons between regions and facilities typically rely on a single risk-adjusted metric, this approach confuses differences in patient demographics with differences in the actual infrastructure of intensive care units (ICUs). This paper presents a framework - MedZone Embedder - for deriving data-driven indicators of regional structural vulnerability by mapping secondary medical care areas onto a learned similarity space, together with its working implementation. The paper sets out the concept, the method, a proof of concept, and an explicit staged validation program, rather than national empirical results. Methods: Each area is represented by a feature vector consisting of aggregated values of intensive care provision indicators derived directly from the Japan Intensive Care Patient Database (JIPAD) - specifically, risk-adjusted mortality rates (standardized mortality ratios and an in-hospital composite indicator), technical efficiency, length of stay, readmission rates, case severity, and case composition - with the within-area variance of these indicators also taken into account. No hierarchical processing by facility type is performed. A contrastive autoencoder (multilayer perceptron encoder 32 -> 16 -> 8, symmetric decoder) is trained by self-supervised learning, using an objective function that combines reconstruction and normalized temperature cross-entropy (NT-Xent) on noise-augmented views. The resulting 8-dimensional embedding supports area searches based on cosine similarity and anomaly scoring in the embedding space (using isolation forest, Mahalanobis distance, or k-nearest-neighbor density), which is normalized to a vulnerability score ranging from 0 to 1. If deep learning libraries are unavailable, or if the number of areas is small, an alternative method using deterministic principal component analysis is employed. Results: This method was implemented and deployed within an operational ICU decision support system on a managed cloud platform. The proof of concept (PoC) is structured around five secondary medical care areas within Kyoto Prefecture and runs entirely on synthetic facility-level aggregate data constructed to follow the JIPAD indicator schema; no registry data were accessed. It generated: an aggregate provision profile for each area; an area embedding space equipped with a similar-area search function; and a vulnerability ranking that identifies areas with low patient numbers and low diversity that exhibit overall poor outcomes. At this scale, the contrastive autoencoder falls back to principal component projection. The deep learning pathway has been implemented and unit testing has been completed; training and evaluation on actual registry data are pending data-use approval and the expansion of data integration. Validation is staged: Stage 2 will train the contrastive pathway over JIPAD-covered areas to assess construct validity against public structural indicators (ICU/HCU beds, population, accessibility), and Stage 3 will extend coverage to all areas via National Database (NDB) linkage. Conclusion: MedZone Embedder reframes regional comparison from single-indicator ranking to structural representation: which areas are alike, and which are structural outliers. The contribution of this paper is the framework - the proposal that the intensive care provision structure of Japanese secondary medical care areas can be learned from a national outcomes registry and read through the lens of what we call institutional debt - together with a deployed implementation and a pre-specified validation program. To our knowledge, this is a candidate first application of contrastive representation learning to Japanese secondary medical care areas.

7

A ReAct Agentic AI System for Natural Language Querying and Statistical Analysis of The Cancer Genome Atlas Clinical Data

Korutla, R.; Amal, S.

2026-07-17 health informatics 10.64898/2026.07.15.26358188 medRxiv

Top 2%

0.4%

Show abstract

The Cancer Genome Atlas (TCGA) holds clinical data for over 11,000 patients across 33 cancer types, but access is hard because of complex file structures, heterogeneous formats, and the need for programming. We present an agentic system for natural language querying and statistical analysis of TCGA clinical data. The system uses a large language model as an autonomous ReAct agent that selects from eight computational tools, including data extraction, descriptive statistics, Kaplan-Meier survival analysis with log-rank tests, hypothesis testing, and verification against the curated TCGA Pan-Cancer Clinical Data Resource (CDR). The agent reasons about intermediate results, adapts its approach, and returns clinically contextualized responses with source attribution and auditable traces. We introduce TCGA-Agent-Bench, 440 queries across five difficulty tiers with ground truth from the independently curated TCGA-CDR, evaluated with dual metrics of numerical accuracy and clinical completeness. The system achieves 93.4% overall accuracy (100% single-patient lookups, 99.1% cohort statistics, 92.8% comparative analyses), outperforming a fixed rule-based pipeline (87.1%), a single-pass LLM (81.8%), and retrieval-augmented generation (66.9% on a subset). Most of the benchmark is answerable from the CDR alone, so we locate the extraction layer's value in fields the CDR lacks (drug treatments, TNM components, biomarkers, biospecimen metadata): on 26 queries targeting these, the full system answers 100% versus 3.8% for CDR-only. Ablations show the reasoning loop is most impactful (+9.1% accuracy, +22.0 completeness points). A tool-based agentic architecture enables accurate, auditable analysis of clinical repositories, with value driven by tool design and recovered fields rather than model scale.

8

MeshScope-Region: Distribution, Road-Network Accessibility, and Nine-Year Evolution of ICU and HCU Capacity Across Japan's 330 Secondary Medical Areas

Ohno, K.; Hirai, M.; hashimoto, s.

2026-07-20 health informatics 10.64898/2026.07.17.26358374 medRxiv

Top 3%

0.3%

Show abstract

Background: In Japan, health planning is organized around secondary medical areas (SMAs; niji-iryo-ken; 330 areas in the 2025 classification), yet nationwide analyses of intensive care unit (ICU) capacity have been conducted mainly at the prefecture level, and a recent SMA-level study addressed only the presence or absence of ICUs. The full supply structure of intensive and intermediate critical care - ICU and high care unit (HCU) beds - has not been characterized at the SMA level with respect to its composition, road-network accessibility, and evolution over time. Methods: We developed MeshScope-Region, an analytical platform built on the Hospital Bed Function Reports (byosho-kino-hokoku) for fiscal years 2016-2024, in which ICU and HCU beds were identified from notified reimbursement categories and aggregated to SMAs. Three analytical layers were integrated: (1) cross-sectional distribution of ICU/HCU beds; (2) nationwide road-network accessibility computed with the Open Source Routing Machine (OSRM) from 176,962 populated 1-km census grid cells to all facilities reporting ICU or HCU beds; and (3) a nine-year longitudinal analysis of supply-structure types, classified by k-means (k = 6) in an 8-dimensional PCA space anchored to fiscal year 2024, with earlier years projected into the same space. Results: In fiscal year 2024, 20,631 ICU/HCU beds were reported nationally (7,114 ICU-type; 13,517 HCU-type) at 1,044 facilities. Zone-level totals among SMAs with any beds ranged 229-fold (3-688 beds); the 90th/10th percentile ratio of per-capita density was 3.6. In total, 90.1% of the population resided within 30 minutes' drive of a facility with ICU beds and 97.8% within 60 minutes; only 0.8% resided beyond 90 minutes. Although 140 of the 330 SMAs had no ICU facility within their own boundaries, 84.7% of their residents could reach an ICU facility in an adjacent area within 60 minutes' drive. Longitudinally, supply structures were highly persistent: 63.0% of SMAs (208/330) retained the same structural type across all nine years, adjacent-year rank correlations of a supply-vulnerability index were 0.887-0.924 (2016 vs. 2024: rho = 0.711), and the number of SMAs with zero ICU beds remained frozen at 133-141. The Gini coefficient of bed distribution declined from 0.384 to 0.262 - although computed on ICU-type beds alone it remained 0.365 in fiscal year 2024 - and capacity growth (total +27.9%) was driven predominantly by HCU beds (+41.6%) while ICU beds grew only +8.0%. Conclusions: Japan's critical care supply structure is regionally rigid, with a stable set of approximately 140 SMAs lacking ICU beds for nearly a decade, yet road-network accessibility substantially mitigates the consequences of zone-level absence. Recent capacity growth - and much of the apparent equalization - has occurred predominantly in intermediate care. MeshScope-Region provides a standing, reproducible evidence base at the geographic unit of Japan's medical planning cycles.

9

Switching from febuxostat to dotinurad in patients with chronic kidney disease and hyperuricemia: a single-center, non-randomized study

Irifuku, T.; Kashiwado, S.; Masaki, T.

2026-07-18 nephrology 10.64898/2026.07.16.26358294 medRxiv

Top 3%

0.3%

Show abstract

Recently, an observational study demonstrated that a lower fractional excretion of uric acid (FEUA) is significantly associated with a higher risk of kidney failure. This study aimed to assess the efficacy of switching from febuxostat to dotinurad, which increases FEUA, in patients with chronic kidney disease (CKD) and hyperuricemia (HUA).This was a non-randomized, open-label, single-center, prospective, single-arm study involving 60 patients with CKD and HUA who received febuxostat. Participants first underwent a 3-month observation period, followed by a 3-month intervention period, during which treatment was switched from febuxostat to dotinurad. The primary outcomes were changes from baseline to 3-months after switching in the estimated glomerular filtration rate (eGFR) calculated from serum creatinine (eGFRcreat) and serum cystatin C (eGFRcys), as well as the serum uric acid levels. The secondary outcome was defined as the correlation between{Delta}FEUA and the changes in both eGFRcreat({Delta}eGFRcreat) and eGFRcys({Delta}eGFRcys), respectively. During the observation period, mean eGFRcreat decreased significantly. The baseline eGFRcreat (mL/min/1.73 m{superscript 2}) was 36.0 {+/-} 15.2, and the serum urate level (mg/dL) was 5.5 {+/-} 1.2. During the intervention period, eGFRcreat increased in contrast to the significant decline observed in eGFRcys. After 3 months of switching to dotinurad, the mean serum UA levels increased significantly from 5.5 {+/-} 1.2 to 6.1 {+/-} 1.4 mg/dL, despite a significant elevation in FEUA. Both {Delta}eGFRcreat and {Delta}eGFRcys after switching to dotinurad were positively correlated with {Delta}FEUA. Switching from febuxostat to dotinurad resulted in discrepant changes in eGFRcreat and eGFRcys, suggesting that renal function should be assessed carefully after switch. Additionally, the risk of elevated serum UA levels should be considered when switching from febuxostat to dotinurad in patients with CKD.

10

Galangin and Caffeic acid inhibit Methylglyoxal-induced Advanced Glycation End Product formation in Bovine Serum Albumin

Kanojia, N.; tiku, A.

2026-07-15 biophysics 10.64898/2026.07.09.737425 medRxiv

Top 3%

0.3%

Show abstract

Glycation, a non-enzymatic reaction occurring between sugars and biological macromolecules, plays a critical role in ageing and disease pathogenesis. Methylglyoxal (MG) is a highly reactive -oxoaldehyde that leads to the formation of endogenous advanced glycation end products (AGEs). These AGEs are associated with diabetes and many other diseases, including neurodegeneration and cancer. This is often through interactions with the receptor for advanced glycation end products (RAGE). Inhibition of glycation/AGEs formation using natural products to target cancer is an area of recent interest. In vitro AGEs formation was observed by browning of samples, increased fluorescence, and carbonyl stress. MG induced changes in the structure of BSA were analysed using electrophoresis, spectroscopy, TEM, AFM, DLS, and CD spectroscopy. Our results show that AGEs form random structures, oligomeric aggregates, and {beta}-sheets. Thioflavin T and Congo red staining further validated these findings. Galangin and Caffeic acid demonstrated significant antiglycation activity, suppressing AGEs formation in vitro. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=134 SRC="FIGDIR/small/737425v1_ufig1.gif" ALT="Figure 1"> View larger version (39K): org.highwire.dtl.DTLVardef@113b391org.highwire.dtl.DTLVardef@7208a1org.highwire.dtl.DTLVardef@94c2e1org.highwire.dtl.DTLVardef@867b85_HPS_FORMAT_FIGEXP M_FIG C_FIG HighlightsO_LIMethylglyoxal-induced Advanced Glycation End Products were prepared in vitro C_LIO_LIMethylglyoxal -induced structural modifications in BSA C_LIO_LIAGEs were characterised using various parameters C_LIO_LIBoth fluorescent and non-fluorescent AGEs were formed. C_LIO_LIPhytochemical treatment induced inhibition of AGEs formation C_LI

11

Storing >1 byte of information in 16S ribosomal RNA using orthogonal trans-splicing ribozymes

Dysart, M. J.; Fang, L.; Karinje, L. K.; Chappell, J.; Stadler, L. B.; Silberg, J. J.

2026-07-15 synthetic biology 10.64898/2026.07.14.738544 medRxiv

Top 4%

0.2%

Show abstract

TEXT ABSTRACTCatalytic-RNA (cat-RNA) expressed from mobile DNA can record cellular events, such as the uptake of plasmids via horizontal gene transfer, by splicing a barcode onto 16S ribosomal RNA (rRNA) - a system termed RNA addressable modification (RAM). However, scaling RAM to record multiple simultaneous biological events requires large numbers of orthogonal cat-RNA whose signals reflect the biological features under investigation rather than variability arising from the barcode sequence. Here, we explore how to design orthogonal cat-RNA to record information about multiple plasmid-encoded traits in parallel. We show that cat-RNA having tRNA-derived barcodes with sequence variation in the anticodon stem-loop present greater signal consistency within Escherichia coli than mRNA-derived barcodes. When orthogonal cat-RNA designs harboring tRNA-derived barcodes were evaluated in Vibrio natriegens and Pseudomonas putida, increased variance was observed compared with Escherichia coli. Nevertheless, the signal consistency was sufficient to use these orthogonal cat-RNAs to report on the relative activities of four promoters and two origins of replication by sequencing barcoded-rRNA derived from the three organisms. These results show how RAM can be multiplexed to report on mobile DNA features in microbial communities and illustrate the importance of accounting for variability in RNA outputs when designing and interpreting multiplexed RNA barcoding data. GRAPHICAL ABSTRACT O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=88 SRC="FIGDIR/small/738544v1_ufig1.gif" ALT="Figure 1"> View larger version (29K): org.highwire.dtl.DTLVardef@406ebaorg.highwire.dtl.DTLVardef@259751org.highwire.dtl.DTLVardef@1f1512corg.highwire.dtl.DTLVardef@8384b_HPS_FORMAT_FIGEXP M_FIG C_FIG

12

The Impact Mechanism of Screen Time on Depression Among Chinese College Students: A Chain Mediation Model of Sleep Quality and Emotion Regulation

Liang, C.; Zhang, D.-y.; Li, K.-x.; Li, B.; Lou, H.; Zhu, S.; Yu, S.-h.; Han, S.-s.

2026-07-21 public and global health 10.64898/2026.07.20.26358281 medRxiv

Top 4%

0.2%

Show abstract

Purpose This study aimed to examine the association between screen time and depressive symptoms among Chinese college students, and to investigate the mediating roles of sleep quality and emotion regulation in this relationship. Furthermore, a serial mediation model was constructed to elucidate the underlying psychological mechanisms linking screen exposure to depression. Methods A stratified cluster sampling method was employed to recruit 10,999 college students for a cross-sectional questionnaire survey. Data were collected on screen time, sleep quality, emotion regulation ability, and depressive symptoms. Descriptive statistics, correlation analyses, and regression analyses were conducted using SPSS 26.0 A serial mediation model was tested using the PROCESS macro (Model 6), and bootstrapping procedures were applied to estimate the significance of indirect effects. Results Correlation analyses indicated that screen time was significantly positively associated with depressive symptoms (r = 0.16, p < 0.01) and sleep quality (r = 0.15, p < 0.01), and significantly negatively associated with emotion regulation (r = -0.13, p < 0.01). Sleep quality was positively correlated with depressive symptoms (r = 0.31, p < 0.01), whereas emotion regulation was negatively correlated with depressive symptoms (r = -0.42, p < 0.01). Regression analyses further showed that screen time significantly positively predicted depressive symptoms ({beta} = 0.712, p < 0.001), positively predicted sleep quality ({beta} = 0.217, p < 0.001), and negatively predicted emotion regulation ({beta} = -0.085, p < 0.001). In addition, both sleep quality ({beta} = 1.318, p < 0.001) and emotion regulation ({beta} = -0.424, p < 0.001) were significant predictors of depressive symptoms. Mediation analyses demonstrated that sleep quality significantly mediated the association between screen time and depressive symptoms (95% CI [0.239, 0.332]), as did emotion regulation (95% CI [0.269, 0.416]). Moreover, a significant serial mediation effect of sleep quality and emotion regulation was observed in the relationship between screen time and depressive symptoms (95% CI [0.082, 0.117]). Conclusion Screen time is significantly associated with depressive symptoms among college students, with sleep quality and emotion regulation serving as important mediating mechanisms. Extended screen exposure may be linked to higher levels of depressive symptoms by impairing sleep quality and weakening emotion regulation capacity.

13

Evaluation of polygenic risk scores and ambient air pollutants for lung cancer risk stratification in a lung cancer screening cohort

Trap, L.; Buyukcelik, R.; Antonissen, N.; Sidorenkov, G. A.; Ruiter, R.; Van Heemst, J.; Sedaghati-Khayat, B.; Stikker, B. S.; Dumoulin, D. W.; Gietema, H. A.; Heuvelmans, M. A.; Mohamed Hoesein, F. A. A.; De Jong, P. A.; Uitterlinden, A. G.; Brusselle, G.; Jacobs, C.; Aerts, J. G. J. V.; Vermeulen, R. C. H.; De Bock, G. H.; Groen, H. J. M.; Vliegenthart, R.; Downward, G. S.; Stadhouders, R.; Van Rooij, J.; NELSON-POP consortium,

2026-07-16 respiratory medicine 10.64898/2026.07.14.26358054 medRxiv

Top 4%

0.2%

Show abstract

Background: Randomized controlled trials have shown that computed tomographic (CT) screening reduces lung cancer mortality. Improved identification of at-risk groups, by leveraging non-smoking risk factors, could help refine screening selection. Aim: To evaluate polygenic risk scores (PRSs) and ambient air pollution (AAP) exposure for risk stratification in the NELSON lung cancer screening cohort. Methods: Two PRSs (PRS-McKay/PRS-Byun) and several AAPs (including nitrogen dioxide, ozone, and particulate matter [PM]) were assessed in the NELSON lung cancer screening trial (N=7,364). PRSs were validated in the Rotterdam Study (N=11,493). Associations with lung cancer, mortality, screening results, and discriminative ability to distinguish lung cancer were evaluated. Results: PRS-McKay and PRS-Byun were associated with lung cancer (odds ratio [OR] per SD [95%CI]: 1.22 [1.08-1.37] and 1.28 [1.13-1.44], respectively) and lung cancer-specific mortality (OR [95%CI]: 1.24 [1.05-1.47], for both), but not with non-lung cancer mortality (OR [95%CI]: 1.01 [0.94-1.10] and 1.03 [0.95-1.12], respectively). Exposure to PM2.5 was associated with lung cancer (OR [95%CI]: 1.11 [1.01-1.22]). PM constituents were associated with adenocarcinoma, particularly PM10 (OR [95%CI]: 1.16 [1.01-1.32]) and ultra-fine particles (OR [95%CI]: 1.16 [1.04-1.30]). PRS and AAP added modestly to the discriminative ability for lung cancer on top of pack-years, age, and sex (area under the curve [95%CI]: 0.659 [0.624-0.695] vs. 0.643 [0.608-0.679]). Conclusions: PRSs and exposure to PM were associated with lung cancer in a high-risk screening population. The primary potential of PRSs may reside in refining lung cancer screening selection toward individuals at higher risk of dying from lung cancer specifically.

14

Benchmarking Speech Recognition Models for Medical Consultations in Latin American Spanish: A Comparative Evaluation with Fine-Tuning

Carrillo, R. M.; Carbajal Serrano, A.; Condori Pinedo, P. S.

2026-07-16 public and global health 10.64898/2026.07.14.26358062 medRxiv

Top 4%

0.2%

Show abstract

BACKGROUND: Artificial intelligence (AI) medical scribes rely on speech-to-text (STT) models for transcription. Evaluations of STT models in non-English settings remain scarce. We benchmarked ten STT models on medical consultations from Latin American (LatAm) Spanish and assessed whether fine-tuning improves transcription accuracy. METHODS: Ten YouTube videos depicting medical consultations. Human transcriptions were the ground truth. Five open-source models were evaluated: Whisper Large, Whisper Large v3, Whisper Large v3 Turbo, Voxtral Mini 3B, and Canary 1B v2; and so were five close-source models: gpt-4o-transcribe, gpt-4o-mini-transcribe, gemini-2.5-pro, Eleven Labs, and Assembly AI. Whisper Large v3 was fine-tuned. One video was withheld from training. Performance assessed using Word Error Rate (WER), Character Error Rate (CER), BLEU Score, ROUGE-L, BERT Score, and Semantic Similarity on the one withheld video. RESULTS: None of the fine-tuning iterations outperformed the vanilla Whisper Large v3. With the withheld video, Gemini-2.5-pro was the close-source model with the best performance in four of six metrics. In comparison to the close-source models, the fine-tuned model never outperformed the other models (withheld video); conversely, in comparison to the close-source models, the fine-tuned model showed better performance across metrics, for instance: BLEU score (63% vs to 58% for the second-ranking model), BERT (89% vs to 86%), and semantic similarity (89% vs to 83%), CER (19% vs 20%). CONCLUSIONS: Whisper Large v3 and its fine-tuned variant are the best open-source STT models for transcribing medical conversations in LatAm Spanish. These findings provide an evidence base for developing AI medical scribes tailored to Spanish-speaking LatAm.

15

Qualitative analysis of the Life Skills training strategy at a public university in Colombia, 2024

Ortiz Ruiz, N.; LOPEZ PAZ, Y.; Orobio Lerma, Y. P.; Burgos Davila, D.; Medina Zapata, H. J.; Manzano Valencia, K. P.; Almeida Espinosa, A.

2026-07-18 public and global health 10.64898/2026.07.16.26358291 medRxiv

Top 5%

0.2%

Show abstract

This qualitative study evaluated the Life Skills (HpV) training strategy at a public university in Colombia in 2024, analyzing its impact on students positive mental health and psychosocial competencies. The mixed-methods research employed semi-structured interviews with faculty and three student focus groups, using thematic analysis to categorize strengths, weaknesses, and perceived changes. Results highlighted curricular coherence, academic freedom, and participatory methodology as key aspects, fostering self-awareness, emotional management, and the building of support networks. Students reported improvements in well-being, stress management, and academic performance, though challenges such as initial resistance to emotional content and student diversity were identified. The conclusions underscore the value of experiential courses in university education, promoting horizontal relationships and safe spaces for collective reflection. Future studies are recommended to expand participant diversity and incorporate quantitative data triangulation to further explore the interventions effects.

16

Developing and Prospectively Validating a Reproducible Graph Representation Specification for Clinical Guideline Algorithms: The Measurement Foundation of the Clinical Guideline Complexity Index

Milani, R. V.; Bober, R. M.

2026-07-20 health informatics 10.64898/2026.07.17.26358358 medRxiv

Top 5%

0.2%

Show abstract

Background. Translating a clinical guideline decision algorithm into a computational graph requires judgment, and unconstrained coding yields divergent graphs; any complexity measure computed from such a graph inherits that variation, so its reproducibility must be demonstrated rather than assumed. Objective. To develop, and prospectively test, an empirical method for making graph extraction reproducible, using the Clinical Guideline Complexity Index (CGCI) and four guideline algorithms as a case study. Methods. We built a Graph Representation Specification (an ontology, a motif catalogue, disambiguation conventions, decomposition rules, a deterministic validator, and a scoring engine) and refined it by error-driven grammar induction: measure inter-coder disagreement, localize its dominant class, induce a single grammar rule, and prospectively test whether that rule improves agreement in the anticipated class. Reproducibility was quantified with a pre-specified, topology-based endpoint (Decision Topology Agreement) rather than edge agreement, which is oversensitive to representational choices that do not affect the score. Two trained coders independently coded the diabetes, dyslipidemia, heart-failure, and hypertension algorithms. Results. A rule induced from the diabetes comorbidity panel (assessment topology) generated a pre-specified prediction that heart-failure figures, sharing the same motif, would converge; on a fresh, independently coded pair they did, with an absolute CGCI difference of approximately one. Decision topology reproduced closely (decision-order agreement at or near 1.00 for three of four guidelines), while breadth counting was rule-sensitive: an explicit modifier-counting rule reduced the largest disagreement from 27 to 4 tokens. Residual disagreement was bounded and localizable to specific, nameable representational choices. Conclusions. Graph-extraction reproducibility can be systematically improved through iterative grammar refinement, and a prospectively derived rule can be confirmed to improve agreement. These results establish the measurement foundation (reliability, not construct validity) for a companion study interpreting CGCI as cognitive load, and the method may apply wherever graphs are extracted from structured source artifacts.

17

Safety Transparency in Animal Cell-Cultured Ingredients for Pet Food: A Case Study Establishing the Standard for Public Disclosure

Tewari, R.; Soukup, R.; Hadjistylianou, L.; Manicone, M.; Serra, M.; Felbermair, M.; Falconer, S.

2026-07-15 cell biology 10.64898/2026.07.14.738473 medRxiv

Top 6%

0.2%

Show abstract

Animal cell-cultured ingredients are entering the EU and UK pet food markets under frameworks that do not require pre-market, ingredient-level safety assessments, creating an ethical need for transparent safety disclosure. We present the first public safety dossier for this sector, describing the proprietary mouse embryonic stem cell line PE25 and its derived, non-viable cellular and conditioned media ingredient produced in food and feed-grade media. PE25 characterization confirmed Mus musculus identity, sterility, absence of mycoplasma and replication-competent retroviruses, and stable growth. Doxorubicin-induced p53 stress testing, CD44/BMI1 profiling, and soft agar assays showed no cancer-like traits and a non-tumorigenic profile; the final ingredient contains no viable cells. Independent OECD TG 471 and 487 assays confirmed non-genotoxicity. Heavy metals, biogenic amines, solvents, and chemical residues were below regulatory limits. Given process variability, we recommend case-by-case safety evaluation and propose this dossier as a model for responsible commercialization.

18

Trends and Future Burden of Major Gastrointestinal Cancers in Jiangsu Province, China, 2010-2030

Zou, Y.; Wang, W.; Tao, L.; Zhu, H.; Ju, H.; Pan, L.; Wang, W.

2026-07-17 public and global health 10.64898/2026.07.16.26358207 medRxiv

Top 6%

0.1%

Show abstract

Aim: To assess temporal trends in incidence and mortality and project the future burden of five major gastrointestinal cancers in Jiangsu Province, China. Methods: Population-based cancer registry data from Jiangsu Province between 2010 and 2021 were used to analyze the burden of esophageal, gastric, colon, rectal, and liver cancers. Age-standardized incidence and mortality rates were calculated and compared by cancer type, sex, and urban-rural residence. Joinpoint regression was used to estimate annual percentage changes (APC) and average annual percentage changes (AAPC). The APC from the most recent Joinpoint segment was used to project incidence and mortality rates to 2030. Results: In 2021, gastric cancer had the highest age-standardized incidence and mortality among the five cancers. Incidence and mortality were consistently higher in males than in females and increased markedly after 50 years of age. From 2010 to 2021, age-standardized incidence and mortality declined for esophageal, gastric, and liver cancer, but increased for colon and rectal cancer. Colon cancer showed the steepest increase in both incidence and mortality. Rural areas experienced faster increases in colon and rectal cancer burden than urban areas. Projections to 2030 suggest continued declines in esophageal, gastric, and liver cancer, while colon cancer incidence and mortality are expected to rise further. Conclusion: Jiangsu Province is experiencing a transition in gastrointestinal cancer burden, with continued declines in esophageal, gastric, and liver cancers but an emerging and growing burden of colorectal cancer, especially colon cancer. Prevention strategies should focus on expanding colorectal cancer screening and early diagnosis, particularly in rural areas, while sustaining control of esophageal, gastric, and liver cancers.

19

First detection of peroxynitrite in live coral cells during thermal stress

Fuller, I. D.; Fetkenhour, K. P.; Kumar, G. D.; Domaille, D. W.; Roger, L. M.

2026-07-15 biochemistry 10.64898/2026.07.14.738561 medRxiv

Top 6%

0.1%

Show abstract

Reactive nitrogen species (RNS), particularly peroxynitrite generated from the reaction of superoxide and nitric oxide, are implicated in thermally-induced oxidative stress but remain difficult to resolve in live coral cells. We optimized fluorescent dye strategies to directly quantify superoxide, nitric oxide, and peroxynitrite production in thermally stressed Pocillopora acuta cell suspensions. Thermal stress was associated with an increase in intracellular peroxynitrite concentration, but not in its precursors, nitric oxide and superoxide, highlighting challenges with the application of fluorescent probes and their controls to live coral cells. Compounds developed for mammalian systems often translate poorly to non-model systems such as corals: strong endogenous fluorescence and multiple membrane barriers within the coral symbiocyte, for instance, limited the function of the nitric oxide probe, DAF-2DA. Despite these limitations, the detection of peroxynitrite in live, thermally stressed P. acuta cells represents a step forward in understanding the mechanism of coral bleaching. We also outline strategies for improving the performance of commercial dyes in non-model systems, including media optimization with EDTA treatment to preserve both cell viability and probe performance.

20

Introducing PHJ Media: A Unique Machine Learning -Driven Basal Formulation to Overcome Recalcitrance for Multi-Genotype Micropropagation of Cannabis sativa L.

Pepe, M.; Hesami, M.; Jones, M.

2026-07-15 plant biology 10.64898/2026.07.14.738465 medRxiv

Top 6%

0.1%

Show abstract

Applications of tissue culture are critical for Cannabis sativa L. (cannabis), supporting clonal propagation, germplasm preservation, pathogen elimination, among other biotechnological applications. However, extensive genetic diversity associated with cannabis results in highly variable responses to in vitro conditioning, and no consensus basal media formulation exists to support reproducible micropropagation across genotypes. To address these limitations, a hybridized ensemble-NSGA-II approach was employed for concurrent optimization of individual media components to create a species specific, cultivar inclusive basal salt formulation for cannabis micropropagation. The resulting PHJ media represents a unique formulation that overcomes recalcitrance across a wide array of cannabis cultivars, facilitating improved growth and uniformity for the nine cultivars used in its development and validation. These results remain consistent from explant initiation through multiple rounds of subculture. The ability of PHJ to overcome genotypic recalcitrance is telling of its potential applicability with an array of plant species beyond cannabis. Additionally, robust performance both with and without plant growth regulators underscores the plausible use of PHJ for diverse applications beyond standard micropropagation. Ultimately, this cultivar-inclusive basal medium demonstrates utility for both scientific research and industrial-scale operations.