Back

Patterns

15 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Leveraging Generative Artificial Intelligence for Enhanced Data Augmentation in Emotion Intensity Classification: A Comprehensive Framework for Cross-Dataset Transfer Learning
2026-03-03 health informatics 10.64898/2026.02.23.26346928
#1 (2.1%)
Show abstract

Data scarcity and stylistic heterogeneity pose major challenges for emotion intensity classification. This paper presents a cross-dataset augmentation framework that leverages prompt-conditioned generative models alongside deterministic and heuristic transformations to synthesize target-style examples for improved transfer learning. We introduce a unified taxonomy of augmentation strategies--Heuristic Lexical Perturbation (HLA), Prompt-Conditioned Generative Augmentation (CGA), Sequential Hybrid...

2
Detection of Malaria Infection from parasite-free blood smears
2026-01-05 health informatics 10.64898/2025.12.29.25343125
#1 (2.0%)
Show abstract

Malaria affects almost 263 million people worldwide, most of whom live in sub-Saharan countries. In a strategy to reduce malaria-related mortality and limit transmission, diagnosis in endemic areas needs to be immediately available on the field, easy to perform and cheap. Therefore, it currently heavily relies on microscopic examination of blood smears. However, several studies comparing the sensitivity of this approach with qPCR, considered as the most sensitive method albeit not available on t...

3
Enhancing Prediabetes Diagnosis from Continuous Glucose Monitoring Data via Iterative Label Cleaning and Deep Learning
2026-03-05 health informatics 10.64898/2026.03.04.26347604
#1 (2.0%)
Show abstract

As of early 2026, over 115 million US adults (more than 1 in 3) have prediabetes, a condition with an annual conversion rate of 5%-10% to type 2 diabetes. Total diabetes (diagnosed and undiagnosed) affects approximately 40.1 million Americans, or 12% of the population, with roughly 1.5 million new cases diagnosed annually. Continuous Glucose Monitoring (CGM) provides real-time, 24/7 insights into glycemic variability, detecting dangerous highs, lows, and trends that HbA1c (a 3-month average) mis...

4
Quantifying the severity of patient safety events via statistical natural language processing
2025-12-27 health informatics 10.64898/2025.12.22.25342876
Top 0.1% (1.9%)
Show abstract

Medical errors are one of the leading causes of death in the United States. Several public databases have been built to record patient safety events across healthcare systems to better understand and improve safety hazards. These reports typically include both structured fields (e.g., event type, device, manufacturer) and unstructured data elements (free text narrative of what happened). The structured fields are usually restricted to a limited number of categories, whereas the unstructured fiel...

5
Race, Ethnicity and Their Implication on Bias in Large Language Models
2026-01-05 health informatics 10.64898/2026.01.04.26343415
Top 0.1% (1.9%)
Show abstract

Large language models (LLMs) increasingly operate in high-stakes settings including healthcare and medicine, where demographic attributes such as race and ethnicity may be explicitly stated or implicitly inferred from text. However, existing studies primarily document outcome-level disparities, offering limited insight into internal mechanisms underlying these effects. We present a mechanistic study of how race and ethnicity are represented and operationalized within LLMs. Using two publicly ava...

6
Data-Driven Hybrid Model of SARIMA-CNNAR For Tuberculosis Incidence Time Series Analysis in Nepal
2026-02-24 health informatics 10.64898/2026.02.22.26346853
Top 0.2% (1.8%)
Show abstract

BackgroundTuberculosis (TB) remains a major public health challenge in Nepal, with incidence rates substantially higher than global estimates. Accurate forecasting of TB incidence is essential for early warning systems, resource allocation, and targeted interventions. This study aimed to develop and validate a hybrid Seasonal Autoregressive Integrated Moving Average (SARIMA) and Convolutional Neural Network Auto-Regressive (CNNAR) model for TB incidence forecasting in Nepal. MethodsMonthly TB i...

7
The Forgotten Shield: Safety Grafting in Parameter-Space for Medical MLLMs
2025-12-22 health systems and quality improvement 10.64898/2025.12.19.25342673
Top 0.2% (1.6%)
Show abstract

Medical Multimodal Large Language Models (Medical MLLMs) have achieved remarkable progress in specialized medical tasks; however, research into their safety has lagged, posing potential risks for real-world deployment. In this paper, we first establish a multidimensional evaluation framework to systematically benchmark the safety of current SOTA Medical MLLMs. Our empirical analysis reveals pervasive vulnerabilities across both general and medical-specific safety dimensions in existing models, p...

8
Efficient Citation Screening by Weak Classifier Ensemble
2026-01-08 health informatics 10.64898/2026.01.07.26343635
Top 0.2% (1.6%)
Show abstract

Citation screening in systematic review is time-consuming. Machine learning can help semi-automate it but faces obstacles. Each systematic review is a new dataset without initial annotations. Extreme class imbalance against irrelevant studies makes it difficult to select a good subset of samples to train a classifier. The rigid requirement of a (near) total recall of relevant studies demands a careful trade-off between accuracy and recall. This paper pilots a weak classifier ensemble approach to...

9
Diagnostic Value of Elastography in Differentiating Parathyroid Adenoma from Hyperplasia: A Systematic Review and Meta-Analysis
2025-12-11 radiology and imaging 10.64898/2025.12.11.25342045
Top 0.3% (1.6%)
Show abstract

BackgroundDifferentiating parathyroid adenoma from hyperplasia is critical for surgical planning, but conventional imaging often cannot reliably distinguish these lesions. Ultrasound elastography offers quantitative assessment of tissue stiffness and may improve preoperative characterization. PurposeTo evaluate the diagnostic accuracy of ultrasound elastography in differentiating parathyroid adenoma from hyperplasia. MethodsA systematic review and meta-analysis was conducted in accordance with...

10
Understanding Clinician Edits to Ambient AI Draft Notes: A Feasibility Analysis Using Large Language Models
2026-03-02 health informatics 10.64898/2026.02.27.26347290
Top 0.3% (1.6%)
Show abstract

Ambient AI documentation tools generate draft notes that clinicians can review and edit before signing off in electronic health records. Scalable computational approaches to characterize how clinicians modify drafts remain limited, yet are essential for evaluating and improving AI effectiveness. We examined the feasibility of a few-shot prompted large language model (LLM) for categorizing sentence-level edits between AI drafts and final documentation. We developed five label-specific binary mode...

11
Data Quality Assurance Tool for the Acute to Chronic Pain Signatures Study (A2CPS): An Interactive R Shiny Application
2026-01-08 health informatics 10.64898/2026.01.07.26343620
Top 0.3% (1.5%)
Show abstract

Background/AimsClinical trials and observational studies support the synthesis and development of clinical guidelines, highlighting the need for strong data quality assurance measures. The Acute to Chronic Pain Signatures (A2CPS) program is a large-scale, multi-site observational study investigating chronic post-surgical pain and opioid dependence. Its primary goal is to identify biomarkers predictive of progression from acute to chronic pain following knee arthroplasty or thoracic surgery. The ...

12
A deterministic safety pipeline for therapeutic AI in elderly assisted living
2026-02-18 health informatics 10.64898/2026.02.17.26346507
Top 0.3% (1.5%)
Show abstract

Over 54 million Americans are aged 65+, with depression affecting 25-49% and anxiety exceeding 30% of assisted living residents. AI systems employing agentic orchestration exhibit 0.5-2% failure rates--unacceptable where a single missed crisis can be fatal. We designed and bench-evaluated Lilo Engine, a 5-layer deterministic therapeutic pipeline replacing a prior multi-agent orchestrator. Safety is enforced through structural invariants: a Guardian layer with 4-gate OR crisis detection runs unco...

13
An LLM-assisted framework for accelerated and verifiable clinical hypothesis testing from electronic health records
2026-02-12 health informatics 10.64898/2026.02.10.26346008
Top 0.3% (1.5%)
Show abstract

Acquiring insights from electronic health records (EHRs) is slowed by manual analytical workflows that limit scalability and reproducibility. We present LATCH (LLM-Assisted Testing of Clinical Hypotheses), an agentic framework that converts natural language clinical hypotheses into fully auditable analyses on structured EHR data. LATCH integrates LLM-assisted semantic layers with deterministic execution pipelines to automate cohort construction, statistical analysis, and result reporting, while ...

14
Optimizing Temporal Windows for Wearable-Augmented Post-Discharge Risk Prediction: A Methods Study
2026-01-23 health informatics 10.64898/2026.01.21.26344487
Top 0.4% (1.5%)
Show abstract

ObjectiveTo identify optimal modeling parameters for dynamically predicting hospital readmission risk using post-discharge step-count data from remote monitoring devices. MethodsWe combined data from two clinical studies that collected wearable or smartphone-based activity data for up to 6 months after hospital discharge. Analyses were limited to older adults ([≥]55 years). We constructed a patient-day dataset incorporating static demographic and clinical variables and dynamic activity featu...

15
Sino-US-DrugQA: A Benchmark for Evaluating Large Language Models in Cross-Jurisdictional Pharmaceutical Regulation
2026-02-17 health informatics 10.64898/2026.02.13.26346236
Top 0.4% (1.5%)
Show abstract

Cross-jurisdictional pharmaceutical compliance requires comparative analysis of regulatory requirements across jurisdictions such as the US FDA and Chinas NMPA. Although large language models (LLMs) are increasingly explored for healthcare-related applications, their performance in cross-jurisdictional regulatory comparison has not been systematically characterized using dedicated benchmarks. This study introduces Sino-US-DrugQA, a bilingual benchmark dataset designed to evaluate LLM performance...

16
Patient-Centric Markov-Chain Framework for Predicting Medication Adherence Using De-Identified Data
2026-02-10 health informatics 10.64898/2026.02.08.26345856
Top 0.4% (1.5%)
Show abstract

Long-term adherence to prescribed therapies remains a persistent challenge in chronic and ultra-rare conditions where clinical outcomes depend on continuous medication use. Even brief gaps in therapy can compromise disease control, yet patients frequently encounter structural barriers including high out-of-pocket costs, prior-authorization (PA) delays, annual re-verification cycles, and refill logistics that disrupt persistence. This study evaluates a patient-centric Markov-chain framework for a...

17
A Clinical Theory-Driven Deep Learning Model for Interpretable Autism Severity Prediction
2026-01-26 health informatics 10.64898/2026.01.25.26344792
Top 0.4% (1.4%)
Show abstract

Autism spectrum disorder (ASD) affects a substantial proportion of children worldwide, yet clinical assessment of symptom severity remains resource-intensive and unevenly accessible. Artificial intelligence (AI) has transformative potential to support scalable and timely severity assessment from behavioral data, but existing approaches largely treat autism as a monolithic prediction target and rely on opaque models that are difficult for clinicians to interpret or trust. Moreover, prior multimod...

18
Handling onset age inconsistencies in longitudinal healthcare survey data
2026-02-23 health informatics 10.64898/2026.02.20.26346741
Top 0.5% (1.4%)
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWLongitudinal healthcare surveys frequently contain inconsistencies in self-reported onset ages, where participants report different ages for the same condition between enrollment and follow-up surveys. We propose two methods to handle this challenge. First, we introduce a procedure that aggregates inconsistency patterns to construct participant-level reliability scores, enabling researchers to stratify participants and prioritize analysis on high-reliability cohorts. Seco...

19
Automated Burn Detection from Images Using Deep Learning Models: The Role of AI in the Triage of Burn Injuries
2025-12-31 health informatics 10.64898/2025.12.24.25337638
Top 0.5% (1.4%)
Show abstract

Burn injuries are a significant concern in developing countries due to limited infrastructure, and treating them remains a major challenge. The manual assessment of burn severity is subjective and depends, to a large extent, on individual expertise. Artificial intelligence can automate this task with greater accuracy and improved predictions, which can assist healthcare professionals in making more informed decisions while triaging burn injuries. This study established a model pipeline for detec...

20
Comparison of local large language models for extraction of signs and symptoms data from electronic health records
2025-12-16 health informatics 10.64898/2025.12.11.25341954
Top 0.5% (1.3%)
Show abstract

Electronic health records (EHRs) provide a large source of data that can be used for research purposes. Extraction of information from unstructured clinical notes in EHRs can be automated by large language models (LLMs). Although LLMs are promising for this task, challenges remain in reliable application of LLMs to EHR, including the lack of development and validation for languages other than English. Here, we identified Dutch LLMs and compared their performance in a case study. We selected the ...