Back

IUCrJ

International Union of Crystallography (IUCr)

Preprints posted in the last 30 days, ranked by how well they match IUCrJ's content profile, based on 29 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
Multi-state Ensemble Refinement for Occupancy Statistics (MEROS) in Time-Resolved X-ray Crystallography

Prester, A.; Spiliopoulou, M.; Schulz, E. C.

2026-05-07 biochemistry 10.64898/2026.05.04.722701 medRxiv
Top 0.1%
22.5%
Show abstract

Accurate determination of state occupancies is essential for interpreting the structural heterogeneity inherent in time-resolved crystallography. However, in cases of high spatial overlap between states, as commonly observed in time-resolved crystallography data, the strong correlation between occupancy and atomic displacement parameters (ADPs) can render single point estimates from standard refinement protocols unreliable. We introduce MEROS (Multi-state Ensemble Refinement for Occupancy Statistics), a pipeline that implements an ensemble refinement approach to assess the post-refinement occupancy-ADP statistics of multiple overlapping states. MEROS utilizes a Monte Carlo sampling of the parameter space, performing independent refinements from randomized starting occupancies and ADP values to empirically characterize the convergence and uncertainty of the solution. The method is implemented as a modular Python pipeline that wraps established refinement programs, ensuring compatibility with existing workflows. We demonstrate its applicability in two case studies: a two-state ligand binding model in T4 lysozyme L99A and a four-state covalent catalysis mechanism in {beta}-lactamase CTX-M-14. MEROS provides occupancy and ADP mean values with standard deviations that directly quantify the informational content of the experimental diffraction data.

2
LowDoseWizard - rapid and standardised setup of low-dose cryo-TEM imaging in SerialEM

Fromm, S. A.; Mattei, S.

2026-05-08 molecular biology 10.64898/2026.05.05.722937 medRxiv
Top 0.1%
4.9%
Show abstract

Structure elucidation of biological macromolecules by single particle cryogenic electron microscopy (SPA cryo-EM) or cryogenic electron tomography (cryo-ET) relies on low-dose imaging on cryogenic transmission electron microscopes (cryo-TEMs). Routine microscope setup remains technically demanding and can be time-consuming, particularly for inexperienced or infrequent users. We present LowDoseWizard, a guided workflow implemented in SerialEM that enables rapid and standardised setup of cryo-TEM imaging conditions. From minimal user input, the workflow configures microscope optics, camera parameters and image shift settings for all low-dose imaging states, and guides the user through key daily alignment procedures including beam shift offset calibration, objective lens astigmatism correction and coma-free alignment. The workflow is organised into modular routines that can be executed sequentially or independently, while microscope-specific acquisition parameters are defined in editable configuration files, allowing flexible adaptation to different instruments without modification of the core scripts. Across user sessions on three microscopes at EMBL Heidelberg, the complete setup required on average less than 15 minutes. To assess whether predefined imaging conditions generated by the workflow are compatible with high-resolution data collection, we acquired apoferritin data on a 200 kV Glacios and a 300 kV Titan Krios. These datasets yielded reconstructions at 1.62 [A] and 1.09 [A] resolution, respectively, demonstrating that rapid, guided setup can support near-atomic and atomic-resolution single particle cryo-EM. LowDoseWizard lowers the barrier to robust cryo-TEM setup, reduces the time spent on routine parameter selection and alignment, and helps users focus on sample-specific aspects of data acquisition such as target selection. The workflow should be particularly valuable in shared instrumentation environments, where accessibility, reproducibility and efficient microscope use are critical.

3
Automated LN2 refill device for uninterrupted cryoFIB-SEM operations.

Gonda, I.; Junker, D.; Eggimann, F.; Kaech, A.; Szwedziak, P.

2026-05-08 biophysics 10.64898/2026.05.06.723155 medRxiv
Top 0.1%
4.0%
Show abstract

Due to recent technological advances, in situ structural cell biology is becoming a high throughput microscopy technique as all the steps of the workflow, from sample preparation to data analysis, are executed faster, more reliable and more reproducible. Sample thinning by cryoFIB-SEM is an essential tool in preparing electron transparent lamellae of biological specimens suitable for further characterization by cryoET. Modern cryoFIB-SEM instruments can be operated remotely and are capable of automated and unsupervised lamellae preparation. To take full advantage of these developments they need a constant supply of LN2 to maintain cryogenic conditions inside the microscope chamber. Here, we introduce a custom automated LN2 refill system that is compatible with gas cooled cryostages, supports long-term cryoFIB-SEM operations and liberates the user from highly repetitive and manual work. We believe this solution can be utilized with other cryoSEM or cryoFIB-SEM devices requiring N2 gas-flow cooling.

4
StructAgent: Orchestrating Cryo-EM Model Building and Refinement with a Multi-Agent LLM System

Guo, X.

2026-05-18 biochemistry 10.64898/2026.05.18.725842 medRxiv
Top 0.1%
3.5%
Show abstract

Building and refining cryo-EM atomic models often requires long, project-specific workflows that combine map inspection, prior structural knowledge, restraints, refinement, validation and expert review. Existing programs perform many individual operations, but coordinating them across iterative model-building sessions remains manual and difficult to audit. We present StructAgent, a user-guided multi-agent resource for cryo-EM model building and refinement. StructAgent couples a domain agent for literature-grounded structural reasoning with an execution agent that runs local software, tracks state, recovers from failures and records provenance. Expert approval gates control major model-changing actions. In three case studies, StructAgent refitted a 64-chain proteasome from an earlier template, audited 530 ribosomal metal-ion sites and guided a chemically ambiguous ligand fit in a folate-metabolism enzyme from ongoing work. These demonstrations show that agentic orchestration can convert modeling intent into auditable, reviewable software workflows while preserving expert control and final scientific judgment.

5
A workflow for the identification of oligomeric structures on tilted sample planes in Cryo-SMLM

Dong, Y.; Yang, Z.; Schneider, M.; Scherzer, O.; Schuetz, G.

2026-05-14 biophysics 10.64898/2026.05.12.724524 medRxiv
Top 0.1%
3.0%
Show abstract

We introduce a workflow to identify oligomeric structures that are recorded with single-molecule localization microscopy (SMLM) under cryogenic conditions. Typically, these oligomers are assumed to consist of protomers arranged as equilateral two-dimensional polygons and every protomer is labeled with a dye molecule for visualization. Unlike previous work, we consider scenarios in which the sample plane has an unknown orientation relative to the focal plane. Our contribution is a high-precision plane-fitting algorithm to determine the sample plane, combined with geometrical transformations and two circle-fitting algorithms to identify the oligomeric structures. Our simulations on synthetic data demonstrate that the proposed workflow achieves high accuracy in estimating both the unknown tilted plane and the oligomer size.

6
Heterogeneous reconstruction algorithms for cryoEM achieve limited particle classification accuracy on real benchmark datasets

Kinman, L. F.; Grassetti, A. V.; Carreira, M. V.; Davis, J. H.

2026-05-11 biochemistry 10.64898/2026.05.08.722747 medRxiv
Top 0.1%
2.7%
Show abstract

The emergence of single-particle cryoEM as a powerful method for structure determination has in large part been fueled by its ability to resolve both single static structures and complex conformational landscapes. Indeed, modern approaches to the heterogeneous reconstruction task can resolve 100s-1,000s of different maps from a single cryoEM dataset. How accurate these algorithms are, however, has proven difficult to rigorously assess, due to a lack of suitable benchmark datasets containing both realistic noise features and ground-truth labels. To address this obstacle, we recently developed a series of benchmark datasets that leverage the targeting power of Cas9 and the programmable heterogeneity of DNA to newly offer access to ground-truth per-particle structural labels in real data. Here, we challenged two popular heterogeneous reconstruction algorithms with mixed particle stacks resampled in silico from these datasets, finding that existing approaches resolve the encoded heterogeneity with limited accuracy. In particular, in realistic particle stacks with complex, multi-scale, and multi-axis heterogeneity, we observed that reconstruction of encoded heterogeneity depended strongly on the application of prior information about where heterogeneity was expected, and that individual particle assignments were made with significant error even when the correct structural states were reconstructed. Both molecular breathing motions and data collection features, such as defocus and projection angle, contributed to the observed particle assignment error. These results highlight important shortcomings of existing heterogeneous reconstruction methods and suggest new avenues for method development in both data collection strategies and in heterogeneous classification and reconstruction algorithms.

7
An efficient eukaryotic cell-free expression and correlative cryo-electron tomography platform

Tillu, V.; Hunter, D.; Chen, K.-E.; Smith, J.; Nassar, O.; Rae, J.; Sierecki, E.; Kobe, B.; Gambin, Y.; Collins, B.; Parton, R. G.; Ariotti, N.

2026-05-04 biochemistry 10.1101/2025.10.18.683253 medRxiv
Top 0.1%
2.6%
Show abstract

Cell-free expression using Leishmania tarentolae lysates allows rapid expression of eukaryotic proteins directly from DNA templates. We develop a pipeline that combines cell-free expression system with cryogenic fluorescence microscopy that we term CC-FLEXCET (Correlative Cell-Free Leishmania EXpression and Cryo-Electron Tomography), to target and visualize expressed protein complexes by cryo-electron tomography at high resolution. We demonstrate the utility of this method by structurally characterising the filaments of the full-length apoptosis-associated speck like protein containing CARD (ASC) protein. Cell-free expression of ASC results in a polymeric structure characteristic of its cellular speck assembly. Sub-tomogram averaging allows us to resolve both the pyrin domain (PYD) to medium resolution, and show, for the first time, the arrangement of the flexibly linked caspase recruitment domain (CARD). Finally, we observed an interaction between the ASC filament and the L. tarentolae ribosome. Using template matching and quantitative approaches, we characterise this interaction and determine that there is a random structural association between the filament and the ribosome, with 57% of ribosomes oriented with the LSU toward the ASC polymer. CC-FLEXCET facilitates structural analysis of macromolecules and protein-lipid assemblies without need of purification, providing a pipeline from DNA template to protein expression to cryo-tilt series acquisition, within a single day.

8
OPUS-ET: Resolving Compositional and Conformational Heterogeneities of Biomolecules in Cryo-Electron Tomography

Luo, Z.; Chen, X.; Wang, Q.; Ma, J.

2026-05-20 molecular biology 10.1101/2025.11.21.688990 medRxiv
Top 0.2%
1.8%
Show abstract

Structural heterogeneity in biomolecules, arising from both compositional and conformational variability, limits resolution and interpretability of cryo-electron tomography (cryo-ET). Here, we present OPUS-ET, a deep learning framework that resolves multiscale heterogeneity throughout the cryo-ET workflow. OPUS-ET combines a composition decoder that captures compositional differences with a conformation decoder that models large-scale motions, thereby providing a hierarchical representation of structural heterogeneity. Starting from noisy template-matching candidates with templates of varying similarity or quality, OPUS-ET efficiently enriches target particle populations and delivers sub-nanometer in situ reconstructions in a single round. It leads to improved resolutions by up to 4.5 [A] over expert annotations or existing deep-learning approaches in four benchmark systems, and reveals continuous conformational landscapes capturing F-F flexible coupling in mitochondrial ATP synthase and tRNA-translocation intermediates in eukaryotic and bacterial ribosomes. Together, these results establish OPUS-ET as a powerful computational tool for linking particle purification, high-resolution reconstruction, and analysis of structural heterogeneity in cryo-ET, with demonstrated robustness to template quality, initial pose noise, and clustering parameters.

9
Easymode: general pretrained networks for cellular cryo-ET enable flexible approaches to subtomogram averaging

So-Last, M. G. F.; Hale, T.; Burt, A.; Allegretti, M.

2026-05-21 molecular biology 10.64898/2026.05.19.726344 medRxiv
Top 0.2%
1.7%
Show abstract

Cellular cryo-electron tomography (cryo-ET) reveals high-resolution details of macromolecules within their native cellular environment. However, in situ cryo-ET datasets are large and highly heterogeneous, which makes comprehensive identification and extraction of the many different elements of cellular architecture for high-resolution analysis a challenging, time-consuming and often tedious task. Here we present easymode, a library of pretrained general segmentation networks for cryo-ET, trained on over 4,000 tilt series spanning a large and diverse variety of sources. Easymode enables in situ structural determination workflows by rendering tomogram content computationally accessible, without requiring any per-dataset training. Beyond directly facilitating high-resolution subtomogram averaging of a selection of widely prevalent complexes, we show how easymode can be used to leverage cellular context in subtomogram averaging workflows, helping identify, align, or filter particle sets, and enabling automated mapping of the cellular landscape surrounding target proteins. We use easymode to determine the in situ structure of rare inosine monophosphate dehydrogenase (IMPDH) filaments at 4.0 A resolution, and to map and visualize the surrounding cellular environment.

10
Phosphorylation Mimicking Mutations Cause TDP-43 to Adopt Different Fibril Conformations

Fonda, B. D.; Murray, D. T.

2026-05-17 biophysics 10.64898/2026.05.14.725298 medRxiv
Top 0.2%
1.7%
Show abstract

The Tar-DNA Binding Protein-43 C-terminal region, TDP43LC, has been previously shown to form amyloid-like fibrils with distinct folds in ALS and FTD. In both diseases, proteinaceous inclusions contain TDP43 C-terminal protein fragments as well as phosphorylated TDP43. Here, we use solution NMR to show that soluble phosphomimetic TDP43LC, P-TDP43LC, is structurally similar to wild-type TDP43LC. Disperse P-TDP43LC, like wild-type protein, contains a central helical region flanked by long disordered regions. Despite this similarity, our turbidity measurements, imaging, and kinetic assays show that P-TDP43LC has different aggregation behavior than wild-type protein. Using solid state NMR measurements we find that that phosphomimetic mutations alter the wild-type fibril conformation. Electrostatic repulsion from negatively charged sidechains, despite having little effect on the soluble proteins structure, perturbs amyloid-like fibril formation and selects for a different conformation in vitro. These results shed light on the structural role of TDP43LC phosphorylation in fibril formation in disease. TOC Graphic O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=104 SRC="FIGDIR/small/725298v1_ufig1.gif" ALT="Figure 1"> View larger version (16K): org.highwire.dtl.DTLVardef@1c63aforg.highwire.dtl.DTLVardef@1d48ed6org.highwire.dtl.DTLVardef@1ed8fd3org.highwire.dtl.DTLVardef@17d67a8_HPS_FORMAT_FIGEXP M_FIG C_FIG SynopsisPhosphomimetic mutations at ALS and FTD neurodegeneration-associated sites in an amyloid forming protein perturbs the aggregated structure compared to wild-type protein.

11
Extracting Parsimonious Quantitative Predictors of Biological Effectiveness from 'First-Principles' Radiobiology: Application to the Mixed-Quality Problem

Yusufaly, T.; Transtrum, M.; Huang, L.; Sabok-Sayr, S.; Sgouros, G.; Hobbs, R.; Jia, X.

2026-05-06 biophysics 10.64898/2026.05.02.722446 medRxiv
Top 0.2%
1.5%
Show abstract

Developing parsimonious, mechanism-aware quantitative models that predict how biological effectiveness changes with different modifiers remains, in general, an unsolved problem. Advances in radiobiological research have created a large knowledge base of first-principles mechanistic models of radiation response that, in principle, could accurately predict radiosensitivity across different experimental and clinical conditions. However, in practice these mechanistic models come with an overabundance of parameters, the majority of which are practically unidentifiable and, moreover, likely unnecessary if one simply wishes to predict how radiosensitivity changes for some specific modifier of interest. Nevertheless, determining which few details in the full mechanistic model are relevant for a given purpose, as well as how to remove any other extraneous details, remains a highly non-trivial task. In this study, we demonstrate the potential of model reduction, starting from a detailed mechanistic description, as a systematic strategy for deriving parsimonious, experimentally falsifiable radiobiological descriptors. As a proof-of-concept demonstration, we apply the Manifold Boundary Approximation Method (MBAM) to a Mechanistic Model of DNA Repair and Survival (MEDRAS), for the problem of cell survival prediction following an acute exposure. Our findings reveal that the complete MEDRAS model for an arbitrary mixed-quality exposure can be structurally simplified to a reduced three-parameter model for an effective uniform-quality, named MEDRAS-LPL. Additional MBAM analysis on MEDRAS-LPL identifies two boundaries in parameter space, corresponding to sparsely ionizing and densely ionizing radiation. Mapping of MEDRAS-LPL parameter space on to effective LQ space further demonstrates that parameters close to the sparsely ionizing boundary line up with expectations from the theory of dual radiation, while parameters close to the densely ionizing boundary line up with expectations from a purely linear model based on a target-theory description. Moreover, our formalism predicts enhanced synergistic interactions between sparsely ionizing and densely ionizing radiation beyond the Zaider Rossi model (ZRM) paradigm, in line with empirical observations. The results highlight the potential for using reduced-order models not only for predictive applications but also for generating novel hypotheses that can inform future experimental designs and optimization strategies in radiobiology.

12
Engineered Cas9 complexes establish an experimentally grounded benchmark for heterogeneous cryoEM reconstruction methods

Grassetti, A. V.; Kinman, L. F.; Davis, J. H.

2026-05-07 biochemistry 10.64898/2026.05.04.721978 medRxiv
Top 0.2%
1.5%
Show abstract

Single-particle cryoEM is increasingly used to resolve conformational and compositional ensembles, yet objective evaluation of heterogeneous reconstruction methods remains limited by the scarcity of experimental benchmarks with per-particle ground-truth labels. Indeed, many widely used experimental"benchmark" datasets necessarily validate observed states retrospectively while purely synthetic datasets provide ground-truth labels but typically fail to capture experimentally realistic complexities including confounding structural heterogeneity, imaging noise, contaminants, and orientation biases, which dominate real-world analyses. Here we develop an experimentally grounded benchmark dataset for heterogeneous reconstruction using catalytically inactive Streptococcus pyogenes Cas9 bound to a constant sgRNA and to target DNA duplexes engineered to carry extensions of defined length. We assembled, purified, vitrified, and imaged thirteen complexes independently, such that the dataset-of-origin provides an unambiguous label for each particles encoded state while preserving the full experimental complexity of cryoEM data. Independent refinements of the pure datasets recover the engineered DNA-extension signal and define a simple quantitative readout, DNA-extension occupancy, that increases monotonically with designed extension length. The same reconstructions also reveal substantial non-encoded conformational variability within the Cas9 core, showing that this benchmark embeds a known structural signal within broader structural heterogeneity that methods must confront in practice. To separate these axes of variation, we used systematic deep classification to generate curated particle subsets depleted of selected domain motions while retaining the encoded labels. We further provide pooled particle stacks with standardized per-particle poses in a common reference frame and a lightweight framework for in silico particle pooling to generate challenge datasets with user-defined ground-truth distributions of encoded and non-encoded structural heterogeneity. Together, this resource supports robust benchmarking of heterogeneous reconstruction algorithms and provides a biochemically tractable model system for evaluating entire cryoEM pipelines, including alternative data-collection and preprocessing approaches, under experimentally realistic conditions.

13
Beyond Redfield: Thermodynamic Bounds and Non-Perturbative Quantum Dynamics in Tubulin Networks

Firmenich, F.; Firmenich, P.; Firmenich, L.

2026-05-13 biophysics 10.64898/2026.05.10.724047 medRxiv
Top 0.2%
1.5%
Show abstract

Quantum effects in biology are unavoidable at the molecular scale; the unresolved question is whether they can remain functionally relevant across the timescale gap between femtosecond molecular dynamics and microsecond-to-millisecond biological function. Here we formalize this mismatch as an equilibrium-to-functionality gap and use tubulin as a stringent open-system test case. We combine secular Lindblad, Redfield, and hierarchical equations of motion (HEOM) treatments to quantify decoherence, non-perturbative relaxation, and the physical amplification required for functional relevance. Equilibrium dephasing yields a conservative [Formula] fs at 310 K, with a generic protein-bath baseline of {approx} 13 fs. A completed 30 ps HEOM trajectory for the full 1JFF tryptophan network shows distributed non-Markovian relaxation, with terminal purity Pur = 0.210 and stretched-exponential exponent {beta}KWW {approx} 0.44, confirming that Redfield is useful as a short-time perturbative comparator but not quantitatively interchangeable with HEOM in this intermediate-coupling regime. We introduce a coherence-utility criterion [U] = [K]{tau}coh/{tau}func, separating required amplification from empirically bounded gain. A thermodynamic uncertainty relation closure shows that neural-scale cascade amplification would require Pmin [~] 10-7 W, about five orders of magnitude above the local microtubule GTP budget. Frohlich pumping is found to be linewidth-gated rather than generically micron-scale; ordered-water cavity QED and geometric subradiance remain experimentally testable but severely constrained candidates. The result is not a model of consciousness, but a reproducible physical benchmark framework for evaluating biological quantum-coherence claims under explicit open-system, energetic, and experimental constraints. Six falsifiable experimental programmes are prioritized, and the full computational framework is released with a validation ledger, cryptographic audit trail, and living supplementary material. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=107 SRC="FIGDIR/small/724047v1_ufig1.gif" ALT="Figure 1"> View larger version (20K): org.highwire.dtl.DTLVardef@19e4f42org.highwire.dtl.DTLVardef@65a719org.highwire.dtl.DTLVardef@1bd63beorg.highwire.dtl.DTLVardef@df77d8_HPS_FORMAT_FIGEXP M_FIG O_FLOATNOGraphical abstract.C_FLOATNO Equilibrium tubulin coherence lies in the femtosecond regime, while functional neural timescales lie in the millisecond regime. Frohlich pumping, QED-cavity protection, and geometric subradiance remain experimentally discriminable non-equilibrium candidates requiring independently bounded amplification. C_FIG FundingThis research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors. Versioned computational scope of this releaseThis manuscript reports the theoretical framework, calibrated equilibrium baseline, Redfield/HEOM validation ledger, stratified Bayesian evidence synthesis, classical comparators, and falsifiable experimental design. The release-specific reproduction audit, including the current validation-check total and the SHA-256 fingerprints of the binary production artefacts (.npz, .pkl), is documented in LIVING_SI.md and outputs_data/raw_json/structur al/validation_report.json. A completed 30 ps HEOM production trajectory has been validated on constrained hardware; the master dataset contains the full 8-site population trajectory. A summary of those results is provided in [§]2.2.5. All claims made below are restricted to the numerical and theoretical evidence reported in this manuscript and its associated repository artefacts. The public repository ships the calibrated phenomenological baseline for accessibility; the HEOM production artefacts serve as the non-perturbative validation benchmark. All source figure outputs associated with this release are maintained in the public repository under outputs_data/figures_final/.

14
Quantum kernel support vector machines for trabecular bone classification: comparing feature reduction strategies on synthetic micro-CT data

Florez, I.; Farhat, A.; Le Houx, J.; Altamura, E.; Tozzi, G.

2026-05-07 biophysics 10.64898/2026.05.04.722627 medRxiv
Top 0.2%
1.4%
Show abstract

Quantum kernel methods offer a potential advantage for classification tasks in high-dimensional feature spaces, yet their practical benefit critically depends on how input features are prepared. We compare five dimensionality reduction strategies--principal component analysis (PCA), Gaussian random projection (RP Gaussian), sparse random projection (RP Sparse), partial least squares (PLS), and uniform manifold approximation and projection (UMAP) -- as pre-processing steps for quantum kernel support vector machines (SVMs) applied to trabecular bone classification from synthetic micro-computed tomography (micro-CT) data. Using a custom procedural generator based on Gaussian random field zero-crossings, we produced 500 synthetic trabecular bone volumes with controlled morphometric properties such as bone volume fraction (BV/TV), trabecular thickness (Tb.Th), number (Tb.N) and spacing (Tb.Sp). Texture features extracted from grayscale slices are reduced to 8-dimensional quantum circuit inputs via each method, then classified using both classical radial basis function (RBF)-SVMs and quantum kernel SVMs with ZZ feature maps on a statevector simulator, both evaluated with 5 x 5 repeated stratified cross-validation (25 folds). Our results show that UMAP is the only reduction method where the quantum kernel remains competitive with the classical baseline. Under repeated cross-validation, UMAP showed a +0.032 accuracy gap favouring the quantum kernel (Dietterich 5 x 2 CV p = 0.177); however, validation on 10 fully independent datasets--each with independently generated samples, separate reduction fits, and separate kernel matrices -- reversed the sign to -0.030 (paired t-test p = 0.123; Wilcoxon p = 0.193; quantum wins 3/10 datasets), indicating that the apparent advantage was likely inflated by fold dependence. Nevertheless, UMAPs gap remains small and non-significant in both analyses, whereas all linear methods (PCA, RP Gaussian, PLS) show substantial quantum deficits of -0.090 to -0.116 across BV/TV classification, with PCA and PLS remaining significant under corrected tests (5 x 2 CV p = 0.004 and p = 0.007 respectively). We additionally evaluate quantum kernel ridge regression for continuous morphometric prediction, finding that ZZ quantum kernels fail uniformly at regression (negative R2 for all methods except PLS at 4 qubits), suggesting that the ZZ kernel captures decision boundaries but not smooth metric structure. These findings provide practical guidance for feature engineering in near-term quantum machine learning pipelines and demonstrate that the choice of dimensionality reduction can determine whether quantum kernels remain competitive with classical baselines.

15
Triplet tumbling microscopy enables in situ quantification of protein complex assembly and dynamics

Lazzari-Dean, J. R.; Millett-Sikking, A.; Rao, P.; Jensvold, Z. D.; Baddock, H.; Ingaramo, M.; Nile, A. H.; York, A. G.; Preciado Lopez, M.

2026-05-11 biophysics 10.64898/2026.05.07.723557 medRxiv
Top 0.2%
1.3%
Show abstract

Protein-protein interactions (PPIs) mediate diverse cellular processes, but PPIs are typically characterized using reconstituted in vitro biochemical and biophysical approaches. Current approaches for PPI detection in living cells are limited in the scope of interactions they can capture and often require prior knowledge of the interacting partners. To close this gap, we developed triplet tumbling microscopy (TTM), which reveals the interactions of a tagged protein of interest in cells in real time. TTM reports protein complex size from rotational diffusion ("tumbling") by leveraging infrared-triggerable emission from triplet states to track tumbling over nanoseconds to hundreds of microseconds. These long-lived triplets overcome the size limitations of existing rotational diffusion-based approaches, enabling TTM to measure species from small protein complexes to organelle-scale beads. In living cells, we apply TTM to detect PPIs, quantify fraction bound, and distinguish protein complexes by size. We measure diverse types of interactions, including rapamycin-induced dimerization, p53 homo-oligomerization, and binding of the E3-ligase E6AP to the human papilloma virus 16 E6 protein. The required hardware is compatible with most fluorescent microscopes, making TTM a versatile way to extract molecular insights from the complex context of living cells. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=109 SRC="FIGDIR/small/723557v1_ufig1.gif" ALT="Figure 1"> View larger version (27K): org.highwire.dtl.DTLVardef@1e70768org.highwire.dtl.DTLVardef@974813org.highwire.dtl.DTLVardef@1fd122borg.highwire.dtl.DTLVardef@1b3da96_HPS_FORMAT_FIGEXP M_FIG C_FIG

16
Drug design using unique conformations to preferentially target a specific site on collagen-bound MMP1

SARKAR, S. K.; Nash, A.; Harms, C.

2026-05-17 biophysics 10.64898/2026.05.14.725194 medRxiv
Top 0.2%
1.3%
Show abstract

Precise site-specific drug design remains a challenge in structure-based drug discovery. Most existing approaches screen for ligands to target binding pockets on a protein surface based on static structures obtained from techniques such as X-ray, NMR, cryo-EM, and AlphaFold. However, the structure-function paradigm is, in reality, a structure-dynamics-function relationship that determines a proteins binding and activity. As such, drug screening or design without evaluating binding competition across the protein surface or considering the receptors dynamic, substrate-dependent conformational states is incomplete. Substrate-specific unique protein conformations are underexplored and offer novel opportunities for selective therapeutic targeting, though systematic workflows for identifying and exploiting such sites remain limited. Previously, we showed that collagen alters matrix metalloprotease-1 (MMP1) dynamics and that R405 is an allosteric residue on the MMP1 surface that exhibits strong dynamic correlations with its active site. Here, we present a substrate-specific allosteric drug-design framework that targets specific sites on a protein, using collagen-bound MMP1 as a model system. We determined the conformational dynamics of free and collagen-bound MMP1 using all-atom molecular dynamics (MD) simulations and categorized conformations into clusters of similar conformations. We then compared and identified unique conformations that occur only in collagen-bound MMP1 to design drugs against them using a machine-learning approach. The top three unique clusters were used to generate approximately 150,000 candidate compounds that were then screened against both the R405-centered region and all detectable binding pockets across the MMP1 surface. We have found several compounds that bind preferentially around R405 by at least 0.3 kcal/mol relative to competing sites across the surface. This strategy establishes a generalizable framework for designing ligands that preferentially target substrate-specific allosteric sites, providing new opportunities for precision therapeutics that modulate proteins in their biologically relevant functional states. Simple SummaryIn this paper, we establish a substrate-specific allosteric drug-design strategy that integrates all-atom molecular dynamics simulations, conformational clustering, machine-learning-based ligand design, and surface-wide binding-selectivity screening, using collagen-bound MMP1 as a model system. We show that collagen binding reshapes the conformational ensemble of MMP1, creating unique conformational states that are absent or inaccessible in the free enzyme. By identifying these substrate-specific conformations, generating ligands based on the corresponding dynamic fingerprints around the collagen-specific allosteric residue R405, and screening compounds across all binding pockets on the MMP1 surface, we demonstrate preferential targeting of the collagen-specific site relative to competing pockets. These results establish a generalizable framework for designing ligands that selectively recognize biologically relevant substrate-bound conformations rather than static protein structures alone. Substrate-specific allosteric targeting may enable selective modulation of individual protein functions while minimizing off-target interactions, providing new opportunities for precision therapeutics against dynamic protein systems.

17
A quantitative proteomics dataset for assessment and prediction of low dose X-ray radiation exposure in mice.

Zelter, A.; Riffle, M.; Merrihew, G. E.; Mutawe, B.; Shulman, N.; Sanders, J. A.; Noble, W. S.; Johnson Erickson, D. P.; Morimoto, A.; Shaver, B. A.; Steins, T. N.; Cao, N.; Ford, E. C.; Rudnick, P. A.; Chelsky, D.; Wan, K. H.; Inman, J. L.; Chang, H.; Snijders, A. M.; Mao, J.-H.; Celniker, S. E.; De Chant, J.; Obst-Huebl, L.; Nakamura, K.; Wu, C. C.; MacCoss, M. J.

2026-05-19 biochemistry 10.64898/2026.05.18.725951 medRxiv
Top 0.2%
1.3%
Show abstract

Ionizing radiation induces molecular responses that may be used to estimate exposure when physical dosimeters are unavailable. Here we present two large-scale proteomics datasets generated from mouse dorsal skin punch samples collected following controlled X-ray exposures spanning multiple doses, dose rates, and post-exposure time points. Experiment 1 comprised 96 samples (including 16 reference samples) collected 6 days after exposure to 0-75 cGy delivered at either 30 or 300 cGy/min. Experiment 2 comprised 936 samples (including 236 reference samples) exposed to 0-100 cGy at either 3 or 28 cGy/min dose rates and harvested between 7 and 150 days post-exposure. All samples were processed using a standardized workflow involving automated bead-based digestion and data-independent acquisition mass spectrometry. The datasets include multiple pooled reference sample types, process controls, and system suitability standards ensuring high quality data. All data presented are available via ProteomeXchange at several levels of processing, from raw files through normalized peptide- and protein-level abundance matrices suitable for biomarker discovery and machine learning applications. This dataset will facilitate generation of new insights into the biological changes and molecular signatures resulting from X-ray exposure in mice and may also help inform future studies in humans.

18
Altair-dvOPM: an open-access platform for large-field three-dimensional tissue imaging

Ngo, T.; Faiyazuddin, M.; Nguyen, T. D.; Haug, J.; Shen, Q.; Gałecki, S.; Borges, H. M.; Chen, B.; Wang, X.; Zhu, H.; Pappas, S. S.; Voigt, F. F.; FIolka, R.; Dean, K. M.

2026-05-12 biophysics 10.64898/2026.05.08.723912 medRxiv
Top 0.2%
1.2%
Show abstract

Altair-dvOPM is an open-access direct-view oblique plane microscope designed for large-field, three-dimensional imaging of cleared and expanded tissue sections. By combining photographic-lens-based detection, externally launched oblique illumination and precision-registered modular baseplates, the system achieves micrometer-scale lateral resolution over a ~5.4 mm field of view without custom objectives or highly specialized alignment procedures. We demonstrate imaging across scales, from subcellular structures in expanded cells to centimeter-scale expanded tissue sections, and provide documentation, CAD files, Zemax models and open-source control software to support replication and extension.

19
AE-PocketMiner Uses Attention to Simultaneously Predict Cryptic Pockets and Their Allosteric Coupling

Zhang, S.; Mishra, P.; Kelly, D.; Kumar, R.; Bowman, G. R.

2026-05-23 biophysics 10.64898/2026.05.21.726899 medRxiv
Top 0.3%
1.2%
Show abstract

Finding and targeting cryptic pockets could dramatically expand the druggable proteome. However, discovering these sites remains challenging since they are only open a fraction of the time. It is also difficult to predict the functional relevance of a cryptic site as this often requires insight into allostery. Here we introduce attention enabled (AE-)PocketMiner, an artificial intelligence (AI) method that uses a graph neural network with an attention mechanism to simultaneously predict the locations of cryptic pockets and their allosteric coupling to the rest of the protein from a single input structure. We show that AE-PocketMiner outperforms past methods for identifying cryptic pockets and recapitulates known allosteric interactions. Moreover, we experimentally confirm newly predicted cryptic pockets and mutations that allosterically control pocket opening. AE-PocketMiner thus provides a powerful framework for multiple steps of the drug discovery process--including pocket identification, prioritization, and assay design--that will help expand the druggable proteome.

20
Connecting Cryo-EM and Crystallographic Views of RNA Folding through Ionic Conditions and Structural Flexibility

Mainan, A.; Roy, S.; Kirmizialtin, S.

2026-05-04 biophysics 10.64898/2026.05.02.722415 medRxiv
Top 0.3%
1.2%
Show abstract

Discrepancies between biomolecular structures resolved by cryo-electron microscopy (cryo-EM) and X-ray crystallography (XRD) often arise from differences in ionic conditions and construct design, yet their mechanistic impact on RNA folding remains unclear. In the SARS-CoV-2 frameshifting stimulatory element, cryo-EM and XRD structures reveal distinct pseudoknot conformations--a bent and a coaxially stacked state--complicating its structure-function relationship. Here, combining all-atom explicit-solvent simulation results with a structure-based electrostatic model, we show that Mg{superscript 2} ions drive transitions between these states by stabilizing long-range tertiary interactions and modulating local dynamical coupling involving the slippery site and stem 3. Energy landscape analysis reveals distinct folding pathways, while deletion of the slippery segment in crystallographic constructs alters intermediates and produces pathways inconsistent with single-molecule optical tweezer experiments. This study demonstrates how condition-dependent experiments encode complementary interaction-level information and how physics-based computational approaches integrate these to yield a coherent, mechanistic picture of RNA folding. TOC GRAPHICS O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=108 SRC="FIGDIR/small/722415v1_ufig1.gif" ALT="Figure 1"> View larger version (41K): org.highwire.dtl.DTLVardef@1a7c324org.highwire.dtl.DTLVardef@fcabceorg.highwire.dtl.DTLVardef@736704org.highwire.dtl.DTLVardef@7061e6_HPS_FORMAT_FIGEXP M_FIG C_FIG