Back

MSstatsResponse: Semi-parametric statistical model enhances detection of drug-protein interactions in chemoproteomics experiments

Szvetecz, S.; Kohler, D.; Federspiel, J.; Field, D. S.; Jean-Beltran, P.; Seward, R. J.; Suh, H.; Xue, L.; Vitek, O.

2026-03-11 bioinformatics
10.64898/2026.03.09.710598 bioRxiv
Show abstract

Chemoproteomics is a popular approach for the identification of small molecule-protein interactions in biological systems. Several chemoproteomics workflows leverage functionalized chemical probes and mass spectrometry to measure protein engagement through direct protein enrichment or competition using a range of small molecule concentrations. Statistical methods for analysis of such dose-response chemoproteomics datasets are limited. For example, existing methods rely on fixed curve shapes and are sensitive to experimental variation, particularly when the number of doses or replicates is limited. Here, we present MSstatsResponse, a semi-parametric statistical framework for analyzing chemoproteomic dose-response experiments that uses isotonic regression that does not require a fixed curve shape. This approach improves the accuracy and robustness of curve fitting, target identification, and half-response estimation across diverse experimental designs. We evaluate MSstatsResponse by generating a benchmark chemoproteomic dataset that profiled the competition between the kinase-binding probe XO44 and the drug Dasatinib using three mass spectrometry acquisition strategies: data-independent acquisition, tandem mass tag-based data-dependent acquisition, and selected reaction monitoring. We further evaluate the method on simulated datasets that vary the number of doses, number of replicates, and levels of noise, and demonstrate that MSstatsResponse consistently improves sensitivity, specificity, and reproducibility compared to existing methods, particularly in low-replicate and low-dose settings. MSstatsResponse is implemented as an open-source R/Bioconductor package that integrates with the MSstats ecosystem for quantitative proteomics. It provides a unified workflow for preprocessing, curve fitting, target identification, and experimental design, enabling researchers to select the number of doses and replicates appropriate to their experimental goals. The software and documentation are freely available at https://bioconductor.org/packages/MSstatsResponse.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Molecular & Cellular Proteomics
158 papers in training set
Top 0.1%
19.0%
2
Bioinformatics
1061 papers in training set
Top 2%
15.0%
3
Journal of Proteome Research
215 papers in training set
Top 0.3%
12.7%
4
Analytical Chemistry
205 papers in training set
Top 0.3%
10.3%
50% of probability mass above
5
Nature Communications
4913 papers in training set
Top 21%
8.6%
6
Nature Methods
336 papers in training set
Top 2%
6.4%
7
Cell Systems
167 papers in training set
Top 5%
2.5%
8
PROTEOMICS
35 papers in training set
Top 0.3%
2.4%
9
PLOS ONE
4510 papers in training set
Top 56%
1.5%
10
Genome Biology
555 papers in training set
Top 5%
1.5%
11
PLOS Computational Biology
1633 papers in training set
Top 18%
1.4%
12
Cell Reports Methods
141 papers in training set
Top 3%
1.2%
13
Communications Chemistry
39 papers in training set
Top 0.6%
1.0%
14
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
1.0%
15
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
16
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
17
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
18
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
19
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.6%
0.7%
20
Scientific Reports
3102 papers in training set
Top 80%
0.5%
21
iScience
1063 papers in training set
Top 40%
0.5%
22
eLife
5422 papers in training set
Top 63%
0.5%