Back

Exploratory Statistical Analysis of Rheumatoid Factor Based Subgroups in Sjögren's Syndrome.

Zhuang, N. Z.; Howells, J. M.

2025-09-12 bioinformatics
10.1101/2025.09.11.670596 bioRxiv
Show abstract

This method describes a computational pipeline for stratifying autoimmune patient groups using exclusively binary autoantibody data. Our method addresses a methodological gap in computational immunology by providing a standardized framework for analyzing categorical serological data commonly found in electronic health records and resource-limited settings. The pipeline integrates three complementary analytical modules: O_LIModule 1: Exploratory screening using statistical association tests. C_LIO_LIModule 2: Quantification of overall immunological similarity and un-certainty. C_LIO_LIModule 3: Prediction modeling and validation against chance. C_LI We demonstrate the methods utility by applying it to two autoimmune disorders. We were successful in recapitulating established clinical relationships in these two closely linked diseases. The pipeline is implemented in Python and includes detailed configuration options for custom disease groups, autoanti-body panels and stratification variables. This method enables researchers to extract meaningful immunological patterns from underutilized binary clinical data, serving as a hypothesis-generation tool to help drive impactful exploration. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=74 SRC="FIGDIR/small/670596v4_ufig1.gif" ALT="Figure 1"> View larger version (19K): org.highwire.dtl.DTLVardef@7c6df3org.highwire.dtl.DTLVardef@116ac5eorg.highwire.dtl.DTLVardef@18e76dcorg.highwire.dtl.DTLVardef@1d6b71_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Frontiers in Immunology
586 papers in training set
Top 0.1%
18.9%
2
ImmunoInformatics
11 papers in training set
Top 0.1%
18.8%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.3%
4
Bioinformatics
1061 papers in training set
Top 3%
7.2%
50% of probability mass above
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.3%
7.2%
6
Frontiers in Genetics
197 papers in training set
Top 1%
4.9%
7
Patterns
70 papers in training set
Top 0.2%
3.7%
8
BMC Medical Genomics
36 papers in training set
Top 0.1%
3.6%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.5%
10
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
11
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
12
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
13
iScience
1063 papers in training set
Top 14%
1.7%
14
The Journal of Immunology
146 papers in training set
Top 1.0%
1.5%
15
Scientific Reports
3102 papers in training set
Top 64%
1.3%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
17
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
18
eLife
5422 papers in training set
Top 61%
0.6%
19
Journal of Immunological Methods
24 papers in training set
Top 0.3%
0.5%
20
Immunology
29 papers in training set
Top 1%
0.5%
21
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.5%
22
PLOS ONE
4510 papers in training set
Top 73%
0.5%