Back

Penalized generalized estimating equations for relative risk regression with applications to brain lesion data

Kindalova, P.; Veldsman, M.; Nichols, T. E.; Kosmidis, I.

2021-11-03 neuroscience
10.1101/2021.11.01.466751 bioRxiv
Show abstract

Motivated by a brain lesion application, we introduce penalized generalized estimating equations for relative risk regression for modelling correlated binary data. Brain lesions can have varying incidence across the brain and result in both rare and high incidence outcomes. As a result, odds ratios estimated from generalized estimating equations with logistic regression structures are not necessarily directly interpretable as relative risks. On the other hand, use of log-link regression structures with the binomial variance function may lead to estimation instabilities when event probabilities are close to 1. To circumvent such issues, we use generalized estimating equations with log-link regression structures with identity variance function and unknown dispersion parameter. Even in this setting, parameter estimates can be infinite, which we address by penalizing the generalized estimating functions with the gradient of the Jeffreys prior. Our findings from extensive simulation studies show significant improvement over the standard log-link generalized estimating equations by providing finite estimates and achieving convergence when boundary estimates occur. The real data application on UK Biobank brain lesion maps further reveals the instabilities of the standard log-link generalized estimating equations for a large-scale data set and demonstrates the clear interpretation of relative risk in clinical applications.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
NeuroImage
813 papers in training set
Top 0.4%
22.9%
2
PLOS Computational Biology
1633 papers in training set
Top 2%
12.5%
3
Human Brain Mapping
295 papers in training set
Top 1.0%
6.5%
4
Biostatistics
21 papers in training set
Top 0.1%
6.4%
5
Scientific Reports
3102 papers in training set
Top 30%
4.0%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 37%
3.7%
7
Biometrics
22 papers in training set
Top 0.1%
2.6%
8
Statistics in Medicine
34 papers in training set
Top 0.1%
2.4%
9
Imaging Neuroscience
242 papers in training set
Top 2%
2.1%
10
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 29%
1.9%
11
Communications Biology
886 papers in training set
Top 6%
1.9%
12
Nature Communications
4913 papers in training set
Top 48%
1.9%
13
Frontiers in Neuroscience
223 papers in training set
Top 4%
1.7%
14
Bulletin of Mathematical Biology
84 papers in training set
Top 1%
1.7%
15
Medical Image Analysis
33 papers in training set
Top 0.7%
1.5%
16
International Journal of Epidemiology
74 papers in training set
Top 2%
1.4%
17
eLife
5422 papers in training set
Top 48%
1.2%
18
PLOS Genetics
756 papers in training set
Top 11%
1.2%
19
Network Neuroscience
116 papers in training set
Top 0.8%
1.2%
20
NeuroImage: Clinical
132 papers in training set
Top 3%
0.9%
21
Brain Communications
147 papers in training set
Top 3%
0.9%
22
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
0.9%
23
Journal of The Royal Society Interface
189 papers in training set
Top 4%
0.8%
24
Research Synthesis Methods
20 papers in training set
Top 0.2%
0.8%
25
Bioinformatics
1061 papers in training set
Top 10%
0.7%
26
Neural Computation
36 papers in training set
Top 0.8%
0.7%
27
Genetic Epidemiology
46 papers in training set
Top 1.0%
0.7%
28
eneuro
389 papers in training set
Top 10%
0.7%
29
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%
30
BMC Medical Research Methodology
43 papers in training set
Top 2%
0.5%