Back

A scoping study of the whole-cell imaging literature: a foundational corpus, potential for data-mining and research synthesis, and a call for standardization of an emerging field

Mirvis, M.; Weingard, B.; Goodman, S.; Marshall, W. F.

2025-02-04 cell biology
10.1101/2025.02.03.636363 bioRxiv
Show abstract

The level of cellular organization bridging the mesoscale and whole-cell scale is coming into focus as a new frontier in cell biology. Great progress has been made in unraveling the complex physical and functional interconnectivity of organelles, but how the entire organelle network is spatially arranged within the cytoplasm is only beginning to be explored. Drawing on cross-disciplinary research synthesis methods, we systematically curated the whole-cell volumetric imaging literature, resulting in a corpus consisting of 89 studies and 118 image datasets. We describe the trajectory and current state of the field between 2004 and 2024. A broad characterization, or "scoping review", of bibliometrics, study design, and reporting practices shows accelerating technological development and research output. We find high variability in study design and reporting practices, including imaging modality, model organism, cellular contexts, organelles imaged, and analyses. Due to the laborious, low-throughput nature of most volumetric imaging methods, we find trends toward small sample sizes (<10 cells) and small cell types. We describe common quantitative analyses across studies, including volumetric ratios of organelles and inter-organelle contact analyses. This work establishes the initial iteration of a growing dataset of whole-cell imaging literature and data, and motivates a call for standardized whole-cell imaging study design, reporting, and data sharing practices in the context of an emerging sub-field of cell biology. Our curated dataset now provides the basis for a plethora of future aggregate and comparative analyses to reveal larger patterns and generalized hypotheses about the systems behavior and regulation of whole-cell organelle networks. More broadly, we showcase the potential of new rigorous secondary research methods to strengthen cell biologys literature review and reproducibility toolkit, create new avenues for discovery, and promote open research practices that support secondary data-reuse and integration.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Life Science Alliance
263 papers in training set
Top 0.1%
12.5%
2
eLife
5422 papers in training set
Top 7%
9.2%
3
Nature
575 papers in training set
Top 4%
7.2%
4
Journal of Cell Biology
333 papers in training set
Top 0.4%
7.2%
5
Developmental Cell
168 papers in training set
Top 4%
4.9%
6
Nature Communications
4913 papers in training set
Top 35%
4.3%
7
Cell Systems
167 papers in training set
Top 3%
4.0%
8
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 2%
3.1%
50% of probability mass above
9
Nature Methods
336 papers in training set
Top 3%
2.8%
10
Molecular Biology of the Cell
272 papers in training set
Top 1.0%
2.1%
11
Scientific Reports
3102 papers in training set
Top 53%
1.9%
12
Molecular Systems Biology
142 papers in training set
Top 0.5%
1.8%
13
Development
440 papers in training set
Top 2%
1.8%
14
Open Biology
95 papers in training set
Top 0.6%
1.7%
15
PLOS ONE
4510 papers in training set
Top 53%
1.7%
16
PLOS Biology
408 papers in training set
Top 9%
1.7%
17
BMC Biology
248 papers in training set
Top 2%
1.3%
18
Cells
232 papers in training set
Top 4%
1.2%
19
Developmental Biology
134 papers in training set
Top 2%
1.1%
20
Journal of Cell Science
353 papers in training set
Top 2%
1.1%
21
The EMBO Journal
267 papers in training set
Top 4%
0.9%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
23
Science
429 papers in training set
Top 18%
0.9%
24
Nucleic Acids Research
1128 papers in training set
Top 16%
0.8%
25
Scientific Data
174 papers in training set
Top 2%
0.8%
26
Molecular Cell
308 papers in training set
Top 9%
0.8%
27
Nature Genetics
240 papers in training set
Top 7%
0.8%
28
Cell Reports
1338 papers in training set
Top 33%
0.8%
29
Cell
370 papers in training set
Top 17%
0.8%
30
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%