Back

barbieQ: An R software package for analysing barcode count data from clonal tracking experiments

Fei, L.; Maksimovic, J.; Oshlack, A.

2025-09-14 bioinformatics
10.1101/2025.09.11.675529 bioRxiv
Show abstract

MotivationA "clone" encompasses a progenitor cell and its progeny cells. Tracking clonal composition as cells differentiate or evolve is useful in many fields. Various single-cell lineage tracing (clonal tracking) technologies use unique DNA barcodes that are passed from progenitor cells to their offspring. The barcode count for each sample indicates cell number in clones. However, analysis of barcode count data is often bespoke and relies on visualisations and heuristics. A generalized workflow for preprocessing and robust statistical analysis of barcode count data across protocols is needed. ResultsWe introduce barbieQ, a Bioconductor R package for analysing barcode count data across groups of samples. It provides data-driven quality control and filtering, extensive visualisations, and two statistical tests: 1) Differential barcode proportion (differences in proportions between sample groups), and 2) Differential barcode occurrence (differences in presence/absence odds between groups). Both tests handle complex experimental designs using regression models and rigorously account for sample-to-sample variability. We validated both tests on semi-simulated, real data and a case study, demonstrating that they hold their size, are sufficiently powered to detect true differences, and outperform existing approaches.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.1%
50.9%
50% of probability mass above
2
Bioinformatics
1061 papers in training set
Top 3%
8.3%
3
Methods in Ecology and Evolution
160 papers in training set
Top 0.7%
4.8%
4
PLOS ONE
4510 papers in training set
Top 34%
4.2%
5
Bioinformatics Advances
184 papers in training set
Top 1%
3.6%
6
GigaScience
172 papers in training set
Top 0.6%
3.5%
7
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.3%
8
Nature Communications
4913 papers in training set
Top 49%
1.9%
9
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
10
Genome Biology
555 papers in training set
Top 4%
1.9%
11
Nucleic Acids Research
1128 papers in training set
Top 11%
1.6%
12
Scientific Reports
3102 papers in training set
Top 67%
1.2%
13
BMC Genomics
328 papers in training set
Top 4%
1.2%
14
STAR Protocols
15 papers in training set
Top 0.2%
0.9%
15
Nature Protocols
30 papers in training set
Top 0.2%
0.9%
16
Genome Medicine
154 papers in training set
Top 8%
0.8%
17
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
18
iScience
1063 papers in training set
Top 33%
0.7%
19
eLife
5422 papers in training set
Top 59%
0.7%
20
BMC Methods
11 papers in training set
Top 0.2%
0.7%
21
Cytometry Part A
30 papers in training set
Top 0.3%
0.7%
22
Life Science Alliance
263 papers in training set
Top 2%
0.7%
23
Forensic Science International: Genetics
24 papers in training set
Top 0.1%
0.7%
24
BMC Biology
248 papers in training set
Top 6%
0.6%