Back

Absolute Quantification of Microbiota in Shotgun Sequencing Using Host Cells or Spike-Ins

Wallace, A.; Ling, H.; Gatenby, S.; Pruden, S.; Neeley, C.; Harland, C.; Couldrey, C.

2023-08-24 microbiology
10.1101/2023.08.23.554046 bioRxiv
Show abstract

BackgroundAn ongoing challenge for DNA sequencing of samples containing microorganisms is the ability to meaningfully compare different samples and to connect the results back to clinically relevant disease states. The reads of DNA sequence from each sample do not, in and of themselves, give sufficient information to calculate the absolute abundances of each observed organism. Using relative abundances alone is insufficient to determine whether absolute abundances have increased or decreased in the organisms of interest from one sample to the next. This is a well-studied problem in 16S sequencing, but solutions in shotgun sequencing are lacking. Here we show how spike-ins can be used in shotgun sequencing to calculate absolute abundances of organisms present. We also propose the use of the host cells already in the sample as an alternative calculation method. Mammalian host cells are typically of sufficient size that they can be easily and cheaply counted prior to sequencing by a variety of methods and combining this with sequencing data provides sufficient information to calculate the absolute abundances of microbial organisms. ResultsMicrobial abundances in the samples calculated via this method were consistent with manufacturer-stated values of microbial communities, with qPCR, and with our method tested against itself with regard the spike-in and host-cell based options. R2 values on the log10 scale in these tests ranged from 0.85 to 0.98, and the log10-RMSE ranged from 0.1 to 0.7. ConclusionsThe proposed method can consistently calculate absolute microbial abundances to within an order of magnitude. Both versions of the method, where spike-ins are added to the samples, or where host cells in the sample are counted, are viable. Calculating absolute abundances allows for direct comparisons to be made between different samples. If disease-thresholds have been identified, absolute abundances can quantify disease states.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Microbiology Spectrum
435 papers in training set
Top 0.1%
14.8%
2
PLOS ONE
4510 papers in training set
Top 19%
10.1%
3
Microbial Genomics
204 papers in training set
Top 0.3%
6.3%
4
Journal of Microbiological Methods
11 papers in training set
Top 0.1%
4.9%
5
mSystems
361 papers in training set
Top 2%
4.9%
6
mSphere
281 papers in training set
Top 1%
3.6%
7
BMC Microbiology
35 papers in training set
Top 0.1%
3.6%
8
PeerJ
261 papers in training set
Top 3%
3.1%
50% of probability mass above
9
Scientific Reports
3102 papers in training set
Top 43%
2.9%
10
BioTechniques
24 papers in training set
Top 0.1%
2.7%
11
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 2%
2.4%
12
Frontiers in Microbiology
375 papers in training set
Top 5%
1.9%
13
Methods in Ecology and Evolution
160 papers in training set
Top 1%
1.9%
14
Journal of Visualized Experiments
30 papers in training set
Top 0.3%
1.7%
15
Journal of Clinical Microbiology
120 papers in training set
Top 1%
1.5%
16
Microbiome
139 papers in training set
Top 2%
1.5%
17
Journal of Virological Methods
36 papers in training set
Top 0.4%
1.2%
18
Microorganisms
101 papers in training set
Top 1%
1.1%
19
MicrobiologyOpen
18 papers in training set
Top 0.1%
0.9%
20
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
21
Microbiology
57 papers in training set
Top 1%
0.8%
22
Access Microbiology
22 papers in training set
Top 0.6%
0.8%
23
BMC Genomics
328 papers in training set
Top 5%
0.8%
24
F1000Research
79 papers in training set
Top 5%
0.7%
25
GigaScience
172 papers in training set
Top 3%
0.7%
26
International Journal of Food Microbiology
11 papers in training set
Top 0.6%
0.7%
27
Journal of Clinical Virology
62 papers in training set
Top 0.9%
0.7%
28
Wellcome Open Research
57 papers in training set
Top 3%
0.6%
29
Journal of Infection
71 papers in training set
Top 3%
0.6%
30
Forensic Science International: Genetics
24 papers in training set
Top 0.2%
0.6%