Strategies for Integrating Single-Cell RNA Sequencing Results With Multiple Species
Hart, R. P.
Show abstract
Single-cell RNA sequencing (scRNAseq) is a robust technology for parsing gene expression in individual cells from a tissue or other complex source. One application involves experiments where cells from multiple species are recovered from a single sample, such as when human cells are transplanted into an animal model. We transplanted microglial precursor cells into newborn mouse brain and then recovered unenriched cortical tissue six months later. Dissociated cells were assessed by scRNAseq. The default method for analyzing these results begins by aligning sequencing reads with a mixture of both mouse and human reference genomes. While this clearly identifies the human cells as a distinct cluster, the clustering is artificially driven by expression from non-comparable gene identifiers from different species. We devised a method for translating expression counts from human to mouse and evaluated four algorithms for parsing mixed-species scRNAseq data. Our optimal approach split raw sequencing reads according to the best alignment score in each genome, and then re-aligned reads only with the appropriate genome. After gene symbol translation, pooled results indicate that cell types are more appropriately clustered and that differential expression analysis identifies species-specific patterns. This method should be applicable to any mixed-species scRNAseq experiment.\n\nSummary of optimal strategyO_LIMixed-species scRNAseq data are aligned with mixture of mouse and human reference genomes\nC_LIO_LIThe BAM file is scanned to find the best alignment score for each sequencing read identifier; these are used to split the paired FASTQ files into two sets of files\nC_LIO_LIEach set of species-specific, paired FASTQ files is re-aligned with only the appropriate reference genome\nC_LIO_LIRaw counts imported into Seurat\nC_LIO_LIThe human counts table is translated to mouse gene symbols using a custom HomoloGene translation table\nC_LIO_LIResults are merged and analyzed\nC_LI
Matching journals
The top 6 journals account for 50% of the predicted probability mass.