VDJdive and ECLIPSE enhance single-cell TCR sequencing analysis through the probabilistic resolution of ambiguous clonotypes
Burns, E. C.; Movassagh, M.; Lundell, J. F.; Ye, M.; Ye, Z.; Oliveira, G.; Rout, R.; Hugaboom, M. B.; Street, K.; Braun, D. A.
Show abstract
Single-cell T cell receptor sequencing (scTCR-seq) has transformed our ability to track individual T cell clones and has been instrumental in advancing our understanding of human T cell differentiation. However, current computational pipelines for analysis, which require precise matching of CDR3 sequences from exactly 2 heterodimeric TCR chains to define the clonotype for each cell, are inherently limited because of the substantial proportion of cells possessing "ambiguous" clonotypes driven by missing (undetected from the technical issue of chain "dropout") or extra chains (present from either true biological expression or due to technical artifacts such as cellular doublets and ambient TCR contamination). As a result, clone sizes are artificially reduced, impeding the tracking of clones across conditions and differentiation states. Here we introduce VDJdive and ECLIPSE (Enhanced CLonotypic Inference via Prediction of Single-cell Expression), two computational methods that, together, resolve this clonal ambiguity by utilizing the expectation-maximization algorithm for the clonal prediction of ambiguous cells. These methods consider chain pairings across the sample, allowing for high-fidelity prediction of chains lost due to dropout and the discernment of biological expression of extra chains from technical artifacts. Consequently, clone sizes are augmented and cells without clonotype assignments are minimized. Our approach facilitates enhanced clonal tracking through these elevated clone sizes and is easily implementable, compatible with standard single-cell transcriptomic workflows, and broadly applicable across biological contexts and T cell subsets.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.