1. Prediction of HLA genotypes from single-cell transcriptome data
- Author
-
Benjamin D. Solomon, Hong Zheng, Laura W. Dillon, Jason D. Goldman, Christopher S. Hourigan, James R. Heath, and Purvesh Khatri
- Subjects
HLA typing algorithm ,next-generation sequencing data (NGS) ,HLA genotype ,single-cell sequencing (scRNA-seq) ,human leukocyte antigen (HLA) ,major histocompatibility (MHC) ,Immunologic diseases. Allergy ,RC581-607 - Abstract
The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in an allele-specific manner and single-cell RNA sequencing (scRNA-seq) has the potential to better characterize these expression patterns. However, quantification of allele-specific expression (ASE) for HLA loci requires sample-specific reference genotyping due to extensive polymorphism. While genotype prediction from bulk RNA sequencing is well described, the feasibility of predicting HLA genotypes directly from single-cell data is unknown. Here we evaluate and expand upon several computational HLA genotyping tools by comparing predictions from human single-cell data to gold-standard, molecular genotyping. The highest 2-field accuracy averaged across all loci was 76% by arcasHLA and increased to 86% using a composite model of multiple genotyping tools. We also developed a highly accurate model (AUC 0.93) for predicting HLA-DRB345 copy number in order to improve genotyping accuracy of the HLA-DRB locus. Genotyping accuracy improved with read depth and was reproducible at repeat sampling. Using a metanalytic approach, we also show that HLA genotypes from PHLAT and OptiType can generate ASE ratios that are highly correlated (R2 = 0.8 and 0.94, respectively) with those derived from gold-standard genotyping.
- Published
- 2023
- Full Text
- View/download PDF