1. scRAA: the development of a robust and automatic annotation procedure for single-cell RNA sequencing data
- Author
-
Dongyan Yan, Zhe Sun, Jiyuan Fang, Shanshan Cao, Wenjie Wang, Xinyue Chang, Sarkhan Badirli, Haoda Fu, and Yushi Liu
- Subjects
Pharmacology ,Statistics and Probability ,Pharmacology (medical) - Abstract
A critical task in single-cell RNA sequencing (scRNA-Seq) data analysis is to identify cell types from heterogeneous tissues. While the majority of classification methods demonstrated high performance in scRNA-Seq annotation problems, a robust and accurate solution is desired to generate reliable outcomes for downstream analyses, for instance, marker genes identification, differentially expressed genes, and pathway analysis. It is hard to establish a universally good metric. Thus, a universally good classification method for all kinds of scenarios does not exist. In addition, reference and query data in cell classification are usually from different experimental batches, and failure to consider batch effects may result in misleading conclusions. To overcome this bottleneck, we propose a robust ensemble approach to classify cells and utilize a batch correction method between reference and query data. We simulated four scenarios that comprise simple to complex batch effect and account for varying cell-type proportions. We further tested our approach on both lung and pancreas data. We found improved prediction accuracy and robust performance across simulation scenarios and real data. The incorporation of batch effect correction between reference and query, and the ensemble approach improve cell-type prediction accuracy while maintaining robustness. We demonstrated these through simulated and real scRNA-Seq data.
- Published
- 2023
- Full Text
- View/download PDF