Back to Search
Start Over
Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
- Source :
- Nature Communications, Vol 11, Iss 1, Pp 1-14 (2020), Nature Communications
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Single-cell RNA sequencing (scRNA-seq) can characterize cell types and states through unsupervised clustering, but the ever increasing number of cells and batch effect impose computational challenges. We present DESC, an unsupervised deep embedding algorithm that clusters scRNA-seq data by iteratively optimizing a clustering objective function. Through iterative self-learning, DESC gradually removes batch effects, as long as technical differences across batches are smaller than true biological variations. As a soft clustering algorithm, cluster assignment probabilities from DESC are biologically interpretable and can reveal both discrete and pseudotemporal structure of cells. Comprehensive evaluations show that DESC offers a proper balance of clustering accuracy and stability, has a small footprint on memory, does not explicitly require batch information for batch effect removal, and can utilize GPU when available. As the scale of single-cell studies continues to grow, we believe DESC will offer a valuable tool for biomedical researchers to disentangle complex cellular heterogeneity.<br />Increasingly large scRNA-seq datasets demand better and more scalable analysis tools. Here, the authors introduce a scalable unsupervised deep embedding algorithm that clusters scRNA-seq data by iteratively optimizing a clustering objective function and enables removal of batch effects.
- Subjects :
- 0301 basic medicine
Statistical methods
Computer science
General Physics and Astronomy
computer.software_genre
Monocytes
Mice
0302 clinical medicine
Software
Bone Marrow
Cluster Analysis
RNA-Seq
lcsh:Science
Multidisciplinary
RNA sequencing
Scalability
Data mining
Single-Cell Analysis
Algorithms
Fuzzy clustering
Science
Stability (learning theory)
Article
Retina
General Biochemistry, Genetics and Molecular Biology
Islets of Langerhans
03 medical and health sciences
Deep Learning
Machine learning
Cluster (physics)
Animals
Humans
Cluster analysis
business.industry
Deep learning
Scale (chemistry)
General Chemistry
TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES
ComputingMethodologies_PATTERNRECOGNITION
030104 developmental biology
Gene Expression Regulation
Leukocytes, Mononuclear
Macaca
lcsh:Q
Artificial intelligence
business
computer
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 20411723
- Volume :
- 11
- Database :
- OpenAIRE
- Journal :
- Nature Communications
- Accession number :
- edsair.doi.dedup.....e691f033c98d92f52c6f8a00128fc14b