Back to Search
Start Over
Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
- Source :
- Bioinformatics
- Publication Year :
- 2013
- Publisher :
- Oxford University Press (OUP), 2013.
-
Abstract
- Motivation: RNA-seq experiments produce digital counts of reads that are affected by both biological and technical variation. To distinguish the systematic changes in expression between conditions from noise, the counts are frequently modeled by the Negative Binomial distribution. However, in experiments with small sample size, the per-gene estimates of the dispersion parameter are unreliable. Method: We propose a simple and effective approach for estimating the dispersions. First, we obtain the initial estimates for each gene using the method of moments. Second, the estimates are regularized, i.e. shrunk towards a common value that minimizes the average squared difference between the initial estimates and the shrinkage estimates. The approach does not require extra modeling assumptions, is easy to compute and is compatible with the exact test of differential expression. Results: We evaluated the proposed approach using 10 simulated and experimental datasets and compared its performance with that of currently popular packages edgeR, DESeq, baySeq, BBSeq and SAMseq. For these datasets, sSeq performed favorably for experiments with small sample size in sensitivity, specificity and computational time. Availability: http://www.stat.purdue.edu/∼ovitek/Software.html and Bioconductor. Contact: ovitek@purdue.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Negative binomial distribution
Gene Expression
Method of moments (statistics)
01 natural sciences
Biochemistry
Bioconductor
010104 statistics & probability
03 medical and health sciences
Statistics
Sensitivity (control systems)
0101 mathematics
Molecular Biology
Probability
030304 developmental biology
Mathematics
0303 health sciences
Models, Statistical
Sequence Analysis, RNA
Original Papers
Expression (mathematics)
Computer Science Applications
Binomial distribution
Binomial Distribution
Computational Mathematics
Noise
Computational Theory and Mathematics
Sample size determination
Sample Size
RNA
Transcriptome
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 29
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....bd96c25442328a40ddca75567180859c
- Full Text :
- https://doi.org/10.1093/bioinformatics/btt143