Back to Search
Start Over
Impact of the spotted microarray preprocessing method on fold-change compression and variance stability
- Source :
- BMC Bioinformatics, BMC Bioinformatics, Vol 12, Iss 1, p 413 (2011), BMC Bioinformatics, Vol. 12, p. 413 (2011)
- Publisher :
- Springer Nature
-
Abstract
- Background The standard approach for preprocessing spotted microarray data is to subtract the local background intensity from the spot foreground intensity, to perform a log2 transformation and to normalize the data with a global median or a lowess normalization. Although well motivated, standard approaches for background correction and for transformation have been widely criticized because they produce high variance at low intensities. Whereas various alternatives to the standard background correction methods and to log2 transformation were proposed, impacts of both successive preprocessing steps were not compared in an objective way. Results In this study, we assessed the impact of eight preprocessing methods combining four background correction methods and two transformations (the log2 and the glog), by using data from the MAQC study. The current results indicate that most preprocessing methods produce fold-change compression at low intensities. Fold-change compression was minimized using the Standard and the Edwards background correction methods coupled with a log2 transformation. The drawback of both methods is a high variance at low intensities which consequently produced poor estimations of the p-values. On the other hand, effective stabilization of the variance as well as better estimations of the p-values were observed after the glog transformation. Conclusion As both fold-change magnitudes and p-values are important in the context of microarray class comparison studies, we therefore recommend to combine the Edwards correction with a hybrid transformation method that uses the log2 transformation to estimate fold-change magnitudes and the glog transformation to estimate p-values.
- Subjects :
- Normalization (statistics)
Quality Control
Computer science
Applied Mathematics
Gene Expression Profiling
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Fold change
Computer Science Applications
lcsh:Biology (General)
Structural Biology
lcsh:R858-859.7
Preprocessor
Humans
lcsh:QH301-705.5
Algorithm
Molecular Biology
Research Article
Oligonucleotide Array Sequence Analysis
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Volume :
- 12
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....78d929c3b4f5e93a1998075caedce2f8
- Full Text :
- https://doi.org/10.1186/1471-2105-12-413