51. The Reproducibility of Lists of Differentially Expressed Genes in Microarray Studies
- Author
-
Catalin Barbacioru, Jane Wang, Wendell D. Jones, Sheng Zhong, Qian Xie, Ernest S. Kawasaki, James J. Chen, Janet A. Warrington, Patrick J. Collins, David Geffen, Charles Wang, Huixiao Hong, Cecilie Boysen, Roger Perkins, Hong Fang, Federico Goodsaid, Harry Hines Boulevard, Lei Guo, Xiaoxi Megan Cao, Richard Shippy, James C. Fuscoe, Brett T. Thorn, Wenjun Bao, Meridian Parkway, Weida Tong, Quan Zhen Li, Xiaohui Fan, Yuling Luo, Leming Shi, Vincent Bertholet, Ron L. Peterson, Raj K. Puri, Shashi Amur, Xu Guo, Russell D. Wolfinger, Stephen C. Harris, Lisa J. Croner, Tzu-Ming Chu, Jie Wu, Jing Han, Roderick V. Jensen, Feng Qian, Nan Mei, Damir Herman, Zhenqiang Su, Hongmei Sun, Liang Zhang, Yaron Turpaz, James C. Willey, Felix W. Frueh, and Yongming Andrew Sun
- Subjects
Reproducibility ,Microarray ,Bioinformatics ,Gene chip analysis ,Cutoff ,General Materials Science ,p-value ,Computational biology ,DNA microarray ,Fold change ,Biotechnology ,Ranking (information retrieval) ,Mathematics - Abstract
Reproducibility is a fundamental requirement in scientific experiments and clinical contexts. Recent publications raise concerns about the reliability of microarray technology because of the apparent lack of agreement between lists of differentially expressed genes (DEGs). In this study we demonstrate that (1) such discordance may stem from ranking and selecting DEGs solely by statistical significance (P) derived from widely used simple t-tests; (2) when fold change (FC) is used as the ranking criterion, the lists become much more reproducible, especially when fewer genes are selected; and (3) the instability of short DEG lists based on P cutoffs is an expected mathematical consequence of the high variability of the t-values. We recommend the use of FC ranking plus a non-stringent P cutoff as a baseline practice in order to generate more reproducible DEG lists. The FC criterion enhances reproducibility while the P criterion balances sensitivity and specificity.
- Published
- 2007
- Full Text
- View/download PDF