1. Identification of novel reference genes based on MeSH categories
- Author
-
Levent Carkacioglu, Tolga Can, Tulin Ersahin, Rengul Cetin-Atalay, Volkan Atalay, and Ozlen Konu
- Subjects
Microarrays ,Gene Expression ,lcsh:Medicine ,Expression ,Transcriptome ,transcriptomics ,Medical Subject Headings ,genetic database ,Reference genes ,genetic variability ,Molecular Cell Biology ,Gastrointestinal Cancers ,Breast Tumors ,Basic Cancer Research ,Databases, Genetic ,Medicine and Health Sciences ,genetics ,lcsh:Science ,housekeeping gene ,Genetics ,receiver operating characteristic ,Multidisciplinary ,Applied Mathematics ,Liver Diseases ,article ,standard ,Microarray Data ,High-Throughput Nucleotide Sequencing ,Reference Standards ,Housekeeping gene ,Rna-seq Data ,Normalization ,Tissues ,Bioassays and Physiological Analysis ,Oncology ,real time polymerase chain reaction ,Physical Sciences ,DNA microarray ,actin ,cancer cell line ,Algorithms ,Statistics (Mathematics) ,Research Article ,Suitable Reference Genes ,in vitro study ,regulatory mechanism ,Housekeeping Genes ,Context (language use) ,gene sequence ,tissue specificity ,Gastroenterology and Hepatology ,Biology ,Biostatistics ,Research and Analysis Methods ,Real-Time Polymerase Chain Reaction ,DNA sequencing ,high throughput sequencing ,glyceraldehyde 3 phosphate dehydrogenase ,GAPDH gene ,Cell Line, Tumor ,Gastrointestinal Tumors ,gene expression profiling ,Hepatocellular-carcinoma ,Humans ,human ,procedures ,Statistical Methods ,Gene ,mouse ,gene identification ,nonhuman ,human cell ,Gene Expression Profiling ,lcsh:R ,tumor cell line ,Biology and Life Sciences ,Computational Biology ,Cancers and Neoplasms ,Cell Biology ,Actin gene ,Gene expression profiling ,sensitivity and specificity ,lcsh:Q ,microarray analysis ,Carcinoma Cell-line ,eEF2 gene ,Mathematics ,Real-time Pcr - Abstract
Transcriptome experiments are performed to assess protein abundance through mRNA expression analysis. Expression levels of genes vary depending on the experimental conditions and the cell response. Transcriptome data must be diverse and yet comparable in reference to stably expressed genes, even if they are generated from different experiments on the same biological context from various laboratories. In this study, expression patterns of 9090 microarray samples grouped into 381 NCBI-GEO datasets were investigated to identify novel candidate reference genes using randomizations and Receiver Operating Characteristic (ROC) curves. The analysis demonstrated that cell type specific reference gene sets display less variability than a united set for all tissues. Therefore, constitutively and stably expressed, origin specific novel reference gene sets were identified based on their coefficient of variation and percentage of occurrence in all GEO datasets, which were classified using Medical Subject Headings (MeSH). A large number of MeSH grouped reference gene lists are presented as novel tissue specific reference gene lists. The most commonly observed 17 genes in these sets were compared for their expression in 8 hepatocellular, 5 breast and 3 colon carcinoma cells by RT-qPCR to verify tissue specificity. Indeed, commonly used housekeeping genes GAPDH, Actin and EEF2 had tissue specific variations, whereas several ribosomal genes were among the most stably expressed genes in vitro. Our results confirm that two or more reference genes should be used in combination for differential expression analysis of large-scale data obtained from microarray or next generation sequencing studies. Therefore context dependent reference gene sets, as presented in this study, are required for normalization of expression data from diverse technological backgrounds. © 2014 Ersahin et al.
- Published
- 2014