Back to Search Start Over

Automated identification of reference genes based on RNA-seq data

Authors :
Macarena Arroyo
Juan de Dios Alché
Rosario Carmona
Adoración Zafra
Pedro Seoane
M. Gonzalo Claros
María José Jiménez-Quesada
Rafael Larrosa
European Commission
Ministerio de Economía y Competitividad (España)
CSIC - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA)
Source :
Digital.CSIC. Repositorio Institucional del CSIC, instname, BioMedical Engineering OnLine, Vol 16, Iss S1, Pp 1-23 (2017), BioMedical Engineering
Publication Year :
2017
Publisher :
BioMed Central, 2017.

Abstract

[Background] Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs.<br />[Results] An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm.<br />[Conclusion] Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.<br />This research was supported by co-funding from the European Union through the ERDF 2014–2020 “Programa Operativo de Crecimiento Inteligente” to the projects RTA2013-00068-C03 and RTA2013-00023-C02 of the Spanish INIA; BFU2011-22779 and RECUPERA2020-3.1.4 from the Spanish MINECO, P11-CVI-7487 from the regional PAI and NEUMOSUR grant 12/2015 entitled “Expresión de retrotransposones en pacientes con adenocarcinoma intervenido. Comparación entre tejjido sano y tumoral”. Publication costs were funded by the mentioned grants

Details

Database :
OpenAIRE
Journal :
Digital.CSIC. Repositorio Institucional del CSIC, instname, BioMedical Engineering OnLine, Vol 16, Iss S1, Pp 1-23 (2017), BioMedical Engineering
Accession number :
edsair.doi.dedup.....0d69dc37cf73fadf1d940fb1bfba7a7c