Back to Search
Start Over
Systematic evaluation of spliced alignment programs for RNA-seq data
- Source :
- Nature methods, Nature Methods, Nature Methods, Nature Publishing Group, 2013, epub ahead of print. ⟨10.1038/nmeth.2722⟩, Recercat. Dipósit de la Recerca de Catalunya, instname
- Publication Year :
- 2013
-
Abstract
- LINA-COMBI; International audience; : High-throughput RNA sequencing is an increasingly accessible method for studying gene structure and activity on a genome-wide scale. A critical step in RNA-seq data analysis is the alignment of partial transcript reads to a reference genome sequence. To assess the performance of current mapping software, we invited developers of RNA-seq aligners to process four large human and mouse RNA-seq data sets. In total, we compared 26 mapping protocols based on 11 programs and pipelines and found major performance differences between methods on numerous benchmarks, including alignment yield, basewise accuracy, mismatch and gap placement, exon junction discovery and suitability of alignments for transcript reconstruction. We observed concordant results on real and simulated RNA-seq data, confirming the relevance of the metrics employed. Future developments in RNA-seq alignment methods would benefit from improved placement of multimapped reads, balanced utilization of existing gene annotation and a reduced false discovery rate for splice junctions.
- Subjects :
- False discovery rate
Sequence analysis
RNA Splicing
Messenger
Animals
Chromosome Mapping
Computational Biology
Exons
False Positive Reactions
High-Throughput Nucleotide Sequencing
Humans
K562 Cells
Mice
RNA, Messenger
Reproducibility of Results
Sequence Alignment
Sequence Analysis, RNA
Software
RNA-Seq
Sequence alignment
Computational biology
Biology
Biochemistry
Article
03 medical and health sciences
Exon
0302 clinical medicine
Empalmament (Genètica)
Gens Mapatge
Molecular Biology
030304 developmental biology
Genetics
0303 health sciences
business.industry
Cell Biology
Gene Annotation
[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]
030220 oncology & carcinogenesis
RNA splicing
RNA
[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
business
Sequence Analysis
Biotechnology
Subjects
Details
- Language :
- English
- ISSN :
- 15487105 and 15487091
- Volume :
- 10
- Issue :
- 12
- Database :
- OpenAIRE
- Journal :
- Nature methods
- Accession number :
- edsair.doi.dedup.....6385fd1559cfdc01d937b50d88596646
- Full Text :
- https://doi.org/10.1038/nmeth.2722⟩