Back to Search Start Over

Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.

Authors :
Han SW
Jewell S
Thomas-Tikhonenko A
Barash Y
Source :
Genome research [Genome Res] 2024 Oct 29; Vol. 34 (10), pp. 1624-1635. Date of Electronic Publication: 2024 Oct 29.
Publication Year :
2024

Abstract

Mapping transcriptomic variations using either short- or long-read RNA sequencing is a staple of genomic research. Long reads are able to capture entire isoforms and overcome repetitive regions, whereas short reads still provide improved coverage and error rates. Yet, open questions remain, such as how to quantitatively compare the technologies, can we combine them, and what is the benefit of such a combined view? We tackle these questions by first creating a pipeline to assess matched long- and short-read data using a variety of transcriptome statistics. We find that across data sets, algorithms, and technologies, matched short-read data detects ∼30% more splice junctions, such that ∼10%-30% of the splice junctions included at ≥20% by short reads are missed by long reads. In contrast, long reads detect many more intron-retention events and can detect full isoforms, pointing to the benefit of combining the technologies. We introduce MAJIQ-L, an extension of the MAJIQ software, to enable a unified view of transcriptome variations from both technologies and demonstrate its benefits. Our software can be used to assess any future long-read technology or algorithm and can be combined with short-read data for improved transcriptome analysis.<br /> (© 2024 Han et al.; Published by Cold Spring Harbor Laboratory Press.)

Details

Language :
English
ISSN :
1549-5469
Volume :
34
Issue :
10
Database :
MEDLINE
Journal :
Genome research
Publication Type :
Academic Journal
Accession number :
39322279
Full Text :
https://doi.org/10.1101/gr.278659.123