Back to Search Start Over

TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes

Authors :
Wang, Jingwen
Li, Bingnan
Marques, Sueli
Steinmetz, Lars M
Wei, Wu
Pelechano, Vicent
Source :
Nucleic Acids Research
Publication Year :
2020
Publisher :
Oxford University Press, 2020.

Abstract

Eukaryotic transcriptomes are complex, involving thousands of overlapping transcripts. The interleaved nature of the transcriptomes limits our ability to identify regulatory regions, and in some cases can lead to misinterpretation of gene expression. To improve the understanding of the overlapping transcriptomes, we have developed an optimized method, TIF-Seq2, able to sequence simultaneously the 5′ and 3′ ends of individual RNA molecules at single-nucleotide resolution. We investigated the transcriptome of a well characterized human cell line (K562) and identified thousands of unannotated transcript isoforms. By focusing on transcripts which are challenging to be investigated with RNA-Seq, we accurately defined boundaries of lowly expressed unannotated and read-through transcripts putatively encoding fusion genes. We validated our results by targeted long-read sequencing and standard RNA-Seq for chronic myeloid leukaemia patient samples. Taking the advantage of TIF-Seq2, we explored transcription regulation among overlapping units and investigated their crosstalk. We show that most overlapping upstream transcripts use poly(A) sites within the first 2 kb of the downstream transcription units. Our work shows that, by paring the 5′ and 3′ end of each RNA, TIF-Seq2 can improve the annotation of complex genomes, facilitate accurate assignment of promoters to genes and easily identify transcriptionally fused genes.

Details

Language :
English
ISSN :
13624962 and 03051048
Volume :
48
Issue :
18
Database :
OpenAIRE
Journal :
Nucleic Acids Research
Accession number :
edsair.pmid..........b0de0544cf642474da42a51943725054