Back to Search Start Over

Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA-minus RNA sequencing data

Authors :
Youri Hoogstrate
Malgorzata A Komor
René Böttcher
Job van Riet
Harmen J G van de Werken
Stef van Lieshout
Ralf Hoffmann
Evert van den Broek
Anne S Bolijn
Natasja Dits
Daoud Sie
David van der Meer
Floor Pepers
Chris H Bangma
Geert J L H van Leenders
Marcel Smid
Pim J French
John W M Martens
Wilbert van Workum
Peter J van der Spek
Bart Janssen
Eric Caldenhoven
Christian Rausch
Mark de Jong
Andrew P Stubbs
Gerrit A Meijer
Remond J A Fijneman
Guido W Jenster
Neurology
Urology
Medical Oncology
Cell biology
Pathology
Human genetics
CCA - Cancer biology and immunology
Source :
Hoogstrate, Y, Komor, M A, Böttcher, R, van Riet, J, van de Werken, H J G, van Lieshout, S, Hoffmann, R, van den Broek, E, Bolijn, A S, Dits, N, Sie, D, van der Meer, D, Pepers, F, Bangma, C H, van Leenders, G J L H, Smid, M, French, P J, Martens, J W M, van Workum, W, van der Spek, P J, Janssen, B, Caldenhoven, E, Rausch, C, de Jong, M, Stubbs, A P, Meijer, G A, Fijneman, R J A & Jenster, G W 2021, ' Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA-minus RNA sequencing data ', GigaScience, vol. 10, no. 12 . https://doi.org/10.1093/gigascience/giab080, GigaScience, 10(12). Oxford University Press, Gigascience, 10(12):giab080. Oxford University Press, GigaScience
Publication Year :
2021

Abstract

Background Fusion genes are typically identified by RNA sequencing (RNA-seq) without elucidating the causal genomic breakpoints. However, non–poly(A)-enriched RNA-seq contains large proportions of intronic reads that also span genomic breakpoints. Results We have developed an algorithm, Dr. Disco, that searches for fusion transcripts by taking an entire reference genome into account as search space. This includes exons but also introns, intergenic regions, and sequences that do not meet splice junction motifs. Using 1,275 RNA-seq samples, we investigated to what extent genomic breakpoints can be extracted from RNA-seq data and their implications regarding poly(A)-enriched and ribosomal RNA–minus RNA-seq data. Comparison with whole-genome sequencing data revealed that most genomic breakpoints are not, or minimally, transcribed while, in contrast, the genomic breakpoints of all 32 TMPRSS2-ERG–positive tumours were present at RNA level. We also revealed tumours in which the ERG breakpoint was located before ERG, which co-existed with additional deletions and messenger RNA that incorporated intergenic cryptic exons. In breast cancer we identified rearrangement hot spots near CCND1 and in glioma near CDK4 and MDM2 and could directly associate this with increased expression. Furthermore, in all datasets we find fusions to intergenic regions, often spanning multiple cryptic exons that potentially encode neo-antigens. Thus, fusion transcripts other than classical gene-to-gene fusions are prominently present and can be identified using RNA-seq. Conclusion By using the full potential of non–poly(A)-enriched RNA-seq data, sophisticated analysis can reliably identify expressed genomic breakpoints and their transcriptional effects.

Details

Language :
English
ISSN :
2047217X
Database :
OpenAIRE
Journal :
Hoogstrate, Y, Komor, M A, Böttcher, R, van Riet, J, van de Werken, H J G, van Lieshout, S, Hoffmann, R, van den Broek, E, Bolijn, A S, Dits, N, Sie, D, van der Meer, D, Pepers, F, Bangma, C H, van Leenders, G J L H, Smid, M, French, P J, Martens, J W M, van Workum, W, van der Spek, P J, Janssen, B, Caldenhoven, E, Rausch, C, de Jong, M, Stubbs, A P, Meijer, G A, Fijneman, R J A & Jenster, G W 2021, ' Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA-minus RNA sequencing data ', GigaScience, vol. 10, no. 12 . https://doi.org/10.1093/gigascience/giab080, GigaScience, 10(12). Oxford University Press, Gigascience, 10(12):giab080. Oxford University Press, GigaScience
Accession number :
edsair.doi.dedup.....52fd7484f285a8cf401d137f28430408
Full Text :
https://doi.org/10.1093/gigascience/giab080