1. CircParser: a novel streamlined pipeline for circular RNA structure and host gene prediction in non-model organisms
- Author
-
Fedor Sharko, Artem V. Nedoluzhko, Rbbani Mg, Ioannis Konstantinidis, Anton B. Teslyuk, and Fernandes Jmo
- Subjects
Bioinformatics ,In silico ,Annotation ,Structural components ,ved/biology.organism_classification_rank.species ,lcsh:Medicine ,Computational biology ,Biology ,Genome ,General Biochemistry, Genetics and Molecular Biology ,DNA sequencing ,03 medical and health sciences ,Circular RNA ,microRNA ,Genetics ,Model organism ,Gene ,030304 developmental biology ,0303 health sciences ,Host gene ,ved/biology ,General Neuroscience ,030302 biochemistry & molecular biology ,lcsh:R ,Computational Biology ,Genomics ,General Medicine ,Genome project ,Matematikk og Naturvitenskap: 400::Basale biofag: 470::Genetikk og genomikk: 474 [VDP] ,General Agricultural and Biological Sciences ,Prediction ,Circular RNAs - Abstract
Circular RNAs (circRNAs) are long noncoding RNAs that play a significant role in various biological processes, including embryonic development and stress responses. These regulatory molecules can modulate microRNA activity and are involved in different molecular pathways as indirect regulators of gene expression. Thousands of circRNAs have been described in diverse taxa due to the recent advances in high throughput sequencing technologies, which led to a huge variety of total RNA sequencing being publicly available. A number of circRNA de novo and host gene prediction tools are available to date, but their ability to accurately predict circRNA host genes is limited in the case of low-quality genome assemblies or annotations. Here, we present CircParser, a simple and fast Unix/Linux pipeline that uses the outputs from the most common circular RNAs in silico prediction tools (CIRI, CIRI2, CircExplorer2, find_circ, and circFinder) to annotate circular RNAs, assigning presumptive host genes from local or public databases such as National Center for Biotechnology Information (NCBI). Also, this pipeline can discriminate circular RNAs based on their structural components (exonic, intronic, exon-intronic or intergenic) using a genome annotation file.
- Published
- 2020