1. Direct identification of A-to-I editing sites with nanopore native RNA sequencing
- Author
-
Tram Anh Nguyen, Jia Wei Joel Heng, Pornchai Kaewsapsak, Eng Piew Louis Kok, Dominik Stanojević, Hao Liu, Angelysia Cardilla, Albert Praditya, Zirong Yi, Mingwan Lin, Jong Ghut Ashley Aw, Yin Ying Ho, Kai Lay Esther Peh, Yuanming Wang, Qixing Zhong, Jacki Heraud-Farlow, Shifeng Xue, Bruno Reversade, Carl Walkley, Ying Swan Ho, Mile Šikić, Yue Wan, Meng How Tan, School of Chemical and Biomedical Engineering, School of Biological Sciences, Genome Institute of Singapore, A*STAR, Yong Loo Lin School of Medicine, NUS, and HP-NTU Digital Manufacturing Corporate Lab
- Subjects
Bioengineering [Engineering] ,RNA metabolism ,Adenosine ,Inosine genetics ,Sequence Analysis, RNA ,Biological sciences [Science] ,Gene Expression ,Cell Biology ,Biochemistry ,Inosine ,Mice ,Nanopores ,RNA ,Animals ,RNA Editing ,RNA genetics ,Molecular Biology ,Sequence Analysis ,Biotechnology ,Adenosine genetics - Abstract
Inosine is a prevalent RNA modification in animals and is formed when an adenosine is deaminated by the ADAR family of enzymes. Traditionally, inosines are identified indirectly as variants from Illumina RNA-sequencing data because they are interpreted as guanosines by cellular machineries. However, this indirect method performs poorly in protein-coding regions where exons are typically short, in non-model organisms with sparsely annotated single-nucleotide polymorphisms, or in disease contexts where unknown DNA mutations are pervasive. Here, we show that Oxford Nanopore direct RNA sequencing can be used to identify inosine-containing sites in native transcriptomes with high accuracy. We trained convolutional neural network models to distinguish inosine from adenosine and guanosine, and to estimate the modification rate at each editing site. Furthermore, we demonstrated their utility on the transcriptomes of human, mouse and Xenopus. Our approach expands the toolkit for studying adenosine-to-inosine editing and can be further extended to investigate other RNA modifications. Nanyang Technological University National Medical Research Council (NMRC) National Research Foundation (NRF) M.H.T. is supported by a National Research Foundation Singapore grant (NRF2017-NRF-ISF002–2673), an Open Fund - Individual Research Grant from the National Medical Research Council (NMRC/OFIRG/0017/2016), an EMBO Global Investigatorship, an ASPIRE League seed grant from Nanyang Technological University, core funds from the Genome Institute of Singapore, and funds for Final Year Project (FYP) and the International Genetically Engineering Machine (iGEM) competition from the School of Chemical and Biomedical Engineering. J.W.J.H. is supported by a Ph.D. research scholarship from the School of Chemical and Biomedical Engineering. Y.S.H. is supported by core funds from the Bioprocessing Technology Institute. We also acknowledge the funding support for this project from Nanyang Technological University under the URECA Undergraduate Research Programme.