12 results on '"Rujira Achawanantakun"'
Search Results
2. Genome, functional gene annotation, and nuclear transformation of the heterokont oleaginous alga Nannochloropsis oceanica CCMP1779.
- Author
-
Astrid Vieler, Guangxi Wu, Chia-Hong Tsai, Blair Bullard, Adam J Cornish, Christopher Harvey, Ida-Barbara Reca, Chelsea Thornburg, Rujira Achawanantakun, Christopher J Buehl, Michael S Campbell, David Cavalier, Kevin L Childs, Teresa J Clark, Rahul Deshpande, Erika Erickson, Ann Armenia Ferguson, Witawas Handee, Que Kong, Xiaobo Li, Bensheng Liu, Steven Lundback, Cheng Peng, Rebecca L Roston, Sanjaya, Jeffrey P Simpson, Allan Terbush, Jaruswan Warakanont, Simone Zäuner, Eva M Farre, Eric L Hegg, Ning Jiang, Min-Hao Kuo, Yan Lu, Krishna K Niyogi, John Ohlrogge, Katherine W Osteryoung, Yair Shachar-Hill, Barbara B Sears, Yanni Sun, Hideki Takahashi, Mark Yandell, Shin-Han Shiu, and Christoph Benning
- Subjects
Genetics ,QH426-470 - Abstract
Unicellular marine algae have promise for providing sustainable and scalable biofuel feedstocks, although no single species has emerged as a preferred organism. Moreover, adequate molecular and genetic resources prerequisite for the rational engineering of marine algal feedstocks are lacking for most candidate species. Heterokonts of the genus Nannochloropsis naturally have high cellular oil content and are already in use for industrial production of high-value lipid products. First success in applying reverse genetics by targeted gene replacement makes Nannochloropsis oceanica an attractive model to investigate the cell and molecular biology and biochemistry of this fascinating organism group. Here we present the assembly of the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from nitrogen-replete and nitrogen-depleted growth conditions support a total of 11,973 genes, of which in addition to automatic annotation some were manually inspected to predict the biochemical repertoire for this organism. Among others, more than 100 genes putatively related to lipid metabolism, 114 predicted transcription factors, and 109 transcriptional regulators were annotated. Comparison of the N. oceanica CCMP1779 gene repertoire with the recently published N. gaditana genome identified 2,649 genes likely specific to N. oceanica CCMP1779. Many of these N. oceanica-specific genes have putative orthologs in other species or are supported by transcriptional evidence. However, because similarity-based annotations are limited, functions of most of these species-specific genes remain unknown. Aside from the genome sequence and its analysis, protocols for the transformation of N. oceanica CCMP1779 are provided. The availability of genomic and transcriptomic data for Nannochloropsis oceanica CCMP1779, along with efficient transformation protocols, provides a blueprint for future detailed gene functional analysis and genetic engineering of Nannochloropsis species by a growing academic community focused on this genus.
- Published
- 2012
- Full Text
- View/download PDF
3. LncRNA-ID: Long non-coding RNA IDentification using balanced random forests.
- Author
-
Rujira Achawanantakun, Jiao Chen, Yanni Sun, and Yuan Zhang
- Published
- 2015
- Full Text
- View/download PDF
4. Ncrna Consensus Secondary Structure Derivation Using Grammar Strings.
- Author
-
Rujira Achawanantakun, Yanni Sun, and Seyedeh Shohreh Takyar
- Published
- 2011
- Full Text
- View/download PDF
5. ChainKnot: a comparative H-type pseudoknot prediction tool using multiple ab initio folding tools.
- Author
-
Jikai Lei, Prapaporn Techa-angkoon, Yanni Sun, and Rujira Achawanantakun
- Published
- 2013
- Full Text
- View/download PDF
6. Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM.
- Author
-
Rujira Achawanantakun and Yanni Sun
- Published
- 2013
- Full Text
- View/download PDF
7. MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations
- Author
-
Mei Yee Law, Gaurav D. Moghe, Carolyn J. Lawrence, Yanni Sun, David E. Hufnagel, Mark Yandell, Kevin L. Childs, Rujira Achawanantakun, Joshua C. Stein, Shin-Han Shiu, Dian Jiao, Doreen Ware, Ning Jiang, Michael S. Campbell, Jikai Lei, and Carson Holt
- Subjects
Physiology ,Pseudogene ,Arabidopsis ,Plant Science ,Computational biology ,Genes, Plant ,Zea mays ,Genome ,Article ,Annotation ,Genetics ,Arabidopsis thaliana ,Repetitive Sequences, Nucleic Acid ,biology ,Computational Biology ,Reproducibility of Results ,The Arabidopsis Information Resource ,Molecular Sequence Annotation ,Exons ,Genome project ,biology.organism_classification ,Non-coding RNA ,Alternative Splicing ,Genome, Plant ,Pseudogenes ,Software - Abstract
We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds.
- Published
- 2013
- Full Text
- View/download PDF
8. Analysis of the canine brain transcriptome with an emphasis on the hypothalamus and cerebral cortex
- Author
-
Robert K. Wayne, Rujira Achawanantakun, Meenakshi Roy, Yanni Sun, Kyung Hwan Kim, Namshin Kim, and Won Hyong Chung
- Subjects
Cerebral Cortex ,Male ,Genetics ,RNA, Untranslated ,Alternative splicing ,Hypothalamus ,Brain ,Exons ,Biology ,Non-coding RNA ,Genome ,Phenotype ,Human genetics ,Transcriptome ,Alternative Splicing ,Exon ,Dogs ,Animals ,Gene - Abstract
The diversity of dog breeds make the domestic dog a valuable model for identifying genes responsible for many phenotypic and behavioral traits. The brain, in particular, is a region of interest for the analysis of molecular changes that are involved in dog-specific behavioral phenotypes. However, such studies are handicapped due to incomplete annotation of the dog genome. We present a high-coverage transcriptome of the dog brain using RNA-Seq. Two areas of the brain, hypothalamus and cerebral cortex, were selected for their roles in cognition, emotion, and neuroendocrine functions. We detected many novel features of the dog transcriptome, including 13,799 novel exons, 51,357 exons with unique 5' or 3' modifications, and many novel alternative splicing events. We provide some examples of novel features in genes that are related to domestication, including ADCY8, SMOC2, and PRNP. We also found 247 novel protein-coding genes and 328 noncoding RNAs, including 57 long noncoding RNAs that represent the first empirical evidence for a large fraction of noncoding RNAs in the dog. In addition, we analyze both gene expression and alternative splicing differences between the hypothalamus and cerebral cortex and find that there is very little overlap between genes that are differentially alternatively spliced and genes that are differentially expressed. We thereby suggest that researchers who want to pinpoint the genetic causes for dog breed-specific traits and diseases should not confine their studies to gene expression alone, but should consider other factors such as alternative splicing and changes in untranslated regions.
- Published
- 2013
- Full Text
- View/download PDF
9. ncRNA CONSENSUS SECONDARY STRUCTURE DERIVATION USING GRAMMAR STRINGS
- Author
-
Yanni Sun, Rujira Achawanantakun, and Seyedeh Shohreh Takyar
- Subjects
Models, Molecular ,Sequence ,RNA, Untranslated ,Theoretical computer science ,Grammar ,Genome, Human ,Computer science ,media_common.quotation_subject ,String (computer science) ,Structure (category theory) ,Computational Biology ,Sequence alignment ,Context-free grammar ,Biochemistry ,Computer Science Applications ,Consensus Sequence ,Humans ,Nucleic Acid Conformation ,Thermodynamics ,Computer Simulation ,Pseudoknot ,Sequence Alignment ,Molecular Biology ,Protein secondary structure ,media_common - Abstract
Many noncoding RNAs (ncRNAs) function through both their sequences and secondary structures. Thus, secondary structure derivation is an important issue in today's RNA research. The state-of-the-art structure annotation tools are based on comparative analysis, which derives consensus structure of homologous ncRNAs. Despite promising results from existing ncRNA aligning and consensus structure derivation tools, there is a need for more efficient and accurate ncRNA secondary structure modeling and alignment methods. In this work, we introduce a consensus structure derivation approach based on grammar string, a novel ncRNA secondary structure representation that encodes an ncRNA's sequence and secondary structure in the parameter space of a context-free grammar (CFG) and a full RNA grammar including pseudoknots. Being a string defined on a special alphabet constructed from a grammar, grammar string converts ncRNA alignment into sequence alignment. We derive consensus secondary structures from hundreds of ncRNA families from BraliBase 2.1 and 25 families containing pseudoknots using grammar string alignment. Our experiments have shown that grammar string-based structure derivation competes favorably in consensus structure quality with Murlet and RNASampler. Source code and experimental data are available at http://www.cse.msu.edu/~yannisun/grammar-string.
- Published
- 2011
- Full Text
- View/download PDF
10. LncRNA-ID: Long non-coding RNA IDentification using balanced random forests
- Author
-
Jiao Chen, Yanni Sun, Rujira Achawanantakun, and Yuan Zhang
- Subjects
Statistics and Probability ,Protein family ,Computer science ,Computational biology ,ENCODE ,Biochemistry ,Ribosome ,Transcriptome ,Machine Learning ,Mice ,Open Reading Frames ,Animals ,Humans ,Nucleotide ,Molecular Biology ,Gene ,Regulation of gene expression ,chemistry.chemical_classification ,RNA ,Proteins ,Ribosomal RNA ,Long non-coding RNA ,Computer Science Applications ,Random forest ,Computational Mathematics ,Open reading frame ,Identification (information) ,Computational Theory and Mathematics ,chemistry ,RNA, Long Noncoding ,Ribosomes ,Software - Abstract
Motivation: Long non-coding RNAs (lncRNAs), which are non-coding RNAs of length above 200 nucleotides, play important biological functions such as gene expression regulation. To fully reveal the functions of lncRNAs, a fundamental step is to annotate them in various species. However, as lncRNAs tend to encode one or multiple open reading frames, it is not trivial to distinguish these long non-coding transcripts from protein-coding genes in transcriptomic data. Results: In this work, we design a new tool that calculates the coding potential of a transcript using a machine learning model (random forest) based on multiple features including sequence characteristics of putative open reading frames, translation scores based on ribosomal coverage, and conservation against characterized protein families. The experimental results show that our tool competes favorably with existing coding potential computation tools in lncRNA identification. Availability and implementation: The scripts and data can be downloaded at https://github.com/zhangy72/LncRNA-ID Contact: yannisun@msu.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2014
11. Genome, Functional Gene Annotation, and Nuclear Transformation of the Heterokont Oleaginous Alga Nannochloropsis oceanica CCMP1779
- Author
-
Eva M. Farré, Yair Shachar-Hill, Erika Erickson, Chia-Hong Tsai, Christopher M. Harvey, Michael S. Campbell, Ida Barbara Reca, Teresa J. Clark, Witawas Handee, Christoph Benning, Yanni Sun, Xiaobo Li, Hideki Takahashi, Jaruswan Warakanont, Min Hao Kuo, Yan Lu, Chelsea K. Thornburg, Blair Bullard, Ann A. Ferguson, Katherine W. Osteryoung, Krishna K. Niyogi, Ning Jiang, Eric L. Hegg, Adam J. Cornish, Sanjaya, Rujira Achawanantakun, Shin-Han Shiu, Rebecca Roston, Allan D. TerBush, Que Kong, Simone Zäuner, Guangxi Wu, Mark Yandell, David Cavalier, Jeffrey P. Simpson, Bensheng Liu, John B. Ohlrogge, Cheng Peng, Rahul Deshpande, Kevin L. Childs, Barbara B. Sears, Steven S. Lundback, Astrid Vieler, and Christopher J. Buehl
- Subjects
0106 biological sciences ,Cancer Research ,lcsh:QH426-470 ,Nitrogen ,Genomics ,Plant Science ,Biology ,01 natural sciences ,Genome ,03 medical and health sciences ,Transformation, Genetic ,Model Organisms ,Species Specificity ,Genetics ,14. Life underwater ,Molecular Biology ,Gene ,Genetics (clinical) ,Ecology, Evolution, Behavior and Systematics ,Organism ,030304 developmental biology ,Whole genome sequencing ,0303 health sciences ,Base Sequence ,Sequence Analysis, RNA ,Systems Biology ,Correction ,Molecular Sequence Annotation ,Sequence Analysis, DNA ,Genome project ,biology.organism_classification ,lcsh:Genetics ,Functional genomics ,Stramenopiles ,Nannochloropsis ,010606 plant biology & botany ,Research Article ,Biotechnology - Abstract
Unicellular marine algae have promise for providing sustainable and scalable biofuel feedstocks, although no single species has emerged as a preferred organism. Moreover, adequate molecular and genetic resources prerequisite for the rational engineering of marine algal feedstocks are lacking for most candidate species. Heterokonts of the genus Nannochloropsis naturally have high cellular oil content and are already in use for industrial production of high-value lipid products. First success in applying reverse genetics by targeted gene replacement makes Nannochloropsis oceanica an attractive model to investigate the cell and molecular biology and biochemistry of this fascinating organism group. Here we present the assembly of the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from nitrogen-replete and nitrogen-depleted growth conditions support a total of 11,973 genes, of which in addition to automatic annotation some were manually inspected to predict the biochemical repertoire for this organism. Among others, more than 100 genes putatively related to lipid metabolism, 114 predicted transcription factors, and 109 transcriptional regulators were annotated. Comparison of the N. oceanica CCMP1779 gene repertoire with the recently published N. gaditana genome identified 2,649 genes likely specific to N. oceanica CCMP1779. Many of these N. oceanica–specific genes have putative orthologs in other species or are supported by transcriptional evidence. However, because similarity-based annotations are limited, functions of most of these species-specific genes remain unknown. Aside from the genome sequence and its analysis, protocols for the transformation of N. oceanica CCMP1779 are provided. The availability of genomic and transcriptomic data for Nannochloropsis oceanica CCMP1779, along with efficient transformation protocols, provides a blueprint for future detailed gene functional analysis and genetic engineering of Nannochloropsis species by a growing academic community focused on this genus., Author Summary Algae are a highly diverse group of organisms that have become the focus of renewed interest due to their potential for producing biofuel feedstocks, nutraceuticals, and biomaterials. Their high photosynthetic yields and ability to grow in areas unsuitable for agriculture provide a potential sustainable alternative to using traditional agricultural crops for biofuels. Because none of the algae currently in use have a history of domestication, and bioengineering of algae is still in its infancy, there is a need to develop algal strains adapted to cultivation for industrial large-scale production of desired compounds. Model organisms ranging from mice to baker's yeast have been instrumental in providing insights into fundamental biological structures and functions. The algal field needs versatile models to develop a fundamental understanding of photosynthetic production of biomass and valuable compounds in unicellular, marine, oleaginous algal species. To contribute to the development of such an algal model system for basic discovery, we sequenced the genome and two sets of transcriptomes of N. oceanica CCMP1779, assembled the genomic sequence, identified putative genes, and began to interpret the function of selected genes. This species was chosen because it is readily transformable with foreign DNA and grows well in culture.
- Published
- 2012
12. Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM
- Author
-
Rujira Achawanantakun and Yanni Sun
- Subjects
RNA, Untranslated ,Support Vector Machine ,0206 medical engineering ,Feature selection ,02 engineering and technology ,Biology ,lcsh:Computer applications to medicine. Medical informatics ,computer.software_genre ,Biochemistry ,03 medical and health sciences ,Software ,Knot (unit) ,Structural Biology ,Sensitivity (control systems) ,lcsh:QH301-705.5 ,Protein secondary structure ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,business.industry ,Applied Mathematics ,Computational Biology ,Pattern recognition ,Computer Science Applications ,Support vector machine ,Proceedings ,lcsh:Biology (General) ,Nucleic Acid Conformation ,lcsh:R858-859.7 ,Adjacency list ,Data mining ,Artificial intelligence ,business ,Pseudoknot ,computer ,020602 bioinformatics - Abstract
Background Accurate secondary structure prediction provides important information to undefirstafinding the tertiary structures and thus the functions of ncRNAs. However, the accuracy of the native structure derivation of ncRNAs is still not satisfactory, especially on sequences containing pseudoknots. It is recently shown that using the abstract shapes, which retain adjacency and nesting of structural features but disregard the length details of helix and loop regions, can improve the performance of structure prediction. In this work, we use SVM-based feature selection to derive the consensus abstract shape of homologous ncRNAs and apply the predicted shape to structure prediction including pseudoknots. Results Our approach was applied to predict shapes and secondary structures on hundreds of ncRNA data sets with and without psuedoknots. The experimental results show that we can achieve 18% higher accuracy in shape prediction than the state-of-the-art consensus shape prediction tools. Using predicted shapes in structure prediction allows us to achieve approximate 29% higher sensitivity and 10% higher positive predictive value than other pseudoknot prediction tools. Conclusions Extensive analysis of RNA properties based on SVM allows us to identify important properties of sequences and structures related to their shapes. The combination of mass data analysis and SVM-based feature selection makes our approach a promising method for shape and structure prediction. The implemented tools, Knot Shape and Knot Structure are open source software and can be downloaded at: http://www.cse.msu.edu/~achawana/KnotShape.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.