6 results on '"Rujira Achawanantakun"'
Search Results
2. Genome, functional gene annotation, and nuclear transformation of the heterokont oleaginous alga Nannochloropsis oceanica CCMP1779.
- Author
-
Astrid Vieler, Guangxi Wu, Chia-Hong Tsai, Blair Bullard, Adam J Cornish, Christopher Harvey, Ida-Barbara Reca, Chelsea Thornburg, Rujira Achawanantakun, Christopher J Buehl, Michael S Campbell, David Cavalier, Kevin L Childs, Teresa J Clark, Rahul Deshpande, Erika Erickson, Ann Armenia Ferguson, Witawas Handee, Que Kong, Xiaobo Li, Bensheng Liu, Steven Lundback, Cheng Peng, Rebecca L Roston, Sanjaya, Jeffrey P Simpson, Allan Terbush, Jaruswan Warakanont, Simone Zäuner, Eva M Farre, Eric L Hegg, Ning Jiang, Min-Hao Kuo, Yan Lu, Krishna K Niyogi, John Ohlrogge, Katherine W Osteryoung, Yair Shachar-Hill, Barbara B Sears, Yanni Sun, Hideki Takahashi, Mark Yandell, Shin-Han Shiu, and Christoph Benning
- Subjects
Genetics ,QH426-470 - Abstract
Unicellular marine algae have promise for providing sustainable and scalable biofuel feedstocks, although no single species has emerged as a preferred organism. Moreover, adequate molecular and genetic resources prerequisite for the rational engineering of marine algal feedstocks are lacking for most candidate species. Heterokonts of the genus Nannochloropsis naturally have high cellular oil content and are already in use for industrial production of high-value lipid products. First success in applying reverse genetics by targeted gene replacement makes Nannochloropsis oceanica an attractive model to investigate the cell and molecular biology and biochemistry of this fascinating organism group. Here we present the assembly of the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from nitrogen-replete and nitrogen-depleted growth conditions support a total of 11,973 genes, of which in addition to automatic annotation some were manually inspected to predict the biochemical repertoire for this organism. Among others, more than 100 genes putatively related to lipid metabolism, 114 predicted transcription factors, and 109 transcriptional regulators were annotated. Comparison of the N. oceanica CCMP1779 gene repertoire with the recently published N. gaditana genome identified 2,649 genes likely specific to N. oceanica CCMP1779. Many of these N. oceanica-specific genes have putative orthologs in other species or are supported by transcriptional evidence. However, because similarity-based annotations are limited, functions of most of these species-specific genes remain unknown. Aside from the genome sequence and its analysis, protocols for the transformation of N. oceanica CCMP1779 are provided. The availability of genomic and transcriptomic data for Nannochloropsis oceanica CCMP1779, along with efficient transformation protocols, provides a blueprint for future detailed gene functional analysis and genetic engineering of Nannochloropsis species by a growing academic community focused on this genus.
- Published
- 2012
- Full Text
- View/download PDF
3. MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations
- Author
-
Mei Yee Law, Gaurav D. Moghe, Carolyn J. Lawrence, Yanni Sun, David E. Hufnagel, Mark Yandell, Kevin L. Childs, Rujira Achawanantakun, Joshua C. Stein, Shin-Han Shiu, Dian Jiao, Doreen Ware, Ning Jiang, Michael S. Campbell, Jikai Lei, and Carson Holt
- Subjects
Physiology ,Pseudogene ,Arabidopsis ,Plant Science ,Computational biology ,Genes, Plant ,Zea mays ,Genome ,Article ,Annotation ,Genetics ,Arabidopsis thaliana ,Repetitive Sequences, Nucleic Acid ,biology ,Computational Biology ,Reproducibility of Results ,The Arabidopsis Information Resource ,Molecular Sequence Annotation ,Exons ,Genome project ,biology.organism_classification ,Non-coding RNA ,Alternative Splicing ,Genome, Plant ,Pseudogenes ,Software - Abstract
We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds.
- Published
- 2013
- Full Text
- View/download PDF
4. LncRNA-ID: Long non-coding RNA IDentification using balanced random forests
- Author
-
Jiao Chen, Yanni Sun, Rujira Achawanantakun, and Yuan Zhang
- Subjects
Statistics and Probability ,Protein family ,Computer science ,Computational biology ,ENCODE ,Biochemistry ,Ribosome ,Transcriptome ,Machine Learning ,Mice ,Open Reading Frames ,Animals ,Humans ,Nucleotide ,Molecular Biology ,Gene ,Regulation of gene expression ,chemistry.chemical_classification ,RNA ,Proteins ,Ribosomal RNA ,Long non-coding RNA ,Computer Science Applications ,Random forest ,Computational Mathematics ,Open reading frame ,Identification (information) ,Computational Theory and Mathematics ,chemistry ,RNA, Long Noncoding ,Ribosomes ,Software - Abstract
Motivation: Long non-coding RNAs (lncRNAs), which are non-coding RNAs of length above 200 nucleotides, play important biological functions such as gene expression regulation. To fully reveal the functions of lncRNAs, a fundamental step is to annotate them in various species. However, as lncRNAs tend to encode one or multiple open reading frames, it is not trivial to distinguish these long non-coding transcripts from protein-coding genes in transcriptomic data. Results: In this work, we design a new tool that calculates the coding potential of a transcript using a machine learning model (random forest) based on multiple features including sequence characteristics of putative open reading frames, translation scores based on ribosomal coverage, and conservation against characterized protein families. The experimental results show that our tool competes favorably with existing coding potential computation tools in lncRNA identification. Availability and implementation: The scripts and data can be downloaded at https://github.com/zhangy72/LncRNA-ID Contact: yannisun@msu.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2014
5. Genome, Functional Gene Annotation, and Nuclear Transformation of the Heterokont Oleaginous Alga Nannochloropsis oceanica CCMP1779
- Author
-
Eva M. Farré, Yair Shachar-Hill, Erika Erickson, Chia-Hong Tsai, Christopher M. Harvey, Michael S. Campbell, Ida Barbara Reca, Teresa J. Clark, Witawas Handee, Christoph Benning, Yanni Sun, Xiaobo Li, Hideki Takahashi, Jaruswan Warakanont, Min Hao Kuo, Yan Lu, Chelsea K. Thornburg, Blair Bullard, Ann A. Ferguson, Katherine W. Osteryoung, Krishna K. Niyogi, Ning Jiang, Eric L. Hegg, Adam J. Cornish, Sanjaya, Rujira Achawanantakun, Shin-Han Shiu, Rebecca Roston, Allan D. TerBush, Que Kong, Simone Zäuner, Guangxi Wu, Mark Yandell, David Cavalier, Jeffrey P. Simpson, Bensheng Liu, John B. Ohlrogge, Cheng Peng, Rahul Deshpande, Kevin L. Childs, Barbara B. Sears, Steven S. Lundback, Astrid Vieler, and Christopher J. Buehl
- Subjects
0106 biological sciences ,Cancer Research ,lcsh:QH426-470 ,Nitrogen ,Genomics ,Plant Science ,Biology ,01 natural sciences ,Genome ,03 medical and health sciences ,Transformation, Genetic ,Model Organisms ,Species Specificity ,Genetics ,14. Life underwater ,Molecular Biology ,Gene ,Genetics (clinical) ,Ecology, Evolution, Behavior and Systematics ,Organism ,030304 developmental biology ,Whole genome sequencing ,0303 health sciences ,Base Sequence ,Sequence Analysis, RNA ,Systems Biology ,Correction ,Molecular Sequence Annotation ,Sequence Analysis, DNA ,Genome project ,biology.organism_classification ,lcsh:Genetics ,Functional genomics ,Stramenopiles ,Nannochloropsis ,010606 plant biology & botany ,Research Article ,Biotechnology - Abstract
Unicellular marine algae have promise for providing sustainable and scalable biofuel feedstocks, although no single species has emerged as a preferred organism. Moreover, adequate molecular and genetic resources prerequisite for the rational engineering of marine algal feedstocks are lacking for most candidate species. Heterokonts of the genus Nannochloropsis naturally have high cellular oil content and are already in use for industrial production of high-value lipid products. First success in applying reverse genetics by targeted gene replacement makes Nannochloropsis oceanica an attractive model to investigate the cell and molecular biology and biochemistry of this fascinating organism group. Here we present the assembly of the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from nitrogen-replete and nitrogen-depleted growth conditions support a total of 11,973 genes, of which in addition to automatic annotation some were manually inspected to predict the biochemical repertoire for this organism. Among others, more than 100 genes putatively related to lipid metabolism, 114 predicted transcription factors, and 109 transcriptional regulators were annotated. Comparison of the N. oceanica CCMP1779 gene repertoire with the recently published N. gaditana genome identified 2,649 genes likely specific to N. oceanica CCMP1779. Many of these N. oceanica–specific genes have putative orthologs in other species or are supported by transcriptional evidence. However, because similarity-based annotations are limited, functions of most of these species-specific genes remain unknown. Aside from the genome sequence and its analysis, protocols for the transformation of N. oceanica CCMP1779 are provided. The availability of genomic and transcriptomic data for Nannochloropsis oceanica CCMP1779, along with efficient transformation protocols, provides a blueprint for future detailed gene functional analysis and genetic engineering of Nannochloropsis species by a growing academic community focused on this genus., Author Summary Algae are a highly diverse group of organisms that have become the focus of renewed interest due to their potential for producing biofuel feedstocks, nutraceuticals, and biomaterials. Their high photosynthetic yields and ability to grow in areas unsuitable for agriculture provide a potential sustainable alternative to using traditional agricultural crops for biofuels. Because none of the algae currently in use have a history of domestication, and bioengineering of algae is still in its infancy, there is a need to develop algal strains adapted to cultivation for industrial large-scale production of desired compounds. Model organisms ranging from mice to baker's yeast have been instrumental in providing insights into fundamental biological structures and functions. The algal field needs versatile models to develop a fundamental understanding of photosynthetic production of biomass and valuable compounds in unicellular, marine, oleaginous algal species. To contribute to the development of such an algal model system for basic discovery, we sequenced the genome and two sets of transcriptomes of N. oceanica CCMP1779, assembled the genomic sequence, identified putative genes, and began to interpret the function of selected genes. This species was chosen because it is readily transformable with foreign DNA and grows well in culture.
- Published
- 2012
6. Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM
- Author
-
Rujira Achawanantakun and Yanni Sun
- Subjects
RNA, Untranslated ,Support Vector Machine ,0206 medical engineering ,Feature selection ,02 engineering and technology ,Biology ,lcsh:Computer applications to medicine. Medical informatics ,computer.software_genre ,Biochemistry ,03 medical and health sciences ,Software ,Knot (unit) ,Structural Biology ,Sensitivity (control systems) ,lcsh:QH301-705.5 ,Protein secondary structure ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,business.industry ,Applied Mathematics ,Computational Biology ,Pattern recognition ,Computer Science Applications ,Support vector machine ,Proceedings ,lcsh:Biology (General) ,Nucleic Acid Conformation ,lcsh:R858-859.7 ,Adjacency list ,Data mining ,Artificial intelligence ,business ,Pseudoknot ,computer ,020602 bioinformatics - Abstract
Background Accurate secondary structure prediction provides important information to undefirstafinding the tertiary structures and thus the functions of ncRNAs. However, the accuracy of the native structure derivation of ncRNAs is still not satisfactory, especially on sequences containing pseudoknots. It is recently shown that using the abstract shapes, which retain adjacency and nesting of structural features but disregard the length details of helix and loop regions, can improve the performance of structure prediction. In this work, we use SVM-based feature selection to derive the consensus abstract shape of homologous ncRNAs and apply the predicted shape to structure prediction including pseudoknots. Results Our approach was applied to predict shapes and secondary structures on hundreds of ncRNA data sets with and without psuedoknots. The experimental results show that we can achieve 18% higher accuracy in shape prediction than the state-of-the-art consensus shape prediction tools. Using predicted shapes in structure prediction allows us to achieve approximate 29% higher sensitivity and 10% higher positive predictive value than other pseudoknot prediction tools. Conclusions Extensive analysis of RNA properties based on SVM allows us to identify important properties of sequences and structures related to their shapes. The combination of mass data analysis and SVM-based feature selection makes our approach a promising method for shape and structure prediction. The implemented tools, Knot Shape and Knot Structure are open source software and can be downloaded at: http://www.cse.msu.edu/~achawana/KnotShape.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.