27 results on '"Michael Paulini"'
Search Results
2. The genome sequence of the heart cockle, Fragum sueziense (Issel, 1869) [version 1; peer review: 2 approved]
- Author
-
Jose Victor Lopez, Eerik Aunin, Victoria McKenna, Sarah Lemer, Isabelle Ailish Clayton-Lucey, Ruiqi Li, Jingchun Li, Elizabeth Sinclair, Graeme Oatley, Camilla Santos, Noah Gettle, Haoyu Niu, Michael Paulini, and Rebecca O’Brien
- Subjects
Fragum sueziense ,heart cockle ,genome sequence ,chromosomal ,Cardiida ,Fraginae ,eng ,Medicine ,Science - Abstract
We present a genome assembly from an individual Fragum sueziense (the heart cockle; Mollusca; Bivalvia; Cardiida; Cardiidae). The genome sequence is 1,206.1 megabases in span. Most of the assembly is scaffolded into 19 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 92.77 kilobases in length. Gene annotation of this assembly on Ensembl identified 70,309 protein-coding genes.
- Published
- 2024
- Full Text
- View/download PDF
3. The genome sequences of the marine diatom Epithemia pelagica strain UHM3201 (Schvarcz, Stancheva & Steward, 2022) and its nitrogen-fixing, endosymbiotic cyanobacterium [version 1; peer review: 2 approved]
- Author
-
Christopher R. Schvarcz, Eerik Aunin, Kendra A. Turk-Kubo, Rosalina Stancheva, Jonathan P. Zehr, Victoria McKenna, Samuel T. Wilson, Grieg F. Steward, Kyle F. Edwards, John M. Archibald, Elizabeth Sinclair, Graeme Oatley, Camilla Santos, Noah Gettle, Haoyu Niu, Michael Paulini, and Rebecca O’Brien
- Subjects
Epithemia pelagica strain UHM3201 ,cyanobacterial endosymbiont ,pennate diatom ,genome sequence ,chromosomal ,Rhopalodiales ,eng ,Medicine ,Science - Abstract
We present the genome assembly of the pennate diatom Epithemia pelagica strain UHM3201 (Ochrophyta; Bacillariophyceae; Rhopalodiales; Rhopalodiaceae) and that of its cyanobacterial endosymbiont (Chroococcales: Aphanothecaceae). The genome sequence of the diatom is 60.3 megabases in span, and the cyanobacterial genome has a length of 2.48 megabases. Most of the diatom nuclear genome assembly is scaffolded into 15 chromosomal pseudomolecules. The organelle genomes have also been assembled, with the mitochondrial genome 40.08 kilobases and the plastid genome 130.75 kilobases in length. A number of other prokaryote MAGs were also assembled.
- Published
- 2024
- Full Text
- View/download PDF
4. The genome sequence of a heart cockle, Fragum fragum (Linnaeus, 1758) [version 1; peer review: 2 approved, 2 approved with reservations]
- Author
-
Eerik Aunin, Victoria McKenna, Graeme Oatley, Sarah Lemer, Isabelle Ailish Clayton-Lucey, Ruiqi Li, Jose Victor Lopez, Jingchun Li, Elizabeth Sinclair, Camilla Santos, Noah Gettle, Haoyu Niu, Michael Paulini, and Rebecca O’Brien
- Subjects
Fragum fragum ,heart cockle ,genome sequence ,chromosomal ,Veneroida ,eng ,Medicine ,Science - Abstract
We present a genome assembly from an individual specimen of Fragum fragum (a heart cockle; Mollusca; Bivalvia; Veneroida; Cardiidae). The genome sequence is 1,153.1 megabases in span. Most of the assembly is scaffolded into 19 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 22.36 kilobases in length. Gene annotation of this assembly on Ensembl identified 17,262 protein coding genes.
- Published
- 2024
- Full Text
- View/download PDF
5. The genome sequence of the giant clam, Tridacna gigas (Linnaeus, 1758) [version 1; peer review: 2 approved, 1 approved with reservations]
- Author
-
Eerik Aunin, Victoria McKenna, Isabelle Ailish Clayton-Lucey, Ruiqi Li, Jose Victor Lopez, Jingchun Li, Elizabeth Sinclair, Graeme Oatley, Camilla Santos, Noah Gettle, Haoyu Niu, Michael Paulini, and Rebecca O’Brien
- Subjects
Tridacna gigas ,giant clam ,genome sequence ,chromosomal ,Veneroida ,eng ,Medicine ,Science - Abstract
We present a chromosomal-level genome assembly from an individual Tridacna gigas (the giant clam; Mollusca; Bivalvia; Veneroida; Cardiidae). The genome sequence is 1,175.9 megabases in span. Most of the assembly is scaffolded into 17 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 25.34 kilobases in length. Gene annotation of this assembly on Ensembl identified 18,177 protein coding genes.
- Published
- 2024
- Full Text
- View/download PDF
6. Sex chromosome evolution in parasitic nematodes of humans
- Author
-
Jeremy M. Foster, Alexandra Grote, John Mattick, Alan Tracey, Yu-Chih Tsai, Matthew Chung, James A. Cotton, Tyson A. Clark, Adam Geber, Nancy Holroyd, Jonas Korlach, Yichao Li, Silvia Libro, Sara Lustigman, Michelle L. Michalski, Michael Paulini, Matthew B. Rogers, Laura Teigen, Alan Twaddle, Lonnie Welch, Matthew Berriman, Julie C. Dunning Hotopp, and Elodie Ghedin
- Subjects
Science - Abstract
Many nematode worms, including Caenorhabditis elegans have XX/XO sex determination, while other species have XY. The authors use a new genome assembly of the filarial parasite Brugia malayi and published data to show that nematode sex chromosome evolution is highly plastic.
- Published
- 2020
- Full Text
- View/download PDF
7. Combining RNA-seq data and homology-based gene prediction for plants, animals and fungi
- Author
-
Jens Keilwagen, Frank Hartung, Michael Paulini, Sven O. Twardziok, and Jan Grau
- Subjects
Homology-based gene prediction ,RNA-seq ,Genome annotation ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Genome annotation is of key importance in many research questions. The identification of protein-coding genes is often based on transcriptome sequencing data, ab-initio or homology-based prediction. Recently, it was demonstrated that intron position conservation improves homology-based gene prediction, and that experimental data improves ab-initio gene prediction. Results Here, we present an extension of the gene prediction program GeMoMa that utilizes amino acid sequence conservation, intron position conservation and optionally RNA-seq data for homology-based gene prediction. We show on published benchmark data for plants, animals and fungi that GeMoMa performs better than the gene prediction programs BRAKER1, MAKER2, and CodingQuarry, and purely RNA-seq-based pipelines for transcript identification. In addition, we demonstrate that using multiple reference organisms may help to further improve the performance of GeMoMa. Finally, we apply GeMoMa to four nematode species and to the recently published barley reference genome indicating that current annotations of protein-coding genes may be refined using GeMoMa predictions. Conclusions GeMoMa might be of great utility for annotating newly sequenced genomes but also for finding homologs of a specific gene or gene family. GeMoMa has been published under GNU GPL3 and is freely available at http://www.jstacs.de/index.php/GeMoMa.
- Published
- 2018
- Full Text
- View/download PDF
8. Ensembl Genomes 2020—enabling non-vertebrate genomic research
- Author
-
Bruno Contreras-Moreira, Laurent Gil, Uma Maheswari, Erin Haskell, Paul Flicek, Kim E. Hammond-Kosack, Stephen J. Trevanion, Matthieu Barba, Sushma Naithani, Matthew Russell, Mark D. McDowall, Nancy George, Andrew D. Yates, Emily Perry, Vasily Sitnik, Dan Bolser, Michael Paulini, Paul J. Kersey, Pankaj Jaiswal, Mikkel B. Christensen, Helder Pedro, Gareth Maslen, Lahcen Cambell, Sophie Helen Janacek, Daniel M. Staines, Gary Williams, Matthieu Muffato, Carla Cummins, Wasiu Akanni, Marc Rosello, James E. Allen, Irene Papatheodorou, Astrid Gall, Nick Langridge, Marc Chakiachvili, Joshua C. Stein, Mateus Patricio, Silvie Fexova, Nishadi De Silva, Sarah E. Hunt, Benjamin Moore, Martin Urban, Justin Preece, Marcela K. Tello-Ruiz, Guy Naamati, Andrew Olson, Parul Gupta, Thomas Maurel, Jorge Alvarez-Jarreta, Doreen Ware, Paul Davis, Manuel Carbajo, Alayne Cuzick, Kevin L. Howe, and Sharon Wei
- Subjects
0106 biological sciences ,Tree of life ,Genomics ,Context (language use) ,Computational biology ,Biology ,01 natural sciences ,Genome ,User-Computer Interface ,03 medical and health sciences ,Resource (project management) ,Reference Values ,Ensembl Genomes ,Databases, Genetic ,Genetics ,Database Issue ,Animals ,Ensembl ,Caenorhabditis elegans ,030304 developmental biology ,Internet ,0303 health sciences ,Computational Biology ,Genetic Variation ,Molecular Sequence Annotation ,Plants ,Phenotype ,Genome, Fungal ,Algorithms ,Genome, Bacterial ,Genome, Plant ,Software ,010606 plant biology & botany - Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.
- Published
- 2019
9. Alliance of Genome Resources Portal: unified model organism research platform
- Author
-
Adam Wright, Paul W. Sternberg, Daniela Raciti, Monika Tutaj, Josh Goodman, Ken Frazer, Paul Thomas, Scott Cain, Raymond Lee, Judith A. Blake, Patrick Kalita, Ajay Shrivatsav, Julie Agapite, Marek S. Skrzypek, Hans-Michael Mueller, Wen J. Chen, Karen Yook, Gillian Millburn, Joanna Argasinska, David Fashena, Kevin Schaper, Joel E. Richardson, Douglas G. Howe, Barbara Dunn, Yvonne M. Bradford, Nathan Dunn, Jaehyoung Cho, Ranjana Kishore, Kalpana Karra, Sabrina Toro, Anne E. Eagle, Norbert Perrimon, Anushya Muruganujan, Beverley B. Matthews, Christian A. Grove, Edith D. Wong, Monte Westerfield, Olin Blodgett, Gary Williams, Jose-Maria Urbano, Marie-Claire Harrison, Steven J Marygold, Tremayne Mushayahama, Marek Tutaj, Susan Russo Gelbart, Jennifer R. Smith, Felix Gondwe, Dustin Ebert, Juancarlos Chan, J. Michael Cherry, Ceri E. Van Slyke, Christopher J. Tabone, L. Sian Gramates, Madeline A. Crosby, Robert S. Nash, Kevin A. MacPherson, Patrick Ng, Christian Pich, Suzi Aleksander, Monika Tomczuk, Brian R. Calvi, Todd W. Harris, Cynthia L. Smith, Stan Laulederkind, Jyothi Thota, Gilberto dos Santos, Matt Simison, Kimberly Van Auken, Mary E. Dolan, Karen R. Christie, Stacia R. Engel, Leyla Ruzicka, Carol J. Bult, Kevin L. Howe, Stuart R. Miyasato, Shur-Jen Wang, David R. Shaw, Mary Shimoyama, Valerio Arnaboldi, Matthew Russell, Michael Paulini, Sibyl Gao, Sagar Jha, Jeff De Pons, Christopher J. Mungall, Seth Carbon, James A. Kadin, Sierra A. T. Moxon, Susan M. Bello, Thomas C. Kaufman, Laurent-Philippe Albou, Shuai Weng, and Helen Attrill
- Subjects
NAR Breakthrough Article ,Saccharomyces cerevisiae ,Biology ,Genome ,Data modeling ,Mice ,User-Computer Interface ,03 medical and health sciences ,0302 clinical medicine ,Resource (project management) ,Databases, Genetic ,Genetics ,Animals ,Humans ,Caenorhabditis elegans ,Alleles ,Zebrafish ,Organism ,030304 developmental biology ,Internet ,0303 health sciences ,Genome, Human ,Computational Biology ,Genomics ,Data science ,Rats ,Variety (cybernetics) ,Drosophila melanogaster ,Gene Ontology ,Data access ,Alliance ,Workflow ,Software ,030217 neurology & neurosurgery - Abstract
The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive body of genetic and genomic studies in these organisms. Initiated in 2016, the Alliance is building a central portal (www.alliancegenome.org) for access to data for the primary model organisms along with gene ontology data and human data. All data types represented in the Alliance portal (e.g. genomic data and phenotype descriptions) have common data models and workflows for curation. All data are open and freely available via a variety of mechanisms. Long-term plans for the Alliance project include a focus on coverage of additional model organisms including those without dedicated curation communities, and the inclusion of new data types with a particular focus on providing data and tools for the non-model-organism researcher that support enhanced discovery about human health and disease. Here we review current progress and present immediate plans for this new bioinformatics resource.
- Published
- 2019
10. Genomic and transcriptomic variation defines the chromosome-scale assembly of Haemonchus contortus, a model gastrointestinal worm
- Author
-
Nancy Holroyd, Roz Laing, Faye H. Rodgers, Stephen R. Doyle, Geetha Sankaranarayanan, Alan Tracey, Michael Paulini, James Cotton, John S. Gilleard, Janneke Wit, Matthew Berriman, Karen Brooks, Kevin L. Howe, Collette Britton, David J. Bartley, Guillaume Sallé, Kirsty Maitland, Jennifer D. Noonan, Wojtek Bazant, Elizabeth Redman, Eileen Devaney, Axel Martinelli, Robin N. Beech, Neil Sargison, Muhammad Zubair Shabbir, Helen Beasley, Michael A. Quail, Umer Chaudhry, The Wellcome Trust Sanger Institute [Cambridge], Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Moredun Research Institute [Penicuik, UK] (MRI), Institute of Parasitology [Ste Anne de Bellevue, Québec], McGill University = Université McGill [Montréal, Canada], Royal (Dick) School of Veterinary Studies, European Molecular Biology Laboratory [Hinxton], Faculty of Veterinary Medicine, University of Calgary, University of Calgary, Infectiologie et Santé Publique (UMR ISP), Université de Tours-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), University of Veterinary and Animal Sciences [Lahore, Pakistan] (UVAS), Wellcome's core funding of the Wellcome Sanger Institute (grant WT206194), Wellcome Trust Project Grant to JSG (grant 067811), the Biotechnology and Biological Sciences Research Council (BBSRC) grants (BB/M003949/1, BB/P024610/1 and BB/K020048/1), and Université de Tours (UT)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)
- Subjects
Male ,Population genetics ,Medicine (miscellaneous) ,Sequence assembly ,Genome informatics ,Population genomics ,transcriptomics ,0302 clinical medicine ,Haemonchus contortus ,Intestinal Diseases, Parasitic ,lcsh:QH301-705.5 ,Caenorhabditis elegans ,[SDV.EE]Life Sciences [q-bio]/Ecology, environment ,0303 health sciences ,biology ,Genomics ,Non-model organisms ,3. Good health ,Caenorhabditis ,Female ,Haemonchus ,General Agricultural and Biological Sciences ,Parasitic infection ,population genomics ,Computational biology ,Models, Biological ,Chromosomes ,Article ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,parasitic diseases ,PacBio long-read sequencing ,Animals ,Humans ,Gene ,030304 developmental biology ,Genome, Helminth ,Life Cycle Stages ,[SDV.GEN]Life Sciences [q-bio]/Genetics ,Genetic diversity ,[SDV.BA.MVSA]Life Sciences [q-bio]/Animal biology/Veterinary medicine and animal Health ,Comparative genomics ,biology.organism_classification ,lcsh:Biology (General) ,chromosomal genome assembly ,Iso- Seq cDNA sequencing ,genetic variation ,Haemonchiasis ,Transcriptome ,030217 neurology & neurosurgery ,[SDV.EE.IEO]Life Sciences [q-bio]/Ecology, environment/Symbiosis - Abstract
Haemonchus contortus is a globally distributed and economically important gastrointestinal pathogen of small ruminants and has become a key nematode model for studying anthelmintic resistance and other parasite-specific traits among a wider group of parasites including major human pathogens. Here, we report using PacBio long-read and OpGen and 10X Genomics long-molecule methods to generate a highly contiguous 283.4 Mbp chromosome-scale genome assembly including a resolved sex chromosome for the MHco3(ISE).N1 isolate. We show a remarkable pattern of conservation of chromosome content with Caenorhabditis elegans, but almost no conservation of gene order. Short and long-read transcriptome sequencing allowed us to define coordinated transcriptional regulation throughout the parasite’s life cycle and refine our understanding of cis- and trans-splicing. Finally, we provide a comprehensive picture of chromosome-wide genetic diversity both within a single isolate and globally. These data provide a high-quality comparison for understanding the evolution and genomics of Caenorhabditis and other nematodes and extend the experimental tractability of this model parasitic nematode in understanding helminth biology, drug discovery and vaccine development, as well as important adaptive traits such as drug resistance., Stephen Doyle et al. report the chromosome-scale genome assembly and transcriptome sequence data of Haemonchus contortus, a key parasitic nematode model. These data show nearly complete conservation of chromosome content with C. elegans and brings insight into transcriptional regulation and chromosome-wide genetic diversity in this important pathogen.
- Published
- 2020
11. Nearly Complete Genome Sequence of Brugia malayi Strain FR3
- Author
-
Jonas Korlach, Michelle L. Michalski, Elodie Ghedin, Yu-Chih Tsai, Matthew B. Rogers, Sara Lustigman, Jeremy M. Foster, Adam Geber, Alan Tracey, Matthew Berriman, Alexandra Grote, Silvia Libro, John S. Mattick, Michael Paulini, Alan Twaddle, Nancy Holroyd, Matthew Chung, Tyson A. Clark, Julie C. Dunning Hotopp, and James Cotton
- Subjects
0301 basic medicine ,Whole genome sequencing ,Genetics ,Autosome ,biology ,030231 tropical medicine ,Genome Sequences ,Elephantiasis ,biology.organism_classification ,Y chromosome ,medicine.disease ,Brugia malayi ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Immunology and Microbiology (miscellaneous) ,parasitic diseases ,medicine ,Molecular Biology ,Lymphatic filariasis ,X chromosome ,Sequence (medicine) - Abstract
Lymphatic filariasis affects ∼120 million people and can result in elephantiasis and hydrocele. Here, we report the nearly complete genome sequence of the best-studied causative agent of lymphatic filariasis, Brugia malayi. The assembly contains four autosomes, an X chromosome, and only eight gaps but lacks a contiguous sequence for the known Y chromosome.
- Published
- 2020
12. Extensive genomic and transcriptomic variation defines the chromosome-scale assembly ofHaemonchus contortus, a model gastrointestinal worm
- Author
-
Stephen R. Doyle, Alan Tracey, Roz Laing, Nancy Holroyd, David Bartley, Wojtek Bazant, Helen Beasley, Robin Beech, Collette Britton, Karen Brooks, Umer Chaudhry, Kirsty Maitland, Axel Martinelli, Jennifer D. Noonan, Michael Paulini, Michael A. Quail, Elizabeth Redman, Faye H. Rodgers, Guillaume Sallé, Muhammad Zubair Shabbir, Geetha Sankaranarayanan, Janneke Wit, Kevin L. Howe, Neil Sargison, Eileen Devaney, Matthew Berriman, John S. Gilleard, and James A. Cotton
- Subjects
0303 health sciences ,biology ,Sequence assembly ,Genomics ,Computational biology ,biology.organism_classification ,Genome ,Caenorhabditis ,03 medical and health sciences ,0302 clinical medicine ,Gene ,030217 neurology & neurosurgery ,Caenorhabditis elegans ,030304 developmental biology ,Haemonchus contortus ,Synteny - Abstract
BackgroundHaemonchus contortusis a globally distributed and economically important gastrointestinal pathogen of small ruminants, and has become the key nematode model for studying anthelmintic resistance and other parasite-specific traits among a wider group of parasites including major human pathogens. Two draft genome assemblies forH. contortuswere reported in 2013, however, both were highly fragmented, incomplete, and differed from one another in important respects. While the introduction of long-read sequencing has significantly increased the rate of production and contiguity ofde novogenome assemblies broadly, achieving high quality genome assemblies for small, genetically diverse, outcrossing eukaryotic organisms such asH. contortusremains a significant challenge.ResultsHere, we report using PacBio long read and OpGen and 10X Genomics long-molecule methods to generate a highly contiguous 283.4 Mbp chromosome-scale genome assembly including a resolved sex chromosome. We show a remarkable pattern of almost complete conservation of chromosome content (synteny) withCaenorhabditis elegans, but almost no conservation of gene order. Long-read transcriptome sequence data has allowed us to define coordinated transcriptional regulation throughout the life cycle of the parasite, and refine our understanding ofcis- andtrans-splicing relative to that observed inC. elegans. Finally, we use this assembly to give a comprehensive picture of chromosome-wide genetic diversity both within a single isolate and globally.ConclusionsTheH. contortusMHco3(ISE).N1 genome assembly presented here represents the most contiguous and resolved nematode assembly outside of theCaenorhabditisgenus to date, together with one of the highest-quality set of predicted gene features. These data provide a high-quality comparison for understanding the evolution and genomics ofCaenorhabditisand other nematodes, and extends the experimental tractability of this model parasitic nematode in understanding pathogen biology, drug discovery and vaccine development, and important adaptive traits such as drug resistance.
- Published
- 2020
13. WormBase: a modern Model Organism Information Resource
- Author
-
Christian A. Grove, Faye H. Rodgers, Gary Williams, Wen J. Chen, Jaehyoung Cho, Paul W. Sternberg, Karen Yook, Valerio Arnaboldi, Matthew Russell, Sibyl Gao, Lincoln Stein, Raymond Lee, Qinghua Wang, Juancarlos Chan, Paul Davis, Paulo A. S. Nuin, Daniela Raciti, Michael Paulini, Adam Wright, Kevin L. Howe, Gary Schindelman, Hans-Michael Müller, Scott Cain, Tim Schedl, Cecilia Nakamura, Ranjana Kishore, Kimberly Van Auken, and Todd W. Harris
- Subjects
Interface (Java) ,Cloud computing ,Biology ,World Wide Web ,03 medical and health sciences ,User-Computer Interface ,0302 clinical medicine ,Databases, Genetic ,Genetics ,Animals ,Data Mining ,Database Issue ,Architecture ,Caenorhabditis elegans ,Genes, Helminth ,030304 developmental biology ,0303 health sciences ,Internet ,business.industry ,Genomics ,biology.organism_classification ,Workflow ,The Internet ,WormBase ,User interface ,business ,030217 neurology & neurosurgery - Abstract
WormBase (https://wormbase.org/) is a mature Model Organism Information Resource supporting researchers using the nematode Caenorhabditis elegans as a model system for studies across a broad range of basic biological processes. Toward this mission, WormBase efforts are arranged in three primary facets: curation, user interface and architecture. In this update, we describe progress in each of these three areas. In particular, we discuss the status of literature curation and recently added data, detail new features of the web interface and options for users wishing to conduct data mining workflows, and discuss our efforts to build a robust and scalable architecture by leveraging commercial cloud offerings. We conclude with a description of WormBase's role as a founding member of the nascent Alliance of Genome Resources.
- Published
- 2019
14. Sex chromosome evolution in parasitic nematodes of humans
- Author
-
John S. Mattick, Adam Geber, Laura Teigen, Matthew Chung, Alan Twaddle, Sara Lustigman, Silvia Libro, Jeremy M. Foster, Alan Tracey, Yichao Li, Michelle L. Michalski, Tyson A. Clark, Matthew Berriman, Alexandra Grote, Elodie Ghedin, Yu-Chih Tsai, Nancy Holroyd, Matthew B. Rogers, Jonas Korlach, Michael Paulini, Lonnie R. Welch, James Cotton, and Julie C. Dunning Hotopp
- Subjects
0301 basic medicine ,Male ,Evolution of sexual reproduction ,Nematoda ,Science ,General Physics and Astronomy ,Helminth genetics ,Genome ,General Biochemistry, Genetics and Molecular Biology ,Brugia malayi ,Article ,Evolutionary genetics ,Evolution, Molecular ,03 medical and health sciences ,0302 clinical medicine ,parasitic diseases ,Animals ,Humans ,lcsh:Science ,Caenorhabditis elegans ,Nematode Infections ,Repetitive Sequences, Nucleic Acid ,Genome, Helminth ,Multidisciplinary ,Autosome ,Sex Chromosomes ,biology ,Eukaryote ,Parasite genomics ,Chromosome ,Chromosome Mapping ,General Chemistry ,Sex Determination Processes ,biology.organism_classification ,Genome evolution ,030104 developmental biology ,Nematode ,Gene Expression Regulation ,Evolutionary biology ,lcsh:Q ,Female ,030217 neurology & neurosurgery - Abstract
Sex determination mechanisms often differ even between related species yet the evolution of sex chromosomes remains poorly understood in all but a few model organisms. Some nematodes such as Caenorhabditis elegans have an XO sex determination system while others, such as the filarial parasite Brugia malayi, have an XY mechanism. We present a complete B. malayi genome assembly and define Nigon elements shared with C. elegans, which we then map to the genomes of other filarial species and more distantly related nematodes. We find a remarkable plasticity in sex chromosome evolution with several distinct cases of neo-X and neo-Y formation, X-added regions, and conversion of autosomes to sex chromosomes from which we propose a model of chromosome evolution across different nematode clades. The phylum Nematoda offers a new and innovative system for gaining a deeper understanding of sex chromosome evolution., Many nematode worms, including Caenorhabditis elegans have XX/XO sex determination, while other species have XY. The authors use a new genome assembly of the filarial parasite Brugia malayi and published data to show that nematode sex chromosome evolution is highly plastic.
- Published
- 2019
15. Using WormBase: A Genome Biology Resource for Caenorhabditis elegans and Related Nematodes
- Author
-
Gary Williams, Kimberly Van Auken, Raymond Lee, Daniela Raciti, Scott Cain, Wen J. Chen, Paul Davis, Mary Ann Tuli, Kevin L. Howe, Michael Paulini, Ranjana Kishore, Todd W. Harris, Christian A. Grove, and Kollmar, Martin
- Subjects
0301 basic medicine ,Proteome ,Computer science ,Genomics ,Computational biology ,Web Browser ,Genome ,Article ,03 medical and health sciences ,User-Computer Interface ,Resource (project management) ,Research community ,Databases, Genetic ,Animals ,Data Mining ,Humans ,Caenorhabditis elegans ,Genes, Helminth ,Genome, Helminth ,biology ,Computational Biology ,Epistasis, Genetic ,biology.organism_classification ,Search Engine ,Centralized database ,030104 developmental biology ,Gene Ontology ,Phenotype ,Genome Biology ,WormBase ,Transcriptome ,Software - Abstract
WormBase ( www.wormbase.org ) provides the nematode research community with a centralized database for information pertaining to nematode genes and genomes. As more nematode genome sequences are becoming available and as richer data sets are published, WormBase strives to maintain updated information, displays, and services to facilitate efficient access to and understanding of the knowledge generated by the published nematode genetics literature. This chapter aims to provide an explanation of how to use basic features of WormBase, new features, and some commonly used tools and data queries. Explanations of the curated data and step-by-step instructions of how to access the data via the WormBase website and available data mining tools are provided.
- Published
- 2018
16. WormBase 2017: molting into a new stage
- Author
-
Mary Ann Tuli, Todd W. Harris, Ranjana Kishore, Paul Davis, Raymond Lee, Lincoln Stein, Tim Schedl, Kevin L. Howe, Paul W. Sternberg, Adam Wright, Gary Williams, Qinghua Wang, Faye H. Rodgers, Gary Schindelman, Matthew Berriman, Daniela Raciti, Hans-Michael Müller, Christian A. Grove, Sibyl Gao, Juancarlos Chan, Paulo A. S. Nuin, Valerio Arnaboldi, Matthew Russell, Michael Paulini, Cecilia Nakamura, Scott Cain, Kimberly Van Auken, Wen J. Chen, Karen Yook, and Paul J. Kersey
- Subjects
0301 basic medicine ,Nematoda ,Process (engineering) ,Datasets as Topic ,Information Storage and Retrieval ,Biology ,Ontology (information science) ,Web Browser ,Bioinformatics ,Genome ,World Wide Web ,03 medical and health sciences ,User-Computer Interface ,0302 clinical medicine ,Databases, Genetic ,Genetics ,Database Issue ,Animals ,Data Mining ,Humans ,Caenorhabditis elegans ,Data Curation ,Publishing ,Data curation ,biology.organism_classification ,Disease Models, Animal ,030104 developmental biology ,Gene Ontology ,Platyhelminths ,Scalability ,Caenorhabditis ,RNA Interference ,WormBase ,Sequence Alignment ,030217 neurology & neurosurgery ,Forecasting - Abstract
WormBase (http://www.wormbase.org) is an important knowledge resource for biomedical researchers worldwide. To accommodate the ever increasing amount and complexity of research data, WormBase continues to advance its practices on data acquisition, curation and retrieval to most effectively deliver comprehensive knowledge about Caenorhabditis elegans, and genomic information about other nematodes and parasitic flatworms. Recent notable enhancements include user-directed submission of data, such as micropublication; genomic data curation and presentation, including additional genomes and JBrowse, respectively; new query tools, such as SimpleMine, Gene Enrichment Analysis; new data displays, such as the Person Lineage browser and the Summary of Ontology-based Annotations. Anticipating more rapid data growth ahead, WormBase continues the process of migrating to a cutting-edge database technology to achieve better stability, scalability, reproducibility and a faster response time. To better serve the broader research community, WormBase, with five other Model Organism Databases and The Gene Ontology project, have begun to collaborate formally as the Alliance of Genome Resources.
- Published
- 2018
17. WormBase 2016: expanding to enable helminth genomic research
- Author
-
Gary Williams, Daniela Raciti, James Done, Christian A. Grove, Paul J. Kersey, Hans-Michael Müller, Adam Wright, Lincoln Stein, Daniel Wang, Scott Cain, Juancarlos Chan, Tim Schedl, Todd W. Harris, Bruce J. Bolt, Paul Davis, Wen J. Chen, Paulo A. S. Nuin, Xiaodong Wang, Karen Yook, Kevin L. Howe, Jane Lomax, Mary Ann Tuli, Eleanor J Stanley, Paul W. Sternberg, Ranjana Kishore, Raymond Lee, Kimberly Van Auken, Michael Paulini, Thomas A. Down, Cecilia Nakamura, Matthew Berriman, Yuling Li, Gary Schindelman, and Sibyl Gao
- Subjects
0301 basic medicine ,Nematoda ,WormBook ,Genomics ,Computational biology ,Ontology (information science) ,Bioinformatics ,Genome ,03 medical and health sciences ,Databases, Genetic ,Genetics ,Animals ,Database Issue ,Helminths ,Caenorhabditis elegans ,Genes, Helminth ,Genome, Helminth ,biology ,Molecular Sequence Annotation ,biology.organism_classification ,030104 developmental biology ,Platyhelminths ,WormBase ,Software - Abstract
WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research.
- Published
- 2015
18. Combining RNA-seq data and homology-based gene prediction for plants, animals and fungi
- Author
-
Sven Twardziok, Frank Hartung, Jan Grau, Jens Keilwagen, and Michael Paulini
- Subjects
Nematoda ,Gene prediction ,Genes, Fungal ,RNA-Seq ,Computational biology ,Biology ,Genes, Plant ,lcsh:Computer applications to medicine. Medical informatics ,Genome Annotation ,Homology-based Gene Prediction ,Rna-seq ,Homology (biology) ,03 medical and health sciences ,0302 clinical medicine ,Animals ,lcsh:QH301-705.5 ,Peptide sequence ,Gene ,030304 developmental biology ,0303 health sciences ,Sequence Homology, Amino Acid ,Sequence Analysis, RNA ,Methodology Article ,Gene Expression Profiling ,Intron ,Hordeum ,Molecular Sequence Annotation ,Genomics ,Genome project ,Introns ,lcsh:Biology (General) ,lcsh:R858-859.7 ,RNA-seq ,Software ,Homology-based gene prediction ,030217 neurology & neurosurgery ,Genome annotation ,Reference genome - Abstract
MotivationGenome annotation is of key importance in many research questions. The identification of protein-coding genes is often based on transcriptome sequencing data, ab-initio or homology-based prediction. Recently, it was demonstrated that intron position conservation improves homology-based gene prediction, and that experimental data improves ab-initio gene prediction.ResultsHere, we present an extension of the gene prediction tool GeMoMa that utilizes amino acid sequence conservation, intron position conservation and optionally RNA-seq data for homology-based gene prediction. We show on published benchmark data for plants, animals and fungi that GeMoMa performs better than the gene prediction programs BRAKER1, MAKER2, and CodingQuarry, and purely RNA-seq-based pipelines for transcript identification. In addition, we demonstrate that using multiple reference organisms may help to further improve the performance of GeMoMa. Finally, we apply GeMoMa to four nematode species and to the recently published barley reference genome indicating that current annotations of protein-coding genes may be refined using GeMoMa predictions.AvailabilityGeMoMa has been published under GNU GPL3 and is freely available at http://www.jstacs.de/index.php/GeMoMa.Contactjens.keilwagen@julius-kuehn.de
- Published
- 2017
- Full Text
- View/download PDF
19. Ensembl Genomes 2013: scaling up access to genome-wide data
- Author
-
Kevin L. Howe, Ken Youens-Clark, Lee J. Falin, Michael Nuhn, Paul J. Kersey, Dan Bolser, Gareth Maslen, Brandon Walts, Daniel M. Staines, Christoph Grabmueller, Doreen Ware, Julia Khobova, Arnaud Kerhornou, Mikkel B. Christensen, Nicholas Langridge, Eugene Kulesha, Xuehong Wei, Marcela K. Monaco, Daniel Lawson, Jay C. Humphrey, Gareth Williams, Helder Pedro, Michael Paulini, Chuang Kee Ong, Derek Wilson, Daniel S.T. Hughes, James E. Allen, Uma Maheswari, Iliana Toneva, Mark D. McDowall, Mary Ann Tuli, Joshua C. Stein, and Paul Davis
- Subjects
0106 biological sciences ,Genomics ,Context (language use) ,Computational biology ,Bacterial genome size ,Biology ,01 natural sciences ,Genome ,03 medical and health sciences ,Ensembl Genomes ,Databases, Genetic ,Genetics ,Animals ,Ensembl ,030304 developmental biology ,Internet ,0303 health sciences ,Molecular Sequence Annotation ,Genome project ,ComputingMethodologies_PATTERNRECOGNITION ,Genome, Fungal ,Edible Grain ,Genome, Bacterial ,Genome, Plant ,Software ,IV. Viruses, bacteria, protozoa and fungi ,010606 plant biology & botany ,Reference genome - Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future.
- Published
- 2013
20. Ensembl Genomes 2016: more genomes, more complexity
- Author
-
Paul J. Kersey, Daniel M. Staines, Jay C. Humphrey, Bert Overduin, Julia Khobova, Doreen Ware, Paul Davis, Gareth Maslen, Emily Perry, Kevin L. Howe, Electra Tapanari, Bruce J. Bolt, Sharon Wei, Michael Nuhn, Joshua C. Stein, Ernesto Lowy, Naveen K. Aranganathan, Irina M. Armean, Brandon Walts, Mikkel B. Christensen, Giulietta Spudich, James E. Allen, Lee J. Falin, Marcela K. Tello-Ruiz, Gareth Williams, Christoph Grabmueller, Denise Carvalho-Silva, Mark D. McDowall, Daniel Lawson, Nicholas Langridge, Chuang Kee Ong, Uma Maheswari, Helder Pedro, Dan Bolser, Arnaud Kerhornou, Michael Paulini, Sanjay Boddu, and Eugene Kulesha
- Subjects
0301 basic medicine ,Genomics ,Context (language use) ,Bacterial genome size ,Computational biology ,Biology ,Genome ,Polyploidy ,03 medical and health sciences ,Ensembl Genomes ,Databases, Genetic ,Genetics ,Ensembl ,Animals ,Database Issue ,Whole genome sequencing ,Eukaryota ,Genetic Variation ,Diploidy ,Invertebrates ,030104 developmental biology ,Genome, Fungal ,Sequence Alignment ,Genome, Bacterial ,Genome, Plant ,Reference genome - Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.
- Published
- 2016
21. WormBase 2014: new views of curated biology
- Author
-
Todd W. Harris, Tamberlyn Bieri, Kimberly Van Auken, J. D. Wong, Tim Schedl, Christian A. Grove, Mary Ann Tuli, Yuling Li, Raymond Lee, Gary Williams, Gary Schindelman, Juancarlos Chan, Kevin L. Howe, John Spieth, Lincoln Stein, Paul J. Kersey, Joachim Baran, Philip Ozersky, Jonathan Hodgkin, Daniel Wang, Daniela Raciti, Hans-Michael Müller, Ranjana Kishore, James Done, Abigail Cabunoc, Paul W. Sternberg, Paul H. Davis, Matthew Berriman, Xiaodong Wang, Wen J. Chen, Karen Yook, Michael Paulini, and Cecilia Nakamura
- Subjects
Genetics ,Genome, Helminth ,Internet ,Nematoda ,biology ,WormBook ,business.industry ,Molecular Sequence Annotation ,V. Human genome, model organisms, comparative genomics ,Coping behavior ,biology.organism_classification ,Manual curation ,World Wide Web ,Databases, Genetic ,Animals ,The Internet ,WormBase ,Caenorhabditis elegans ,business ,Web site - Abstract
WormBase (http://www.wormbase.org/) is a highly curated resource dedicated to supporting research using the model organism Caenorhabditis elegans. With an electronic history predating the World Wide Web, WormBase contains information ranging from the sequence and phenotype of individual alleles to genome-wide studies generated using next-generation sequencing technologies. In recent years, we have expanded the contents to include data on additional nematodes of agricultural and medical significance, bringing the knowledge of C. elegans to bear on these systems and providing support for underserved research communities. Manual curation of the primary literature remains a central focus of the WormBase project, providing users with reliable, up-to-date and highly cross-linked information. In this update, we describe efforts to organize the original atomized and highly contextualized curated data into integrated syntheses of discrete biological topics. Next, we discuss our experiences coping with the vast increase in available genome sequences made possible through next-generation sequencing platforms. Finally, we describe some of the features and tools of the new WormBase Web site that help users better find and explore data of interest.
- Published
- 2014
22. Automatische Modellgenerierung zur Simulation der Kommunikationsumgebung im Kraftfahrzeug
- Author
-
Paul Willutzki, Michael Paulini, Felix Bloos, and Bartono Adiprasito
- Subjects
General Computer Science - Published
- 1999
23. WormBase: Annotating many nematode genomes
- Author
-
Paul W. Sternberg, Paul J. Kersey, Mary Ann Tuli, Michael Paulini, Richard Durbin, Gary Williams, Karen Yook, Paul Davis, and Kevin L. Howe
- Subjects
Resource ,community resource ,WormBook ,nematode ,Computational biology ,Biology ,Genome ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,Caenorhabditis elegans ,genome ,030304 developmental biology ,Sequence (medicine) ,parasitic nematode ,Whole genome sequencing ,0303 health sciences ,biology.organism_classification ,model organism database ,Data science ,sequence curation ,annotation ,WormBase ,Functional genomics ,030217 neurology & neurosurgery - Abstract
WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase’s role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE.
- Published
- 2013
24. Genome Mapping and Genomics of Caenorhabditis elegans
- Author
-
Michael Paulini, Mary Ann Tuli, and Jonathan Hodgkin
- Subjects
Caenorhabditis ,Comparative genomics ,Genetics ,biology ,Pseudogene ,Genomics ,biology.organism_classification ,Gene ,Genome ,Caenorhabditis elegans ,Synteny - Abstract
The nematode Caenorhabditis elegans is one of the most extensively studied and utilized model organisms, owing to experimental advantages such as its ease of culture and rapid growth, facile genetics, cellular simplicity, and complete transparency throughout life. Its compact 100 Mb genome sequence was the first to be completely determined for any multicellular organism, in 1998. Early linkage mapping by recombinational methods and cytology defined a nuclear genome of five autosomes and one X (sex) chromosome, of roughly equal size. The two natural sexes are both autosomally diploid; hermaphrodites have two X chromosomes (XX) while males have one (XO). All chromosomes are holocentric, but each contains a central region where recombination is reduced and conserved house-keeping genes are more frequent. Centromeres and extended heterochromatic regions are absent. Telomeres are conventional. Annotation of the genome has defined over 20,000 protein-coding genes, with relatively few pseudogenes. About 15 % of these genes are transcribed as multicistronic operons, which are divided up into mRNAs by trans-splicing. The genome also contains many noncoding RNA genes, including a well-defined set of miRNAs. Families of transposons and repeated sequences are present but less abundant than in vertebrates. Postgenomic approaches include extensive resequencing, transcriptomics, microarray analyses and proteomics, together with determination of spatial and temporal gene expression patterns using GFP reporter transgenes, and functional testing by systematic gene deletion and global RNAi knockdown screens. Protein–protein interactions have been explored by large-scale yeast two-hybrid testing. Genome sequences for several other species within the genus Caenorhabditis have been determined; these provide a major resource for comparative genomics, and reveal a high degree of synteny between the different species.
- Published
- 2012
25. Ensembl Genomes: An integrative resource for genome-scale data from non-vertebrate species
- Author
-
Michael Nuhn, Ewan Birney, Paul J. Kersey, Paul S. Derwent, Stephen Keenan, Eugene Kulesha, Mark D. McDowall, Daniel M. Staines, Jay C. Humphrey, Gautier Koscielny, Daniel S.T. Hughes, Daniel Lawson, Nicholas Langridge, Derek Wilson, Iliana Toneva, Uma Maheswari, Helder Pedro, Karyn Megy, Arnaud Kerhornou, Andrew D. Yates, and Michael Paulini
- Subjects
Genomics ,Context (language use) ,Computational biology ,Biology ,Vertebrate and Genome Annotation Project ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Ensembl Genomes ,Databases, Genetic ,Genetics ,Ensembl ,Animals ,030304 developmental biology ,0303 health sciences ,Molecular Sequence Annotation ,Genome project ,Articles ,Invertebrates ,Systems Integration ,Genome, Fungal ,030217 neurology & neurosurgery ,Genome, Bacterial ,Genome, Plant ,Reference genome - Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrative resource for genome-scale data from non-vertebrate species. The project exploits and extends technology (for genome annotation, analysis and dissemination) developed in the context of the (vertebrate-focused) Ensembl project and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. Since its launch in 2009, Ensembl Genomes has undergone rapid expansion, with the goal of providing coverage of all major experimental organisms, and additionally including taxonomic reference points to provide the evolutionary context in which genes can be understood. Against the backdrop of a continuing increase in genome sequencing activities in all parts of the tree of life, we seek to work, wherever possible, with the communities actively generating and using data, and are participants in a growing range of collaborations involved in the annotation and analysis of genomes.
- Published
- 2012
- Full Text
- View/download PDF
26. WormBase 2012: more genomes, more data, new website
- Author
-
Ranjana Kishore, Matthew Berriman, Snehalata Kadam, Xiaoqi Shi, Daniel Wang, Uma Ganesan, Paul J. Kersey, Bill Nash, Juancarlos Chan, Michael Paulini, Kimberly Van Auken, Cecilia Nakamura, Lincoln Stein, Paul W. Sternberg, Christian A. Grove, Arun Rangarajan, Todd W. Harris, John Spieth, Gary Williams, Erich M. Schwarz, Xiaodong Wang, Daniela Raciti, Richard Durbin, Mary Ann Tuli, Hans-Michael Müller, Philip Ozersky, Adrian Duong, Raymond Lee, Tamberlyn Bieri, Paul H. Davis, Kevin L. Howe, Abigail Cabunoc, Jonathan Hodgkin, Yuling Li, Ruihua Fang, Gary Schindelman, Wen J. Chen, Karen Yook, and Norie De La Cruz
- Subjects
Nematoda ,Scientific literature ,Biology ,World Wide Web ,03 medical and health sciences ,0302 clinical medicine ,Resource (project management) ,Databases, Genetic ,Genetics ,Computer Graphics ,Animals ,Caenorhabditis elegans ,030304 developmental biology ,0303 health sciences ,Genome, Helminth ,Internet ,Application programming interface ,business.industry ,Gene Expression Profiling ,Usability ,Molecular Sequence Annotation ,Genomics ,Articles ,Identification (information) ,Phenotype ,Data extraction ,Caenorhabditis ,The Internet ,WormBase ,business ,030217 neurology & neurosurgery - Abstract
Since its release in 2000, WormBase (http://www .wormbase.org) has grown from a small resource focusing on a single species and serving a dedicated research community, to one now spanning 15 species essential to the broader biomedical and agricultural research fields. To enhance the rate of curation, we have automated the identification of key data in the scientific literature and use similar methodology for data extraction. To ease access to the data, we are collaborating with journals to link entities in research publications to their report pages at WormBase. To facilitate discovery, we have added new views of the data, integrated large-scale datasets and expanded descriptions of models for human disease. Finally, we have introduced a dramatic overhaul of the WormBase website for public beta testing. Designed to balance complexity and usability, the new site is species-agnostic, highly customizable, and interactive. Casual users and developers alike will be able to leverage the public RESTful application programming interface (API) to generate custom data mining solutions and extensions to the site. We report on the growth of our database and on our work in keeping pace with the growing demand for data, efforts to anticipate the requirements of users and new collaborations with the larger science community. © The Author(s) 2011.
- Published
- 2011
27. Ensembl Genomes 2022: an expanding genome resource for non-vertebrates
- Author
-
Andrew D. Yates, Michael Paulini, Anne Parker, Kim E. Hammond-Kosack, Marcela K. Tello-Ruiz, Justin Elser, Jyothish Bhai, Manuel Luypaert, Faye H. Rodgers, Stavros Diamantakis, James Seager, Pankaj Jaiswal, Robert D. Finn, Thomas Maurel, Michal Szpak, Marc Rosello, James E. Allen, Parul Gupta, Matthieu Barba, Lahcen I. Campbell, Marc Chakiachvili, Vivek Kumar, Manuel Carbajo Martinez, Benjamin Moore, Nishadi De Silva, John Tate, Ridwan M Amode, Andrew Olson, Astrid Gall, Magali Ruffier, Paul Davis, Mikkel B. Christensen, Mark Quinton-Tulloch, Alayne Cuzick, Kevin L. Howe, Martin Urban, Justin Preece, Nick Langridge, Sunita Kumari, Kapeel Chougule, Emily Perry, Tuan Le, Gareth Maslen, Sharon Wei, Dionysios Grigoriadis, Carla Valeria Filippi, Andrea Winterbottom, Doreen Ware, Aleena Mushtaq, Vinay Kaikala, Andrey G Azov, Helder Pedro, Luca Da Rin Fioretto, Gary Williams, Magdalena Zarowiecki, Cristina Guijarro-Clarke, Guy Naamati, Sushma Naithani, Bruno Contreras-Moreira, Sarah Dyer, Andrés Becerra, Paul Flicek, Matthieu Muffato, Vasily Sitnik, and Stephen J. Trevanion
- Subjects
Whole genome sequencing ,Internet ,AcademicSubjects/SCI00010 ,Computational Biology ,Genomics ,Genome project ,Gene Annotation ,Computational biology ,Plants ,Biology ,Genome ,Annotation ,Resource (project management) ,ComputingMethodologies_PATTERNRECOGNITION ,Ensembl Genomes ,Databases, Genetic ,Vertebrates ,Genetics ,Database Issue ,Animals ,Ensembl ,Genome, Fungal ,Genome, Bacterial ,Genome, Plant ,Software - Abstract
Ensembl Genomes (https://www.ensemblgenomes.org) provides access to non-vertebrate genomes and analysis complementing vertebrate resources developed by the Ensembl project (https://www.ensembl.org). The two resources collectively present genome annotation through a consistent set of interfaces spanning the tree of life presenting genome sequence, annotation, variation, transcriptomic data and comparative analysis. Here, we present our largest increase in plant, metazoan and fungal genomes since the project's inception creating one of the world's most comprehensive genomic resources and describe our efforts to reduce genome redundancy in our Bacteria portal. We detail our new efforts in gene annotation, our emerging support for pangenome analysis, our efforts to accelerate data dissemination through the Ensembl Rapid Release resource and our new AlphaFold visualization. Finally, we present details of our future plans including updates on our integration with Ensembl, and how we plan to improve our support for the microbial research community. Software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license). Data updates are synchronised with Ensembl's release cycle.
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.