Back to Search
Start Over
Recompleting the Caenorhabditis elegans genome
- Source :
- Lab Anim (NY)
- Publication Year :
- 2019
- Publisher :
- Cold Spring Harbor Laboratory Press, 2019.
-
Abstract
- Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology.
- Subjects :
- Resource
Systems biology
Genomics
Computational biology
Genome
Article
03 medical and health sciences
chemistry.chemical_compound
0302 clinical medicine
Tandem repeat
Genetics
Animals
Caenorhabditis elegans
Caenorhabditis elegans Proteins
Gene
Genetics (clinical)
030304 developmental biology
0303 health sciences
Genome, Helminth
biology
Computational Biology
High-Throughput Nucleotide Sequencing
Reproducibility of Results
Molecular Sequence Annotation
biology.organism_classification
Multicellular organism
chemistry
030217 neurology & neurosurgery
DNA
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Lab Anim (NY)
- Accession number :
- edsair.doi.dedup.....820168284f28ef33d53d4e7bd041b8ad