Back to Search Start Over

Dog10K_Boxer_Tasha_1.0: A Long-Read Assembly of the Dog Reference Genome

Authors :
Xiangquan Zhang
Christophe Hitte
Elaine A. Ostrander
Patrick Masterson
Terence Murphy
Yan-Hu Liu
Jeffrey M. Kidd
S. Emery
Brian W. Davis
Tosso Leeb
Ya-Ping Zhang
Reuben M. Buckley
Guo-Dong Wang
Vidhya Jagannathan
University of Bern
Institut de Génétique et Développement de Rennes (IGDR)
Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique )-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1)
Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)
University of Michigan [Ann Arbor]
University of Michigan System
National Center for Biotechnology Information (NCBI)
Texas A&M University [College Station]
National Human Genome Research Institute (NHGRI)
Kunming Institute of Zoology
Chinese Academy of Sciences [Beijing] (CAS)
2019YFA0707101, The National Key R&D Program of China
R01GM140135, National Institutes of Health
Université de Rennes (UR)-Centre National de la Recherche Scientifique (CNRS)-Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique )
Kunming Institute of Zoology (KIZ)
Source :
Jagannathan, Vidya; Hitte, Christophe; Kidd, Jeffrey M.; Masterson, Patrick; Murphy, Terence D.; Emery, Sarah; Davis, Brian; Buckley, Reuben M.; Liu, Yan-Hu; Zhang, Xiang-Quan; Leeb, Tosso; Zhang, Ya-Ping; Ostrander, Elaine A.; Wang, Guo-Dong (2021). Dog10K_Boxer_Tasha_1.0: A Long-Read Assembly of the Dog Reference Genome. Genes, 12(6) MDPI, Molecular Diversity Preservation International 10.3390/genes12060847 , Genes, Genes, MDPI, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩, Genes, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩, Genes, Vol 12, Iss 847, p 847 (2021), Volume 12, Issue 6
Publication Year :
2021
Publisher :
MDPI, Molecular Diversity Preservation International, 2021.

Abstract

The domestic dog has evolved to be an important biomedical model for studies regarding the genetic basis of disease, morphology and behavior. Genetic studies in the dog have relied on a draft reference genome of a purebred female boxer dog named “Tasha” initially published in 2005. Derived from a Sanger whole genome shotgun sequencing approach coupled with limited clone-based sequencing, the initial assembly and subsequent updates have served as the predominant resource for canine genetics for 15 years. While the initial assembly produced a good-quality draft, as with all assemblies produced at the time, it contained gaps, assembly errors and missing sequences, particularly in GC-rich regions, which are found at many promoters and in the first exons of protein-coding genes. Here, we present Dog10K_Boxer_Tasha_1.0, an improved chromosome-level highly contiguous genome assembly of Tasha created with long-read technologies that increases sequence contiguity &gt<br />100-fold, closes &gt<br />23,000 gaps of the CanFam3.1 reference assembly and improves gene annotation by identifying &gt<br />1200 new protein-coding transcripts. The assembly and annotation are available at NCBI under the accession GCF_000002285.5.

Details

ISSN :
20734425
Database :
OpenAIRE
Journal :
Jagannathan, Vidya; Hitte, Christophe; Kidd, Jeffrey M.; Masterson, Patrick; Murphy, Terence D.; Emery, Sarah; Davis, Brian; Buckley, Reuben M.; Liu, Yan-Hu; Zhang, Xiang-Quan; Leeb, Tosso; Zhang, Ya-Ping; Ostrander, Elaine A.; Wang, Guo-Dong (2021). Dog10K_Boxer_Tasha_1.0: A Long-Read Assembly of the Dog Reference Genome. Genes, 12(6) MDPI, Molecular Diversity Preservation International 10.3390/genes12060847 <http://dx.doi.org/10.3390/genes12060847>, Genes, Genes, MDPI, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩, Genes, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩, Genes, Vol 12, Iss 847, p 847 (2021), Volume 12, Issue 6
Accession number :
edsair.doi.dedup.....9738f007d0f0f64621c0fcd22b4c822e
Full Text :
https://doi.org/10.48350/156573