586 results on '"Korlach, Jonas"'
Search Results
2. Semi-automated assembly of high-quality diploid human reference genomes
- Author
-
Jarvis, Erich D, Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R, Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A, Carnevali, Paolo, Chaisson, Mark JP, Chin, Chen-Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S, Fulton, Lucinda L, Garg, Shilpa, Gerton, Jennifer L, Ghurye, Jay, Granat, Anastasiya, Green, Richard E, Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B, Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O, Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W, McDaniel, Jennifer, Nie, Fan, Olsen, Hugh E, Olson, Nathan D, Pesout, Trevor, Potapova, Tamara, Puiu, Daniela, Regier, Allison, Ruan, Jue, Salzberg, Steven L, Sanders, Ashley D, Schatz, Michael C, Schmitt, Anthony, Schneider, Valerie A, Selvaraj, Siddarth, Shafin, Kishwar, Shumate, Alaina, Stitziel, Nathan O, Stober, Catherine, Torrance, James, Wagner, Justin, Wang, Jianxin, Wenger, Aaron, Xiao, Chuanle, Zimin, Aleksey V, Zhang, Guojie, Wang, Ting, Li, Heng, Garrison, Erik, Haussler, David, Hall, Ira, Zook, Justin M, Eichler, Evan E, Phillippy, Adam M, Paten, Benedict, Howe, Kerstin, and Miga, Karen H
- Subjects
Genetics ,Human Genome ,Biotechnology ,Generic health relevance ,Humans ,Chromosome Mapping ,Diploidy ,Genome ,Human ,Haplotypes ,High-Throughput Nucleotide Sequencing ,Sequence Analysis ,DNA ,Reference Standards ,Genomics ,Chromosomes ,Human ,Genetic Variation ,Human Pangenome Reference Consortium ,General Science & Technology - Abstract
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.
- Published
- 2022
3. Evolution of the ancestral mammalian karyotype and syntenic regions
- Author
-
Damas, Joana, Corbo, Marco, Kim, Jaebum, Turner-Maier, Jason, Farré, Marta, Larkin, Denis M, Ryder, Oliver A, Steiner, Cynthia, Houck, Marlys L, Hall, Shaune, Shiue, Lily, Thomas, Stephen, Swale, Thomas, Daly, Mark, Korlach, Jonas, Uliano-Silva, Marcela, Mazzoni, Camila J, Birren, Bruce W, Genereux, Diane P, Johnson, Jeremy, Lindblad-Toh, Kerstin, Karlsson, Elinor K, Nweeia, Martin T, Johnson, Rebecca N, Lewin, Harris A, Andrews, Gregory, Armstrong, Joel C, Bianchi, Matteo, Bredemeyer, Kevin R, Breit, Ana M, Christmas, Matthew J, Clawson, Hiram, Di Palma, Federica, Diekhans, Mark, Dong, Michael X, Eizirik, Eduardo, Fan, Kaili, Fanter, Cornelia, Foley, Nicole M, Forsberg-Nilsson, Karin, Garcia, Carlos J, Gatesy, John, Gazal, Steven, Goodman, Linda, Grimshaw, Jenna, Halsey, Michaela K, Harris, Andrew J, Hickey, Glenn, Hiller, Michael, Hindle, Allyson G, Hubley, Robert M, Hughes, Graham M, Juan, David, Kaplow, Irene M, Keough, Kathleen C, Kirilenko, Bogdan, Koepfli, Klaus-Peter, Korstian, Jennifer M, Kowalczyk, Amanda, Kozyrev, Sergey V, Lawler, Alyssa J, Lawless, Colleen, Lehmann, Thomas, Levesque, Danielle L, Li, Xue, Lind, Abigail, Mackay-Smith, Ava, Marinescu, Voichita D, Marques-Bonet, Tomas, Mason, Victor C, Meadows, Jennifer RS, Meyer, Wynn K, Moore, Jill E, Moreira, Lucas R, Moreno-Santillan, Diana D, Morrill, Kathleen M, Muntané, Gerard, Murphy, William J, Navarro, Arcadi, Nweeia, Martin, Ortmann, Sylvia, Osmanski, Austin, Paten, Benedict, Paulat, Nicole S, Pfenning, Andreas R, Phan, BaDoi N, Pollard, Katherine S, Pratt, Henry E, Ray, David A, Reilly, Steven K, Rosen, Jeb R, Ruf, Irina, and Ryan, Louise
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Evolutionary Biology ,Genetics ,Human Genome ,Generic health relevance ,Animals ,Cattle ,Chromosomes ,Mammalian ,Eutheria ,Evolution ,Molecular ,Humans ,Karyotype ,Mammals ,Phylogeny ,Sloths ,Synteny ,chromosome evolution ,mammals ,synteny conservation ,ancestral genome reconstruction ,topologically associating domains ,Zoonomia Consortium - Abstract
Decrypting the rearrangements that drive mammalian chromosome evolution is critical to understanding the molecular bases of speciation, adaptation, and disease susceptibility. Using 8 scaffolded and 26 chromosome-scale genome assemblies representing 23/26 mammal orders, we computationally reconstructed ancestral karyotypes and syntenic relationships at 16 nodes along the mammalian phylogeny. Three different reference genomes (human, sloth, and cattle) representing phylogenetically distinct mammalian superorders were used to assess reference bias in the reconstructed ancestral karyotypes and to expand the number of clades with reconstructed genomes. The mammalian ancestor likely had 19 pairs of autosomes, with nine of the smallest chromosomes shared with the common ancestor of all amniotes (three still conserved in extant mammals), demonstrating a striking conservation of synteny for ∼320 My of vertebrate evolution. The numbers and types of chromosome rearrangements were classified for transitions between the ancestral mammalian karyotype, descendent ancestors, and extant species. For example, 94 inversions, 16 fissions, and 14 fusions that occurred over 53 My differentiated the therian from the descendent eutherian ancestor. The highest breakpoint rate was observed between the mammalian and therian ancestors (3.9 breakpoints/My). Reconstructed mammalian ancestor chromosomes were found to have distinct evolutionary histories reflected in their rates and types of rearrangements. The distributions of genes, repetitive elements, topologically associating domains, and actively transcribed regions in multispecies homologous synteny blocks and evolutionary breakpoint regions indicate that purifying selection acted over millions of years of vertebrate evolution to maintain syntenic relationships of developmentally important genes and regulatory landscapes of gene-dense chromosomes.
- Published
- 2022
4. The complete sequence of a human genome
- Author
-
Nurk, Sergey, Koren, Sergey, Rhie, Arang, Rautiainen, Mikko, Bzikadze, Andrey V, Mikheenko, Alla, Vollger, Mitchell R, Altemose, Nicolas, Uralsky, Lev, Gershman, Ariel, Aganezov, Sergey, Hoyt, Savannah J, Diekhans, Mark, Logsdon, Glennis A, Alonge, Michael, Antonarakis, Stylianos E, Borchers, Matthew, Bouffard, Gerard G, Brooks, Shelise Y, Caldas, Gina V, Chen, Nae-Chyun, Cheng, Haoyu, Chin, Chen-Shan, Chow, William, de Lima, Leonardo G, Dishuck, Philip C, Durbin, Richard, Dvorkina, Tatiana, Fiddes, Ian T, Formenti, Giulio, Fulton, Robert S, Fungtammasan, Arkarachai, Garrison, Erik, Grady, Patrick GS, Graves-Lindsay, Tina A, Hall, Ira M, Hansen, Nancy F, Hartley, Gabrielle A, Haukness, Marina, Howe, Kerstin, Hunkapiller, Michael W, Jain, Chirag, Jain, Miten, Jarvis, Erich D, Kerpedjiev, Peter, Kirsche, Melanie, Kolmogorov, Mikhail, Korlach, Jonas, Kremitzki, Milinn, Li, Heng, Maduro, Valerie V, Marschall, Tobias, McCartney, Ann M, McDaniel, Jennifer, Miller, Danny E, Mullikin, James C, Myers, Eugene W, Olson, Nathan D, Paten, Benedict, Peluso, Paul, Pevzner, Pavel A, Porubsky, David, Potapova, Tamara, Rogaev, Evgeny I, Rosenfeld, Jeffrey A, Salzberg, Steven L, Schneider, Valerie A, Sedlazeck, Fritz J, Shafin, Kishwar, Shew, Colin J, Shumate, Alaina, Sims, Ying, Smit, Arian FA, Soto, Daniela C, Sović, Ivan, Storer, Jessica M, Streets, Aaron, Sullivan, Beth A, Thibaud-Nissen, Françoise, Torrance, James, Wagner, Justin, Walenz, Brian P, Wenger, Aaron, Wood, Jonathan MD, Xiao, Chunlin, Yan, Stephanie M, Young, Alice C, Zarate, Samantha, Surti, Urvashi, McCoy, Rajiv C, Dennis, Megan Y, Alexandrov, Ivan A, Gerton, Jennifer L, O’Neill, Rachel J, Timp, Winston, Zook, Justin M, Schatz, Michael C, Eichler, Evan E, Miga, Karen H, and Phillippy, Adam M
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,1.1 Normal biological development and functioning ,Underpinning research ,Generic health relevance ,Cell Line ,Chromosomes ,Artificial ,Bacterial ,Chromosomes ,Human ,Genome ,Human ,Human Genome Project ,Humans ,Reference Values ,Sequence Analysis ,DNA ,General Science & Technology - Abstract
Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion-base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, corrects errors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies.
- Published
- 2022
5. Population genomics of the critically endangered kākāpō
- Author
-
Dussex, Nicolas, van der Valk, Tom, Morales, Hernán E, Wheat, Christopher W, Díez-del-Molino, David, von Seth, Johanna, Foster, Yasmin, Kutschera, Verena E, Guschanski, Katerina, Rhie, Arang, Phillippy, Adam M, Korlach, Jonas, Howe, Kerstin, Chow, William, Pelan, Sarah, Damas, Joanna D Mendes, Lewin, Harris A, Hastie, Alex R, Formenti, Giulio, Fedrigo, Olivier, Guhlin, Joseph, Harrop, Thomas WR, Le Lec, Marissa F, Dearden, Peter K, Haggerty, Leanne, Martin, Fergal J, Kodali, Vamsi, Thibaud-Nissen, Françoise, Iorns, David, Knapp, Michael, Gemmell, Neil J, Robertson, Fiona, Moorhouse, Ron, Digby, Andrew, Eason, Daryl, Vercoe, Deidre, Howard, Jason, Jarvis, Erich D, Robertson, Bruce C, and Dalén, Love
- Subjects
Human Genome ,Genetics ,Life on Land ,bottleneck ,conservation ,inbreeding ,kākāpō ,mutational load ,purging - Abstract
The kākāpō is a flightless parrot endemic to New Zealand. Once common in the archipelago, only 201 individuals remain today, most of them descending from an isolated island population. We report the first genome-wide analyses of the species, including a high-quality genome assembly for kākāpō, one of the first chromosome-level reference genomes sequenced by the Vertebrate Genomes Project (VGP). We also sequenced and analyzed 35 modern genomes from the sole surviving island population and 14 genomes from the extinct mainland population. While theory suggests that such a small population is likely to have accumulated deleterious mutations through genetic drift, our analyses on the impact of the long-term small population size in kākāpō indicate that present-day island kākāpō have a reduced number of harmful mutations compared to mainland individuals. We hypothesize that this reduced mutational load is due to the island population having been subjected to a combination of genetic drift and purging of deleterious mutations, through increased inbreeding and purifying selection, since its isolation from the mainland ∼10,000 years ago. Our results provide evidence that small populations can survive even when isolated for hundreds of generations. This work provides key insights into kākāpō breeding and recovery and more generally into the application of genetic tools in conservation efforts for endangered species.
- Published
- 2021
6. Platypus and echidna genomes reveal mammalian biology and evolution
- Author
-
Zhou, Yang, Shearwin-Whyatt, Linda, Li, Jing, Song, Zhenzhen, Hayakawa, Takashi, Stevens, David, Fenelon, Jane C, Peel, Emma, Cheng, Yuanyuan, Pajpach, Filip, Bradley, Natasha, Suzuki, Hikoyu, Nikaido, Masato, Damas, Joana, Daish, Tasman, Perry, Tahlia, Zhu, Zexian, Geng, Yuncong, Rhie, Arang, Sims, Ying, Wood, Jonathan, Haase, Bettina, Mountcastle, Jacquelyn, Fedrigo, Olivier, Li, Qiye, Yang, Huanming, Wang, Jian, Johnston, Stephen D, Phillippy, Adam M, Howe, Kerstin, Jarvis, Erich D, Ryder, Oliver A, Kaessmann, Henrik, Donnelly, Peter, Korlach, Jonas, Lewin, Harris A, Graves, Jennifer, Belov, Katherine, Renfree, Marilyn B, Grutzner, Frank, Zhou, Qi, and Zhang, Guojie
- Subjects
Genetics ,Human Genome ,Biotechnology ,1.1 Normal biological development and functioning ,Underpinning research ,Generic health relevance ,Animals ,Biological Evolution ,Female ,Genome ,Male ,Mammals ,Phylogeny ,Platypus ,Sex Chromosomes ,Tachyglossidae ,Base Pairing ,Base Sequence ,Cattle ,Chromosome Mapping ,Chromosomes ,Mammalian ,DNA ,Evolution ,Molecular ,Molecular Sequence Data ,Mutation ,Recombination ,Genetic ,X Chromosome ,Y Chromosome ,General Science & Technology - Abstract
Egg-laying mammals (monotremes) are the only extant mammalian outgroup to therians (marsupial and eutherian animals) and provide key insights into mammalian evolution1,2. Here we generate and analyse reference genomes of the platypus (Ornithorhynchus anatinus) and echidna (Tachyglossus aculeatus), which represent the only two extant monotreme lineages. The nearly complete platypus genome assembly has anchored almost the entire genome onto chromosomes, markedly improving the genome continuity and gene annotation. Together with our echidna sequence, the genomes of the two species allow us to detect the ancestral and lineage-specific genomic changes that shape both monotreme and mammalian evolution. We provide evidence that the monotreme sex chromosome complex originated from an ancestral chromosome ring configuration. The formation of such a unique chromosome complex may have been facilitated by the unusually extensive interactions between the multi-X and multi-Y chromosomes that are shared by the autosomal homologues in humans. Further comparative genomic analyses unravel marked differences between monotremes and therians in haptoglobin genes, lactation genes and chemosensory receptor genes for smell and taste that underlie the ecological adaptation of monotremes.
- Published
- 2021
7. Multi-platform discovery of haplotype-resolved structural variation in human genomes.
- Author
-
Chaisson, Mark JP, Sanders, Ashley D, Zhao, Xuefang, Malhotra, Ankit, Porubsky, David, Rausch, Tobias, Gardner, Eugene J, Rodriguez, Oscar L, Guo, Li, Collins, Ryan L, Fan, Xian, Wen, Jia, Handsaker, Robert E, Fairley, Susan, Kronenberg, Zev N, Kong, Xiangmeng, Hormozdiari, Fereydoun, Lee, Dillon, Wenger, Aaron M, Hastie, Alex R, Antaki, Danny, Anantharaman, Thomas, Audano, Peter A, Brand, Harrison, Cantsilieris, Stuart, Cao, Han, Cerveira, Eliza, Chen, Chong, Chen, Xintong, Chin, Chen-Shan, Chong, Zechen, Chuang, Nelson T, Lambert, Christine C, Church, Deanna M, Clarke, Laura, Farrell, Andrew, Flores, Joey, Galeev, Timur, Gorkin, David U, Gujral, Madhusudan, Guryev, Victor, Heaton, William Haynes, Korlach, Jonas, Kumar, Sushant, Kwon, Jee Young, Lam, Ernest T, Lee, Jong Eun, Lee, Joyce, Lee, Wan-Ping, Lee, Sau Peng, Li, Shantao, Marks, Patrick, Viaud-Martinez, Karine, Meiers, Sascha, Munson, Katherine M, Navarro, Fabio CP, Nelson, Bradley J, Nodzak, Conor, Noor, Amina, Kyriazopoulou-Panagiotopoulou, Sofia, Pang, Andy WC, Qiu, Yunjiang, Rosanio, Gabriel, Ryan, Mallory, Stütz, Adrian, Spierings, Diana CJ, Ward, Alistair, Welch, AnneMarie E, Xiao, Ming, Xu, Wei, Zhang, Chengsheng, Zhu, Qihui, Zheng-Bradley, Xiangqun, Lowy, Ernesto, Yakneen, Sergei, McCarroll, Steven, Jun, Goo, Ding, Li, Koh, Chong Lek, Ren, Bing, Flicek, Paul, Chen, Ken, Gerstein, Mark B, Kwok, Pui-Yan, Lansdorp, Peter M, Marth, Gabor T, Sebat, Jonathan, Shi, Xinghua, Bashir, Ali, Ye, Kai, Devine, Scott E, Talkowski, Michael E, Mills, Ryan E, Marschall, Tobias, Korbel, Jan O, Eichler, Evan E, and Lee, Charles
- Subjects
Humans ,Chromosome Mapping ,Genomics ,Haplotypes ,Genome ,Human ,Algorithms ,Databases ,Genetic ,INDEL Mutation ,Genomic Structural Variation ,High-Throughput Nucleotide Sequencing ,Whole Genome Sequencing ,Genome ,Human ,Databases ,Genetic - Abstract
The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (
- Published
- 2019
8. Africa: sequence 100,000 species to safeguard biodiversity
- Author
-
Ebenezer, ThankGod Echezona, Muigai, Anne W. T., Nouala, Simplice, Badaoui, Bouabid, Blaxter, Mark, Buddie, Alan G., Jarvis, Erich D., Korlach, Jonas, Kuja, Josiah O., Lewin, Harris A., Majewska, Roksana, Mapholi, Ntanganedzeni, Maslamoney, Suresh, Mbo’o-Tchouawou, Michèle, Osuji, Julian O., Seehausen, Ole, Shorinola, Oluwaseyi, Tiambo, Christian Keambou, Mulder, Nicola, Ziyomo, Cathrine, and Djikeng, Appolinaire
- Published
- 2022
- Full Text
- View/download PDF
9. Improved reference genome of Aedes aegypti informs arbovirus vector control.
- Author
-
Matthews, Benjamin J, Dudchenko, Olga, Kingan, Sarah B, Koren, Sergey, Antoshechkin, Igor, Crawford, Jacob E, Glassford, William J, Herre, Margaret, Redmond, Seth N, Rose, Noah H, Weedall, Gareth D, Wu, Yang, Batra, Sanjit S, Brito-Sierra, Carlos A, Buckingham, Steven D, Campbell, Corey L, Chan, Saki, Cox, Eric, Evans, Benjamin R, Fansiri, Thanyalak, Filipović, Igor, Fontaine, Albin, Gloria-Soria, Andrea, Hall, Richard, Joardar, Vinita S, Jones, Andrew K, Kay, Raissa GG, Kodali, Vamsi K, Lee, Joyce, Lycett, Gareth J, Mitchell, Sara N, Muehling, Jill, Murphy, Michael R, Omer, Arina D, Partridge, Frederick A, Peluso, Paul, Aiden, Aviva Presser, Ramasamy, Vidya, Rašić, Gordana, Roy, Sourav, Saavedra-Rodriguez, Karla, Sharan, Shruti, Sharma, Atashi, Smith, Melissa Laird, Turner, Joe, Weakley, Allison M, Zhao, Zhilei, Akbari, Omar S, Black, William C, Cao, Han, Darby, Alistair C, Hill, Catherine A, Johnston, J Spencer, Murphy, Terence D, Raikhel, Alexander S, Sattelle, David B, Sharakhov, Igor V, White, Bradley J, Zhao, Li, Aiden, Erez Lieberman, Mann, Richard S, Lambrechts, Louis, Powell, Jeffrey R, Sharakhova, Maria V, Tu, Zhijian, Robertson, Hugh M, McBride, Carolyn S, Hastie, Alex R, Korlach, Jonas, Neafsey, Daniel E, Phillippy, Adam M, and Vosshall, Leslie B
- Subjects
Animals ,Aedes ,Arboviruses ,Dengue Virus ,Arbovirus Infections ,Pyrethrins ,Glutathione Transferase ,Genetics ,Population ,Genomics ,Insect Control ,Insecticide Resistance ,Multigene Family ,Reference Standards ,Female ,Male ,Genome ,Insect ,Genetic Variation ,DNA Copy Number Variations ,Sex Determination Processes ,Molecular Sequence Annotation ,Mosquito Vectors ,Genetics ,Population ,Genome ,Insect ,General Science & Technology - Abstract
Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.
- Published
- 2018
10. Automation of PacBio SMRTbell NGS library preparation for bacterial genome sequencing
- Author
-
Kong, Nguyet, Ng, Whitney, Thao, Kao, Agulto, Regina, Weis, Allison, Kim, Kristi Spittle, Korlach, Jonas, Hickey, Luke, Kelly, Lenore, Lappin, Stephen, and Weimer, Bart C
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Biotechnology ,Generic health relevance ,PacBio SMRTbell NGS library preparation ,Bacterial genomic DNA ,Automation ,NGS workstation ,TapeStation System ,Bioanalyzer ,Biochemistry and Cell Biology - Abstract
BackgroundThe PacBio RS II provides for single molecule, real-time DNA technology to sequence genomes and detect DNA modifications. The starting point for high-quality sequence production is high molecular weight genomic DNA. To automate the library preparation process, there must be high-throughput methods in place to assess the genomic DNA, to ensure the size and amounts of the sheared DNA fragments and final library.FindingsThe library construction automation was accomplished using the Agilent NGS workstation with Bravo accessories for heating, shaking, cooling, and magnetic bead manipulations for template purification. The quality control methods from gDNA input to final library using the Agilent Bioanalyzer System and Agilent TapeStation System were evaluated.ConclusionsAutomated protocols of PacBio 10 kb library preparation produced libraries with similar technical performance to those generated manually. The TapeStation System proved to be a reliable method that could be used in a 96-well plate format to QC the DNA equivalent to the standard Bioanalyzer System results. The DNA Integrity Number that is calculated in the TapeStation System software upon analysis of genomic DNA is quite helpful to assure that the starting genomic DNA is not degraded. In this respect, the gDNA assay on the TapeStation System is preferable to the DNA 12000 assay on the Bioanalyzer System, which cannot run genomic DNA, nor can the Bioanalyzer work directly from the 96-well plates.
- Published
- 2017
11. Comparative Genomics Reveals the Diversity of Restriction-Modification Systems and DNA Methylation Sites in Listeria monocytogenes
- Author
-
Chen, Poyin, Bakker, Henk C den, Korlach, Jonas, Kong, Nguyet, Storey, Dylan B, Paxinos, Ellen E, Ashby, Meredith, Clark, Tyson, Luong, Khai, Wiedmann, Martin, and Weimer, Bart C
- Subjects
Biological Sciences ,Biomedical and Clinical Sciences ,Genetics ,Microbiology ,Medical Microbiology ,Digestive Diseases ,Prevention ,Foodborne Illness ,Vaccine Related ,Biodefense ,Human Genome ,Infectious Diseases ,Emerging Infectious Diseases ,2.1 Biological and endogenous factors ,Aetiology ,2.2 Factors relating to the physical environment ,Infection ,DNA Methylation ,DNA Restriction-Modification Enzymes ,Genome ,Bacterial ,Genomics ,Listeria monocytogenes ,Sequence Alignment ,Synteny ,L. monocytogenes ,100K Pathogen Genome Project ,SMRT sequencing ,methylation ,inversion ,infection ,bacterial epigenetics ,genetic epidemiology ,DNA methylation ,Listeria ,genome analysis ,virulence regulation ,Medical microbiology - Abstract
Listeria monocytogenes is a bacterial pathogen that is found in a wide variety of anthropogenic and natural environments. Genome sequencing technologies are rapidly becoming a powerful tool in facilitating our understanding of how genotype, classification phenotypes, and virulence phenotypes interact to predict the health risks of individual bacterial isolates. Currently, 57 closed L. monocytogenes genomes are publicly available, representing three of the four phylogenetic lineages, and they suggest that L. monocytogenes has high genomic synteny. This study contributes an additional 15 closed L. monocytogenes genomes that were used to determine the associations between the genome and methylome with host invasion magnitude. In contrast to previous findings, large chromosomal inversions and rearrangements were detected in five isolates at the chromosome terminus and within rRNA genes, including a previously undescribed inversion within rRNA-encoding regions. Each isolate's epigenome contained highly diverse methyltransferase recognition sites, even within the same serotype and methylation pattern. Eleven strains contained a single chromosomally encoded methyltransferase, one strain contained two methylation systems (one system on a plasmid), and three strains exhibited no methylation, despite the occurrence of methyltransferase genes. In three isolates a new, unknown DNA modification was observed in addition to diverse methylation patterns, accompanied by a novel methylation system. Neither chromosome rearrangement nor strain-specific patterns of epigenome modification observed within virulence genes were correlated with serotype designation, clonal complex, or in vitro infectivity. These data suggest that genome diversity is larger than previously considered in L. monocytogenes and that as more genomes are sequenced, additional structure and methylation novelty will be observed in this organism.ImportanceListeria monocytogenes is the causative agent of listeriosis, a disease which manifests as gastroenteritis, meningoencephalitis, and abortion. Among Salmonella, Escherichia coli, Campylobacter, and Listeria-causing the most prevalent foodborne illnesses-infection by L. monocytogenes carries the highest mortality rate. The ability of L. monocytogenes to regulate its response to various harsh environments enables its persistence and transmission. Small-scale comparisons of L. monocytogenes focusing solely on genome contents reveal a highly syntenic genome yet fail to address the observed diversity in phenotypic regulation. This study provides a large-scale comparison of 302 L. monocytogenes isolates, revealing the importance of the epigenome and restriction-modification systems as major determinants of L. monocytogenes phylogenetic grouping and subsequent phenotypic expression. Further examination of virulence genes of select outbreak strains reveals an unprecedented diversity in methylation statuses despite high degrees of genome conservation.
- Published
- 2017
12. Towards complete and error-free genome assemblies of all vertebrate species
- Author
-
Rhie, Arang, McCarthy, Shane A., Fedrigo, Olivier, Damas, Joana, Formenti, Giulio, Koren, Sergey, Uliano-Silva, Marcela, Chow, William, Fungtammasan, Arkarachai, Kim, Juwan, Lee, Chul, Ko, Byung June, Chaisson, Mark, Gedman, Gregory L., Cantin, Lindsey J., Thibaud-Nissen, Francoise, Haggerty, Leanne, Bista, Iliana, Smith, Michelle, Haase, Bettina, Mountcastle, Jacquelyn, Winkler, Sylke, Paez, Sadye, Howard, Jason, Vernes, Sonja C., Lama, Tanya M., Grutzner, Frank, Warren, Wesley C., Balakrishnan, Christopher N., Burt, Dave, George, Julia M., Biegler, Matthew T., Iorns, David, Digby, Andrew, Eason, Daryl, Robertson, Bruce, Edwards, Taylor, Wilkinson, Mark, Turner, George, Meyer, Axel, Kautt, Andreas F., Franchini, Paolo, Detrich, III, H. William, Svardal, Hannes, Wagner, Maximilian, Naylor, Gavin J. P., Pippel, Martin, Malinsky, Milan, Mooney, Mark, Simbirsky, Maria, Hannigan, Brett T., Pesout, Trevor, Houck, Marlys, Misuraca, Ann, Kingan, Sarah B., Hall, Richard, Kronenberg, Zev, Sović, Ivan, Dunn, Christopher, Ning, Zemin, Hastie, Alex, Lee, Joyce, Selvaraj, Siddarth, Green, Richard E., Putnam, Nicholas H., Gut, Ivo, Ghurye, Jay, Garrison, Erik, Sims, Ying, Collins, Joanna, Pelan, Sarah, Torrance, James, Tracey, Alan, Wood, Jonathan, Dagnew, Robel E., Guan, Dengfeng, London, Sarah E., Clayton, David F., Mello, Claudio V., Friedrich, Samantha R., Lovell, Peter V., Osipova, Ekaterina, Al-Ajli, Farooq O., Secomandi, Simona, Kim, Heebal, Theofanopoulou, Constantina, Hiller, Michael, Zhou, Yang, Harris, Robert S., Makova, Kateryna D., Medvedev, Paul, Hoffman, Jinna, Masterson, Patrick, Clark, Karen, Martin, Fergal, Howe, Kevin, Flicek, Paul, Walenz, Brian P., Kwak, Woori, Clawson, Hiram, Diekhans, Mark, Nassar, Luis, Paten, Benedict, Kraus, Robert H. S., Crawford, Andrew J., Gilbert, M. Thomas P., Zhang, Guojie, Venkatesh, Byrappa, Murphy, Robert W., Koepfli, Klaus-Peter, Shapiro, Beth, Johnson, Warren E., Di Palma, Federica, Marques-Bonet, Tomas, Teeling, Emma C., Warnow, Tandy, Graves, Jennifer Marshall, Ryder, Oliver A., Haussler, David, O’Brien, Stephen J., Korlach, Jonas, Lewin, Harris A., Howe, Kerstin, Myers, Eugene W., Durbin, Richard, Phillippy, Adam M., and Jarvis, Erich D.
- Published
- 2021
- Full Text
- View/download PDF
13. Monoallelically-expressed Noncoding RNAs form nucleolar territories on NOR-containing chromosomes and regulate rRNA expression
- Author
-
Hao, Qinyu, primary, Liu, Minxue, additional, Daulatabad, Swapna Vidhur, additional, Gaffari, Saba, additional, Song, You Jin, additional, Srivastava, Rajneesh, additional, Bhaskar, Shivang, additional, Moitra, Anurupa, additional, Mangan, Hazel, additional, Tseng, Elizabeth, additional, Gilmore, Rachel B, additional, Frier, Susan M, additional, Chen, Xin, additional, Wang, Chengliang, additional, Huang, Sui, additional, Chamberlain, Stormy, additional, Jin, Hong, additional, Korlach, Jonas, additional, McStay, Brian, additional, Sinha, Saurabh, additional, Janga, Sarath C, additional, Prasanth, Supriya, additional, and Prasanth, Kannanganattu V, additional
- Published
- 2024
- Full Text
- View/download PDF
14. The Epigenomic Landscape of Prokaryotes.
- Author
-
Blow, Matthew J, Clark, Tyson A, Daum, Chris G, Deutschbauer, Adam M, Fomenkov, Alexey, Fries, Roxanne, Froula, Jeff, Kang, Dongwan D, Malmstrom, Rex R, Morgan, Richard D, Posfai, Janos, Singh, Kanwar, Visel, Axel, Wetmore, Kelly, Zhao, Zhiying, Rubin, Edward M, Korlach, Jonas, Pennacchio, Len A, and Roberts, Richard J
- Subjects
Prokaryotic Cells ,DNA Restriction-Modification Enzymes ,Methyltransferases ,Evolution ,Molecular ,Phylogeny ,DNA Methylation ,DNA Replication ,Gene Expression Regulation ,Conserved Sequence ,Substrate Specificity ,Multigene Family ,Genome ,Epigenomics ,Molecular Sequence Annotation ,Nucleotide Motifs ,Evolution ,Molecular ,Human Genome ,Genetics ,Generic Health Relevance ,Developmental Biology - Abstract
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active 'orphan' MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.
- Published
- 2016
15. Genomic Studies Need Diversity
- Author
-
Korlach, Jonas
- Published
- 2019
16. Complete Genome Sequence of Streptomyces sp. Strain CCM_MD2014, Isolated from Topsoil in Woods Hole, Massachusetts.
- Author
-
Mariita, Richard M, Bhatnagar, Srijak, Hanselmann, Kurt, Hossain, Mohammad J, Korlach, Jonas, Boitano, Matthew, Roberts, Richard J, Liles, Mark R, Moss, Anthony G, Leadbetter, Jared R, Newman, Dianne K, and Dawson, Scott C
- Subjects
Biochemistry and Cell Biology ,Genetics ,Microbiology - Abstract
Here, we present the complete genome sequence of Streptomyces sp. strain CCM_MD2014 (phylum Actinobacteria), isolated from surface soil in Woods Hole, MA. Its single linear chromosome of 8,274,043 bp in length has a 72.13% G+C content and contains 6,948 coding sequences.
- Published
- 2015
17. Complete Genome Sequence of Curtobacterium sp. Strain MR_MD2014, Isolated from Topsoil in Woods Hole, Massachusetts.
- Author
-
Mariita, Richard M, Bhatnagar, Srijak, Hanselmann, Kurt, Hossain, Mohammad J, Korlach, Jonas, Boitano, Matthew, Roberts, Richard J, Liles, Mark R, Moss, Anthony G, Leadbetter, Jared R, Newman, Dianne K, and Dawson, Scott C
- Subjects
Biochemistry and Cell Biology ,Genetics ,Microbiology - Abstract
Here, we present the 3,443,800-bp complete genome sequence of Curtobacterium sp. strain MR_MD2014 (phylum Actinobacteria). This strain was isolated from soil in Woods Hole, MA, as part of the 2014 Microbial Diversity Summer Program at the Marine Biological Laboratory in Woods Hole, MA.
- Published
- 2015
18. Resolving the complexity of the human genome using single-molecule sequencing
- Author
-
Chaisson, Mark JP, Huddleston, John, Dennis, Megan Y, Sudmant, Peter H, Malig, Maika, Hormozdiari, Fereydoun, Antonacci, Francesca, Surti, Urvashi, Sandstrom, Richard, Boitano, Matthew, Landolin, Jane M, Stamatoyannopoulos, John A, Hunkapiller, Michael W, Korlach, Jonas, and Eichler, Evan E
- Subjects
Genetics ,Human Genome ,Generic health relevance ,Chromosome Inversion ,Chromosomes ,Human ,Pair 10 ,Cloning ,Molecular ,GC Rich Sequence ,Genetic Variation ,Genome ,Human ,Genomics ,Haploidy ,Humans ,Mutagenesis ,Insertional ,Reference Standards ,Sequence Analysis ,DNA ,Tandem Repeat Sequences ,General Science & Technology - Abstract
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchromatic gaps remain and aspects of its structural variation remain poorly understood ten years after its completion. To identify missing sequence and genetic variation, here we sequence and analyse a haploid human genome (CHM1) using single-molecule, real-time DNA sequencing. We close or extend 55% of the remaining interstitial gaps in the human GRCh37 reference genome--78% of which carried long runs of degenerate short tandem repeats, often several kilobases in length, embedded within (G+C)-rich genomic regions. We resolve the complete sequence of 26,079 euchromatic structural variants at the base-pair level, including inversions, complex insertions and long tracts of tandem repeats. Most have not been previously reported, with the greatest increases in sensitivity occurring for events less than 5 kilobases in size. Compared to the human reference, we find a significant insertional bias (3:1) in regions corresponding to complex insertions and long short tandem repeats. Our results suggest a greater complexity of the human genome in the form of variation of longer and more complex repetitive DNA that can now be largely resolved with the application of this longer-read sequencing technology.
- Published
- 2015
19. Reconstructing complex regions of genomes using long-read sequencing technology
- Author
-
Huddleston, John, Ranade, Swati, Malig, Maika, Antonacci, Francesca, Chaisson, Mark, Hon, Lawrence, Sudmant, Peter H, Graves, Tina A, Alkan, Can, Dennis, Megan Y, Wilson, Richard K, Turner, Stephen W, Korlach, Jonas, and Eichler, Evan E
- Subjects
Biotechnology ,Human Genome ,Genetics ,Generic health relevance ,Animals ,Chromosomes ,Artificial ,Bacterial ,Chromosomes ,Human ,Pair 17 ,Genome ,Bacterial ,High-Throughput Nucleotide Sequencing ,Humans ,Mice ,Molecular Sequence Data ,Pan troglodytes ,Biological Sciences ,Medical and Health Sciences ,Bioinformatics - Abstract
Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.
- Published
- 2014
20. Novel giant siphovirus from Bacillus anthracis features unusual genome characteristics.
- Author
-
Ganz, Holly H, Law, Christina, Schmuki, Martina, Eichenseher, Fritz, Calendar, Richard, Loessner, Martin J, Getz, Wayne M, Korlach, Jonas, Beyer, Wolfgang, and Klumpp, Jochen
- Subjects
Animals ,Equidae ,Bacillus anthracis ,Siphoviridae ,Endopeptidases ,Bayes Theorem ,Sequence Analysis ,DNA ,Soil Microbiology ,Demography ,Phylogeny ,Species Specificity ,Base Sequence ,Genome ,Viral ,Models ,Genetic ,Molecular Sequence Data ,Namibia ,Sequence Analysis ,DNA ,Genome ,Viral ,Models ,Genetic ,General Science & Technology - Abstract
Here we present vB_BanS-Tsamsa, a novel temperate phage isolated from Bacillus anthracis, the agent responsible for anthrax infections in wildlife, livestock and humans. Tsamsa phage is a giant siphovirus (order Caudovirales), featuring a long, flexible and non-contractile tail of 440 nm (not including baseplate structure) and an isometric head of 82 nm in diameter. We induced Tsamsa phage in samples from two different carcass sites in Etosha National Park, Namibia. The Tsamsa phage genome is the largest sequenced Bacillus siphovirus, containing 168,876 bp and 272 ORFs. The genome features an integrase/recombinase enzyme, indicative of a temperate lifestyle. Among bacterial strains tested, the phage infected only certain members of the Bacillus cereus sensu lato group (B. anthracis, B. cereus and B. thuringiensis) and exhibited moderate specificity for B. anthracis. Tsamsa lysed seven out of 25 B. cereus strains, two out of five B. thuringiensis strains and six out of seven B. anthracis strains tested. It did not lyse B. anthracis PAK-1, an atypical strain that is also resistant to both gamma phage and cherry phage. The Tsamsa endolysin features a broader lytic spectrum than the phage host range, indicating possible use of the enzyme in Bacillus biocontrol.
- Published
- 2014
21. Synchronized long-read genome, methylome, epigenome, and transcriptome for resolving a Mendelian condition
- Author
-
Vollger, Mitchell R., primary, Korlach, Jonas, additional, Eldred, Kiara C., additional, Swanson, Elliott, additional, Underwood, Jason G., additional, Munson, Katherine M., additional, Cheng, Yong-Han H., additional, Ranchalis, Jane, additional, Mao, Yizi, additional, Blue, Elizabeth E., additional, Schwarze, Ulrike, additional, Saunders, Christopher T., additional, Wenger, Aaron M., additional, Allworth, Aimee, additional, Chanprasert, Sirisak, additional, Duerden, Brittney L., additional, Glass, Ian, additional, Horike-Pyne, Martha, additional, Kim, Michelle, additional, Leppig, Kathleen A., additional, McLaughlin, Ian J., additional, Ogawa, Jessica, additional, Rosenthal, Elisabeth A., additional, Sheppeard, Sam, additional, Sherman, Stephanie M., additional, Strohbehn, Samuel, additional, Yuen, Amy L., additional, Reh, Thomas A., additional, Byers, Peter H., additional, Bamshad, Michael J., additional, Hisama, Fuki M., additional, Jarvik, Gail P., additional, Sancak, Yasemin, additional, Dipple, Katrina M., additional, and Stergachis, Andrew B., additional
- Published
- 2023
- Full Text
- View/download PDF
22. Exploring the roles of DNA methylation in the metal-reducing bacterium Shewanella oneidensis MR-1
- Author
-
Bendall, Matthew L., Luong, Khai, Wetmore, Kelly M., Blow, Matthew, Korlach, Jonas, Deutschbauer, Adam, and Malmstrom, Rex
- Subjects
DNA ,methylation ,Shewanella oneidensis MR-1 - Abstract
We performed whole genome analyses of DNA methylation in Shewanella17 oneidensis MR-1 to examine its possible role in regulating gene expression and18 other cellular processes. Single-Molecule Real Time (SMRT) sequencing19 revealed extensive methylation of adenine (N6mA) throughout the20 genome. These methylated bases were located in five sequence motifs,21 including three novel targets for Type I restriction/modification enzymes. The22 sequence motifs targeted by putative methyltranferases were determined via23 SMRT sequencing of gene knockout mutants. In addition, we found S.24 oneidensis MR-1 cultures grown under various culture conditions displayed25 different DNA methylation patterns. However, the small number of differentially26 methylated sites could not be directly linked to the much larger number of27 differentially expressed genes in these conditions, suggesting DNA methylation is28 not a major regulator of gene expression in S. oneidensis MR-1. The enrichment29 of methylated GATC motifs in the origin of replication indicate DNA methylation30 may regulate genome replication in a manner similar to that seen in Escherichia31 coli. Furthermore, comparative analyses suggest that many32 Gammaproteobacteria, including all members of the Shewanellaceae family, may 33 also utilize DNA methylation to regulate genome replication.
- Published
- 2013
23. Genome and Methylome Variation in Helicobacter pylori With a cag Pathogenicity Island During Early Stages of Human Infection
- Author
-
Nell, Sandra, Estibariz, Iratxe, Krebes, Juliane, Bunk, Boyke, Graham, David Y., Overmann, Jörg, Song, Yi, Spröer, Cathrin, Yang, Ines, Wex, Thomas, Korlach, Jonas, Malfertheiner, Peter, and Suerbaum, Sebastian
- Published
- 2018
- Full Text
- View/download PDF
24. PacBio Only Assembly with Low Genomic DNA Input
- Author
-
Zhao, Zhiying, Tsai, Yu-Chih, Clum, Alicia, Munson, Katherine, Daum, Chris, Turner, Stephen W., Korlach, Jonas, Pennacchio, Len A., and Chen, Feng
- Subjects
PacBio ,Low Genomic DNA ,de novo assembly - Abstract
The assembly and analysis of microbial species on earth remains a largely unexplored area of life. This is partially due to their inability to be cultured but also based on the large historic cost of drafting and finishing individual microbial species genomes. The single-molecule real-time (SMRTTM) sequencing platform developed by Pacific Biosciences (PacBio) offers several benefits including Single Molecule real-time analysis, longer read length at fast speed, low sequencing redundancy and bias. Thus, it was used at JGI as a quick-turnaround and cost-effective solution for finishing microbial genomes. Construction of PacBio library by traditional protocol still requires micrograms of genomic DNA. In many cases, getting high quantity of genomic DNA remains as a major challenge. Recently, PacBio developed a more efficient library construction method using terminal deoxynucleotidyl transferase (TdT), which makes it possible to obtain sufficient sequencing data for assembly from significantly smaller amount of genomic DNA. We have tested and validated this newly developed method. Preliminary analysis results suggested that this technology can be used for microbial genome assembly with PacBio only data
- Published
- 2013
25. The vaginal microbiome and preterm birth
- Author
-
Fettweis, Jennifer M., Serrano, Myrna G., Brooks, J. Paul, Edwards, David J., Girerd, Philippe H., Parikh, Hardik I., Huang, Bernice, Arodz, Tom J., Edupuganti, Laahirie, Glascock, Abigail L., Xu, Jie, Jimenez, Nicole R., Vivadelli, Stephany C., Fong, Stephen S., Sheth, Nihar U., Jean, Sophonie, Lee, Vladimir, Bokhari, Yahya A., Lara, Ana M., Mistry, Shreni D., Duckworth, III, Robert A., Bradley, Steven P., Koparde, Vishal N., Orenda, X. Valentine, Milton, Sarah H., Rozycki, Sarah K., Matveyev, Andrey V., Wright, Michelle L., Huzurbazar, Snehalata V., Jackson, Eugenie M., Smirnova, Ekaterina, Korlach, Jonas, Tsai, Yu-Chih, Dickinson, Molly R., Brooks, Jamie L., Drake, Jennifer I., Chaffin, Donald O., Sexton, Amber L., Gravett, Michael G., Rubens, Craig E., Wijesooriya, N. Romesh, Hendricks-Muñoz, Karen D., Jefferson, Kimberly K., Strauss, III, Jerome F., and Buck, Gregory A.
- Published
- 2019
- Full Text
- View/download PDF
26. Spontaneous Nucleotide Exchange in Low Molecular Weight GTPases by Fluorescently Labeled γ-Phosphate-Linked GTP Analogs
- Author
-
Korlach, Jonas, Baird, Daniel W., Heikal, Ahmed A., Gee, Kyle R., Hoffman, Gregory R., and Webb, Watt W.
- Published
- 2004
27. Monoallelically expressed noncoding RNAs form nucleolar territories on NOR-containing chromosomes and regulate rRNA expression.
- Author
-
Qinyu Hao, Minxue Liu, Daulatabad, Swapna Vidhur, Gaffari, Saba, You Jin Song, Srivastava, Rajneesh, Bhaskar, Shivang, Moitra, Anurupa, Mangan, Hazel, Tseng, Elizabeth, Gilmore, Rachel B., Frier, Susan M., Xin Chen, Chengliang Wang, Sui Huang, Chamberlain, Stormy, Hong Jin, Korlach, Jonas, McStay, Brian, and Sinha, Saurabh
- Published
- 2024
- Full Text
- View/download PDF
28. Characterization of Lipid Bilayer Phases by Confocal Microscopy and Fluorescence Correlation Spectroscopy
- Author
-
Korlach, Jonas, Schwille, Petra, Webb, Watt W., and Feigenson, Gerald W.
- Published
- 1999
29. NCTC3000: a century of bacterial strain collecting leads to a rich genomic data resource
- Author
-
Dicks, Jo, primary, Fazal, Mohammed-Abbas, additional, Oliver, Karen, additional, Grayson, Nicholas E., additional, Turnbull, Jake D., additional, Bane, Evangeline, additional, Burnett, Edward, additional, Deheer-Graham, Ana, additional, Holroyd, Nancy, additional, Kaushal, Dorota, additional, Keane, Jacqueline, additional, Langridge, Gemma, additional, Lomax, Jane, additional, McGregor, Hannah, additional, Picton, Steve, additional, Quail, Michael, additional, Singh, Deepak, additional, Tracey, Alan, additional, Korlach, Jonas, additional, Russell, Julie E., additional, Alexander, Sarah, additional, and Parkhill, Julian, additional
- Published
- 2023
- Full Text
- View/download PDF
30. Abstract LB078: pbfusion: Detecting gene-fusion and other transcriptional abnormalities using PacBio HiFi data
- Author
-
Volden, Roger, primary, Kronenberg, Zev, additional, Gillmor, Aaron, additional, Verhey, Ted, additional, Monument, Michael, additional, Senger, Donna, additional, Dhillon, Harsharan, additional, Underwood, Jason, additional, Tseng, Elizabeth, additional, Baker, Daniel, additional, Baybayan, Primo, additional, Eberle, Michael A., additional, Korlach, Jonas, additional, and Morrissy, Sorana, additional
- Published
- 2023
- Full Text
- View/download PDF
31. Single-Molecule Sequencing
- Author
-
Korlach, Jonas, Turner, Stephen W, and Roberts, Gordon C. K., editor
- Published
- 2013
- Full Text
- View/download PDF
32. Simultaneous sequencing of oxidized methylcytosines produced by TET/JBP dioxygenases in Coprinopsis cinerea
- Author
-
Chavez, Lukas, Huang, Yun, Luong, Khai, Agarwal, Suneet, Iyer, Lakshminaraya M., Pastor, William A., Hench, Virginia K., Frazier-Bowers, Sylvia A., Korol, Evgenia, Liu, Shuo, Tahiliani, Mamta, Wang, Yinsheng, Clark, Tyson A., Korlach, Jonas, Pukkila, Patricia J., Aravind, L., and Rao, Anjana
- Published
- 2014
33. Long‐read HiFi sequencing of NUDT15 : Phased full‐gene haplotyping and pharmacogenomic allele discovery
- Author
-
Scott, Erick R., primary, Yang, Yao, additional, Botton, Mariana R., additional, Seki, Yoshinori, additional, Hoshitsuki, Keito, additional, Harting, John, additional, Baybayan, Primo, additional, Cody, Neal, additional, Nicoletti, Paola, additional, Moriyama, Takaya, additional, Chakraborty, Shreyasee, additional, Yang, Jun J., additional, Edelmann, Lisa, additional, Schadt, Eric E., additional, Korlach, Jonas, additional, and Scott, Stuart A., additional
- Published
- 2022
- Full Text
- View/download PDF
34. Author Correction: Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus
- Author
-
Nguyen, Thong T., Suryamohan, Kushal, Kuriakose, Boney, Janakiraman, Vasantharajan, Reichelt, Mike, Chaudhuri, Subhra, Guillory, Joseph, Divakaran, Neethu, Rabins, P. E., Goel, Ridhi, Deka, Bhabesh, Sarkar, Suman, Ekka, Preety, Tsai, Yu-Chih, Vargas, Derek, Santhosh, Sam, Mohan, Sangeetha, Chin, Chen-Shan, Korlach, Jonas, Thomas, George, Babu, Azariah, and Seshagiri, Somasekar
- Published
- 2018
- Full Text
- View/download PDF
35. Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus
- Author
-
Nguyen, Thong T., Suryamohan, Kushal, Kuriakose, Boney, Janakiraman, Vasantharajan, Reichelt, Mike, Chaudhuri, Subhra, Guillory, Joseph, Divakaran, Neethu, Rabins, P. E., Goel, Ridhi, Deka, Bhabesh, Sarkar, Suman, Ekka, Preety, Tsai, Yu-Chih, Vargas, Derek, Santhosh, Sam, Mohan, Sangeetha, Chin, Chen-Shan, Korlach, Jonas, Thomas, George, Babu, Azariah, and Seshagiri, Somasekar
- Published
- 2018
- Full Text
- View/download PDF
36. Zero-Mode Waveguides
- Author
-
Korlach, Jonas, Turner, Stephen W., and Roberts, Gordon C. K., editor
- Published
- 2013
- Full Text
- View/download PDF
37. Monoallelically-expressed Noncoding RNAs form nucleolar territories on NOR-containing chromosomes and regulate rRNA expression
- Author
-
Hao, Qinyu, primary, Liu, Minxue, additional, Daulatabad, Swapna Vidhur, additional, Gaffari, Saba, additional, Srivastava, Rajneesh, additional, Song, You Jin, additional, Bhaskar, Shivang, additional, Moitra, Anurupa, additional, Mangan, Hazel, additional, Tseng, Elizabeth, additional, Gilmore, Rachel B., additional, Frier, Susan M, additional, Chen, Xin, additional, Wang, Chengliang, additional, Huang, Sui, additional, Chamberlain, Stormy, additional, Jin, Hong, additional, Korlach, Jonas, additional, McStay, Brian, additional, Sinha, Saurabh, additional, Janga, Sarath C., additional, Prasanth, Supriya, additional, and Prasanth, Kannanganattu V, additional
- Published
- 2022
- Full Text
- View/download PDF
38. Genomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes
- Author
-
Cohen, Ana S.A., primary, Farrow, Emily G., additional, Abdelmoity, Ahmed T., additional, Alaimo, Joseph T., additional, Amudhavalli, Shivarajan M., additional, Anderson, John T., additional, Bansal, Lalit, additional, Bartik, Lauren, additional, Baybayan, Primo, additional, Belden, Bradley, additional, Berrios, Courtney D., additional, Biswell, Rebecca L., additional, Buczkowicz, Pawel, additional, Buske, Orion, additional, Chakraborty, Shreyasee, additional, Cheung, Warren A., additional, Coffman, Keith A., additional, Cooper, Ashley M., additional, Cross, Laura A., additional, Curran, Tom, additional, Dang, Thuy Tien T., additional, Elfrink, Mary M., additional, Engleman, Kendra L., additional, Fecske, Erin D., additional, Fieser, Cynthia, additional, Fitzgerald, Keely, additional, Fleming, Emily A., additional, Gadea, Randi N., additional, Gannon, Jennifer L., additional, Gelineau-Morel, Rose N., additional, Gibson, Margaret, additional, Goldstein, Jeffrey, additional, Grundberg, Elin, additional, Halpin, Kelsee, additional, Harvey, Brian S., additional, Heese, Bryce A., additional, Hein, Wendy, additional, Herd, Suzanne M., additional, Hughes, Susan S., additional, Ilyas, Mohammed, additional, Jacobson, Jill, additional, Jenkins, Janda L., additional, Jiang, Shao, additional, Johnston, Jeffrey J., additional, Keeler, Kathryn, additional, Korlach, Jonas, additional, Kussmann, Jennifer, additional, Lambert, Christine, additional, Lawson, Caitlin, additional, Le Pichon, Jean-Baptiste, additional, Leeder, James Steven, additional, Little, Vicki C., additional, Louiselle, Daniel A., additional, Lypka, Michael, additional, McDonald, Brittany D., additional, Miller, Neil, additional, Modrcin, Ann, additional, Nair, Annapoorna, additional, Neal, Shelby H., additional, Oermann, Christopher M., additional, Pacicca, Donna M., additional, Pawar, Kailash, additional, Posey, Nyshele L., additional, Price, Nigel, additional, Puckett, Laura M.B., additional, Quezada, Julio F., additional, Raje, Nikita, additional, Rowell, William J., additional, Rush, Eric T., additional, Sampath, Venkatesh, additional, Saunders, Carol J., additional, Schwager, Caitlin, additional, Schwend, Richard M., additional, Shaffer, Elizabeth, additional, Smail, Craig, additional, Soden, Sarah, additional, Strenk, Meghan E., additional, Sullivan, Bonnie R., additional, Sweeney, Brooke R., additional, Tam-Williams, Jade B., additional, Walter, Adam M., additional, Welsh, Holly, additional, Wenger, Aaron M., additional, Willig, Laurel K., additional, Yan, Yun, additional, Younger, Scott T., additional, Zhou, Dihong, additional, Zion, Tricia N., additional, Thiffault, Isabelle, additional, and Pastinen, Tomi, additional
- Published
- 2022
- Full Text
- View/download PDF
39. Global methylation state at base-pair resolution of the Caulobacter genome throughout the cell cycle
- Author
-
Kozdon, Jennifer B., Melfi, Michael D., Luong, Khai, Clark, Tyson A., Boitano, Matthew, Wang, Susana, Bo Zhou, Gonzalez, Diego, Collier, Justine, Turner, Stephen W., Korlach, Jonas, Shapiro, Lucy, and McAdams, Harley H.
- Published
- 2013
40. Semi-automated assembly of high-quality diploid human reference genomes
- Author
-
Jarvis, Erich D., Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R., Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A., Carnevali, Paolo, Chaisson, Mark J.P., Chin, Chen Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S., Fulton, Lucinda L., Garg, Shilpa, Gerton, Jennifer L., Ghurye, Jay, Granat, Anastasiya, Green, Richard E., Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B., Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O., Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W., McDaniel, Jennifer, Nie, Fan, Zhang, Guojie, Li, Heng, Jarvis, Erich D., Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R., Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A., Carnevali, Paolo, Chaisson, Mark J.P., Chin, Chen Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S., Fulton, Lucinda L., Garg, Shilpa, Gerton, Jennifer L., Ghurye, Jay, Granat, Anastasiya, Green, Richard E., Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B., Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O., Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W., McDaniel, Jennifer, Nie, Fan, Zhang, Guojie, and Li, Heng
- Abstract
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent–child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.
- Published
- 2022
41. Germline mosaicism of a missense variant inKCNC2in a multiplex family with autism and epilepsy characterized by long‐read sequencing
- Author
-
Mehinovic, Elvisa, primary, Gray, Teddi, additional, Campbell, Meghan, additional, Ekholm, Jenny, additional, Wenger, Aaron, additional, Rowell, William, additional, Grudo, Ari, additional, Grimwood, Jane, additional, Korlach, Jonas, additional, Gurnett, Christina, additional, Constantino, John N., additional, and Turner, Tychele N., additional
- Published
- 2022
- Full Text
- View/download PDF
42. Real-Time DNA Sequencing from Single Polymerase Molecules
- Author
-
Eid, John, Fehr, Adrian, Gray, Jeremy, Luong, Khai, Lyle, John, Otto, Geoff, Peluso, Paul, Rank, David, Baybayan, Primo, Bettman, Brad, Bibillo, Arkadiusz, Bjornson, Keith, Chaudhuri, Bidhan, Christians, Frederick, Cicero, Ronald, Clark, Sonya, Dalal, Ravindra, deWinter, Alex, Dixon, John, Foquet, Mathieu, Gaertner, Alfred, Hardenbol, Paul, Heiner, Cheryl, Hester, Kevin, Holden, David, Kearns, Gregory, Kong, Xiangxu, Kuse, Ronald, Lacroix, Yves, Lin, Steven, Lundquist, Paul, Ma, Congcong, Marks, Patrick, Maxham, Mark, Murphy, Devon, Park, Insil, Pham, Thang, Phillips, Michael, Roy, Joy, Sebra, Robert, Shen, Gene, Sorenson, Jon, Tomaney, Austin, Travers, Kevin, Trulson, Mark, Vieceli, John, Wegener, Jeffrey, Wu, Dawn, Yang, Alicia, Zaccarin, Denis, Zhao, Peter, Zhong, Frank, Korlach, Jonas, and Turner, Stephen
- Published
- 2009
- Full Text
- View/download PDF
43. Selective Aluminum Passivation for Targeted Immobilization of Single DNA Polymerase Molecules in Zero-Mode Waveguide Nanostructures
- Author
-
Korlach, Jonas, Marks, Patrick J., Cicero, Ronald L., Gray, Jeremy J., Murphy, Devon L., Roitman, Daniel B., Pham, Thang T., Otto, Geoff A., Foquet, Mathieu, and Turner, Stephen W.
- Published
- 2008
- Full Text
- View/download PDF
44. Conformational Changes of Calmodulin upon Ca²⁺ Binding Studied with a Microfluidic Mixer
- Author
-
Park, Hye Yoon, Kim, Sally A., Korlach, Jonas, Rhoades, Elizabeth, Kwok, Lisa W., Zipfel, Warren R., Waxham, M. Neal, Webb, Watt W., and Pollack, Lois
- Published
- 2008
- Full Text
- View/download PDF
45. Germline mosaicism of a missense variant in KCNC2 in a multiplex family with autism and epilepsy
- Author
-
Mehinovic, Elvisa, primary, Gray, Teddi, additional, Campbell, Meghan, additional, Ekholm, Jenny, additional, Wenger, Aaron, additional, Rowell, William, additional, Grudo, Ari, additional, Grimwood, Jane, additional, Korlach, Jonas, additional, Gurnett, Christina, additional, Constantino, John N., additional, and Turner, Tychele N., additional
- Published
- 2021
- Full Text
- View/download PDF
46. List of Contributors
- Author
-
Ajami, Nadim J., primary, Almeida, Mathieu, additional, Bowman, Brett, additional, del Castillo, Erika, additional, Cho, Yong-Joon, additional, Gerber, Georg K., additional, He, Shaomei, additional, Izard, Jacques, additional, Kim, Mincheol, additional, Korlach, Jonas, additional, La Rosa, Patricio S., additional, Petrosino, Joseph F., additional, Pop, Mihai, additional, Rivera, Maria C., additional, Scholz, Matthias, additional, Segata, Nicola, additional, Shannon, William D., additional, Sodergren, Erica, additional, Tett, Adrian, additional, Weinstock, George, additional, and Zhou, Yanjiao, additional
- Published
- 2015
- Full Text
- View/download PDF
47. Long-Read, Single Molecule, Real-Time (SMRT) DNA Sequencing for Metagenomic Applications
- Author
-
Bowman, Brett, primary, Kim, Mincheol, additional, Cho, Yong-Joon, additional, and Korlach, Jonas, additional
- Published
- 2015
- Full Text
- View/download PDF
48. IGenomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes
- Author
-
Cohen, Ana SA, primary, Farrow, Emily G, additional, Abdelmoity, Ahmed T, additional, Alaimo, Joseph T, additional, Amudhavalli, Shivarajan M, additional, Anderson, John T, additional, Bansal, Lalit, additional, Bartik, Lauren, additional, Baybayan, Primo, additional, Belden, Bradley, additional, Berrios, Courtney D, additional, Biswell, Rebecca L, additional, Buczkowicz, Pawel, additional, Buske, Orion, additional, Chakraborty, Shreyasee, additional, Cheung, Warren A, additional, Coffman, Keith A, additional, Cooper, Ashley M, additional, Cross, Laura A, additional, Curran, Thomas, additional, Dang, Thuy Tien T, additional, Elfrink, Mary M, additional, Engleman, Kendra L, additional, Fecske, Erin D, additional, Fieser, Cynthia, additional, Fitzgerald, Keely, additional, Fleming, Emily A, additional, Gadea, Randi N, additional, Gannon, Jennifer L, additional, Gelineau-Morel, Rose N, additional, Gibson, Margaret, additional, Goldstein, Jeffrey, additional, Grundberg, Elin, additional, Halpin, Kelsee, additional, Harvey, Brian S, additional, Heese, Bryce A, additional, Hein, Wendy, additional, Herd, Suzanne M, additional, Hughes, Susan S, additional, Ilyas, Mohammed, additional, Jacobson, Jill, additional, Jenkins, Janda L, additional, Jiang, Shao, additional, Johnston, Jeffrey J, additional, Keeler, Kathryn, additional, Korlach, Jonas, additional, Kussmann, Jennifer, additional, Lambert, Christine, additional, Lawson, Caitlin, additional, Le Pichon, Jean-Baptiste, additional, Leeder, Steve, additional, Little, Vicki C, additional, Louiselle, Daniel A, additional, Lypka, Michael, additional, McDonald, Brittany D, additional, Miller, Neil, additional, Modrcin, Ann, additional, Nair, Annapoorna, additional, Neal, Shelby H, additional, Oermann, Christopher M, additional, Pacicca, Donna M, additional, Pawar, Kailash, additional, Posey, Nyshele L, additional, Price, Nigel, additional, Puckett, Laura MB, additional, Quezada, Julio F, additional, Raje, Nikita, additional, Rowell, William J, additional, Rush, Eric T, additional, Sampath, Venkatesh, additional, Saunders, Carol J, additional, Schwager, Caitlin, additional, Schwend, Richard M, additional, Shaffer, Elizabeth, additional, Smail, Craig, additional, Soden, Sarah, additional, Strenk, Meghan E, additional, Sullivan, Bonnie R, additional, Sweeney, Brooke R, additional, Tam-Williams, Jade B, additional, Walter, Adam M, additional, Welsh, Holly, additional, Wenger, Aaron M, additional, Willig, Laurel K, additional, Yan, Yun, additional, Younger, Scott T, additional, Zhou, Dihong, additional, Zion, Tricia N, additional, Thiffault, Isabelle, additional, and Pastinen, Tomi, additional
- Published
- 2021
- Full Text
- View/download PDF
49. Defining the sequence requirements for the positioning of base J in DNA using SMRT sequencing
- Author
-
Genest, Paul-Andre, Baugh, Loren, Taipale, Alex, Zhao, Wanqi, Jan, Sabrina, van Luenen, Henri G.A.M., Korlach, Jonas, Clark, Tyson, Luong, Khai, Boitano, Matthew, Turner, Steve, Myler, Peter J., and Borst, Piet
- Published
- 2015
- Full Text
- View/download PDF
50. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications
- Author
-
Formenti, Giulio, Rhie, Arang, Balacco, Jennifer, Haase, Bettina, Mountcastle, Jacquelyn, Fedrigo, Olivier, Brown, Samara, Capodiferro, Marco Rosario, Al-Ajli, Farooq O., Ambrosini, Roberto, Houde, Peter, Koren, Sergey, Oliver, Karen, Smith, Michelle, Skelton, Jason, Betteridge, Emma, Dolucan, Jale, Corton, Craig, Bista, Iliana, Torrance, James, Tracey, Alan, Wood, Jonathan, Uliano-Silva, Marcela, Howe, Kerstin, McCarthy, Shane, Winkler, Sylke, Kwak, Woori, Korlach, Jonas, Fungtammasan, Arkarachai, Fordham, Daniel, Costa, Vania, Mayes, Simon, Chiara, Matteo, Horner, David S., Myers, Eugene, Durbin, Richard, Achilli, Alessandro, Braun, Edward L., Phillippy, Adam M., Jarvis, Erich D., Kirschel, Alexander N. G., Digby, Andrew, Veale, Andrew, Bronikowski, Anne, Murphy, Bob, Robertson, Bruce, Baker, Clare, Mazzoni, Camila, Balakrishnan, Christopher, Lee, Chul, Mead, Daniel, Teeling, Emma, Aiden, Erez Lieberman, Todd, Erica, Eichler, Evan, Naylor, Gavin J. P., Zhang, Guojie, Smith, Jeramiah, Wolf, Jochen, Touchon, Justin, Delmore, Kira, Jakobsen, Kjetill, Komoroske, Lisa, Wilkinson, Mark, Genner, Martin, Pšenička, Martin, Fuxjager, Matthew, Stratton, Mike, Liedvogel, Miriam, Gemmell, Neil, Minias, Piotr, Dunn, Peter O., Sudmant, Peter, Morin, Phil, Ayub, Qasim, Kraus, Robert, Vernes, Sonja, Smith, Steve, Lama, Tanya, Edwards, Taylor, Smith, Tim, Gilbert, Tom, Marques-Bonet, Tomas, Einfeldt, Tony, Venkatesh, Byrappa, Johnson, Warren, Warren, Wes, Bukhman, Yury, Formenti, Giulio [0000-0002-7554-5991], and Apollo - University of Cambridge Repository
- Subjects
Vertebrate ,Research ,Assembly ,Sequencing ,Duplications ,Long reads ,Repeats ,Mitochondrial DNA - Abstract
Background: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. Results: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. Conclusions: Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.
- Published
- 2021
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.