26 results on '"David DeCaprio"'
Search Results
2. TB database: an integrated platform for tuberculosis research.
- Author
-
T. B. K. Reddy, Robert Riley 0002, Farrell Wymore, Phillip Montgomery, David DeCaprio, Reinhard Engels, Marcel Gellesch, Jeremy Hubble, Dennis Jen, Heng Jin, Michael Koehrsen, Lisa Larson, Maria Mao, Michael Nitzberg, Peter Sisk, Christian Stolte, Brian Weiner, Jared White, Zachariah K. Zachariah, Gavin Sherlock, James E. Galagan, Catherine A. Ball, and Gary K. Schoolnik
- Published
- 2009
- Full Text
- View/download PDF
3. List of contributors
- Author
-
Sofiane Abbar, Thamina Acter, Esteve Almirall, Fadi Al-Turjman, Arpita Jadhav Bhatt, David DeCaprio, Ajantha Devi, Paola Di Maio, Moawia E. Eldow, Feichin Ted Tschang, Ruby Jain, Nikolaos Mavridis, Mohamed Mokbel, Subash Nadar, Shivika Prasanna, Engelbert Quack, Praveen Rao, Neetu Sardana, Nizam Uddin, and Kai Wussow
- Published
- 2021
- Full Text
- View/download PDF
4. Preparing with predictions: forecasting epidemics with artificial intelligence
- Author
-
David DeCaprio
- Subjects
business.industry ,Computer science ,Preparedness ,Community organization ,Pandemic ,Resource allocation ,Relevance (information retrieval) ,Artificial intelligence ,Applications of artificial intelligence ,Population Health Management ,business ,Healthcare providers - Abstract
In the early phases of an epidemic response, our success will largely be dictated by our preparedness. Our ability to forecast likely scenarios is crucial in cases where responses must be made quickly and with limited information. The first section of the chapter examines several factors to consider when considering using a dataset for artificial intelligence (AI) predictions. These are: relevance, quantity, accuracy and completeness, availability, and bias. Several case studies demonstrate how these data considerations influence the development of AI applications for pandemic preparedness. The second half covers the applications of predictions in the areas of disease spread, population health management, and resource allocation. We examine various applications of AI predictions and how different organizations can put them into practice during a pandemic. In this assessment, we will specifically consider how predictions may be applied to support healthcare providers, first responders, community organizations, governments, and physicians.
- Published
- 2021
- Full Text
- View/download PDF
5. Building a COVID-19 Vulnerability Index
- Author
-
David DeCaprio, Carol J. McCall, Shaayaan Sayed, Thadeus Burgess, Joseph Gartner, and Sarthak Kothari
- Subjects
Outreach ,Coronavirus disease 2019 (COVID-19) ,Vulnerability index ,business.industry ,Environmental health ,Pandemic ,Medicine ,Acute respiratory disease ,Disease ,business ,World health ,Proxy (climate) - Abstract
COVID-19 is an acute respiratory disease that has been classified as a pandemic by the World Health Organization. Characterization of this disease is still in its early stages; however, it is known to have high mortality rates, particularly among individuals with preexisting medical conditions. Creating models to identify individuals who are at the greatest risk for severe complications due to COVID-19 will be useful for outreach campaigns to help mitigate the disease’s worst effects. While information specific to COVID-19 is limited, a model using complications due to other upper respiratory infections can be used as a proxy to help identify those individuals who are at the greatest risk. We present the results for three models predicting such complications, with each model increasing predictive effectiveness at the expense of ease of implementation.
- Published
- 2020
- Full Text
- View/download PDF
6. Combo: a whole genome comparative browser.
- Author
-
Reinhard Engels, Tamara Yu, Christopher B. Burge, Jill P. Mesirov, David DeCaprio, and James E. Galagan
- Published
- 2006
- Full Text
- View/download PDF
7. Genome sequence of Aedes aegypti, a major arbovirus vector
- Author
-
Sergio Verjovski-Almeida, James E. Galagan, Ryan C. Kennedy, Zhiyong Xi, Jason R. Miller, Eric Eisenstadt, Kyanne R. Reidenbach, Robert V. Bruggner, Yu-Hui Rogers, Hadi Quesneville, Doreen Werner, Owen White, Alexander S. Raikhel, Mario Stanke, J. Spencer Johnston, Diane D. Lovin, Evgenia V. Kriventseva, Ian T. Paulsen, Kathryn S. Campbell, Norman H. Lee, Ewan Birney, Karyn Megy, Hean Koo, David Kulp, Shelby L. Bidwell, Jingsong Zhu, Philip Montgomery, Paolo Amedeo, Yongmei Zhao, Chinnappa D. Kodira, Javier Costas, Michael C. Schatz, Steven P. Sinkins, Claire M. Fraser-Liggett, Martin Shumway, Kurt LaButti, Akio Mori, Brendan J. Loftus, Manfred Grabherr, Eric O. Stinson, Frank H. Collins, Zhijian Jake Tu, Monique R. Coy, Matt Crawford, Janice P. Vanzee, William M. Gelbart, Joshua Orvis, Peter Arensburger, Chunhong Mao, Evgeny M. Zdobnov, Saul A. Kravitz, Suely Lopes Gomes, David DeCaprio, David G. Hogenkamp, Daniel Lawson, Dennis L. Knudson, David W. Severson, George Dimopoulos, Marcelo B. Soares, Sinéad B. O'Leary, Peter W. Atkinson, David M. Jaffe, Becky deBruyn, Martin Hammond, Ana L. T. O. Nascimento, Jim Biedler, Stefan Wyder, Jose M. C. Tubio, Bruce W. Birren, Catherine A. Hill, Chad Nusbaum, Eduardo Lee, Song Li, Susan E. Brown, Jennifer R. Wortman, James R. Hogan, Hamza El-Dorry, Qi Zhao, Linda Hannick, Carlos Frederico Martins Menck, Vishvanath Nene, Jonathan Crabtree, Steven L. Salzberg, Michael H. Holmes, Maria de Fatima Bonaldo, Quinghu Ren, Mihaela Pertea, Charles Roth, Evan Mauceli, Karin Eiglmeier, Horacio Naveira, Brian J. Haas, Qiandong Zeng, Neil F. Lobo, Jennifer R. Schneider, The Institute for Genomic Research, The Institute for Genomic Research, Rockville, European Bioinformatics Institute [Hinxton] ( EMBL-EBI ), European Molecular Biology Laboratory [Hinxton], Broad Institute of MIT and Harvard ( BROAD INSTITUTE ), Broad Institute of MIT and Harvard, Virginia Polytechnic Institute and State University [Blacksburg], University College Dublin [Dublin] ( UCD ), Bloomberg School of Public Health, Johns Hopkins University ( JHU ) -Bloomberg School of Public Health, University of Geneva Medical School, Swiss Institute of Bioinformatics-University of Geneva Medical School, University of Notre Dame ( UND ), Harvard University [Cambridge], College of Agricultural Sciences Colorado State University, Colorado State University [Fort Collins] ( CSU ) -College of Agricultural Sciences, Northwestern University [Evanston], University of California [Riverside] ( UCR ), Department of Atmospheric, Oceanic and Planetary Physics [Oxford] ( AOPP ), University of Oxford [Oxford], Purdue University [West Lafayette], Centro Nacional de Genotipado Fundación Pública Galega de Medicina Xenómica Hospital Clínico Universitario de Santiago, Centro Nacional de Genotipado-Fundación Pública Galega de Medicina Xenómica-Hospital Clínico Universitario de Santiago, Institut Pasteur [Paris], Universidade de Sao Paulo Instituto de Quimica, Universidade de São Paulo ( USP ) -Instituto de Quimica, Texas A&M University [College Station], Joint Technology Center, University of Massachusetts [Amherst] ( UMass Amherst ), Universidade de Sao Paulo, Institute of Biomedical Sciences, Universidade de São Paulo ( USP ) -Institute of Biomedical Sciences, Instituto Butantan [São Paulo], Universidade da Coruña, University of Maryland [College Park], Institut Jacques Monod ( IJM ), Université Paris Diderot - Paris 7 ( UPD7 ) -Centre National de la Recherche Scientifique ( CNRS ), University of California [Santa Cruz] ( UCSC ), Complexo Hospitalario Universitario de Santiago, Universität Göttingen, Georg-August-Universität Göttingen, George Washington University Medical Center, George Washington University ( GW ), The Institute for Genomic Research (TIGR), European Bioinformatics Institute [Hinxton] (EMBL-EBI), EMBL Heidelberg, Broad Institute of MIT and Harvard (BROAD INSTITUTE), Harvard Medical School [Boston] (HMS)-Massachusetts Institute of Technology (MIT)-Massachusetts General Hospital [Boston], University College Dublin [Dublin] (UCD), Johns Hopkins Bloomberg School of Public Health [Baltimore], Johns Hopkins University (JHU), Swiss Institute of Bioinformatics [Lausanne] (SIB), Université de Lausanne (UNIL)-Université de Lausanne (UNIL)-University of Geneva Medical School, University of Notre Dame [Indiana] (UND), Colorado State University [Fort Collins] (CSU)-College of Agricultural Sciences, University of California [Riverside] (UCR), University of California, Department of Atmospheric, Oceanic and Planetary Physics [Oxford] (AOPP), Universidade de São Paulo (USP)-Instituto de Quimica, University of Massachusetts [Amherst] (UMass Amherst), University of Massachusetts System (UMASS), Universidade de São Paulo (USP)-Institute of Biomedical Sciences (ICB/USP), Universidade de São Paulo (USP), University of Maryland System, Institut Jacques Monod (IJM (UMR_7592)), Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), University of California [Santa Cruz] (UCSC), Georg-August-University [Göttingen], The George Washington University (GW), George Washington University (GW), Université de Lausanne = University of Lausanne (UNIL)-Université de Lausanne = University of Lausanne (UNIL)-University of Geneva Medical School, Harvard University, University of California [Riverside] (UC Riverside), University of California (UC), University of Oxford, Institut Pasteur [Paris] (IP), Universidade de São Paulo = University of São Paulo (USP)-Instituto de Quimica, Universidade de São Paulo = University of São Paulo (USP)-Institute of Biomedical Sciences (ICB/USP), Universidade de São Paulo = University of São Paulo (USP), University of California [Santa Cruz] (UC Santa Cruz), Georg-August-University = Georg-August-Universität Göttingen, Zdobnov, Evgeny, and Wyder, Stefan
- Subjects
0106 biological sciences ,Male ,Transcription, Genetic ,Genome, Insect ,transposons ,receptors ,Genes, Insect ,Aedes/ genetics/metabolism ,MESH: Genes, Insect ,Yellow Fever/prevention & control/transmission ,MESH: Base Sequence ,01 natural sciences ,Dengue/prevention & control/transmission ,MESH: Protein Structure, Tertiary ,MESH: Arboviruses ,MESH : Insect Vectors ,MESH: Insect Proteins ,MESH : Anopheles gambiae ,MESH: Animals ,MESH : Arboviruses ,insects ,MESH: Yellow Fever ,superfamily ,ddc:616 ,0303 health sciences ,Anopheles ,MESH : Genes, Insect ,3. Good health ,yellow-fever mosquito ,anopheles-gambiae ,drosophila-melanogaster ,expression ,evolution ,organization ,Drosophila melanogaster ,MESH: DNA Transposable Elements ,[ SDV.BBM.GTP ] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,Multigene Family ,Public Health ,MESH : Protein Structure, Tertiary ,MESH: Sex Characteristics ,Molecular Sequence Data ,MESH : Multigene Family ,MESH: Sex Determination (Genetics) ,MESH : Sex Determination (Genetics) ,Arbovirus ,Article ,03 medical and health sciences ,Species Specificity ,[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,MESH: Anopheles gambiae ,Yellow Fever ,MESH: Species Specificity ,Humans ,MESH: Humans ,MESH: Molecular Sequence Data ,fungi ,MESH : Humans ,MESH : Sex Characteristics ,Membrane Transport Proteins ,Sex Determination Processes ,medicine.disease ,Insect Vectors ,Protein Structure, Tertiary ,Sex Determination (Genetics) ,MESH: Multigene Family ,MESH: Female ,MESH : Sequence Analysis, DNA ,MESH: Sequence Analysis, DNA ,Drosophila melanogaster/genetics ,MESH : Molecular Sequence Data ,Odorant binding ,Insect Proteins/genetics ,Anopheles gambiae ,MESH: Dengue ,Genome ,MESH: Membrane Transport Proteins ,Dengue ,MESH : Membrane Transport Proteins ,Aedes ,Insect Vectors/ genetics/metabolism ,MESH : Drosophila melanogaster ,MESH : Female ,Membrane Transport Proteins/genetics ,Genetics ,MESH : Insect Proteins ,Sex Characteristics ,Multidisciplinary ,MESH: Synteny ,MESH: Aedes ,MESH : Genome, Insect ,MESH : DNA Transposable Elements ,Insect Proteins ,Female ,Orthologous Gene ,MESH : Male ,MESH : Dengue ,Aedes aegypti ,MESH: Insect Vectors ,Biology ,010603 evolutionary biology ,Synteny ,MESH: Drosophila melanogaster ,Protein Structure, Tertiary/genetics ,MESH : Yellow Fever ,parasitic diseases ,medicine ,MESH : Species Specificity ,Animals ,030304 developmental biology ,MESH : Aedes ,Base Sequence ,MESH: Genome, Insect ,MESH: Transcription, Genetic ,MESH : Synteny ,MESH : Transcription, Genetic ,Sequence Analysis, DNA ,biology.organism_classification ,MESH: Male ,DNA Transposable Elements ,MESH : Base Sequence ,MESH : Animals ,Arboviruses ,Anopheles gambiae/genetics/metabolism - Abstract
We present a draft sequence of the genome of Aedes aegypti , the primary vector for yellow fever and dengue fever, which at ∼1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae . Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of ∼4 to 6 increase in average gene length and in sizes of intergenic regions relative to An. gambiae and Drosophila melanogaster . Nonetheless, chromosomal synteny is generally maintained among all three insects, although conservation of orthologous gene order is higher (by a factor of ∼2) between the mosquito species than between either of them and the fruit fly. An increase in genes encoding odorant binding, cytochrome P450, and cuticle domains relative to An. gambiae suggests that members of these protein families underpin some of the biological differences between the two mosquito species.
- Published
- 2016
- Full Text
- View/download PDF
8. Cheminformatics approaches to analyze diversity in compound screening libraries
- Author
-
Lakshmi B. Akella and David DeCaprio
- Subjects
Creative visualization ,Computer science ,Drug discovery ,media_common.quotation_subject ,Principal (computer security) ,Computational Biology ,Bioinformatics ,Biochemistry ,Data science ,Chemical space ,High-Throughput Screening Assays ,Analytical Chemistry ,Small Molecule Libraries ,Identification (information) ,Cheminformatics ,Drug Discovery ,Relevance (information retrieval) ,Diversity (politics) ,media_common - Abstract
As high-throughput screening matures as a discipline, cheminformatics is playing an increasingly important role in selecting new compounds for diverse screening libraries. New visualization techniques such as multi-fusion similarity maps, scaffold trees, and principal moments of inertia plots provide complementary information on compound libraries and enable identification of unexplored regions of chemical space with potential biological relevance. Quantitative metrics have been developed to analyze libraries for properties such as natural product-likeness and shape complexity. Analysis of high-throughput screening results and drug discovery programs identify compounds problematic for screening. Taken together these approaches allow us to increase the diversity of biological outcomes available in compound screening libraries and improve the success rates of high-throughput screening against new targets without making significant increases in the size of compound libraries.
- Published
- 2010
- Full Text
- View/download PDF
9. A genome-wide map of diversity in Plasmodium falciparum
- Author
-
Danny A. Milner, David DeCaprio, Daniel E. Neafsey, Manoj T. Duraisingh, James E. Galagan, S. Mboup, Eric S. Lander, Nicole Stange-Thomann, Ousmane Sarr, Evan Mauceli, Amanda K. Lukens, Sante Gnerre, Stephen F. Schaffner, Alan Derr, David B. Jaffe, Roger C. Wiegand, Dyann F. Wirth, Daouda Ndiaye, Joanne Zainoun, Pardis C. Sabeti, Bruce W. Birren, Liuda Ziaugra, Daniel L. Hartl, Robert C. Onofrio, Sarah K. Volkman, Skye G. Waggoner, Omar Ndir, and Johanna P. Daily
- Subjects
Genetics ,Linkage disequilibrium ,Genetic diversity ,Genetic variation ,Population genetics ,Single-nucleotide polymorphism ,Plasmodium falciparum ,Biology ,biology.organism_classification ,Genotyping ,Genome - Abstract
Genetic variation allows the malaria parasite Plasmodium falciparum to overcome chemotherapeutic agents, vaccines and vector control strategies and remain a leading cause of global morbidity and mortality. Here we describe an initial survey of genetic variation across the P. falciparum genome. We performed extensive sequencing of 16 geographically diverse parasites and identified 46,937 SNPs, demonstrating rich diversity among P. falciparum parasites (pi = 1.16 x 10(-3)) and strong correlation with gene function. We identified multiple regions with signatures of selective sweeps in drug-resistant parasites, including a previously unidentified 160-kb region with extremely low polymorphism in pyrimethamine-resistant parasites. We further characterized 54 worldwide isolates by genotyping SNPs across 20 genomic regions. These data begin to define population structure among African, Asian and American groups and illustrate the degree of linkage disequilibrium, which extends over relatively short distances in African parasites but over longer distances in Asian parasites. We provide an initial map of genetic diversity in P. falciparum and demonstrate its potential utility in identifying genes subject to recent natural selection and in understanding the population genetics of this parasite.
- Published
- 2006
- Full Text
- View/download PDF
10. DNA sequence and analysis of human chromosome 18
- Author
-
Chad, Nusbaum, Michael C, Zody, Mark L, Borowsky, Michael, Kamal, Chinnappa D, Kodira, Todd D, Taylor, Charles A, Whittaker, Jean L, Chang, Christina A, Cuomo, Ken, Dewar, Michael G, FitzGerald, Xiaoping, Yang, Amr, Abouelleil, Nicole R, Allen, Scott, Anderson, Toby, Bloom, Boris, Bugalter, Jonathan, Butler, April, Cook, David, DeCaprio, Reinhard, Engels, Manuel, Garber, Andreas, Gnirke, Nabil, Hafez, Jennifer L, Hall, Catherine Hosage, Norman, Takehiko, Itoh, David B, Jaffe, Yoko, Kuroki, Jessica, Lehoczky, Annie, Lui, Pendexter, Macdonald, Evan, Mauceli, Tarjei S, Mikkelsen, Jerome W, Naylor, Robert, Nicol, Cindy, Nguyen, Hideki, Noguchi, Sinéad B, O'Leary, Keith, O'Neill, Bruno, Piqani, Cherylyn L, Smith, Jessica A, Talamas, Kerri, Topham, Yasushi, Totoki, Atsushi, Toyoda, Hester M, Wain, Sarah K, Young, Qiandong, Zeng, Andrew R, Zimmer, Asao, Fujiyama, Masahira, Hattori, Bruce W, Birren, Yoshiyuki, Sakaki, and Eric S, Lander
- Subjects
Expressed Sequence Tags ,Multidisciplinary ,Genome, Human ,Molecular Sequence Data ,DNA ,Exons ,Sequence Analysis, DNA ,Aneuploidy ,Synteny ,Introns ,Genes ,Animals ,Humans ,CpG Islands ,Chromosomes, Human, Pair 18 ,Conserved Sequence - Abstract
Chromosome 18 appears to have the lowest gene density of any human chromosome and is one of only three chromosomes for which trisomic individuals survive to term. There are also a number of genetic disorders stemming from chromosome 18 trisomy and aneuploidy. Here we report the finished sequence and gene annotation of human chromosome 18, which will allow a better understanding of the normal and disease biology of this chromosome. Despite the low density of protein-coding genes on chromosome 18, we find that the proportion of non-protein-coding sequences evolutionarily conserved among mammals is close to the genome-wide average. Extending this analysis to the entire human genome, we find that the density of conserved non-protein-coding sequences is largely uncorrelated with gene density. This has important implications for the nature and roles of non-protein-coding sequence elements.
- Published
- 2005
11. The Complete Genome and Proteome of Mycoplasma mobile
- Author
-
Howard C. Berg, Nicole Stange-Thomann, Chinnappa D. Kodira, Robert Nicol, Jonathan Butler, John E. Major, George M. Church, Jacob D. Jaffe, Cherylyn Smith, Jane E. Wilkinson, Nabil Hafez, Sheila Fisher, Shunguang Wang, Sarah E. Calvo, Tim Elkins, Bruce W. Birren, Chad Nusbaum, David DeCaprio, and Michael Fitzgerald
- Subjects
Genetics ,Genome evolution ,Proteome ,Gliding motility ,Molecular Sequence Data ,Computational Biology ,Articles ,Mycoplasma ,Genome project ,Biology ,Physical Chromosome Mapping ,medicine.disease_cause ,Genome ,Phylogenetics ,Horizontal gene transfer ,medicine ,Amino Acid Sequence ,Genome, Bacterial ,Phylogeny ,Genetics (clinical) - Abstract
Although often considered “minimal” organisms, mycoplasmas show a wide range of diversity with respect to host environment, phenotypic traits, and pathogenicity. Here we report the complete genomic sequence and proteogenomic map for the piscine mycoplasma Mycoplasma mobile, noted for its robust gliding motility. For the first time, proteomic data are used in the primary annotation of a new genome, providing validation of expression for many of the predicted proteins. Several novel features were discovered including a long repeating unit of DNA of ∼2435 bp present in five complete copies that are shown to code for nearly identical yet uniquely expressed proteins. M. mobile has among the lowest DNA GC contents (24.9%) and most reduced set of tRNAs of any organism yet reported (28). Numerous instances of tandem duplication as well as lateral gene transfer are evident in the genome. The multiple available complete genome sequences for other motile and immotile mycoplasmas enabled us to use comparative genomic and phylogenetic methods to suggest several candidate genes that might be involved in motility. The results of these analyses leave open the possibility that gliding motility might have arisen independently more than once in the mycoplasma lineage.
- Published
- 2004
- Full Text
- View/download PDF
12. The fusarium graminearum genome reveals a link between localized polymorphism and pathogen specialization
- Author
-
David R. Nelson, Hans-Werner Mewes, H. Corby Kistler, Weihong Qi, Kerry O'Donnell, John C. Kennell, Scott E. Baker, Liane R. Gale, Igor V. Tetko, Jon K. Magnuson, Gary J. Muehlbauer, Martin Münsterkötter, Martijn Rep, Karen Hilburn, Jiqiang Yao, B. Gillian Turgeon, Thérèse Ouellet, Hadi Quesneville, Li-Jun Ma, Todd J. Ward, Linda J. Harris, Gerhard Adam, Kim E. Hammond-Kosack, Sarah E. Calvo, Scott Kroken, M. Isabel G. Roncero, Kye Yong Seong, Jin-Rong Xu, Gertrud Mannhaupt, Yueh-Long Chang, Antonio Di Pietro, Ulrich Güldener, Christina A. Cuomo, Evan Mauceli, Rudolf Mitterbauer, John F. Antoniw, Rubella S. Goswami, David DeCaprio, Martin Urban, Cees Waalwijk, Jonathan D. Walton, Sante Gnerre, Frances Trail, Thomas K. Baldwin, Bruce W. Birren, Broad Institute of MIT and Harvard (BROAD INSTITUTE), Harvard Medical School [Boston] (HMS)-Massachusetts Institute of Technology (MIT)-Massachusetts General Hospital [Boston], Technische Universität Munchen - Université Technique de Munich [Munich, Allemagne] (TUM), Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health, Purdue University [West Lafayette], Michigan State University [East Lansing], Michigan State University System, Cornell University [New York], Universidad de Córdoba [Cordoba], Pacific Northwest National Laboratory (PNNL), University of Amsterdam [Amsterdam] (UvA), Universität für Bodenkultur Wien [Vienne, Autriche] (BOKU), Rothamsted Research, University of Minnesota [Twin Cities] (UMN), University of Minnesota System, Agriculture and Agri-Food [Ottawa] (AAFC), U.S. Department of Agriculture - Agricultural Research Service - Cereal Disease Laboratory (USDA), USDA-ARS : Agricultural Research Service, Saint Louis University (SLU), University of Arizona, University of Tennessee Memphis, The University of Tennessee Health Science Center [Memphis] (UTHSC), USDA ARS, National Center for Agricultural Utilization Research (USDA ARS,), Institut Jacques Monod (IJM (UMR_7592)), Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), Institute of Bioorganic Chemistry and Photochemistry, National Ukrainian Academy of Sciences, Institute Bioorganic Chemistry an Photochemistry, Plant Research International (PRI), Wageningen University and Research [Wageningen] (WUR), Molecular Plant Pathology (SILS, FNWI), Technische Universität München [München] (TUM), Cornell University, University of Minnesota [Twin Cities], and Wageningen University and Research Centre [Wageningen] (WUR)
- Subjects
localized polymorphism ,MESH: Sequence Analysis, DNA ,neurospora ,Sequence analysis ,Molecular Sequence Data ,Single-nucleotide polymorphism ,Genomics ,Gene mutation ,Biology ,dna ,medicine.disease_cause ,Genome ,Polymorphism, Single Nucleotide ,Evolution, Molecular ,03 medical and health sciences ,MESH: Plant Diseases ,Fusarium ,MESH: Polymorphism, Genetic ,medicine ,Point Mutation ,DNA, Fungal ,Gene ,MESH: Evolution, Molecular ,030304 developmental biology ,Plant Diseases ,MESH: Point Mutation ,Genetics ,0303 health sciences ,Mutation ,MESH: Fusarium ,Multidisciplinary ,Polymorphism, Genetic ,MESH: Molecular Sequence Data ,Biointeracties and Plant Health ,030306 microbiology ,Point mutation ,MESH: Polymorphism, Single Nucleotide ,food and beverages ,Hordeum ,Sequence Analysis, DNA ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,MESH: DNA, Fungal ,MESH: Hordeum ,MESH: Genome, Fungal ,Fusarium graminearum genome ,PRI Biointeractions en Plantgezondheid ,Genome, Fungal ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] - Abstract
We sequenced and annotated the genome of the filamentous fungus Fusarium graminearum , a major pathogen of cultivated cereals. Very few repetitive sequences were detected, and the process of repeat-induced point mutation, in which duplicated sequences are subject to extensive mutation, may partially account for the reduced repeat content and apparent low number of paralogous (ancestrally duplicated) genes. A second strain of F. graminearum contained more than 10,000 single-nucleotide polymorphisms, which were frequently located near telomeres and within other discrete chromosomal segments. Many highly polymorphic regions contained sets of genes implicated in plant-fungus interactions and were unusually divergent, with higher rates of recombination. These regions of genome innovation may result from selection due to interactions of F. graminearum with its plant hosts.
- Published
- 2007
- Full Text
- View/download PDF
13. Causal modeling using network ensemble simulations of genetic and gene expression data predicts genes involved in rheumatoid arthritis
- Author
-
John P. Carulli, Heming Xing, Karl Runge, Tanya Cashorali, Jadwiga Bienkowska, Bruce Church, David DeCaprio, Robert E. Miller, Ronenn Roubenoff, Iya Khalil, and Paul D. McDonagh
- Subjects
musculoskeletal diseases ,Immunoconjugates ,In silico ,Immunology ,Regulator ,Arthritis ,Gene Expression ,Biology ,Bioinformatics ,Abatacept ,Arthritis, Rheumatoid ,Cellular and Molecular Neuroscience ,Gene expression ,Sphingosine N-Acyltransferase ,Genetics ,medicine ,Humans ,Computer Simulation ,Molecular Biology ,lcsh:QH301-705.5 ,Rheumatology/Rheumatoid Arthritis ,Ecology, Evolution, Behavior and Systematics ,Genetics and Genomics/Medical Genetics ,Computational Biology/Systems Biology ,Ecology ,Tumor Necrosis Factor-alpha ,Gene Expression Profiling ,Interleukins ,Genetics and Genomics/Gene Therapy ,Computational Biology ,Genetics and Genomics ,Genetics and Genomics/Gene Expression ,Genetics and Genomics/Bioinformatics ,medicine.disease ,Gene expression profiling ,Genetics and Genomics/Gene Function ,Computational Theory and Mathematics ,lcsh:Biology (General) ,Modeling and Simulation ,Rheumatoid arthritis ,Antirheumatic Agents ,Tumor necrosis factor alpha ,Genetics and Genomics/Gene Discovery ,medicine.drug ,Research Article - Abstract
Tumor necrosis factor α (TNF-α) is a key regulator of inflammation and rheumatoid arthritis (RA). TNF-α blocker therapies can be very effective for a substantial number of patients, but fail to work in one third of patients who show no or minimal response. It is therefore necessary to discover new molecular intervention points involved in TNF-α blocker treatment of rheumatoid arthritis patients. We describe a data analysis strategy for predicting gene expression measures that are critical for rheumatoid arthritis using a combination of comprehensive genotyping, whole blood gene expression profiles and the component clinical measures of the arthritis Disease Activity Score 28 (DAS28) score. Two separate network ensembles, each comprised of 1024 networks, were built from molecular measures from subjects before and 14 weeks after treatment with TNF-α blocker. The network ensemble built from pre-treated data captures TNF-α dependent mechanistic information, while the ensemble built from data collected under TNF-α blocker treatment captures TNF-α independent mechanisms. In silico simulations of targeted, personalized perturbations of gene expression measures from both network ensembles identify transcripts in three broad categories. Firstly, 22 transcripts are identified to have new roles in modulating the DAS28 score; secondly, there are 6 transcripts that could be alternative targets to TNF-α blocker therapies, including CD86 - a component of the signaling axis targeted by Abatacept (CTLA4-Ig), and finally, 59 transcripts that are predicted to modulate the count of tender or swollen joints but not sufficiently enough to have a significant impact on DAS28., Author Summary The collection and analysis of clinical data has played a key role in providing insights into the diagnosis, prognosis and treatment of disease. However, it is imperative that molecular and genetic data also be collected and integrated into the creation of network models, which capture underlying mechanisms of disease and can be interrogated to elucidate previously unknown biology. Bringing data from the clinic to the bench completes the cycle of translational research, which we demonstrate with this work. We built disease models from genetics, whole blood gene expression profiles and the component clinical measures of rheumatoid arthritis using a data-driven approach that leverages supercomputing. Genetic factors can be utilized as a source of perturbation to the system such that causal connections between genetics, molecular entities and clinical outcomes can be inferred. The existing TNF-α blocker treatments for rheumatoid arthritis are only effective for approximately 2/3 of the affected population. We identified novel therapeutic intervention points that may lead to the development of alternatives to TNF-α blocker treatments. We believe this approach will provide improved drug discovery programs, new insights into disease progression, increased drug efficacy and novel biomarkers for chronic and complex diseases.
- Published
- 2010
14. A High-Density Single Nucleotide Polymorphism Map for Neurospora crassa
- Author
-
Jennifer J. Loros, David DeCaprio, Daniel J. Park, James E. Galagan, Matthew S. Sachs, Matthew R. Henn, Jay C. Dunlap, Mi Shi, William J. Belden, Randy Lambreghts, Bruce W. Birren, and Meray Baştürkmen
- Subjects
Genetics ,Expressed Sequence Tags ,Genetic Markers ,Recombination, Genetic ,Expressed sequence tag ,biology ,Neurospora crassa ,Genes, Fungal ,Bulked segregant analysis ,Chromosome Mapping ,Single-nucleotide polymorphism ,Investigations ,biology.organism_classification ,Genome ,Polymerase Chain Reaction ,Polymorphism, Single Nucleotide ,Single Nucleotide Polymorphism Map ,Species Specificity ,Genetic marker ,Cleaved amplified polymorphic sequence ,Mutation ,DNA, Fungal ,Databases, Nucleic Acid - Abstract
We report the discovery and validation of a set of single nucleotide polymorphisms (SNPs) between the reference Neurospora crassa strain Oak Ridge and the Mauriceville strain (FGSC 2555), of sufficient density to allow fine mapping of most loci. Sequencing of Mauriceville cDNAs and alignment to the completed genomic sequence of the Oak Ridge strain identified 19,087 putative SNPs. Of these, a subset was validated by cleaved amplified polymorphic sequence (CAPS), a simple and robust PCR-based assay that reliably distinguishes between SNP alleles. Experimental confirmation resulted in the development of 250 CAPS markers distributed evenly over the genome. To demonstrate the applicability of this map, we used bulked segregant analysis followed by interval mapping to locate the csp-1 mutation to a narrow region on LGI. Subsequently, we refined mapping resolution to 74 kbp by developing additional markers, resequenced the candidate gene, NCU02713.3, in the mutant background, and phenocopied the mutation by gene replacement in the WT strain. Together, these techniques demonstrate a generally applicable and straightforward approach for the isolation of novel genes from existing mutants. Data on both putative and validated SNPs are deposited in a customized public database at the Broad Institute, which encourages augmentation by community users.
- Published
- 2009
15. TB database: an integrated platform for tuberculosis research
- Author
-
Maria Mao, Michael Nitzberg, Brian Weiner, Robert Riley, James E. Galagan, Marcel Gellesch, David DeCaprio, Reinhard Engels, Phillip Montgomery, Dennis Jen, Lisa Larson, Jeremy Hubble, Peter Sisk, Michael Koehrsen, Heng Jin, Jared White, T. B. K. Reddy, Catherine A. Ball, Farrell Wymore, Gary K. Schoolnik, Gavin Sherlock, Christian Stolte, and Zachariah K. Zachariah
- Subjects
Tuberculosis ,Biomedical Research ,Gene Expression ,Genomics ,Biology ,computer.software_genre ,Genome ,Mycobacterium tuberculosis ,Tuberculosis diagnosis ,Databases, Genetic ,Genetics ,medicine ,Computer Graphics ,Humans ,Whole genome sequencing ,Comparative genomics ,Database ,Genome project ,Articles ,medicine.disease ,biology.organism_classification ,Systems Integration ,computer ,Genome, Bacterial - Abstract
The effective control of tuberculosis (TB) has been thwarted by the need for prolonged, complex and potentially toxic drug regimens, by reliance on an inefficient vaccine and by the absence of biomarkers of clinical status. The promise of the genomics era for TB control is substantial, but has been hindered by the lack of a central repository that collects and integrates genomic and experimental data about this organism in a way that can be readily accessed and analyzed. The Tuberculosis Database (TBDB) is an integrated database providing access to TB genomic data and resources, relevant to the discovery and development of TB drugs, vaccines and biomarkers. The current release of TBDB houses genome sequence data and annotations for 28 different Mycobacterium tuberculosis strains and related bacteria. TBDB stores pre- and post-publication gene-expression data from M. tuberculosis and its close relatives. TBDB currently hosts data for nearly 1500 public tuberculosis microarrays and 260 arrays for Streptomyces. In addition, TBDB provides access to a suite of comparative genomics and microarray analysis software. By bringing together M. tuberculosis genome annotation and gene-expression data with a suite of analysis tools, TBDB (http://www.tbdb.org/) provides a unique discovery platform for TB research.
- Published
- 2008
16. Using ChemBank to probe chemical biology
- Author
-
Heidi Kuehn, Kathleen Petri Seiler, David DeCaprio, Paul A. Clemons, and Mary Pat Happ
- Subjects
Internet ,Databases, Factual ,Computer science ,business.industry ,Chemical biology ,Drug Evaluation, Preclinical ,Information Storage and Retrieval ,MOLECULAR BIOLOGY METHODS ,Bioinformatics ,Biochemistry ,Data science ,Pharmaceutical Preparations ,Structural Biology ,Cheminformatics ,Database Management Systems ,The Internet ,Analysis tools ,business ,Molecular Biology - Abstract
ChemBank (http://chembank.broad.harvard.edu/) is a public, Web-based informatics environment. ChemBank stores and makes freely available data derived from small molecules and small-molecule screens and has resources for relating and studying these data. Currently, ChemBank stores information on hundreds of thousands of small molecules and hundreds of biomedically relevant assays performed at the Broad Institute screening center. Web-based analysis tools are available within ChemBank to study the relationships between small molecules, cell measurements, and cell states. This unit demonstrates the use of ChemBank data to ask and answer questions relating to chemical biology and screening experiments contained within ChemBank.
- Published
- 2008
17. Conrad: Gene prediction using conditional random fields
- Author
-
David DeCaprio, James E. Galagan, Philip Montgomery, Matthew Doherty, Matthew D. Pearson, and Jade P. Vinson
- Subjects
Conditional random field ,Gene prediction ,Genes, Fungal ,Inference ,Biology ,Machine learning ,computer.software_genre ,Aspergillus nidulans ,Discriminative model ,Artificial Intelligence ,Encoding (memory) ,Genetics ,Feature (machine learning) ,Methods ,Hidden Markov model ,Genetics (clinical) ,Likelihood Functions ,Markov chain ,business.industry ,Discriminant Analysis ,Reference Standards ,Markov Chains ,Cryptococcus neoformans ,Artificial intelligence ,Chromosomes, Fungal ,business ,computer ,Algorithms ,Software - Abstract
We present Conrad, the first comparative gene predictor based on semi-Markov conditional random fields (SMCRFs). Unlike the best standalone gene predictors, which are based on generalized hidden Markov models (GHMMs) and trained by maximum likelihood, Conrad is discriminatively trained to maximize annotation accuracy. In addition, unlike the best annotation pipelines, which rely on heuristic and ad hoc decision rules to combine standalone gene predictors with additional information such as ESTs and protein homology, Conrad encodes all sources of information as features and treats all features equally in the training and inference algorithms. Conrad outperforms the best standalone gene predictors in cross-validation and whole chromosome testing on two fungi with vastly different gene structures. The performance improvement arises from the SMCRF’s discriminative training methods and their ability to easily incorporate diverse types of information by encoding them as feature functions. On Cryptococcus neoformans, configuring Conrad to reproduce the predictions of a two-species phylo-GHMM closely matches the performance of Twinscan. Enabling discriminative training increases performance, and adding new feature functions further increases performance, achieving a level of accuracy that is unprecedented for this organism. Similar results are obtained on Aspergillus nidulans comparing Conrad versus Fgenesh. SMCRFs are a promising framework for gene prediction because of their highly modular nature, simplifying the process of designing and testing potential indicators of gene structure. Conrad’s implementation of SMCRFs advances the state of the art in gene prediction in fungi and provides a robust platform for both current application and future research.
- Published
- 2007
18. Enabling a Community to Dissect an Organism: Overview of the Neurospora Functional Genomics Project
- Author
-
Mi Shi, Michael Plamann, Matthew D. Pearson, Michael Koerhsen, Jay C. Dunlap, Christopher M. Crew, James E. Galagan, Hildur V. Colot, Takao Kasuga, Jeffrey P. Townsend, Kevin McCluskey, Lisa Larson, Jennifer J. Loros, Richard L. Weiss, Randy Lambreghts, Susan Curilla, Junhuan Xu, Matthew Crawford, Liubov Litvinkova, Lorena Altamirano, David DeCaprio, Bruce W. Birren, Patrick D. Collopy, Gyungsoon Park, Chaoguang Tian, Meray Baştürkmen, Mary Anne Nelson, Carol S. Ringelberg, Phil Montgomery, Matthew S. Sachs, Matthew R. Henn, Heather M. Hood, Katherine A. Borkovich, Gloria E. Turner, and N. Louise Glass
- Subjects
Positional cloning ,Genomics ,Computational biology ,Biology ,Genome ,Neurospora ,Polymorphism, Single Nucleotide ,Article ,Neurospora crassa ,DNA, Fungal ,Gene Library ,Oligonucleotide Array Sequence Analysis ,Genetics ,Base Sequence ,Gene Expression Profiling ,fungi ,Fungal genetics ,Chromosome Mapping ,biology.organism_classification ,Phenotype ,Genetic Techniques ,Mutation ,DNA microarray ,Genome, Fungal ,Functional genomics ,Gene Deletion - Abstract
A consortium of investigators is engaged in a functional genomics project centered on the filamentous fungus Neurospora, with an eye to opening up the functional genomic analysis of all the filamentous fungi. The overall goal of the four interdependent projects in this effort is to acccomplish functional genomics, annotation, and expression analyses of Neurospora crassa, a filamentous fungus that is an established model for the assemblage of over 250,000 species of nonyeast fungi. Building from the completely sequenced 43-Mb Neurospora genome, Project 1 is pursuing the systematic disruption of genes through targeted gene replacements, phenotypic analysis of mutant strains, and their distribution to the scientific community at large. Project 2, through a primary focus in Annotation and Bioinformatics, has developed a platform for electronically capturing community feedback and data about the existing annotation, while building and maintaining a database to capture and display information about phenotypes. Oligonucleotide-based microarrays created in Project 3 are being used to collect baseline expression data for the nearly 11,000 distinguishable transcripts in Neurospora under various conditions of growth and development, and eventually to begin to analyze the global effects of loss of novel genes in strains created by Project 1. cDNA libraries generated in Project 4 document the overall complexity of expressed sequences in Neurospora, including alternative splicing alternative promoters and antisense transcripts. In addition, these studies have driven the assembly of an SNP map presently populated by nearly 300 markers that will greatly accelerate the positional cloning of genes.
- Published
- 2007
19. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis
- Author
-
Claire M. Wade, Matt J. Cahill, Eric C.H. Ho, Plinio Guzmán, Bruce W. Birren, David DeCaprio, Li-Jun Ma, Gabi Friedrich, Jan Schirawski, Sarah Young, Gero Steinberg, Gertrud Mannhaupt, Heine J. Deelstra, Steven J. Klosterman, Patricia Sánchez-Alonso, Thomas Brefort, Karen M. Snetselaar, Candace Swimmer, Han A. B. Wösten, Christoph W. Basse, Jose Ruiz-Herrera, Dirk Haase, Jonathan Butler, Regine Kahmann, Michael P. McCann, Hans-Werner Mewes, Juan Manuel González-Prieto, Feng Chen, Martin Münsterkötter, Barry J. Saville, Chad Nusbaum, Evan Mauceli, Kylie J. Boyce, Jason E. Stajich, Miroslav Vranes, Scott E. Gold, Olaf Müller, Michael H. Perlin, John C. Kennell, Björn Sandrock, Volker Vincon, James W. Kronstad, Valentina Vysotskaia, Oliver Ladendorf, Michael Bölker, Andreas Gnirke, Peter Schreier, José Pérez-Martín, Matthias Oesterheld, Lazaro Molina, Isolde Häuser-Hahn, David B. Jaffe, Michael Feldbrügge, Shaowu Meng, Martin Vaupel, José I. Ibeas, Jörg Kämper, Darren Mark Platt, Cristina G. Reynaga-Peña, Jonathan Margolis, Jana Klose, Rafael Sentandreu, Karin Münch, Edda Koopmann, Ulrich Güldener, Mark L. Farman, Doris Greilinger, Uta Fuchs, Hartmut Voss, Flora Banuett, Lucila Ortiz-Castellanos, Thomas Schlüter, Artemio Mendoza-Mendoza, Mario Scherer, William K. Holloman, Weixi Li, Ronald P. de Vries, Sarah E. Calvo, Nicole Rössel, and James E. Galagan
- Subjects
Corn smut ,Genetics ,Multidisciplinary ,biology ,Virulence ,Ustilago ,Gene Expression Profiling ,Genes, Fungal ,Fungal genetics ,Genomics ,biology.organism_classification ,Genome ,Zea mays ,Fungal Proteins ,Gene Expression Regulation, Fungal ,Multigene Family ,Gene family ,Genome, Fungal ,Gene ,Pathogen - Abstract
Ustilago maydis is a ubiquitous pathogen of maize and a well-established model organism for the study of plant-microbe interactions. This basidiomycete fungus does not use aggressive virulence strategies to kill its host. U. maydis belongs to the group of biotrophic parasites (the smuts) that depend on living tissue for proliferation and development. Here we report the genome sequence for a member of this economically important group of biotrophic fungi. The 20.5-million-base U. maydis genome assembly contains 6,902 predicted protein-encoding genes and lacks pathogenicity signatures found in the genomes of aggressive pathogenic fungi, for example a battery of cell-wall-degrading enzymes. However, we detected unexpected genomic features responsible for the pathogenicity of this organism. Specifically, we found 12 clusters of genes encoding small secreted proteins with unknown function. A significant fraction of these genes exists in small gene families. Expression analysis showed that most of the genes contained in these clusters are regulated together and induced in infected tissue. Deletion of individual clusters altered the virulence of U. maydis in five cases, ranging from a complete lack of symptoms to hypervirulence. Despite years of research into the mechanism of pathogenicity in U. maydis, no 'true' virulence factors had been previously identified. Thus, the discovery of the secreted protein gene clusters and the functional demonstration of their decisive role in the infection process illuminate previously unknown mechanisms of pathogenicity operating in biotrophic fungi. Genomic analysis is, similarly, likely to open up new avenues for the discovery of virulence determinants in other pathogens.
- Published
- 2006
20. DNA sequence of human chromosome 17 and analysis of rearrangement in the human lineage
- Author
-
James R. Lupski, Xiaohong Liu, Sante Gnerre, Bruce W. Birren, Nicole R. Allen, James G. R. Gilbert, Pawel Stankiewicz, Tashi Lokyitsang, Jane E. Loveland, Amr Abouelleil, John E. Major, Manuel Garber, Steven A. McCarroll, April Cook, Mark L. Borowsky, Annie Lui, David C. Schwartz, Jonathan Butler, Christine Nicholson, Chad Nusbaum, Atanas Mihalev, Laurens G. Wilming, Catherine Hosage Norman, Daniel S. Hagopian, Jessica A. Lehoczky, Robert Nicol, David Swarbreck, Cindy Nguyen, Varsha K. Khodiyar, Ted Sharpe, Gavin K. Laird, Evan Mauceli, Russell J. Grocock, Jane Rogers, S. Searle, Charles Shaw-Smith, Michael C. Zody, Toby Bloom, Boris Bugalter, David B. Jaffe, Pieter J. de Jong, Jean L. Chang, Kazutoyo Osoegawa, Michael Kamal, Sinéad B. O'Leary, Steven A. Goldstein, David J. Adams, Richard Gibson, Jennifer Harrow, Sean Humphray, Chao-Kung Chen, Michael Fitzgerald, Allan Bradley, Ken Dewar, Charles A. Steward, Eric S. Lander, E. Hart, David DeCaprio, Nabil Hafez, Matthew C. Jones, Darren Grafham, Tim Hubbard, Vijay Venkataraman, Jonathan M. Mudge, Charles A. Whittaker, Kurt LaButti, Andrew Zimmer, Christina A. Cuomo, Xiaoping Yang, Benjamin Corum, Sarah Young, Pendexter Macdonald, Lucy Matthews, and Weimin Bi
- Subjects
Genetics ,Base Composition ,Multidisciplinary ,Sequence Analysis, DNA ,Biology ,Synteny ,Article ,Chromosome 17 (human) ,Evolution, Molecular ,Mice ,Chromosome 4 ,Chromosome 16 ,Long Interspersed Nucleotide Elements ,Chromosome 19 ,Gene Duplication ,Animals ,Humans ,Chromosome 21 ,Chromosome 22 ,Chromosomal inversion ,Segmental duplication ,Chromosomes, Human, Pair 17 ,Short Interspersed Nucleotide Elements - Abstract
Chromosome 17 is unusual among the human chromosomes in many respects. It is the largest human autosome with orthology to only a single mouse chromosome1, mapping entirely to the distal half of mouse chromosome 11. Chromosome 17 is rich in protein-coding genes, having the second highest gene density in the genome2,3. It is also enriched in segmental duplications, ranking third in density among the autosomes4. Here we report a finished sequence for human chromosome 17, as well as a structural comparison with the finished sequence for mouse chromosome 11, the first finished mouse chromosome. Comparison of the orthologous regions reveals striking differences. In contrast to the typical pattern seen in mammalian evolution5,6, the human sequence has undergone extensive intrachromosomal rearrangement, whereas the mouse sequence has been remarkably stable. Moreover, although the human sequence has a high density of segmental duplication, the mouse sequence has a very low density. Notably, these segmental duplications correspond closely to the sites of structural rearrangement, demonstrating a link between duplication and rearrangement. Examination of the main classes of duplicated segments provides insight into the dynamics underlying expansion of chromosome-specific, low-copy repeats in the human genome.
- Published
- 2006
21. Analysis of the DNA sequence and duplication history of human chromosome 15
- Author
-
Sandra Stewart, Amardeep Kaur, Evan Mauceli, Kerri Topham, Harindra Arachchi, Brian Birditt, Jerome Naylor, Toby Bloom, Sarah Young, Anup Madan, Reinhard Engels, Manuel Garber, Sabrina M. Stone, Anuradha Madan, Amber L Ratcliffe, Ryan Nesbitt, Amr Abouelleil, Keith O'Neill, Scott Bloom, Katherine M. B. Sneddon, Dascena Vincent, Lester Dorris, Steven Rounsley, Jennifer L. Hall, Michael Fitzgerald, David B. Jaffe, Grace Hensley, Gary Gearin, Devin P. Locke, Asha Kamat, Ericka M. Johnson, Jonathan Butler, Sinéad B. O'Leary, Jeremy Burke, Lida Baradarani, Jean L. Chang, Kurt DeArellano, Michael Kamal, Andrew Zimmer, Annie Lui, Eric S. Lander, Charles A. Whittaker, Monica Dors, Chad Nusbaum, David DeCaprio, Chinnappa D. Kodira, Leroy Hood, Robert Nicol, Ted Sharpe, Evan E. Eichler, Nissa Abbasi, Christina A. Cuomo, Glen Munson, Mark L. Borowsky, Shunguang Wang, Michael C. Zody, Shizhen Qin, Charlien Jones, Peter Fleetwood, Xinwei She, Pendexter Macdonald, Ken Dewar, April Cook, Xiaoping Yang, Bruce W. Birren, Jessica Fahey, Cynthia Friedman, Carrie Sougnez, and Lee Rowen
- Subjects
Molecular Sequence Data ,Biology ,Synteny ,Evolution, Molecular ,Chromosome 16 ,Chromosome 19 ,Gene Duplication ,Animals ,Humans ,Conserved Sequence ,Phylogeny ,Segmental duplication ,Genetics ,Chromosome 7 (human) ,Chromosomes, Human, Pair 15 ,Multidisciplinary ,Polymorphism, Genetic ,Genome, Human ,Sequence Analysis, DNA ,Macaca mulatta ,Chromosome 17 (human) ,Chromosome 4 ,Genes ,Haplotypes ,Multigene Family ,Chromosome 21 ,Chromosome 22 - Abstract
Here we present a finished sequence of human chromosome 15, together with a high-quality gene catalogue. As chromosome 15 is one of seven human chromosomes with a high rate of segmental duplication, we have carried out a detailed analysis of the duplication structure of the chromosome. Segmental duplications in chromosome 15 are largely clustered in two regions, on proximal and distal 15q; the proximal region is notable because recombination among the segmental duplications can result in deletions causing Prader-Willi and Angelman syndromes. Sequence analysis shows that the proximal and distal regions of 15q share extensive ancient similarity. Using a simple approach, we have been able to reconstruct many of the events by which the current duplication structure arose. We find that most of the intrachromosomal duplications seem to share a common ancestry. Finally, we demonstrate that some remaining gaps in the genome sequence are probably due to structural polymorphisms between haplotypes; this may explain a significant fraction of the gaps remaining in the human genome.
- Published
- 2005
22. Genome sequence, comparative analysis and haplotype structure of the domestic dog
- Author
-
Tsering Wangchuk, Mayank Kumar, Sharon Stavropoulos, James Cuff, Mostafa Benamara, David DeCaprio, Birhane Hagos, Nathaniel Novod, Tashi Lokyitsang, Nyima Norbu, Jennifer Baldwin, Sabrina M. Stone, Catherine Stone, Geneva Young, Osebhajajeme Egbiremolen, Dawa Thoulutsang, Tanya Mihova, Lisa Kim, Julie Sahalie, Jan Macdonald, Amr Abouelleil, Toby Bloom, Yama Cheshatsang, Carolyne Bardeleben, Qing Yu, Berta Blitshteyn, Tuyen T. Nguyen, Tarjei S. Mikkelsen, Edward Grandbois, Claire M. Wade, John E. Major, Filip Rege, Cindy Nguyen, Andrew Barry, Tracey Honan, Pablo Alvarez, Andy Vo, Manuel Garber, Cristyn Kells, Rachel Mittelman, Lucien Oyono, Norbu Dhargay, Sean M. Sykes, Diallo Ferguson, Tyler Aldredge, Tenchoe Nyima, Todd Sparrow, Daniel S. Hagopian, Christophe Hitte, Andreas Heger, Jane E. Wilkinson, Verneda Ray, Peter Rogov, Ewen F. Kirkness, Jill Falk, Robert Nicol, Christopher Patti, Danielle Perrin, Ted Sharpe, Douglas Smith, Peter Olandt, Matthew Breen, Ali Aslam, Cherylyn Smith, Tara Biagi, Diane Gage, Jean L. Chang, Karen Hughes Miller, Valentine Mlenga, Andrea Horn, Jessie Sloan, Claire M. Healy, Adam Wilson, Ngawang Sherpa, Riza M. Daza, David B. Jaffe, Leonid Boguslavskiy, Jody Camarata, Peter Kisner, William H. Lee, Kunsang Dorjee, M. Husby, Sante Gnerre, Kunsang Gyaltsen, Asha Kamat, Jonathan Butler, Terrance Shea, Alicia Franke, Patrick Cooke, Rayale Rameau, Andrew Zimmer, Gary Gearin, Nabil Hafez, Kerri Topham, Kebede Maru, Chris P. Ponting, Jerome Naylor, Yama Thoulutsang, Keith O'Neill, Jinlei Liu, Manolis Kellis, Claude Bonnet, Claudel Antoine, Passang Dorje, Adal Abebe, Tsamla Tsamla, Michael Kleber, Michael Weiand, Audra Goyette, Rachael Thomas, Lisa Zembek, Atanas Mihalev, Daniel Bessette, Helen Vassiliev, Pasang Bachantsang, Adam Navidi, Kathleen Dooley, Caleb Webber, Pierre Tchuinga, Tashi Bayul, Michael Kamal, Heidi G. Parker, Ben Kanga, Kimberly Dooley, Nadia Calixte, Mostafa Ait-zahra, Niall J. Lennon, Ira Topping, Eric S. Lander, Pieter J. deJong, Nicole R. Allen, Peter An, Boris Boukhgalter, Richard Elong, Thomas E. Landers, Anthony Rachupka, Michael Fitzgerald, Lisa Leuper, William Brockman, Marcia Lara, Susan Faro, Elaine A. Ostrander, Joanne Zainoun, Leigh Anne Hunnicutt, Mark J. Daly, Leanne Hughes, April Cook, Patrick Cahill, Sujaa Raghuraman, Manfred Grabherr, Robert K. Wayne, Adam Brown, Xiaohong Liu, Charles Matthews, Scott Anderson, Margaret Priest, Shailendra Yadav, Evan Mauceli, Kerstin Lindblad-Toh, Patricia Ferreira, Yeshi Lokyitsang, Harindra Arachchi, Alexandre Melnikov, Christina Raymond, James Meldrim, Dmitry Khazanovich, Mieke Citroen, Aaron M. Berlin, Alix Chinh Kieu, John Stalker, Francis Galibert, Noah Duffey, Krista Lance, Louis Meneus, Jennifer Ruth Sadler Hall, Choe Norbu, Pema Tenzing, Richard Marabella, Chee-Wye Chin, Karen Foley, Xiaoping Yang, Nga Nguyen, Tenzin Dawoe, Ryan Hegarty, Julie Rogers, Joseph Graham, Chelsea D. Foley, Leonidas Mulrain, Tsering Wangdi, Karin Decktor, Sarah LeVine, Shuli Yang, Dennis C. Friedrich, Tina Goode, Cecil Rise, Teena Mehta, Laura Ayotte, Michele Clamp, Nicole Stange-Thomann, Annie Lui, Edward J. Kulbokas, Pema Phunkhang, Alan Dupes, Elinor K. Karlsson, Lynne Aftuck, Sahal Osman, Abderrahim Farina, Barry O'Neill, Diana M. Shih, Xiaohui Xie, Lester Dorris, Vijay Venkataraman, Benjamin Jester, Sampath Settipalli, Thu Nguyen, Alville Collymore, Klaus-Peter Koepfli, Senait Tesfaye, Nathan Houde, Susan McDonough, Leo Goodstadt, Glen Munson, Georgia Giannoukos, Jeffrey Chu, Nathan B. Sutter, Sheila A. Fisher, Charlien Jones, Michael C. Zody, Jianying Shi, John P. Pollinger, Mechele Sheehan, Stephen M. J. Searle, Fritz Pierre, Jason Blye, Jean-Pierre Leger, and Stuart DeGray
- Subjects
Male ,Genomics ,Single-nucleotide polymorphism ,Biology ,Genome ,Polymorphism, Single Nucleotide ,Synteny ,Conserved sequence ,Evolution, Molecular ,Mice ,Dogs ,Molecular evolution ,Animals ,Humans ,Dog Diseases ,Conserved Sequence ,Short Interspersed Nucleotide Elements ,Genetics ,Whole genome sequencing ,Multidisciplinary ,Dog leukocyte antigen ,Haplotype ,Rats ,Haplotypes ,Mutagenesis ,biology.protein ,Hybridization, Genetic ,Female - Abstract
Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
- Published
- 2005
23. DNA sequence and analysis of human chromosome 8
- Author
-
Chad Nusbaum, Tarjei S. Mikkelsen, Michael C. Zody, Shuichi Asakawa, Stefan Taudien, Manuel Garber, Chinnappa D. Kodira, Mary G. Schueler, Atsushi Shimizu, Charles A. Whittaker, Jean L. Chang, Christina A. Cuomo, Ken Dewar, Michael G. FitzGerald, Xiaoping Yang, Nicole R. Allen, Scott Anderson, Teruyo Asakawa, Karin Blechschmidt, Toby Bloom, Mark L. Borowsky, Jonathan Butler, April Cook, Benjamin Corum, Kurt DeArellano, David DeCaprio, Kathleen T. Dooley, Lester Dorris, Reinhard Engels, Gernot Glöckner, Nabil Hafez, Daniel S. Hagopian, Jennifer L. Hall, Sabine K. Ishikawa, David B. Jaffe, Asha Kamat, Jun Kudoh, Rüdiger Lehmann, Tashi Lokitsang, Pendexter Macdonald, John E. Major, Charles D. Matthews, Evan Mauceli, Uwe Menzel, Atanas H. Mihalev, Shinsei Minoshima, Yuji Murayama, Jerome W. Naylor, Robert Nicol, Cindy Nguyen, Sinéad B. O'Leary, Keith O'Neill, Stephen C. J. Parker, Andreas Polley, Christina K. Raymond, Kathrin Reichwald, Joseph Rodriguez, Takashi Sasaki, Markus Schilhabel, Roman Siddiqui, Cherylyn L Smith, Tam P. Sneddon, Jessica A. Talamas, Pema Tenzin, Kerri Topham, Vijay Venkataraman, Gaiping Wen, Satoru Yamazaki, Sarah K. Young, Qiandong Zeng, Andrew R. Zimmer, Andre Rosenthal, Bruce W. Birren, Matthias Platzer, Nobuyoshi Shimizu, and Eric S. Lander
- Subjects
Male ,Multidisciplinary ,Molecular Sequence Data ,Sequence Analysis, DNA ,DNA, Satellite ,Immunity, Innate ,Defensins ,Euchromatin ,Evolution, Molecular ,Contig Mapping ,Multigene Family ,Animals ,Humans ,Female ,Chromosomes, Human, Pair 8 - Abstract
The International Human Genome Sequencing Consortium (IHGSC) recently completed a sequence of the human genome. As part of this project, we have focused on chromosome 8. Although some chromosomes exhibit extreme characteristics in terms of length, gene content, repeat content and fraction segmentally duplicated, chromosome 8 is distinctly typical in character, being very close to the genome median in each of these aspects. This work describes a finished sequence and gene catalogue for the chromosome, which represents just over 5% of the euchromatic human genome. A unique feature of the chromosome is a vast region of approximately 15 megabases on distal 8p that appears to have a strikingly high mutation rate, which has accelerated in the hominids relative to other sequenced mammals. This fast-evolving region contains a number of genes related to innate immunity and the nervous system, including loci that appear to be under positive selection--these include the major defensin (DEF) gene cluster and MCPH1, a gene that may have contributed to the evolution of expanded brain size in the great apes. The data from chromosome 8 should allow a better understanding of both normal and disease biology and genome evolution.
- Published
- 2005
24. Gene prediction with conditional random fields
- Author
-
James Galagan and David DeCaprio., Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science., Doherty, Matthew K, James Galagan and David DeCaprio., Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science., and Doherty, Matthew K
- Abstract
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007., Includes bibliographical references (p. 75-77)., The accurate annotation of an organism's protein-coding genes is crucial for subsequent genomic analysis. The rapid advance of sequencing technology has created a gap between genomic sequences and their annotations. Automated annotation methods are needed to bridge this gap, but existing solutions based on hidden Markov models cannot easily incorporate diverse evidence to make more accurate predictions. In this thesis, I built upon the semi-Markov conditional random field framework created by DeCaprio et al. to predict protein-coding genes in DNA sequences. Several novel extensions were designed and implemented, including a 29-state model with both semi-Markov and Markov states, an N-best Viterbi inference algorithm, several classes of discriminative feature functions that incorporate diverse evidence, and parallelization of the training and inference algorithms. The extensions were tested on the genomes of Phytophthora infestans, Culex pipiens, and Homo sapiens. The gene predictions were analyzed and the benefits of discriminative methods were explored., by Matthew K. Doherty., M.Eng.
- Published
- 2008
25. Abstract 5663: Confirmation of peroxiredoxin II as a driver gene for doxorubicin sensitivity identified from drug-induced expression profiling of the NCI-60 cell lines using Reverse Engineering (REFS) network models
- Author
-
James H. Doroshow, Boris Hayete, Curtis Hose, Karl Runge, David DeCaprio, Anne Monks, Paul McDonaugh, Iya Khalil, and Beverley A. Teicher
- Subjects
Gene expression profiling ,Cancer Research ,Gene knockdown ,Oncology ,Chemistry ,Cell culture ,RNA interference ,In silico ,Gene expression ,Cancer research ,Peroxiredoxin 2 ,Molecular biology ,Gene - Abstract
Analysis of expression profiling data from the NCI-60 cell line panel, designed to detect transcripts whose expression levels change in response to doxorubicin (Dox) treatment (100nM and 1000nM) for 2, 6 and 24 h, identified peroxiredoxin II (PRDX2) as a modulator of Dox sensitivity. The antitumor activity of Dox is pleiotropic but has been attributed in part to generation of reactive oxygen species which may also be an important factor in myocardial toxicity. Variation in the gene expression data produced by the increasing Dox concentrations together with the variation in response to Dox across the NCI-60 panel was used to construct causal gene expression network models of the mechanism of efficacy of Dox using the REFS™ platform from GNS Healthcare. REFS™ is a scalable, super computer-enabled framework for discovering causal network models directly from experimental data that requires highly optimized machine-learning algorithms run on massively parallel cloud-based supercomputers. REFS™ extracts information in two steps; Reverse Engineering to identify causal relationships followed by Forward Simulation to simulate interventions in silico. In this case, network models were used to simulate the predicted effect of a gene knockdown for each transcript in the NCI-60 cell lines on Dox sensitivity. PRDX2, which encodes for peroxiredoxin 2, an antioxidant enzyme that reduces hydrogen peroxide and has been demonstrated to have a role in protecting against reactive oxygen damage, was identified as a causal driver gene responsible for Dox sensitivity in the renal cell line, ACHN. Validation of the in silico predictions required RNAi to knock down PRDX2 gene expression (75 - 90% inhibition of expression) with 2 siRNA's in the ACHN and HCT-116 cell lines. Replicate experiments (n=3) demonstrated a significant (p Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 103rd Annual Meeting of the American Association for Cancer Research; 2012 Mar 31-Apr 4; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2012;72(8 Suppl):Abstract nr 5663. doi:1538-7445.AM2012-5663
- Published
- 2012
- Full Text
- View/download PDF
26. Erratum: Corrigendum: DNA sequence and analysis of human chromosome 18
- Author
-
Michael Fitzgerald, Amr Abouelleil, Bruce W. Birren, Andreas Gnirke, Takehiko Itoh, Cindy Nguyen, Yasushi Totoki, Annie Lui, Chinnappa D. Kodira, Chad Nusbaum, Michael C. Zody, Cherylyn Smith, Christina A. Cuomo, Yoshiyuki Sakaki, David DeCaprio, H Wain, Jessica A. Talamas, Bruno Piqani, Mark L. Borowsky, Manuel Garber, Qiandong Zeng, Jonathan Butler, Masahira Hattori, Charles Whittaker, Scott F. Anderson, David B. Jaffe, Atsushi Toyoda, Boris Bugalter, Ken Dewar, April Cook, Asao Fujiyama, Robert Nicol, Kerri Topham, Jerome Naylor, Catherine Hosage Norman, Tarjei S. Mikkelsen, Hideki Noguchi, Sinéad B. O'Leary, Reinhard Engels, Andrew Zimmer, Keith O'Neill, Nabil Hafez, Jean L. Chang, Toby Bloom, Sarah Young, Jennifer L. Hall, Evan Mauceli, Michael Kamal, Pendexter Macdonald, Yoko Kuroki, Xiaoping Yang, Eric S. Lander, Jessica A. Lehoczky, Nicole R. Allen, and Todd D. Taylor
- Subjects
Genetics ,Multidisciplinary ,Chromosome 18 ,Biology ,DNA sequencing - Abstract
Nature 437, 551–555 (2005) doi:10.1038/nature03983 The name of Keith O'Neill was accidentally omitted from the published author list. He is at the first affiliation in the address list.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.