45 results on '"Schraiber JG"'
Search Results
2. Ancient human genomes suggest three ancestral populations for present-day Europeans
- Author
-
Joanna L. Mountain, Michael F. Hammer, Ruslan Ruizbakiev, Cesare de Filippo, Kumarasamy Thangaraj, David E. C. Cole, Haim Ben-Ami, Leila Laredj, Mark Lipson, Jüri Parik, Valentino Romano, Andres Ruiz-Linares, Fouad Berrada, Dominique Delsate, Ugur Hodoglugil, Antti Sajantila, Olga Utevska, Shahlo Turdikulova, Tor Hervig, Ludmila P. Osipova, Hovhannes Sahakyan, Robert W. Mahley, Ramiro Barrantes, Kirsten I. Bos, Stanislav Dryomov, Peter H. Sudmant, Nadin Rohland, Heng Li, Gabriel Renaud, Mikhail Voevoda, Claudio M. Bravi, Jean-Michel Guinet, Rem I. Sukernik, Joachim Wahl, Matthias Meyer, Christos Economou, Kay Prüfer, Graciela Bailliet, Mait Metspalu, Mikhail Churnosov, Iosif Lazaridis, Johannes Krause, Bonnie Berger, Levon Yepiskoposyan, Francesca Brisighelli, Francesco Calì, Irene Gallego Romero, Oleg Balanovsky, George Ayodo, Alan Cooper, Alissa Mittnik, Julio Molina, George van Driem, Jean-Michel Dugoujon, Larissa Damba, Fedor Platonov, Nick Patterson, David Reich, Thomas B. Nyambo, David Comas, Olga L. Posukh, Béla Melegh, Draga Toncheva, Alena Kushniarevich, Brenna M. Henn, Montgomery Slatkin, René Vasquez, Elena B. Starikovskaya, Joachim Burger, Ayele Tarekegn, Tatijana Zemunik, Ene Metspalu, Sena Karachanak-Yankova, Lalji Singh, Wolfgang Haak, Susanna Sawyer, Rick A. Kittles, Cheryl A. Winkler, Svante Pääbo, Francisco Rothhammer, Marina Gubina, Pierre Zalloua, Aashish R. Jha, Swapan Mallick, Sergi Castellano, Qiaomei Fu, Desislava Nesheva, Sergey Litvinov, Ingrida Uktveryte, Michael Francken, Cosimo Posth, Theologos Loukidis, Cristian Capelli, Janet Kelso, Sarah A. Tishkoff, Toomas Kivisild, Mark G. Thomas, Elin Fornander, Mercedes Villena, Fredrik Hallgren, Vaidutis Kučinskas, Daniel Corach, George B.J. Busby, Judit Bene, William Klitz, Hamza A. Babiker, Karola Kirsanow, Ruth Bollongino, Rita Khusainova, Evan E. Eichler, Sardana A. Fedorova, Klemetti Näkkäläjärvi, Igor Rudan, Susanne Nordenfelt, Joshua G. Schraiber, Elena Balanovska, Antonio Salas, Richard Villems, Gabriel Bedoya, Elza Khusnutdinova, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Mathematics, Lipson, Mark, Berger Leighton, Bonnie, Lazaridis,I, Patterson,P, Mittnik,A, Renaud,G, Mallick,S, Kirsanow,K, Sudmant,PH, Schraiber,JG, Castellano,S, Lipson,M, Berger,B, Economou,C, Bollongino,R, Fu,Q, Bos,KI, Nordenfelt,S, Li,H, De Filippo,C, Pruefer,K, Sawyer, Posth,C, Haak1,H, Hallgren,F, Fornander,E, Rohland,N, Delsate,D, Francken,M, Guinet,JM, Wah,J, Ayodo,G, Babiker,HA, Bailliet,G, Balanovska,E, Balanovsky,O, Barrantes,R, Bedoya,G, Ben-Ami,H, Bene,J, Berrada,F, Bravi,CM, Brisighelli,F, Busby,GBJ, Cali,F, Churnosov,M, Cole,DEC, Corach,D, Damba,L, van Driem,G, Dryomov,S, Dugoujon,JM, Fedorova,SA, Gallego Romero,I, Gubina,M, Hammer,M, Henn,BM, Hervig,T, Hodoglugi,U, Jha,AR, Karachanak-Yankova,S, Khusainova,R, Khusnutdinova,E, Kittles,R:Kivisild,T, Klitz,W, Kucˇinskas,V, Kushniarevich,A, Laredj,L, Litvinov,S, Loukidis,T, Mahley,RW, Melegh,B, Metspalu,E, Molina,J, Mountain,J, Na¨kka¨la¨ja¨rvi,K, Nesheva,D, Nyambo,T, Osipova,L, Parik,J, Platonov,F, Posukh,O, Romano,V, Rothhammer,F, Rudan,I, Ruizbakiev,R, Sahakyan,H, Sajantila,A, Salas,A, Starikovskaya,EB, Tarekegn,A, Toncheva,D, Turdikulova,S, Uktveryte,I, Utevska,O, Vasquez,R, Villena,M, Voevoda,M, Winkler,CA, Yepiskoposyan,L, Zalloua,P, Zemunik,T, Cooper, Capelli,C, Thomas,MG, Ruiz-inares,A, Tishkoff,SA, Singh,L, Thangaraj,K, Villems,R, Comas,D, Sukernik,R, Metspalu,M, Meyer,M, Eichler,EE, Burger,J, Slatkin,M, Pa¨a¨bo,S, Kelso,J, Reich,D, and Krause,J
- Subjects
History ,Neanderthal ,Biología ,Population Dynamics ,Present day ,Genoma humà ,Genome ,purl.org/becyt/ford/1 [https] ,Basal (phylogenetics) ,Settore BIO/13 - Biologia Applicata ,History, Ancient ,Genetics ,Principal Component Analysis ,education.field_of_study ,0303 health sciences ,Multidisciplinary ,Ancient DNA ,030305 genetics & heredity ,food and beverages ,Agriculture ,Genomics ,3. Good health ,Europe ,Workforce ,CIENCIAS NATURALES Y EXACTAS ,Human ,Archaeogenetics ,Asia ,Lineage (genetic) ,EUROPE ,Otras Ciencias Biológicas ,European Continental Ancestry Group ,Population ,Settore BIO/08 - ANTROPOLOGIA ,evolution ,Europeans ,Biology ,Article ,White People ,Ancient ,Genètica de poblacions humanes ,Human origins ,Ciencias Biológicas ,03 medical and health sciences ,HUMAN ORIGINS ,biology.animal ,Humans ,ANCIENT DNA ,purl.org/becyt/ford/1.6 [https] ,education ,Quantitative Biology - Populations and Evolution ,Denisovan ,030304 developmental biology ,Genetic diversity ,ancient DNA, modern DNA, Europeans, prehistory ,Genome, Human ,Populations and Evolution (q-bio.PE) ,biology.organism_classification ,Evolutionary biology ,FOS: Biological sciences ,Upper Paleolithic ,Human genome ,GENOMICS - Abstract
We sequenced the genomes of a ∼7,000-year-old farmer from Germany and eight ∼8,000-year-old hunter-gatherers from Luxembourg and Sweden. We analysed these and other ancient genomes1,2,3,4 with 2,345 contemporary humans to show that most present-day Europeans derive from at least three highly differentiated populations: west European hunter-gatherers, who contributed ancestry to all Europeans but not to Near Easterners; ancient north Eurasians related to Upper Palaeolithic Siberians3, who contributed to both Europeans and Near Easterners; and early European farmers, who were mainly of Near Eastern origin but also harboured west European hunter-gatherer related ancestry. We model these populations’ deep relationships and show that early European farmers had ∼44% ancestry from a ‘basal Eurasian’ population that split before the diversification of other non-African lineages., Instituto Multidisciplinario de Biología Celular
- Published
- 2014
3. A path integral approach for allele frequency dynamics under polygenic selection.
- Author
-
Anderson NW, Kirk L, Schraiber JG, and Ragsdale AP
- Subjects
- Genetic Drift, Phenotype, Alleles, Selection, Genetic, Multifactorial Inheritance, Gene Frequency, Models, Genetic
- Abstract
Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence (E&R) experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a role in a given allele frequency change (AFC). Predicting AFCs under drift and selection, even for alleles contributing to simple, monogenic traits, has remained a challenging problem. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here, we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. We derive analytic expressions for the transition probability (i.e. the probability that an allele will change in frequency from x to y in time t) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of AFC to test for selection, as well as explore optimal design choices for E&R experiments to uncover the genetic architecture of polygenic traits under selection., Competing Interests: Conflicts of interest: The author(s) declare no conflicts of interest., (© The Author(s) 2024. Published by Oxford University Press on behalf of The Genetics Society of America. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact journals.permissions@oup.com.)
- Published
- 2025
- Full Text
- View/download PDF
4. Demographic history and genetic variation of the Armenian population.
- Author
-
Hovhannisyan A, Delser PM, Hakobyan A, Jones ER, Schraiber JG, Antonosyan M, Margaryan A, Xue Z, Jeon S, Bhak J, Hrechdakian P, Sahakyan H, Saag L, Khachatryan Z, Yepiskoposyan L, and Manica A
- Subjects
- Humans, Armenia, Genome, Human, Demography, Genetic Variation, Genetics, Population
- Abstract
We introduce a sizable (n = 34) whole-genome dataset on Armenians, a population inhabiting the region in West Asia known as the Armenian highlands. Equipped with this genetic data, we conducted a whole-genome study of Armenians and deciphered their fine-scale population structure and complex demographic history. We demonstrated that the Armenian populations from western, central, and eastern parts of the highlands are relatively homogeneous. The Sasun, a population in the south that had been argued to have received a major genetic contribution from Assyrians, was instead shown to have derived its slightly divergent genetic profile from a bottleneck that occurred in the recent past. We also investigated the debated question on the genetic origin of Armenians and failed to find any significant support for historical suggestions by Herodotus of their Balkan-related ancestry. We checked the degree of continuity of modern Armenians with ancient inhabitants of the eastern Armenian highlands and detected a genetic input into the region from a source linked to Neolithic Levantine Farmers at some point after the Early Bronze Age. Additionally, we cataloged an abundance of new mutations unique to the population, including a missense mutation predicted to cause familial Mediterranean fever, an autoinflammatory disorder highly prevalent in Armenians. Thus, we highlight the importance of further genetic and medical studies of this population., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2025
- Full Text
- View/download PDF
5. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations.
- Author
-
Schraiber JG, Edge MD, and Pennell M
- Subjects
- Genetics, Population methods, Quantitative Trait Loci, Humans, Computer Simulation, Models, Statistical, Phylogeny, Phenotype, Models, Genetic, Genome-Wide Association Study methods
- Abstract
In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these 2 fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we lay out a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., genome-wide association studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur analytically and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study, we re-examine an analysis testing for coevolution of expression levels between genes across a fungal phylogeny and show that including eigenvectors of the covariance matrix as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2024 Schraiber et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2024
- Full Text
- View/download PDF
6. Estimation of demography and mutation rates from one million haploid genomes.
- Author
-
Schraiber JG, Spence JP, and Edge MD
- Abstract
As genetic sequencing costs have plummeted, datasets with sizes previously un-thinkable have begun to appear. Such datasets present new opportunities to learn about evolutionary history, particularly via rare alleles that record the very recent past. However, beyond the computational challenges inherent in the analysis of many large-scale datasets, large population-genetic datasets present theoretical problems. In particular, the majority of population-genetic tools require the assumption that each mutant allele in the sample is the result of a single mutation (the "infinite sites" assumption), which is violated in large samples. Here, we present DR EVIL, a method for estimating mutation rates and recent demographic history from very large samples. DR EVIL avoids the infinite-sites assumption by using a diffusion approximation to a branching-process model with recurrent mutation. The branching-process approach limits the method to rare alleles, but, along with recent results, renders tractable likelihoods with recurrent mutation. We show that DR EVIL performs well in simulations and apply it to rare-variant data from a million haploid samples, identifying a signal of mutation-rate heterogeneity within commonly analyzed classes and predicting that in modern sample sizes, most rare variants at sites with high mutation rates represent the descendants of multiple mutation events.
- Published
- 2024
- Full Text
- View/download PDF
7. Convergent expansions of keystone gene families drive metabolic innovation in a major eukaryotic clade.
- Author
-
David KT, Schraiber JG, Crandall JG, Labella AL, Opulente DA, Harrison MC, Wolters JF, Zhou X, Shen XX, Groenewald M, Hittinger CT, Pennell M, and Rokas A
- Abstract
Many remarkable innovations have repeatedly occurred across vast evolutionary distances. When convergent traits emerge on the tree of life, they are sometimes driven by the same underlying gene families, while other times many different gene families are involved. Conversely, a gene family may be repeatedly recruited for a single trait or many different traits. To understand the general rules governing convergence at both genomic and phenotypic levels, we systematically tested associations between 56 binary metabolic traits and gene count in 14,710 gene families from 993 species of Saccharomycotina yeasts. Using a recently developed phylogenetic approach that reduces spurious correlations, we discovered that gene family expansion and contraction was significantly linked to trait gain and loss in 45/56 (80%) of traits. While 601/746 (81%) of significant gene families were associated with only one trait, we also identified several 'keystone' gene families that were significantly associated with up to 13/56 (23%) of all traits. These results indicate that metabolic innovations in yeasts are governed by a narrow set of major genetic elements and mechanisms., Competing Interests: Competing Interest Statement A.R. is a scientific consultant for LifeMine Therapeutics, Inc. The other authors declare no other competing interests.
- Published
- 2024
- Full Text
- View/download PDF
8. A Path Integral Approach for Allele Frequency Dynamics Under Polygenic Selection.
- Author
-
Anderson NW, Kirk L, Schraiber JG, and Ragsdale AP
- Abstract
Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a roll in a given allele frequency change. Predicting how much allele frequencies change under drift and selection had remained an open problem well into the 21
st century, even those contributing to simple, monogenic traits. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. In particular, we derive analytic expressions for the transition probability (i.e., the probability that an allele will change in frequency from x , to y in time t ) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of allele frequency change to test for selection, as well as explore optimal design choices for evolve-and-resequence experiments to uncover the genetic architecture of polygenic traits under selection.- Published
- 2024
- Full Text
- View/download PDF
9. Heritability within groups is uninformative about differences among groups: Cases from behavioral, evolutionary, and statistical genetics.
- Author
-
Schraiber JG and Edge MD
- Subjects
- Humans, Phenotype, Genotype, Computer Simulation, Genetic Variation, Biological Evolution, Genetics, Population
- Abstract
Without the ability to control or randomize environments (or genotypes), it is difficult to determine the degree to which observed phenotypic differences between two groups of individuals are due to genetic vs. environmental differences. However, some have suggested that these concerns may be limited to pathological cases, and methods have appeared that seem to give-directly or indirectly-some support to claims that aggregate heritable variation within groups can be related to heritable variation among groups. We consider three families of approaches: the "between-group heritability" sometimes invoked in behavior genetics, the statistic [Formula: see text] used in empirical work in evolutionary quantitative genetics, and methods based on variation in ancestry in an admixed population, used in anthropological and statistical genetics. We take up these examples to show mathematically that information on within-group genetic and phenotypic information in the aggregate cannot separate among-group differences into genetic and environmental components, and we provide simulation results that support our claims. We discuss these results in terms of the long-running debate on this topic., Competing Interests: Competing interests statement:The authors declare no competing interest.
- Published
- 2024
- Full Text
- View/download PDF
10. Double trouble: Predicting new variant counts across two heterogeneous populations.
- Author
-
Shen Y, Masoero L, Schraiber JG, and Broderick T
- Abstract
Collecting genomics data across multiple heterogeneous populations (e.g., across different cancer types) has the potential to improve our understanding of disease. Despite sequencing advances, though, resources often remain a constraint when gathering data. So it would be useful for experimental design if experimenters with access to a pilot study could predict the number of new variants they might expect to find in a follow-up study: both the number of new variants shared between the populations and the total across the populations. While many authors have developed prediction methods for the single-population case, we show that these predictions can fare poorly across multiple populations that are heterogeneous. We prove that, surprisingly, a natural extension of a state-of-the-art single-population predictor to multiple populations fails for fundamental reasons. We provide the first predictor for the number of new shared variants and new total variants that can handle heterogeneity in multiple populations. We show that our proposed method works well empirically using real cancer and population genetics data.
- Published
- 2024
11. Identification of constrained sequence elements across 239 primate genomes.
- Author
-
Kuderna LFK, Ulirsch JC, Rashid S, Ameen M, Sundaram L, Hickey G, Cox AJ, Gao H, Kumar A, Aguet F, Christmas MJ, Clawson H, Haeussler M, Janiak MC, Kuhlwilm M, Orkin JD, Bataillon T, Manu S, Valenzuela A, Bergman J, Rouselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath JE, Hvilsom C, Juan D, Frandsen P, Schraiber JG, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, Valsecchi J, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin AD, Guschanski K, Schierup MH, Beck RMD, Karakikes I, Wang KC, Umapathy G, Roos C, Boubli JP, Siepel A, Kundaje A, Paten B, Lindblad-Toh K, Rogers J, Marques Bonet T, and Farh KK
- Subjects
- Animals, Female, Humans, Pregnancy, Deoxyribonuclease I metabolism, DNA genetics, DNA metabolism, Mammals classification, Mammals genetics, Placenta, Regulatory Sequences, Nucleic Acid genetics, Reproducibility of Results, Transcription Factors metabolism, Proteins genetics, Gene Expression Regulation genetics, Conserved Sequence genetics, Evolution, Molecular, Genome genetics, Primates classification, Primates genetics
- Abstract
Noncoding DNA is central to our understanding of human gene regulation and complex diseases
1,2 , and measuring the evolutionary sequence constraint can establish the functional relevance of putative regulatory elements in the human genome3-9 . Identifying the genomic elements that have become constrained specifically in primates has been hampered by the faster evolution of noncoding DNA compared to protein-coding DNA10 , the relatively short timescales separating primate species11 , and the previously limited availability of whole-genome sequences12 . Here we construct a whole-genome alignment of 239 species, representing nearly half of all extant species in the primate order. Using this resource, we identified human regulatory elements that are under selective constraint across primates and other mammals at a 5% false discovery rate. We detected 111,318 DNase I hypersensitivity sites and 267,410 transcription factor binding sites that are constrained specifically in primates but not across other placental mammals and validate their cis-regulatory effects on gene expression. These regulatory elements are enriched for human genetic variants that affect gene expression and complex traits and diseases. Our results highlight the important role of recent evolution in regulatory sequence elements differentiating primates, including humans, from other placental mammals., (© 2023. The Author(s).)- Published
- 2024
- Full Text
- View/download PDF
12. Tree-based QTL mapping with expected local genetic relatedness matrices.
- Author
-
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CWK, and Edge MD
- Subjects
- Humans, Chromosome Mapping methods, Models, Genetic, Phenotype, Native Hawaiian or Pacific Islander genetics, Genome-Wide Association Study, Quantitative Trait Loci genetics, Genetics, Population
- Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide association studies (GWASs) are a powerful way to find genetic loci associated with phenotypes. GWASs are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix (local eGRM) given the ARG. Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to analyze two chromosomes containing known body size loci in a sample of Native Hawaiians. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2023 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
13. A global catalog of whole-genome diversity from 233 primate species.
- Author
-
Kuderna LFK, Gao H, Janiak MC, Kuhlwilm M, Orkin JD, Bataillon T, Manu S, Valenzuela A, Bergman J, Rousselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath JE, Hvilsom C, Juan D, Frandsen P, Schraiber JG, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, Valsecchi J, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin AD, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Rogers J, Farh KK, and Marques Bonet T
- Subjects
- Animals, Humans, Genome, Mutation Rate, Phylogeny, Population Density, Biological Evolution, Primates genetics, Genetic Variation
- Abstract
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology and is urgent given severe threats these species are facing. Here, we present high-coverage whole-genome data from 233 primate species representing 86% of genera and all 16 families. This dataset was used, together with fossil calibration, to create a nuclear DNA phylogeny and to reassess evolutionary divergence times among primate clades. We found within-species genetic diversity across families and geographic regions to be associated with climate and sociality, but not with extinction risk. Furthermore, mutation rates differ across species, potentially influenced by effective population sizes. Lastly, we identified extensive recurrence of missense mutations previously thought to be human specific. This study will open a wide range of research avenues for future primate genomic research.
- Published
- 2023
- Full Text
- View/download PDF
14. The landscape of tolerated genetic variation in humans and primates.
- Author
-
Gao H, Hamp T, Ede J, Schraiber JG, McRae J, Singer-Berk M, Yang Y, Dietrich ASD, Fiziev PP, Kuderna LFK, Sundaram L, Wu Y, Adhikari A, Field Y, Chen C, Batzoglou S, Aguet F, Lemire G, Reimers R, Balick D, Janiak MC, Kuhlwilm M, Orkin JD, Manu S, Valenzuela A, Bergman J, Rousselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath JE, Hvilsom C, Juan D, Frandsen P, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, do Amaral JV, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Bataillon T, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin A, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Lek M, Sunyaev S, O'Donnell-Luria A, Rehm HL, Xu J, Rogers J, Marques-Bonet T, and Farh KK
- Subjects
- Animals, Humans, Base Sequence, Gene Frequency, Whole Genome Sequencing, Genetic Variation, Primates genetics
- Abstract
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.
- Published
- 2023
- Full Text
- View/download PDF
15. Rare penetrant mutations confer severe risk of common diseases.
- Author
-
Fiziev PP, McRae J, Ulirsch JC, Dron JS, Hamp T, Yang Y, Wainschtein P, Ni Z, Schraiber JG, Gao H, Cable D, Field Y, Aguet F, Fasnacht M, Metwally A, Rogers J, Marques-Bonet T, Rehm HL, O'Donnell-Luria A, Khera AV, and Farh KK
- Subjects
- Humans, Genome-Wide Association Study, Mutation, Phenotype, Risk Factors, Genetic Predisposition to Disease, Multifactorial Inheritance, Penetrance
- Abstract
We examined 454,712 exomes for genes associated with a wide spectrum of complex traits and common diseases and observed that rare, penetrant mutations in genes implicated by genome-wide association studies confer ~10-fold larger effects than common variants in the same genes. Consequently, an individual at the phenotypic extreme and at the greatest risk for severe, early-onset disease is better identified by a few rare penetrant variants than by the collective action of many common variants with weak effects. By combining rare variants across phenotype-associated genes into a unified genetic risk model, we demonstrate superior portability across diverse global populations compared with common-variant polygenic risk scores, greatly improving the clinical utility of genetic-based risk prediction.
- Published
- 2023
- Full Text
- View/download PDF
16. Rare penetrant mutations confer severe risk of common diseases.
- Author
-
Fiziev P, McRae J, Ulirsch JC, Dron JS, Hamp T, Yang Y, Wainschtein P, Ni Z, Schraiber JG, Gao H, Cable D, Field Y, Aguet F, Fasnacht M, Metwally A, Rogers J, Marques-Bonet T, Rehm HL, O'Donnell-Luria A, Khera AV, and Kai-How Farh K
- Abstract
We examined 454,712 exomes for genes associated with a wide spectrum of complex traits and common diseases and observed that rare, penetrant mutations in genes implicated by genome-wide association studies confer ∼10-fold larger effects than common variants in the same genes. Consequently, an individual at the phenotypic extreme and at the greatest risk for severe, early-onset disease is better identified by a few rare penetrant variants than by the collective action of many common variants with weak effects. By combining rare variants across phenotype-associated genes into a unified genetic risk model, we demonstrate superior portability across diverse global populations compared to common variant polygenic risk scores, greatly improving the clinical utility of genetic-based risk prediction., One Sentence Summary: Rare variant polygenic risk scores identify individuals with outlier phenotypes in common human diseases and complex traits.
- Published
- 2023
- Full Text
- View/download PDF
17. The population genomic legacy of the second plague pandemic.
- Author
-
Gopalakrishnan S, Ebenesersdóttir SS, Lundstrøm IKC, Turner-Walker G, Moore KHS, Luisi P, Margaryan A, Martin MD, Ellegaard MR, Magnússon ÓÞ, Sigurðsson Á, Snorradóttir S, Magnúsdóttir DN, Laffoon JE, van Dorp L, Liu X, Moltke I, Ávila-Arcos MC, Schraiber JG, Rasmussen S, Juan D, Gelabert P, de-Dios T, Fotakis AK, Iraeta-Orbegozo M, Vågene ÅJ, Denham SD, Christophersen A, Stenøien HK, Vieira FG, Liu S, Günther T, Kivisild T, Moseng OG, Skar B, Cheung C, Sandoval-Velasco M, Wales N, Schroeder H, Campos PF, Guðmundsdóttir VB, Sicheritz-Ponten T, Petersen B, Halgunset J, Gilbert E, Cavalleri GL, Hovig E, Kockum I, Olsson T, Alfredsson L, Hansen TF, Werge T, Willerslev E, Balloux F, Marques-Bonet T, Lalueza-Fox C, Nielsen R, Stefánsson K, Helgason A, and Gilbert MTP
- Subjects
- Humans, Pandemics history, Metagenomics, Genome, Bacterial, Phylogeny, Plague epidemiology, Plague genetics
- Abstract
Human populations have been shaped by catastrophes that may have left long-lasting signatures in their genomes. One notable example is the second plague pandemic that entered Europe in ca. 1,347 CE and repeatedly returned for over 300 years, with typical village and town mortality estimated at 10%-40%.
1 It is assumed that this high mortality affected the gene pools of these populations. First, local population crashes reduced genetic diversity. Second, a change in frequency is expected for sequence variants that may have affected survival or susceptibility to the etiologic agent (Yersinia pestis).2 Third, mass mortality might alter the local gene pools through its impact on subsequent migration patterns. We explored these factors using the Norwegian city of Trondheim as a model, by sequencing 54 genomes spanning three time periods: (1) prior to the plague striking Trondheim in 1,349 CE, (2) the 17th -19th century, and (3) the present. We find that the pandemic period shaped the gene pool by reducing long distance immigration, in particular from the British Isles, and inducing a bottleneck that reduced genetic diversity. Although we also observe an excess of large FST values at multiple loci in the genome, these are shaped by reference biases introduced by mapping our relatively low genome coverage degraded DNA to the reference genome. This implies that attempts to detect selection using ancient DNA (aDNA) datasets that vary by read length and depth of sequencing coverage may be particularly challenging until methods have been developed to account for the impact of differential reference bias on test statistics., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.)- Published
- 2022
- Full Text
- View/download PDF
18. The history and geographic distribution of a KCNQ1 atrial fibrillation risk allele.
- Author
-
Hateley S, Lopez-Izquierdo A, Jou CJ, Cho S, Schraiber JG, Song S, Maguire CT, Torres N, Riedel M, Bowles NE, Arrington CB, Kennedy BJ, Etheridge SP, Lai S, Pribble C, Meyers L, Lundahl D, Byrnes J, Granka JM, Kauffman CA, Lemmon G, Boyden S, Scott Watkins W, Karren MA, Knight S, Brent Muhlestein J, Carlquist JF, Anderson JL, Chahine KG, Shah KU, Ball CA, Benjamin IJ, Yandell M, and Tristani-Firouzi M
- Subjects
- Action Potentials, Alleles, Denmark, Emigrants and Immigrants, Female, Genotype, Geography, Humans, Induced Pluripotent Stem Cells cytology, Induced Pluripotent Stem Cells metabolism, Male, Middle Aged, Myocytes, Cardiac cytology, Myocytes, Cardiac metabolism, Myocytes, Cardiac physiology, Pedigree, Risk Factors, Utah, Atrial Fibrillation genetics, Genetic Predisposition to Disease genetics, KCNQ1 Potassium Channel genetics, Mutation, Missense, Polymorphism, Single Nucleotide
- Abstract
The genetic architecture of atrial fibrillation (AF) encompasses low impact, common genetic variants and high impact, rare variants. Here, we characterize a high impact AF-susceptibility allele, KCNQ1 R231H, and describe its transcontinental geographic distribution and history. Induced pluripotent stem cell-derived cardiomyocytes procured from risk allele carriers exhibit abbreviated action potential duration, consistent with a gain-of-function effect. Using identity-by-descent (IBD) networks, we estimate the broad- and fine-scale population ancestry of risk allele carriers and their relatives. Analysis of ancestral migration routes reveals ancestors who inhabited Denmark in the 1700s, migrated to the Northeastern United States in the early 1800s, and traveled across the Midwest to arrive in Utah in the late 1800s. IBD/coalescent-based allele dating analysis reveals a relatively recent origin of the AF risk allele (~5000 years). Thus, our approach broadens the scope of study for disease susceptibility alleles to the context of human migration and ancestral origins., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
19. Ancestry inference using reference labeled clusters of haplotypes.
- Author
-
Wang Y, Song S, Schraiber JG, Sedghifar A, Byrnes JK, Turissini DA, Hong EL, Ball CA, and Noto K
- Subjects
- Haplotypes, Humans, Polymorphism, Single Nucleotide, Genetics, Population, Genome, Human
- Abstract
Background: We present ARCHes, a fast and accurate haplotype-based approach for inferring an individual's ancestry composition. Our approach works by modeling haplotype diversity from a large, admixed cohort of hundreds of thousands, then annotating those models with population information from reference panels of known ancestry., Results: The running time of ARCHes does not depend on the size of a reference panel because training and testing are separate processes, and the inferred population-annotated haplotype models can be written to disk and reused to label large test sets in parallel (in our experiments, it averages less than one minute to assign ancestry from 32 populations using 10 CPU). We test ARCHes on public data from the 1000 Genomes Project and the Human Genome Diversity Project (HGDP) as well as simulated examples of known admixture., Conclusions: Our results demonstrate that ARCHes outperforms RFMix at correctly assigning both global and local ancestry at finer population scales regardless of the amount of population admixture., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
20. Multiple episodes of interbreeding between Neanderthal and modern humans.
- Author
-
Villanea FA and Schraiber JG
- Subjects
- Animals, Gene Flow, Genome, Human, Humans, Models, Theoretical, Hybridization, Genetic, Neanderthals genetics
- Abstract
Neanderthals and anatomically modern humans overlapped geographically for a period of over 30,000 years following human migration out of Africa. During this period, Neanderthals and humans interbred, as evidenced by Neanderthal portions of the genome carried by non-African individuals today. A key observation is that the proportion of Neanderthal ancestry is ~12-20% higher in East Asian individuals relative to European individuals. Here, we explore various demographic models that could explain this observation. These include distinguishing between a single admixture event and multiple Neanderthal contributions to either population, and the hypothesis that reduced Neanderthal ancestry in modern Europeans resulted from more recent admixture with a ghost population that lacked a Neanderthal ancestry component (the 'dilution' hypothesis). To summarize the asymmetric pattern of Neanderthal allele frequencies, we compiled the joint fragment frequency spectrum of European and East Asian Neanderthal fragments and compared it with both analytical theory and data simulated under various models of admixture. Using maximum-likelihood and machine learning, we found that a simple model of a single admixture did not fit the empirical data, and instead favour a model of multiple episodes of gene flow into both European and East Asian populations. These findings indicate a longer-term, more complex interaction between humans and Neanderthals than was previously appreciated.
- Published
- 2019
- Full Text
- View/download PDF
21. A multispecies coalescent model for quantitative traits.
- Author
-
Mendes FK, Fuentes-González JA, Schraiber JG, and Hahn MW
- Subjects
- Animals, Biological Evolution, Humans, Phenotype, Phylogeny, Species Specificity, Models, Genetic, Quantitative Trait Loci, Quantitative Trait, Heritable
- Abstract
We present a multispecies coalescent model for quantitative traits that allows for evolutionary inferences at micro- and macroevolutionary scales. A major advantage of this model is its ability to incorporate genealogical discordance underlying a quantitative trait. We show that discordance causes a decrease in the expected trait covariance between more closely related species relative to more distantly related species. If unaccounted for, this outcome can lead to an overestimation of a trait's evolutionary rate, to a decrease in its phylogenetic signal, and to errors when examining shifts in mean trait values. The number of loci controlling a quantitative trait appears to be irrelevant to all trends reported, and discordance also affected discrete, threshold traits. Our model and analyses point to the conditions under which different methods should fare better or worse, in addition to indicating current and future approaches that can mitigate the effects of discordance., Competing Interests: FM, JF, JS, MH No competing interests declared, (© 2018, Mendes et al.)
- Published
- 2018
- Full Text
- View/download PDF
22. Mito-nuclear discordance across a recent contact zone for California voles.
- Author
-
Lin D, Bi K, Conroy CJ, Lacey EA, Schraiber JG, and Bowie RCK
- Abstract
To examine the processes that maintain genetic diversity among closely related taxa, we investigated the dynamics of introgression across a contact zone between two lineages of California voles ( Microtus californicus ). We tested the prediction that introgression of nuclear loci would be greater than that for mitochondrial loci, assuming ongoing gene flow across the contact zone. We also predicted that genomic markers would show a mosaic pattern of differentiation across this zone, consistent with genomes that are semi-permeable. Using mitochondrial cytochrome b sequences and genome-wide loci developed via ddRAD-seq, we analyzed genetic variation for 10 vole populations distributed along the central California coast; this transect included populations from within the distributions of both parental lineages as well as the putative contact zone. Our analyses revealed that (1) the two lineages examined are relatively young, having diverged ca. 8.5-54 kya, (2) voles from the contact zone in Santa Barbara County did not include F1 or early generation backcrossed individuals, and (3) there appeared to be little to no recurrent gene flow across the contact zone. Introgression patterns for mitochondrial and nuclear markers were not concordant; only mitochondrial markers revealed evidence of introgression, putatively due to historical hybridization. These differences in genetic signatures are intriguing given that the contact zone occurs in a region of continuous vole habitat, with no evidence of past or present physical barriers. Future studies that examine specific isolating mechanisms, such as microhabitat use and mate choice, will facilitate our understanding of how genetic boundaries are maintained in this system.
- Published
- 2018
- Full Text
- View/download PDF
23. Origins and genetic legacies of the Caribbean Taino.
- Author
-
Schroeder H, Sikora M, Gopalakrishnan S, Cassidy LM, Maisano Delser P, Sandoval Velasco M, Schraiber JG, Rasmussen S, Homburger JR, Ávila-Arcos MC, Allentoft ME, Moreno-Mayar JV, Renaud G, Gómez-Carballa A, Laffoon JE, Hopkins RJA, Higham TFG, Carr RS, Schaffer WC, Day JS, Hoogland M, Salas A, Bustamante CD, Nielsen R, Bradley DG, Hofman CL, and Willerslev E
- Subjects
- Adult, Archaeology, Bahamas, DNA, Ancient, DNA, Mitochondrial genetics, Female, Genetics, Population, Genomics, Hispanic or Latino genetics, History, Ancient, Human Migration history, Humans, Male, Paleontology, Phylogeny, Young Adult, American Indian or Alaska Native genetics, Genome, Human genetics, Human Migration statistics & numerical data
- Abstract
The Caribbean was one of the last parts of the Americas to be settled by humans, but how and when the islands were first occupied remains a matter of debate. Ancient DNA can help answering these questions, but the work has been hampered by poor DNA preservation. We report the genome sequence of a 1,000-year-old Lucayan Taino individual recovered from the site of Preacher's Cave in the Bahamas. We sequenced her genome to 12.4-fold coverage and show that she is genetically most closely related to present-day Arawakan speakers from northern South America, suggesting that the ancestors of the Lucayans originated there. Further, we find no evidence for recent inbreeding or isolation in the ancient genome, suggesting that the Lucayans had a relatively large effective population size. Finally, we show that the native American components in some present-day Caribbean genomes are closely related to the ancient Taino, demonstrating an element of continuity between precontact populations and present-day Latino populations in the Caribbean., Competing Interests: Conflict of interest statement: H.S. is on the scientific advisory board of Living DNA Ltd, J.R.H. is co-founder of Encompass Bioscience Inc, and C.D.B. is founder of IdentifyGenomics, LLC, and scientific advisor for Personalis Inc, Ancestry.com Inc, and Invitae Inc. This did not affect the design, execution, or interpretation of the experiments and results presented here., (Copyright © 2018 the Author(s). Published by PNAS.)
- Published
- 2018
- Full Text
- View/download PDF
24. Assessing the Relationship of Ancient and Modern Populations.
- Author
-
Schraiber JG
- Subjects
- Algorithms, Alleles, Humans, Polymorphism, Single Nucleotide, DNA, Ancient, Genetics, Population, Models, Genetic
- Abstract
Genetic material sequenced from ancient samples is revolutionizing our understanding of the recent evolutionary past. However, ancient DNA is often degraded, resulting in low coverage, error-prone sequencing. Several solutions exist to this problem, ranging from simple approach, such as selecting a read at random for each site, to more complicated approaches involving genotype likelihoods. In this work, we present a novel method for assessing the relationship of an ancient sample with a modern population, while accounting for sequencing error and postmortem damage by analyzing raw reads from multiple ancient individuals simultaneously. We show that, when analyzing SNP data, it is better to sequence more ancient samples to low coverage: two samples sequenced to 0.5× coverage provide better resolution than a single sample sequenced to 2× coverage. We also examined the power to detect whether an ancient sample is directly ancestral to a modern population, finding that, with even a few high coverage individuals, even ancient samples that are very slightly diverged from the modern population can be detected with ease. When we applied our approach to European samples, we found that no ancient samples represent direct ancestors of modern Europeans. We also found that, as shown previously, the most ancient Europeans appear to have had the smallest effective population sizes, indicating a role for agriculture in modern population growth., (Copyright © 2018 Schraiber.)
- Published
- 2018
- Full Text
- View/download PDF
25. Pulsed evolution shaped modern vertebrate body sizes.
- Author
-
Landis MJ and Schraiber JG
- Subjects
- Animals, Female, Male, Vertebrates anatomy & histology, Body Size genetics, Evolution, Molecular, Models, Genetic, Vertebrates genetics
- Abstract
The relative importance of different modes of evolution in shaping phenotypic diversity remains a hotly debated question. Fossil data suggest that stasis may be a common mode of evolution, while modern data suggest some lineages experience very fast rates of evolution. One way to reconcile these observations is to imagine that evolution proceeds in pulses, rather than in increments, on geological timescales. To test this hypothesis, we developed a maximum-likelihood framework for fitting Lévy processes to comparative morphological data. This class of stochastic processes includes both an incremental and a pulsed component. We found that a plurality of modern vertebrate clades examined are best fitted by pulsed processes over models of incremental change, stationarity, and adaptive radiation. When we compare our results to theoretical expectations of the rate and speed of regime shifts for models that detail fitness landscape dynamics, we find that our quantitative results are broadly compatible with both microevolutionary models and observations from the fossil record., Competing Interests: The authors declare no conflict of interest., (Copyright © 2017 the Author(s). Published by PNAS.)
- Published
- 2017
- Full Text
- View/download PDF
26. Archaic Hominin Admixture Facilitated Adaptation to Out-of-Africa Environments.
- Author
-
Gittelman RM, Schraiber JG, Vernot B, Mikacenic C, Wurfel MM, and Akey JM
- Subjects
- Animals, Demography, Gene Expression Regulation, Genetic Variation, Haplotypes, Humans, Neanderthals, Pigmentation, Adaptation, Physiological genetics, Biological Evolution, DNA genetics, Genome, Human genetics
- Abstract
As modern humans dispersed from Africa throughout the world, they encountered and interbred with archaic hominins, including Neanderthals and Denisovans [1, 2]. Although genome-scale maps of introgressed sequences have been constructed [3-6], considerable gaps in knowledge remain about the functional, phenotypic, and evolutionary significance of archaic hominin DNA that persists in present-day individuals. Here, we describe a comprehensive set of analyses that identified 126 high-frequency archaic haplotypes as putative targets of adaptive introgression in geographically diverse populations. These loci are enriched for immune-related genes (such as OAS1/2/3, TLR1/6/10, and TNFAIP3) and also encompass genes (including OCA2 and BNC2) that influence skin pigmentation phenotypes. Furthermore, we leveraged existing and novel large-scale gene expression datasets to show many positively selected archaic haplotypes act as expression quantitative trait loci (eQTLs), suggesting that modulation of transcript abundance was a common mechanism facilitating adaptive introgression. Our results demonstrate that hybridization between modern and archaic hominins provided an important reservoir of advantageous alleles that enabled adaptation to out-of-Africa environments., (Copyright © 2016 Elsevier Ltd. All rights reserved.)
- Published
- 2016
- Full Text
- View/download PDF
27. Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.
- Author
-
Nuttle X, Giannuzzi G, Duyzend MH, Schraiber JG, Narvaiza I, Sudmant PH, Penn O, Chiatante G, Malig M, Huddleston J, Benner C, Camponeschi F, Ciofi-Baffoni S, Stessman HA, Marchetto MC, Denman L, Harshman L, Baker C, Raja A, Penewit K, Janke N, Tang WJ, Ventura M, Banci L, Antonacci F, Akey JM, Amemiya CT, Gage FH, Reymond A, and Eichler EE
- Subjects
- Animals, Autistic Disorder genetics, Chromosome Breakage, Gene Duplication, Homeostasis genetics, Humans, Iron metabolism, Pan troglodytes genetics, Pongo genetics, Proteins analysis, Recombination, Genetic, Species Specificity, Time Factors, Chromosomes, Human, Pair 16 genetics, DNA Copy Number Variations genetics, Evolution, Molecular, Genetic Predisposition to Disease, Proteins genetics
- Abstract
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
- Published
- 2016
- Full Text
- View/download PDF
28. Bayesian Inference of Natural Selection from Allele Frequency Time Series.
- Author
-
Schraiber JG, Evans SN, and Slatkin M
- Subjects
- Animals, Bayes Theorem, Diploidy, Horses genetics, Skin Pigmentation genetics, Gene Frequency, Models, Genetic, Selection, Genetic, Software
- Abstract
The advent of accessible ancient DNA technology now allows the direct ascertainment of allele frequencies in ancestral populations, thereby enabling the use of allele frequency time series to detect and estimate natural selection. Such direct observations of allele frequency dynamics are expected to be more powerful than inferences made using patterns of linked neutral variation obtained from modern individuals. We developed a Bayesian method to make use of allele frequency time series data and infer the parameters of general diploid selection, along with allele age, in nonequilibrium populations. We introduce a novel path augmentation approach, in which we use Markov chain Monte Carlo to integrate over the space of allele frequency trajectories consistent with the observed data. Using simulations, we show that this approach has good power to estimate selection coefficients and allele age. Moreover, when applying our approach to data on horse coat color, we find that ignoring a relevant demographic history can significantly bias the results of inference. Our approach is made available in a C++ software package., (Copyright © 2016 by the Genetics Society of America.)
- Published
- 2016
- Full Text
- View/download PDF
29. Excavating Neandertal and Denisovan DNA from the genomes of Melanesian individuals.
- Author
-
Vernot B, Tucci S, Kelso J, Schraiber JG, Wolf AB, Gittelman RM, Dannemann M, Grote S, McCoy RC, Norton H, Scheinfeldt LB, Merriwether DA, Koki G, Friedlaender JS, Wakefield J, Pääbo S, and Akey JM
- Subjects
- Animals, Humans, Genetic Variation, Melanesia, Sequence Analysis, DNA, DNA genetics, Genome, Human genetics, Neanderthals genetics, Pacific Island People genetics
- Abstract
Although Neandertal sequences that persist in the genomes of modern humans have been identified in Eurasians, comparable studies in people whose ancestors hybridized with both Neandertals and Denisovans are lacking. We developed an approach to identify DNA inherited from multiple archaic hominin ancestors and applied it to whole-genome sequences from 1523 geographically diverse individuals, including 35 previously unknown Island Melanesian genomes. In aggregate, we recovered 1.34 gigabases and 303 megabases of the Neandertal and Denisovan genome, respectively. We use these maps of archaic sequences to show that Neandertal admixture occurred multiple times in different non-African populations, characterize genomic regions that are significantly depleted of archaic sequences, and identify signatures of adaptive introgression., (Copyright © 2016, American Association for the Advancement of Science.)
- Published
- 2016
- Full Text
- View/download PDF
30. Methods and models for unravelling human evolutionary history.
- Author
-
Schraiber JG and Akey JM
- Subjects
- Humans, Models, Genetic, Databases, Nucleic Acid, Evolution, Molecular, Genetic Variation, Genetics, Medical methods, Genome, Human, Genomics methods
- Abstract
The genomes of contemporary humans contain considerable information about the history of our species. Although the general contours of human evolutionary history have been defined with increasing resolution throughout the past several decades, the continuing deluge of massively large sequencing data sets presents new opportunities and challenges for understanding human evolutionary history. Here, we review the signatures that demographic history imparts on patterns of DNA sequence variation, statistical methods that have been developed to leverage information contained in genome-scale data sets and insights gleaned from these studies. We also discuss the importance of using exploratory analyses to assess data quality, the strengths and limitations of commonly used population genomics methods, and factors that confound population genomics inferences.
- Published
- 2015
- Full Text
- View/download PDF
31. Evidence of long-term gene flow and selection during domestication from analyses of Eurasian wild and domestic pig genomes.
- Author
-
Frantz LA, Schraiber JG, Madsen O, Megens HJ, Cagan A, Bosse M, Paudel Y, Crooijmans RP, Larson G, and Groenen MA
- Subjects
- Animals, Asia, Europe, Animals, Domestic genetics, Animals, Wild genetics, Gene Flow, Genome, Swine genetics
- Abstract
Traditionally, the process of domestication is assumed to be initiated by humans, involve few individuals and rely on reproductive isolation between wild and domestic forms. We analyzed pig domestication using over 100 genome sequences and tested whether pig domestication followed a traditional linear model or a more complex, reticulate model. We found that the assumptions of traditional models, such as reproductive isolation and strong domestication bottlenecks, are incompatible with the genetic data. In addition, our results show that, despite gene flow, the genomes of domestic pigs have strong signatures of selection at loci that affect behavior and morphology. We argue that recurrent selection for domestic traits likely counteracted the homogenizing effect of gene flow from wild boars and created 'islands of domestication' in the genome. Our results have major ramifications for the understanding of animal domestication and suggest that future studies should employ models that do not assume reproductive isolation.
- Published
- 2015
- Full Text
- View/download PDF
32. Sensitivity of quantitative traits to mutational effects and number of loci.
- Author
-
Schraiber JG and Landis MJ
- Subjects
- Humans, Models, Genetic, Phenotype, Genetic Variation genetics, Genetics, Population, Mutation genetics, Quantitative Trait Loci genetics
- Abstract
When models of quantitative genetic variation are built from population genetic first principles, several assumptions are often made. One of the most important assumptions is that traits are controlled by many genes of small effect. This leads to a prediction of a Gaussian trait distribution in the population, via the Central Limit Theorem. Since these biological assumptions are often unknown or untrue, we characterized how finite numbers of loci or large mutational effects can impact the sampling distribution of a quantitative trait. To do so, we developed a neutral coalescent-based framework, allowing us to gain a detailed understanding of how number of loci and the underlying mutational model impacts the distribution of a quantitative trait. Through both analytical theory and simulation we found the normality assumption was highly sensitive to the details of the mutational process, with the greatest discrepancies arising when the number of loci was small or the mutational kernel was heavy-tailed. In particular, skewed mutational effects will produce skewed trait distributions and fat-tailed mutational kernels result in multimodal sampling distributions, even for traits controlled by a large number of loci. Since selection models and robust neutral models may produce qualitatively similar sampling distributions, we advise extra caution should be taken when interpreting model-based results for poorly understood systems of quantitative traits., (Copyright © 2015 Elsevier Inc. All rights reserved.)
- Published
- 2015
- Full Text
- View/download PDF
33. Evolution of Tibetan wild boars.
- Author
-
Frantz LA, Madsen O, Megens HJ, Schraiber JG, Paudel Y, Bosse M, Crooijmans RP, Larson G, and Groenen MA
- Subjects
- Animals, Female, Selection, Genetic, Sus scrofa genetics
- Published
- 2015
- Full Text
- View/download PDF
34. Approximation to the distribution of fitness effects across functional categories in human segregating polymorphisms.
- Author
-
Racimo F and Schraiber JG
- Subjects
- Genetic Drift, Genetics, Population, Genome, Human, Humans, Mutation Rate, Evolution, Molecular, Genetic Fitness, Mutation genetics, Polymorphism, Single Nucleotide genetics
- Abstract
Quantifying the proportion of polymorphic mutations that are deleterious or neutral is of fundamental importance to our understanding of evolution, disease genetics and the maintenance of variation genome-wide. Here, we develop an approximation to the distribution of fitness effects (DFE) of segregating single-nucleotide mutations in humans. Unlike previous methods, we do not assume that synonymous mutations are neutral or not strongly selected, and we do not rely on fitting the DFE of all new nonsynonymous mutations to a single probability distribution, which is poorly motivated on a biological level. We rely on a previously developed method that utilizes a variety of published annotations (including conservation scores, protein deleteriousness estimates and regulatory data) to score all mutations in the human genome based on how likely they are to be affected by negative selection, controlling for mutation rate. We map this and other conservation scores to a scale of fitness coefficients via maximum likelihood using diffusion theory and a Poisson random field model on SNP data. Our method serves to approximate the deleterious DFE of mutations that are segregating, regardless of their genomic consequence. We can then compare the proportion of mutations that are negatively selected or neutral across various categories, including different types of regulatory sites. We observe that the distribution of intergenic polymorphisms is highly peaked at neutrality, while the distribution of nonsynonymous polymorphisms has a second peak at [Formula: see text]. Other types of polymorphisms have shapes that fall roughly in between these two. We find that transcriptional start sites, strong CTCF-enriched elements and enhancers are the regulatory categories with the largest proportion of deleterious polymorphisms.
- Published
- 2014
- Full Text
- View/download PDF
35. Ancient human genomes suggest three ancestral populations for present-day Europeans.
- Author
-
Lazaridis I, Patterson N, Mittnik A, Renaud G, Mallick S, Kirsanow K, Sudmant PH, Schraiber JG, Castellano S, Lipson M, Berger B, Economou C, Bollongino R, Fu Q, Bos KI, Nordenfelt S, Li H, de Filippo C, Prüfer K, Sawyer S, Posth C, Haak W, Hallgren F, Fornander E, Rohland N, Delsate D, Francken M, Guinet JM, Wahl J, Ayodo G, Babiker HA, Bailliet G, Balanovska E, Balanovsky O, Barrantes R, Bedoya G, Ben-Ami H, Bene J, Berrada F, Bravi CM, Brisighelli F, Busby GB, Cali F, Churnosov M, Cole DE, Corach D, Damba L, van Driem G, Dryomov S, Dugoujon JM, Fedorova SA, Gallego Romero I, Gubina M, Hammer M, Henn BM, Hervig T, Hodoglugil U, Jha AR, Karachanak-Yankova S, Khusainova R, Khusnutdinova E, Kittles R, Kivisild T, Klitz W, Kučinskas V, Kushniarevich A, Laredj L, Litvinov S, Loukidis T, Mahley RW, Melegh B, Metspalu E, Molina J, Mountain J, Näkkäläjärvi K, Nesheva D, Nyambo T, Osipova L, Parik J, Platonov F, Posukh O, Romano V, Rothhammer F, Rudan I, Ruizbakiev R, Sahakyan H, Sajantila A, Salas A, Starikovskaya EB, Tarekegn A, Toncheva D, Turdikulova S, Uktveryte I, Utevska O, Vasquez R, Villena M, Voevoda M, Winkler CA, Yepiskoposyan L, Zalloua P, Zemunik T, Cooper A, Capelli C, Thomas MG, Ruiz-Linares A, Tishkoff SA, Singh L, Thangaraj K, Villems R, Comas D, Sukernik R, Metspalu M, Meyer M, Eichler EE, Burger J, Slatkin M, Pääbo S, Kelso J, Reich D, and Krause J
- Subjects
- Agriculture history, Asia ethnology, Europe, History, Ancient, Humans, Population Dynamics, Principal Component Analysis, Workforce, Genome, Human genetics, White People classification, White People genetics
- Abstract
We sequenced the genomes of a ∼7,000-year-old farmer from Germany and eight ∼8,000-year-old hunter-gatherers from Luxembourg and Sweden. We analysed these and other ancient genomes with 2,345 contemporary humans to show that most present-day Europeans derive from at least three highly differentiated populations: west European hunter-gatherers, who contributed ancestry to all Europeans but not to Near Easterners; ancient north Eurasians related to Upper Palaeolithic Siberians, who contributed to both Europeans and Near Easterners; and early European farmers, who were mainly of Near Eastern origin but also harboured west European hunter-gatherer related ancestry. We model these populations' deep relationships and show that early European farmers had ∼44% ancestry from a 'basal Eurasian' population that split before the diversification of other non-African lineages.
- Published
- 2014
- Full Text
- View/download PDF
36. A path integral formulation of the Wright-Fisher process with genic selection.
- Author
-
Schraiber JG
- Subjects
- Probability, Models, Genetic, Selection, Genetic
- Abstract
The Wright-Fisher process with selection is an important tool in population genetics theory. Traditional analysis of this process relies on the diffusion approximation. The diffusion approximation is usually studied in a partial differential equations framework. In this paper, I introduce a path integral formalism to study the Wright-Fisher process with selection and use that formalism to obtain a simple perturbation series to approximate the transition density. The perturbation series can be understood in terms of Feynman diagrams, which have a simple probabilistic interpretation in terms of selective events. The perturbation series proves to be an accurate approximation of the transition density for weak selection and is shown to be arbitrarily accurate for any selection coefficient., (Copyright © 2013 Elsevier Inc. All rights reserved.)
- Published
- 2014
- Full Text
- View/download PDF
37. Analysis and rejection sampling of Wright-Fisher diffusion bridges.
- Author
-
Schraiber JG, Griffiths RC, and Evans SN
- Subjects
- Genetics, Population, Models, Theoretical
- Abstract
We investigate the properties of a Wright-Fisher diffusion process starting at frequency x at time 0 and conditioned to be at frequency y at time T. Such a process is called a bridge. Bridges arise naturally in the analysis of selection acting on standing variation and in the inference of selection from allele frequency time series. We establish a number of results about the distribution of neutral Wright-Fisher bridges and develop a novel rejection-sampling scheme for bridges under selection that we use to study their behavior., (Copyright © 2013 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
38. Phylogenetic analysis using Lévy processes: finding jumps in the evolution of continuous traits.
- Author
-
Landis MJ, Schraiber JG, and Liang M
- Subjects
- Animals, Body Mass Index, Computer Simulation, Primates classification, Primates physiology, Models, Biological, Phenotype, Phylogeny
- Abstract
Gaussian processes, a class of stochastic processes including Brownian motion and the Ornstein-Uhlenbeck process, are widely used to model continuous trait evolution in statistical phylogenetics. Under such processes, observations at the tips of a phylogenetic tree have a multivariate Gaussian distribution, which may lead to suboptimal model specification under certain evolutionary conditions, as supposed in models of punctuated equilibrium or adaptive radiation. To consider non-normally distributed continuous trait evolution, we introduce a method to compute posterior probabilities when modeling continuous trait evolution as a Lévy process. Through data simulation and model testing, we establish that single-rate Brownian motion (BM) and Lévy processes with jumps generate distinct patterns in comparative data. We then analyzed body mass and endocranial volume measurements for 126 primates. We rejected single-rate BM in favor of a Lévy process with jumps for each trait, with the lineage leading to most recent common ancestor of great apes showing particularly strong evidence against single-rate BM.
- Published
- 2013
- Full Text
- View/download PDF
39. Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus.
- Author
-
Frantz LA, Schraiber JG, Madsen O, Megens HJ, Bosse M, Paudel Y, Semiadi G, Meijaard E, Li N, Crooijmans RP, Archibald AL, Slatkin M, Schook LB, Larson G, and Groenen MA
- Subjects
- Africa, Animal Distribution, Animals, Asia, Southeastern, Chromosome Mapping, Climate, Gene Flow, Genetics, Population, Phylogeography, Sequence Analysis, DNA, Genetic Speciation, Genetic Variation, Genome, Phylogeny, Swine classification, Swine genetics
- Abstract
Background: Elucidating the process of speciation requires an in-depth understanding of the evolutionary history of the species in question. Studies that rely upon a limited number of genetic loci do not always reveal actual evolutionary history, and often confuse inferences related to phylogeny and speciation. Whole-genome data, however, can overcome this issue by providing a nearly unbiased window into the patterns and processes of speciation. In order to reveal the complexity of the speciation process, we sequenced and analyzed the genomes of 10 wild pigs, representing morphologically or geographically well-defined species and subspecies of the genus Sus from insular and mainland Southeast Asia, and one African common warthog., Results: Our data highlight the importance of past cyclical climatic fluctuations in facilitating the dispersal and isolation of populations, thus leading to the diversification of suids in one of the most species-rich regions of the world. Moreover, admixture analyses revealed extensive, intra- and inter-specific gene-flow that explains previous conflicting results obtained from a limited number of loci. We show that these multiple episodes of gene-flow resulted from both natural and human-mediated dispersal., Conclusions: Our results demonstrate the importance of past climatic fluctuations and human mediated translocations in driving and complicating the process of speciation in island Southeast Asia. This case study demonstrates that genomics is a powerful tool to decipher the evolutionary history of a genus, and reveals the complexity of the process of speciation.
- Published
- 2013
- Full Text
- View/download PDF
40. Inferring evolutionary histories of pathway regulation from transcriptional profiling data.
- Author
-
Schraiber JG, Mostovoy Y, Hsu TY, and Brem RB
- Subjects
- Phylogeny, Saccharomyces genetics, Evolution, Molecular, Gene Expression Profiling methods, Genomics methods, Models, Genetic
- Abstract
One of the outstanding challenges in comparative genomics is to interpret the evolutionary importance of regulatory variation between species. Rigorous molecular evolution-based methods to infer evidence for natural selection from expression data are at a premium in the field, and to date, phylogenetic approaches have not been well-suited to address the question in the small sets of taxa profiled in standard surveys of gene expression. We have developed a strategy to infer evolutionary histories from expression profiles by analyzing suites of genes of common function. In a manner conceptually similar to molecular evolution models in which the evolutionary rates of DNA sequence at multiple loci follow a gamma distribution, we modeled expression of the genes of an a priori-defined pathway with rates drawn from an inverse gamma distribution. We then developed a fitting strategy to infer the parameters of this distribution from expression measurements, and to identify gene groups whose expression patterns were consistent with evolutionary constraint or rapid evolution in particular species. Simulations confirmed the power and accuracy of our inference method. As an experimental testbed for our approach, we generated and analyzed transcriptional profiles of four Saccharomyces yeasts. The results revealed pathways with signatures of constrained and accelerated regulatory evolution in individual yeasts and across the phylogeny, highlighting the prevalence of pathway-level expression change during the divergence of yeast species. We anticipate that our pathway-based phylogenetic approach will be of broad utility in the search to understand the evolutionary relevance of regulatory change.
- Published
- 2013
- Full Text
- View/download PDF
41. Genomic tests of variation in inbreeding among individuals and among chromosomes.
- Author
-
Schraiber JG, Shih S, and Slatkin M
- Subjects
- Genetic Variation, Genome, Human, Heterozygote, Humans, Likelihood Functions, Chromosomes, Human, Consanguinity, Models, Genetic
- Abstract
We examine the distribution of heterozygous sites in nine European and nine Yoruban individuals whose genomic sequences were made publicly available by Complete Genomics. We show that it is possible to obtain detailed information about inbreeding when a relatively small set of whole-genome sequences is available. Rather than focus on testing for deviations from Hardy-Weinberg genotype frequencies at each site, we analyze the entire distribution of heterozygotes conditioned on the number of copies of the derived (non-chimpanzee) allele. Using Levene's exact test, we reject Hardy-Weinberg in both populations. We generalized Levene's distribution to obtain the exact distribution of the number of heterozygous individuals given that every individual has the same inbreeding coefficient, F. We estimated F to be 0.0026 in Europeans and 0.0005 in Yorubans, but we could also reject the hypothesis that F was the same in each individual. We used a composite-likelihood method to estimate F in each individual and within each chromosome. Variation in F across chromosomes within individuals was too large to be consistent with sampling effects alone. Furthermore, estimates of F for each chromosome in different populations were not correlated. Our results show how detailed comparisons of population genomic data can be made to theoretical predictions. The application of methods to the Complete Genomics data set shows that the extent of apparent inbreeding varies across chromosomes and across individuals, and estimates of inbreeding coefficients are subject to unexpected levels of variation, which might be partly accounted for by selection.
- Published
- 2012
- Full Text
- View/download PDF
42. Analyses of pig genomes provide insight into porcine demography and evolution.
- Author
-
Groenen MA, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, Rogel-Gaillard C, Park C, Milan D, Megens HJ, Li S, Larkin DM, Kim H, Frantz LA, Caccamo M, Ahn H, Aken BL, Anselmo A, Anthon C, Auvil L, Badaoui B, Beattie CW, Bendixen C, Berman D, Blecha F, Blomberg J, Bolund L, Bosse M, Botti S, Bujie Z, Bystrom M, Capitanu B, Carvalho-Silva D, Chardon P, Chen C, Cheng R, Choi SH, Chow W, Clark RC, Clee C, Crooijmans RP, Dawson HD, Dehais P, De Sapio F, Dibbits B, Drou N, Du ZQ, Eversole K, Fadista J, Fairley S, Faraut T, Faulkner GJ, Fowler KE, Fredholm M, Fritz E, Gilbert JG, Giuffra E, Gorodkin J, Griffin DK, Harrow JL, Hayward A, Howe K, Hu ZL, Humphray SJ, Hunt T, Hornshøj H, Jeon JT, Jern P, Jones M, Jurka J, Kanamori H, Kapetanovic R, Kim J, Kim JH, Kim KW, Kim TH, Larson G, Lee K, Lee KT, Leggett R, Lewin HA, Li Y, Liu W, Loveland JE, Lu Y, Lunney JK, Ma J, Madsen O, Mann K, Matthews L, McLaren S, Morozumi T, Murtaugh MP, Narayan J, Nguyen DT, Ni P, Oh SJ, Onteru S, Panitz F, Park EW, Park HS, Pascal G, Paudel Y, Perez-Enciso M, Ramirez-Gonzalez R, Reecy JM, Rodriguez-Zas S, Rohrer GA, Rund L, Sang Y, Schachtschneider K, Schraiber JG, Schwartz J, Scobie L, Scott C, Searle S, Servin B, Southey BR, Sperber G, Stadler P, Sweedler JV, Tafer H, Thomsen B, Wali R, Wang J, Wang J, White S, Xu X, Yerle M, Zhang G, Zhang J, Zhang J, Zhao S, Rogers J, Churcher C, and Schook LB
- Subjects
- Animals, Demography, Models, Animal, Molecular Sequence Data, Population Dynamics, Genome genetics, Phylogeny, Sus scrofa classification, Sus scrofa genetics
- Abstract
For 10,000 years pigs and humans have shared a close and complex relationship. From domestication to modern breeding practices, humans have shaped the genomes of domestic pigs. Here we present the assembly and analysis of the genome sequence of a female domestic Duroc pig (Sus scrofa) and a comparison with the genomes of wild and domestic pigs from Europe and Asia. Wild pigs emerged in South East Asia and subsequently spread across Eurasia. Our results reveal a deep phylogenetic split between European and Asian wild boars ∼1 million years ago, and a selective sweep analysis indicates selection on genes involved in RNA processing and regulation. Genes associated with immune response and olfaction exhibit fast evolution. Pigs have the largest repertoire of functional olfactory receptor genes, reflecting the importance of smell in this scavenging animal. The pig genome sequence provides an important resource for further improvements of this important livestock species, and our identification of many putative disease-causing variants extends the potential of the pig as a biomedical model.
- Published
- 2012
- Full Text
- View/download PDF
43. A high-coverage genome sequence from an archaic Denisovan individual.
- Author
-
Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, Schraiber JG, Jay F, Prüfer K, de Filippo C, Sudmant PH, Alkan C, Fu Q, Do R, Rohland N, Tandon A, Siebauer M, Green RE, Bryc K, Briggs AW, Stenzel U, Dabney J, Shendure J, Kitzman J, Hammer MF, Shunkov MV, Derevianko AP, Patterson N, Andrés AM, Eichler EE, Slatkin M, Reich D, Kelso J, and Pääbo S
- Subjects
- Alleles, Animals, Base Sequence, Fossils, Gene Flow, Gene Library, Humans, Molecular Sequence Data, Sequence Analysis, DNA, Genetic Variation, Genome, Human genetics, Heterozygote, Neanderthals genetics
- Abstract
We present a DNA library preparation method that has allowed us to reconstruct a high-coverage (30×) genome sequence of a Denisovan, an extinct relative of Neandertals. The quality of this genome allows a direct estimation of Denisovan heterozygosity indicating that genetic diversity in these archaic hominins was extremely low. It also allows tentative dating of the specimen on the basis of "missing evolution" in its genome, detailed measurements of Denisovan and Neandertal admixture into present-day human populations, and the generation of a near-complete catalog of genetic changes that swept to high frequency in modern humans since their divergence from Denisovans.
- Published
- 2012
- Full Text
- View/download PDF
44. Evolution of a membrane protein regulon in Saccharomyces.
- Author
-
Martin HC, Roop JI, Schraiber JG, Hsu TY, and Brem RB
- Subjects
- Gene Expression Profiling, Promoter Regions, Genetic, Saccharomyces cerevisiae genetics, Saccharomyces cerevisiae Proteins genetics, Evolution, Molecular, Fungal Proteins genetics, Membrane Proteins genetics, Saccharomyces genetics
- Abstract
Expression variation is widespread between species. The ability to distinguish regulatory change driven by natural selection from the consequences of neutral drift remains a major challenge in comparative genomics. In this work, we used observations of mRNA expression and promoter sequence to analyze signatures of selection on groups of functionally related genes in Saccharomycete yeasts. In a survey of gene regulons with expression divergence between Saccharomyces cerevisiae and S. paradoxus, we found that most were subject to variation in trans-regulatory factors that provided no evidence against a neutral model. However, we identified one regulon of membrane protein genes controlled by unlinked cis- and trans-acting determinants with coherent effects on gene expression, consistent with a history of directional, nonneutral evolution. For this membrane protein group, S. paradoxus alleles at regulatory loci were associated with elevated expression and altered stress responsiveness relative to other yeasts. In a phylogenetic comparison of promoter sequences of the membrane protein genes between species, the S. paradoxus lineage was distinguished by a short branch length, indicative of strong selective constraint. Likewise, sequence variants within the S. paradoxus population, but not across strains of other yeasts, were skewed toward low frequencies in promoters of genes in the membrane protein regulon, again reflecting strong purifying selection. Our results support a model in which a distinct expression program for the membrane protein genes in S. paradoxus has been preferentially maintained by negative selection as the result of an increased importance to organismal fitness. These findings illustrate the power of integrating expression- and sequence-based tests of natural selection in the study of evolutionary forces that underlie regulatory change.
- Published
- 2012
- Full Text
- View/download PDF
45. Constraints on the use of lifespan-shortening Wolbachia to control dengue fever.
- Author
-
Schraiber JG, Kaczmarczyk AN, Kwok R, Park M, Silverstein R, Rutaganira FU, Aggarwal T, Schwemmer MA, Hom CL, Grosberg RK, and Schreiber SJ
- Subjects
- Aedes physiology, Aedes virology, Animals, Dengue transmission, Fertility physiology, Host-Pathogen Interactions, Humans, Insect Vectors virology, Longevity, Models, Biological, Population Dynamics, Aedes microbiology, Dengue prevention & control, Insect Vectors microbiology, Pest Control, Biological methods, Wolbachia physiology
- Abstract
Dengue fever, a viral disease spread by the mosquito Aedes aegypti, affects 50-100 million people a year in many tropical countries. Because the virus must incubate within mosquito hosts for two weeks before being able to transmit the infection, shortening the lifespan of mosquitoes may curtail dengue transmission. We developed a continuous time reaction-diffusion model of the spatial spread of Wolbachia through a population of A. aegypti. This model incorporates the lifespan-shortening effects of Wolbachia on infected A. aegypti and the fitness advantage to infected females due to cytoplasmic incompatibility (CI). We found that local establishment of the Wolbachia infection can occur if the fitness advantage due to CI exceeds the fitness reduction due to lifespan-shortening effects, in accordance with earlier results concerning fecundity reduction. However, spatial spread is possible only if the fitness advantage due to CI is twice as great as the fitness reduction due to lifespan shortening effects. Moreover, lifespan-shortening and fecundity-reduction can have different effects on the speed of wave-retreat. Using data from the literature, we estimated all demographic parameters for infected and uninfected mosquitoes and computed the velocities of spread of infection. Our most optimistic estimates suggest that the spatial spread of lifespan-shortening Wolbachia may be so slow that efficient spatial spread would require a prohibitively large number of point releases. However, as these estimates of demographic parameters may not accurately reflect natural conditions, further research is necessary to corroborate these predictions., (Copyright © 2011 Elsevier Ltd. All rights reserved.)
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.