93 results on '"Alon Keinan"'
Search Results
2. Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
- Author
-
Yingjie Guo, Chenxi Wu, Maozu Guo, Quan Zou, Xiaoyan Liu, and Alon Keinan
- Subjects
genome-wide association studies ,single nucleotide polymorphisms ,quantitative traits ,linear mixed model ,sparse group lasso ,Genetics ,QH426-470 - Abstract
Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs into account simultaneously due to the “polygenic” attribute of complex quantitative traits. In this paper, we propose a new approach SGL-LMM that puts together sparse group lasso (SGL) and linear mixed model (LMM) for multivariate associations of quantitative traits. LMM, as has been often used in GWAS, controls for confounders, while SGL maintains sparsity of the underlying multivariate regression model. SGL-LMM first sets a fixed zero effect to learn the parameters of random effects using LMM, and then estimates fixed effects using SGL regularization. We present efficient algorithms for hyperparameter tuning and feature selection using stability selection. While controlling for confounders and constraining for sparse solutions, SGL-LMM also provides a natural framework for incorporating prior biological information into the group structure underlying the model. Results based on both simulated and real data show SGL-LMM outperforms previous approaches in terms of power to detect associations and accuracy of quantitative trait prediction.
- Published
- 2019
- Full Text
- View/download PDF
3. Interaction between SNPs in the RXRA and near ANGPTL3 gene region inhibits apoB reduction after statin-fenofibric acid therapy in individuals with mixed dyslipidemia
- Author
-
Li Ma, Christie M. Ballantyne, John W. Belmont, Alon Keinan, and Ariel Brautbar
- Subjects
pharmacogenetics ,fibrates ,genetic variants ,single-nucleotide polymorphism ,apolipoprotein ,Biochemistry ,QD415-436 - Abstract
The mixed dyslipidemia phenotype is characterized by elevated triglycerides (TG), low HDL cholesterol (HDL-C), increased ApoB levels, and premature coronary atherosclerosis. Fibrate-statin combination therapy reduces ApoB levels and coronary events in the mixed dyslipidemia population. We sought to identify gene-gene interactions that affect ApoB response to statin-fenofibric acid therapy in the mixed dyslipidemia population. Using a predefined subset of single-nucleotide polymorphisms (SNPs) that were previously associated with TG, VLDL, or HDL-C, we applied gene-gene interaction testing in a randomized, double-blind, clinical trial examining the response to fenofibric acid (FNA) and its combination with statin in 1,865 individuals with mixed dyslipidemia. Of 11,783 possible SNP pairs examined, we detected a single significant interaction between rs12130333, located within the ANGPTL3 gene region, and rs4240705, within the RXRA gene, on ApoB reduction after statin-FNA therapy (P = 4.0 × 10−6). ApoB response to therapy gradually reduced with the increasing number of T alleles in the rs12130333 but only in the presence of the GG genotype of rs4240705. Individuals doubly homozygous for the minor alleles at rs12130333 and rs4240705 showed a paradoxical increase of 1.8% in ApoB levels after FNA-statin combination therapy. No gene-gene interaction was identified other than an interaction between SNPs in the ANGPTL3 and RXRA regions, which results in the inhibition of ApoB reduction in response to statin-FNA therapy. Further study is required to examine the clinical applicability of this genetic interaction and its effect on coronary events.
- Published
- 2012
- Full Text
- View/download PDF
4. The Genetics of Bene Israel from India Reveals Both Substantial Jewish and Indian Ancestry.
- Author
-
Yedael Y Waldman, Arjun Biddanda, Natalie R Davidson, Paul Billing-Ross, Maya Dubrovsky, Christopher L Campbell, Carole Oddoux, Eitan Friedman, Gil Atzmon, Eran Halperin, Harry Ostrer, and Alon Keinan
- Subjects
Medicine ,Science - Abstract
The Bene Israel Jewish community from West India is a unique population whose history before the 18th century remains largely unknown. Bene Israel members consider themselves as descendants of Jews, yet the identity of Jewish ancestors and their arrival time to India are unknown, with speculations on arrival time varying between the 8th century BCE and the 6th century CE. Here, we characterize the genetic history of Bene Israel by collecting and genotyping 18 Bene Israel individuals. Combining with 486 individuals from 41 other Jewish, Indian and Pakistani populations, and additional individuals from worldwide populations, we conducted comprehensive genome-wide analyses based on FST, principal component analysis, ADMIXTURE, identity-by-descent sharing, admixture linkage disequilibrium decay, haplotype sharing and allele sharing autocorrelation decay, as well as contrasted patterns between the X chromosome and the autosomes. The genetics of Bene Israel individuals resemble local Indian populations, while at the same time constituting a clearly separated and unique population in India. They are unique among Indian and Pakistani populations we analyzed in sharing considerable genetic ancestry with other Jewish populations. Putting together the results from all analyses point to Bene Israel being an admixed population with both Jewish and Indian ancestry, with the genetic contribution of each of these ancestral populations being substantial. The admixture took place in the last millennium, about 19-33 generations ago. It involved Middle-Eastern Jews and was sex-biased, with more male Jewish and local female contribution. It was followed by a population bottleneck and high endogamy, which can lead to increased prevalence of recessive diseases in this population. This study provides an example of how genetic analysis advances our knowledge of human history in cases where other disciplines lack the relevant data to do so.
- Published
- 2016
- Full Text
- View/download PDF
5. Population genomic analysis of 962 whole genome sequences of humans reveals natural selection in non-coding regions.
- Author
-
Fuli Yu, Jian Lu, Xiaoming Liu, Elodie Gazave, Diana Chang, Srilakshmi Raj, Haley Hunter-Zinck, Ran Blekhman, Leonardo Arbiza, Cris Van Hout, Alanna Morrison, Andrew D Johnson, Joshua Bis, L Adrienne Cupples, Bruce M Psaty, Donna Muzny, Jin Yu, Richard A Gibbs, Alon Keinan, Andrew G Clark, and Eric Boerwinkle
- Subjects
Medicine ,Science - Abstract
Whole genome analysis in large samples from a single population is needed to provide adequate power to assess relative strengths of natural selection across different functional components of the genome. In this study, we analyzed next-generation sequencing data from 962 European Americans, and found that as expected approximately 60% of the top 1% of positive selection signals lie in intergenic regions, 33% in intronic regions, and slightly over 1% in coding regions. Several detailed functional annotation categories in intergenic regions showed statistically significant enrichment in positively selected loci when compared to the null distribution of the genomic span of ENCODE categories. There was a significant enrichment of purifying selection signals detected in enhancers, transcription factor binding sites, microRNAs and target sites, but not on lincRNA or piRNAs, suggesting different evolutionary constraints for these domains. Loci in "repressed or low activity regions" and loci near or overlapping the transcription start site were the most significantly over-represented annotations among the top 1% of signals for positive selection.
- Published
- 2015
- Full Text
- View/download PDF
6. Principal component analysis characterizes shared pathogenetics from genome-wide association studies.
- Author
-
Diana Chang and Alon Keinan
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their "shared pathogenetics". The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases.
- Published
- 2014
- Full Text
- View/download PDF
7. Analysis of multiple association studies provides evidence of an expression QTL hub in gene-gene interaction network affecting HDL cholesterol levels.
- Author
-
Li Ma, Christie Ballantyne, Ariel Brautbar, and Alon Keinan
- Subjects
Medicine ,Science - Abstract
Epistasis has been suggested to underlie part of the missing heritability in genome-wide association studies. In this study, we first report an analysis of gene-gene interactions affecting HDL cholesterol (HDL-C) levels in a candidate gene study of 2,091 individuals with mixed dyslipidemia from a clinical trial. Two additional studies, the Atherosclerosis Risk in Communities study (ARIC; n = 9,713) and the Multi-Ethnic Study of Atherosclerosis (MESA; n = 2,685), were considered for replication. We identified a gene-gene interaction between rs1532085 and rs12980554 (P = 7.1 × 10(-7)) in their effect on HDL-C levels, which is significant after Bonferroni correction (P(c) = 0.017) for the number of SNP pairs tested. The interaction successfully replicated in the ARIC study (P = 7.0 × 10(-4); P(c) = 0.02). Rs1532085, an expression QTL (eQTL) of LIPC, is one of the two SNPs involved in another, well-replicated gene-gene interaction underlying HDL-C levels. To further investigate the role of this eQTL SNP in gene-gene interactions affecting HDL-C, we tested in the ARIC study for interaction between this SNP and any other SNP genome-wide. We found the eQTL to be involved in a few suggestive interactions, one of which significantly replicated in MESA. Importantly, these gene-gene interactions, involving only rs1532085, explain an additional 1.4% variation of HDL-C, on top of the 0.65% explained by rs1532085 alone. LIPC plays a key role in the lipid metabolism pathway and it, and rs1532085 in particular, has been associated with HDL-C and other lipid levels. Collectively, we discovered several novel gene-gene interactions, all involving an eQTL of LIPC, thus suggesting a hub role of LIPC in the gene-gene interaction network that regulates HDL-C levels, which in turn raises the hypothesis that LIPC's contribution is largely via interactions with other lipid metabolism related genes.
- Published
- 2014
- Full Text
- View/download PDF
8. Accounting for eXentricities: analysis of the X chromosome in GWAS reveals X-linked genes implicated in autoimmune diseases.
- Author
-
Diana Chang, Feng Gao, Andrea Slavney, Li Ma, Yedael Y Waldman, Aaron J Sams, Paul Billing-Ross, Aviv Madar, Richard Spritz, and Alon Keinan
- Subjects
Medicine ,Science - Abstract
Many complex human diseases are highly sexually dimorphic, suggesting a potential contribution of the X chromosome to disease risk. However, the X chromosome has been neglected or incorrectly analyzed in most genome-wide association studies (GWAS). We present tailored analytical methods and software that facilitate X-wide association studies (XWAS), which we further applied to reanalyze data from 16 GWAS of different autoimmune and related diseases (AID). We associated several X-linked genes with disease risk, among which (1) ARHGEF6 is associated with Crohn's disease and replicated in a study of ulcerative colitis, another inflammatory bowel disease (IBD). Indeed, ARHGEF6 interacts with a gastric bacterium that has been implicated in IBD. (2) CENPI is associated with three different AID, which is compelling in light of known associations with AID of autosomal genes encoding centromere proteins, as well as established autosomal evidence of pleiotropy between autoimmune diseases. (3) We replicated a previous association of FOXP3, a transcription factor that regulates T-cell development and function, with vitiligo; and (4) we discovered that C1GALT1C1 exhibits sex-specific effect on disease risk in both IBDs. These and other X-linked genes that we associated with AID tend to be highly expressed in tissues related to immune response, participate in major immune pathways, and display differential gene expression between males and females. Combined, the results demonstrate the importance of the X chromosome in autoimmunity, reveal the potential of extensive XWAS, even based on existing data, and provide the tools and incentive to properly include the X chromosome in future studies.
- Published
- 2014
- Full Text
- View/download PDF
9. Gene-based testing of interactions in association studies of quantitative traits.
- Author
-
Li Ma, Andrew G Clark, and Alon Keinan
- Subjects
Genetics ,QH426-470 - Abstract
Various methods have been developed for identifying gene-gene interactions in genome-wide association studies (GWAS). However, most methods focus on individual markers as the testing unit, and the large number of such tests drastically erodes statistical power. In this study, we propose novel interaction tests of quantitative traits that are gene-based and that confer advantage in both statistical power and biological interpretation. The framework of gene-based gene-gene interaction (GGG) tests combine marker-based interaction tests between all pairs of markers in two genes to produce a gene-level test for interaction between the two. The tests are based on an analytical formula we derive for the correlation between marker-based interaction tests due to linkage disequilibrium. We propose four GGG tests that extend the following P value combining methods: minimum P value, extended Simes procedure, truncated tail strength, and truncated P value product. Extensive simulations point to correct type I error rates of all tests and show that the two truncated tests are more powerful than the other tests in cases of markers involved in the underlying interaction not being directly genotyped and in cases of multiple underlying interactions. We applied our tests to pairs of genes that exhibit a protein-protein interaction to test for gene-level interactions underlying lipid levels using genotype data from the Atherosclerosis Risk in Communities study. We identified five novel interactions that are not evident from marker-based interaction testing and successfully replicated one of these interactions, between SMAD3 and NEDD9, in an independent sample from the Multi-Ethnic Study of Atherosclerosis. We conclude that our GGG tests show improved power to identify gene-level interactions in existing, as well as emerging, association studies.
- Published
- 2013
- Full Text
- View/download PDF
10. Knowledge-driven analysis identifies a gene-gene interaction affecting high-density lipoprotein cholesterol levels in multi-ethnic populations.
- Author
-
Li Ma, Ariel Brautbar, Eric Boerwinkle, Charles F Sing, Andrew G Clark, and Alon Keinan
- Subjects
Genetics ,QH426-470 - Abstract
Total cholesterol, low-density lipoprotein cholesterol, triglyceride, and high-density lipoprotein cholesterol (HDL-C) levels are among the most important risk factors for coronary artery disease. We tested for gene-gene interactions affecting the level of these four lipids based on prior knowledge of established genome-wide association study (GWAS) hits, protein-protein interactions, and pathway information. Using genotype data from 9,713 European Americans from the Atherosclerosis Risk in Communities (ARIC) study, we identified an interaction between HMGCR and a locus near LIPC in their effect on HDL-C levels (Bonferroni corrected P(c) = 0.002). Using an adaptive locus-based validation procedure, we successfully validated this gene-gene interaction in the European American cohorts from the Framingham Heart Study (P(c) = 0.002) and the Multi-Ethnic Study of Atherosclerosis (MESA; P(c) = 0.006). The interaction between these two loci is also significant in the African American sample from ARIC (P(c) = 0.004) and in the Hispanic American sample from MESA (P(c) = 0.04). Both HMGCR and LIPC are involved in the metabolism of lipids, and genome-wide association studies have previously identified LIPC as associated with levels of HDL-C. However, the effect on HDL-C of the novel gene-gene interaction reported here is twice as pronounced as that predicted by the sum of the marginal effects of the two loci. In conclusion, based on a knowledge-driven analysis of epistasis, together with a new locus-based validation method, we successfully identified and validated an interaction affecting a complex trait in multi-ethnic populations.
- Published
- 2012
- Full Text
- View/download PDF
11. Predicting signatures of 'synthetic associations' and 'natural associations' from empirical patterns of human genetic variation.
- Author
-
Diana Chang and Alon Keinan
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Genome-wide association studies (GWAS) have in recent years discovered thousands of associated markers for hundreds of phenotypes. However, associated loci often only explain a relatively small fraction of heritability and the link between association and causality has yet to be uncovered for most loci. Rare causal variants have been suggested as one scenario that may partially explain these shortcomings. Specifically, Dickson et al. recently reported simulations of rare causal variants that lead to association signals of common, tag single nucleotide polymorphisms, dubbed "synthetic associations". However, an open question is what practical implications synthetic associations have for GWAS. Here, we explore the signatures exhibited by such "synthetic associations" and their implications based on patterns of genetic variation observed in human populations, thus accounting for human evolutionary history -a force disregarded in previous simulation studies. This is made possible by human population genetic data from HapMap 3 consisting of both resequencing and array-based genotyping data for the same set of individuals from multiple populations. We report that synthetic associations tend to be further away from the underlying risk alleles compared to "natural associations" (i.e. associations due to underlying common causal variants), but to a much lesser extent than previously predicted, with both the age and the effect size of the risk allele playing a part in this phenomenon. We find that while a synthetic association has a lower probability of capturing causal variants within its linkage disequilibrium block, sequencing around the associated variant need not extend substantially to have a high probability of capturing at least one causal variant. We also show that the minor allele frequency of synthetic associations is lower than of natural associations for most, but not all, loci that we explored. Finally, we find the variance in associated allele frequency to be a potential indicator of synthetic associations.
- Published
- 2012
- Full Text
- View/download PDF
12. The history of African gene flow into Southern Europeans, Levantines, and Jews.
- Author
-
Priya Moorjani, Nick Patterson, Joel N Hirschhorn, Alon Keinan, Li Hao, Gil Atzmon, Edward Burns, Harry Ostrer, Alkes L Price, and David Reich
- Subjects
Genetics ,QH426-470 - Abstract
Previous genetic studies have suggested a history of sub-Saharan African gene flow into some West Eurasian populations after the initial dispersal out of Africa that occurred at least 45,000 years ago. However, there has been no accurate characterization of the proportion of mixture, or of its date. We analyze genome-wide polymorphism data from about 40 West Eurasian groups to show that almost all Southern Europeans have inherited 1%-3% African ancestry with an average mixture date of around 55 generations ago, consistent with North African gene flow at the end of the Roman Empire and subsequent Arab migrations. Levantine groups harbor 4%-15% African ancestry with an average mixture date of about 32 generations ago, consistent with close political, economic, and cultural links with Egypt in the late middle ages. We also detect 3%-5% sub-Saharan African ancestry in all eight of the diverse Jewish populations that we analyzed. For the Jewish admixture, we obtain an average estimated date of about 72 generations. This may reflect descent of these groups from a common ancestral population that already had some African ancestry prior to the Jewish Diasporas.
- Published
- 2011
- Full Text
- View/download PDF
13. Human population differentiation is strongly correlated with local recombination rate.
- Author
-
Alon Keinan and David Reich
- Subjects
Genetics ,QH426-470 - Abstract
Allele frequency differences across populations can provide valuable information both for studying population structure and for identifying loci that have been targets of natural selection. Here, we examine the relationship between recombination rate and population differentiation in humans by analyzing two uniformly-ascertained, whole-genome data sets. We find that population differentiation as assessed by inter-continental F(ST) shows negative correlation with recombination rate, with F(ST) reduced by 10% in the tenth of the genome with the highest recombination rate compared with the tenth of the genome with the lowest recombination rate (P<
- Published
- 2010
- Full Text
- View/download PDF
14. Quantitative analysis of genetic and neuronal multi-perturbation experiments.
- Author
-
Alon Kaufman, Alon Keinan, Isaac Meilijson, Martin Kupiec, and Eytan Ruppin
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Perturbation studies, in which functional performance is measured after deletion, mutation, or lesion of elements of a biological system, have been traditionally employed in many fields in biology. The vast majority of these studies have been qualitative and have employed single perturbations, often resulting in little phenotypic effect. Recently, newly emerging experimental techniques have allowed researchers to carry out concomitant multi-perturbations and to uncover the causal functional contributions of system elements. This study presents a rigorous and quantitative multi-perturbation analysis of gene knockout and neuronal ablation experiments. In both cases, a quantification of the elements' contributions, and new insights and predictions, are provided. Multi-perturbation analysis has a potentially wide range of applications and is gradually becoming an essential tool in biology.
- Published
- 2005
- Full Text
- View/download PDF
15. Evolutionary Network Minimization: Adaptive Implicit Pruning of Successful Agents.
- Author
-
Zohar Ganon, Alon Keinan, and Eytan Ruppin
- Published
- 2003
- Full Text
- View/download PDF
16. Solving a Delayed Response Task with Spiking.
- Author
-
Keren Saggie, Alon Keinan, and Eytan Ruppin
- Published
- 2003
- Full Text
- View/download PDF
17. Neurocontroller Analysis via Evolutionary Network Minimization.
- Author
-
Zohar Ganon, Alon Keinan, and Eytan Ruppin
- Published
- 2006
- Full Text
- View/download PDF
18. Neural Processing of Counting in Evolved Spiking and McCulloch-Pitts Agents.
- Author
-
Keren Saggie-Wexler, Alon Keinan, and Eytan Ruppin
- Published
- 2006
- Full Text
- View/download PDF
19. Axiomatic Scalable Neurocontroller Analysis via the Shapley Value.
- Author
-
Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson, and Eytan Ruppin
- Published
- 2006
- Full Text
- View/download PDF
20. Fair Attribution of Functional Contribution in Artificial and Biological Networks.
- Author
-
Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson, and Eytan Ruppin
- Published
- 2004
- Full Text
- View/download PDF
21. Causal localization of neural function: the Shapley value method.
- Author
-
Alon Keinan, Claus C. Hilgetag, Isaac Meilijson, and Eytan Ruppin
- Published
- 2004
- Full Text
- View/download PDF
22. Spikes that count: rethinking spikiness in neurally embedded systems.
- Author
-
Keren Saggie, Alon Keinan, and Eytan Ruppin
- Published
- 2004
- Full Text
- View/download PDF
23. Fair localization of function via multi-lesion analysis.
- Author
-
Alon Keinan, Alon Kaufman, Nadia Sachs, Claus C. Hilgetag, and Eytan Ruppin
- Published
- 2004
- Full Text
- View/download PDF
24. Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations
- Author
-
Siqi Liang, Li Yao, Christen A. Rivera-Erick, David Neil Cooper, Matthew Mort, Andrew G. Clark, Alon Keinan, Xiaomu Wei, Haiyuan Yu, Ting-Yi Wang, Peter D. Stenson, Kaixiong Ye, Jin Liang, Robert Fragoza, John C. Schimenti, Shayne D. Wierbowski, Juan Felipe Beltrán, Jishnu Das, and Tina N. Tran
- Subjects
0301 basic medicine ,Science ,Population ,Mutation, Missense ,General Physics and Astronomy ,Biology ,Genome ,Polymorphism, Single Nucleotide ,General Biochemistry, Genetics and Molecular Biology ,Article ,03 medical and health sciences ,Mice ,0302 clinical medicine ,Gene Frequency ,Genetic variation ,Missense mutation ,Animals ,Humans ,Disease ,Genetic Predisposition to Disease ,Allele ,education ,lcsh:Science ,Allele frequency ,Exome sequencing ,Alleles ,Genetics ,education.field_of_study ,Multidisciplinary ,Base Sequence ,Genome, Human ,Genetic Variation ,Rare variants ,General Chemistry ,Protein-protein interaction networks ,030104 developmental biology ,Genetics, Population ,HEK293 Cells ,Phenotype ,Mutation ,Human genome ,lcsh:Q ,Systems biology ,030217 neurology & neurosurgery ,Protein Binding - Abstract
Each human genome carries tens of thousands of coding variants. The extent to which this variation is functional and the mechanisms by which they exert their influence remains largely unexplored. To address this gap, we leverage the ExAC database of 60,706 human exomes to investigate experimentally the impact of 2009 missense single nucleotide variants (SNVs) across 2185 protein-protein interactions, generating interaction profiles for 4797 SNV-interaction pairs, of which 421 SNVs segregate at > 1% allele frequency in human populations. We find that interaction-disruptive SNVs are prevalent at both rare and common allele frequencies. Furthermore, these results suggest that 10.5% of missense variants carried per individual are disruptive, a higher proportion than previously reported; this indicates that each individual’s genetic makeup may be significantly more complex than expected. Finally, we demonstrate that candidate disease-associated mutations can be identified through shared interaction perturbations between variants of interest and known disease mutations., Low frequency coding single-nucleotide variants (SNVs) are predicted to disproportionately affect protein function. Here, the authors evaluate 2,009 missense SNVs across 2,185 protein-protein interactions using yeast two-hybrid and protein complementation assays and find that disruptive SNVs often occur in disease-associated genes.
- Published
- 2019
25. NRE: A tool for exploring neutral loci in the human genome.
- Author
-
Leonardo Arbiza, Elaine Zhong, and Alon Keinan
- Published
- 2012
- Full Text
- View/download PDF
26. Clustered mutations in hominid genome evolution are consistent with APOBEC3G enzymatic activity
- Author
-
Erez Y. Levanon, Leonardo Arbiza, Alon Keinan, Aaron J. Sams, Orshay Gabay, and Yishay Pinto
- Subjects
0301 basic medicine ,Genetics ,Genome evolution ,Mutation ,Research ,Point mutation ,Mutagenesis (molecular biology technique) ,Hominidae ,APOBEC-3G Deaminase ,Exaptation ,Biology ,medicine.disease_cause ,Genome ,Evolution, Molecular ,03 medical and health sciences ,030104 developmental biology ,Germline mutation ,medicine ,Animals ,Humans ,Gene family ,Genetics (clinical) - Abstract
The gradual accumulation of mutations by any of a number of mutational processes is a major driving force of divergence and evolution. Here, we investigate a potentially novel mutational process that is based on the activity of members of the AID/APOBEC family of deaminases. This gene family has been recently shown to introduce—in multiple types of cancer—enzyme-induced clusters of co-occurring somatic mutations caused by cytosine deamination. Going beyond somatic mutations, we hypothesized that APOBEC3—following its rapid expansion in primates—can introduce unique germline mutation clusters that can play a role in primate evolution. In this study, we tested this hypothesis by performing a comprehensive comparative genomic screen for APOBEC3-induced mutagenesis patterns across different hominids. We detected thousands of mutation clusters introduced along primate evolution which exhibit features that strongly fit the known patterns of APOBEC3G mutagenesis. These results suggest that APOBEC3G-induced mutations have contributed to the evolution of all genomes we studied. This is the first indication of site-directed, enzyme-induced genome evolution, which played a role in the evolution of both modern and archaic humans. This novel mutational mechanism exhibits several unique features, such as its higher tendency to mutate transcribed regions and regulatory elements and its ability to generate clusters of concurrent point mutations that all occur in a single generation. Our discovery demonstrates the exaptation of an anti-viral mechanism as a new source of genomic variation in hominids with a strong potential for functional consequences.
- Published
- 2016
- Full Text
- View/download PDF
27. Positive Selection on a Regulatory Insertion–Deletion Polymorphism inFADS2Influences Apparent Endogenous Synthesis of Arachidonic Acid
- Author
-
Kaixiong Ye, Kumar S.D. Kothapalli, Hui Gyu Park, Kinsley Ojukwu, Alon Keinan, Kalpana Joshi, Carlson Susan E, Ji Yao Zhang, Stephanie S. Hyon, Zhenglong Gu, Maithili S. Gadgil, J. Thomas Brenna, Kimberly O. O'Brien, and James Zou
- Subjects
Adult ,Fatty Acid Desaturases ,Male ,0301 basic medicine ,FADS1 ,FADS2 ,Linoleic acid ,Population ,Biology ,Polymorphism, Single Nucleotide ,Young Adult ,03 medical and health sciences ,chemistry.chemical_compound ,Delta-5 Fatty Acid Desaturase ,Gene Frequency ,INDEL Mutation ,positive selection ,population divergence ,Genotype ,Genetics ,Humans ,indel (insertion–deletion) ,Selection, Genetic ,education ,Molecular Biology ,Allele frequency ,Discoveries ,Alleles ,Phospholipids ,Ecology, Evolution, Behavior and Systematics ,education.field_of_study ,Arachidonic Acid ,Haplotype ,Genetic Variation ,Genotype frequency ,030104 developmental biology ,Haplotypes ,chemistry ,Fatty Acids, Unsaturated ,Female ,Databases, Nucleic Acid ,long chain polyunsaturated fatty acids - Abstract
Long chain polyunsaturated fatty acids (LCPUFA) are bioactive components of membrane phospholipids and serve as substrates for signaling molecules. LCPUFA can be obtained directly from animal foods or synthesized endogenously from 18 carbon precursors via the FADS2 coded enzyme. Vegans rely almost exclusively on endogenous synthesis to generate LCPUFA and we hypothesized that an adaptive genetic polymorphism would confer advantage. The rs66698963 polymorphism, a 22-bp insertion–deletion within FADS2, is associated with basal FADS1 expression, and coordinated induction of FADS1 and FADS2 in vitro. Here, we determined rs66698963 genotype frequencies from 234 individuals of a primarily vegetarian Indian population and 311 individuals from the US. A much higher I/I genotype frequency was found in Indians (68%) than in the US (18%). Analysis using 1000 Genomes Project data confirmed our observation, revealing a global I/I genotype of 70% in South Asians, 53% in Africans, 29% in East Asians, and 17% in Europeans. Tests based on population divergence, site frequency spectrum, and long-range haplotype consistently point to positive selection encompassing rs66698963 in South Asian, African, and some East Asian populations. Basal plasma phospholipid arachidonic acid (ARA) status was 8% greater in I/I compared with D/D individuals. The biochemical pathway product–precursor difference, ARA minus linoleic acid, was 31% and 13% greater for I/I and I/D compared with D/D, respectively. This study is consistent with previous in vitro data suggesting that the insertion allele enhances n-6 LCPUFA synthesis and may confer an adaptive advantage in South Asians because of the traditional plant-based diet practice.
- Published
- 2016
- Full Text
- View/download PDF
28. Strong Constraint on Human Genes Escaping X-Inactivation Is Modulated by their Expression Level and Breadth in Both Sexes
- Author
-
Andrea Slavney, Andrew G. Clark, Leonardo Arbiza, and Alon Keinan
- Subjects
Male ,0301 basic medicine ,Gene Expression ,Biology ,Genome ,X-inactivation ,03 medical and health sciences ,Negative selection ,Genes, X-Linked ,X Chromosome Inactivation ,Transcription (biology) ,Gene expression ,Genetics ,Humans ,Gene silencing ,Selection, Genetic ,Molecular Biology ,Gene ,Discoveries ,Ecology, Evolution, Behavior and Systematics ,Chromosomes, Human, X ,Models, Genetic ,Genome, Human ,030104 developmental biology ,Female ,Human genome - Abstract
In eutherian mammals, X-linked gene expression is normalized between XX females and XY males through the process of X chromosome inactivation (XCI). XCI results in silencing of transcription from one ChrX homolog per female cell. However, approximately 25% of human ChrX genes escape XCI to some extent and exhibit biallelic expression in females. The evolutionary basis of this phenomenon is not entirely clear, but high sequence conservation of XCI escapers suggests that purifying selection may directly or indirectly drive XCI escape at these loci. One hypothesis is that this signal results from contributions to developmental and physiological sex differences, but presently there is limited evidence supporting this model in humans. Another potential driver of this signal is selection for high and/or broad gene expression in both sexes, which are strong predictors of reduced nucleotide substitution rates in mammalian genes. Here, we compared purifying selection and gene expression patterns of human XCI escapers with those of X-inactivated genes in both sexes. When we accounted for the functional status of each ChrX gene's Y-linked homolog (or "gametolog"), we observed that XCI escapers exhibit greater degrees of purifying selection in the human lineage than X-inactivated genes, as well as higher and broader gene expression than X-inactivated genes across tissues in both sexes. These results highlight a significant role for gene expression in both sexes in driving purifying selection on XCI escapers, and emphasize these genes' potential importance in human disease.
- Published
- 2015
- Full Text
- View/download PDF
29. The utility of ancient human DNA for improving allele age estimates, with implications for demographic models and tests of natural selection
- Author
-
Alon Keinan, Aaron J. Sams, and John Hawks
- Subjects
Male ,Genetics ,education.field_of_study ,Linkage disequilibrium ,Natural selection ,Population ,Context (language use) ,DNA ,Biology ,Article ,Anthropology, Physical ,Europe ,Evolution, Molecular ,Genetics, Population ,Ancient DNA ,Anthropology ,Animals ,Humans ,Selection, Genetic ,Allele ,education ,Allele frequency ,Ecology, Evolution, Behavior and Systematics ,Exome sequencing ,Neanderthals - Abstract
The age of polymorphic alleles in humans is often estimated from population genetic patterns in extant human populations, such as allele frequencies, linkage disequilibrium, and rate of mutations. Ancient DNA can improve the accuracy of such estimates, as well as facilitate testing the validity of demographic models underlying many population genetic methods. Specifically, the presence of an allele in a genome derived from an ancient sample testifies that the allele is at least as old as that sample. In this study, we consider a common method for estimating allele age based on allele frequency as applied to variants from the US National Institutes of Health (NIH) Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project. We view these estimates in the context of the presence or absence of each allele in the genomes of the 5300 year old Tyrolean Iceman, Ötzi, and of the 50,000 year old Altai Neandertal. Our results illuminate the accuracy of these estimates and their sensitivity to demographic events that were not included in the model underlying age estimation. Specifically, allele presence in the Iceman genome provides a good fit of allele age estimates to the expectation based on the age of that specimen. The equivalent based on the Neandertal genome leads to a poorer fit. This is likely due in part to the older age of the Neandertal and the older time of the split between modern humans and Neandertals, but also due to gene flow from Neandertals to modern humans not being considered in the underlying demographic model. Thus, the incorporation of ancient DNA can improve allele age estimation, demographic modeling, and tests of natural selection. Our results also point to the importance of considering a more diverse set of ancient samples for understanding the geographic and temporal range of individual alleles.
- Published
- 2015
- Full Text
- View/download PDF
30. XWAS: A Software Toolset for Genetic Data Analysis and Association Studies of the X Chromosome
- Author
-
Li Ma, Arjun Biddanda, Alon Keinan, Yingjie Guo, Zilu Zhou, Feng Gao, and Diana Chang
- Subjects
Male ,Computer Note ,Single-nucleotide polymorphism ,Genome-wide association study ,Computational biology ,Disease ,Biology ,Polymorphism, Single Nucleotide ,X-inactivation ,03 medical and health sciences ,0302 clinical medicine ,X Chromosome Inactivation ,Genetics ,Humans ,Bioinformatics and computational genetics ,GWAS ,SNP ,complex diseases ,Molecular Biology ,Genetics (clinical) ,X chromosome ,030304 developmental biology ,Genetic association ,Chromosomes, Human, X ,0303 health sciences ,software ,Inheritance (genetic algorithm) ,3. Good health ,Phenotype ,genetic association study ,sexual dimorphism ,030220 oncology & carcinogenesis ,chromosome X ,Trait ,Female ,Quantitative genetics and Mendelian inheritance ,Genome-Wide Association Study ,Biotechnology - Abstract
XWAS is a new software suite for the analysis of the X chromosome in association studies and similar genetic studies. The X chromosome plays an important role in human disease and traits of many species, especially those with sexually dimorphic characteristics. Special attention needs to be given to its analysis due to the unique inheritance pattern, which leads to analytical complications that have resulted in the majority of genome-wide association studies (GWAS) either not considering X or mishandling it with toolsets that had been designed for non-sex chromosomes. We hence developed XWAS to fill the need for tools that are specially designed for analysis of X. Following extensive, stringent, and X-specific quality control, XWAS offers an array of statistical tests of association, including: 1) the standard test between a SNP (single nucleotide polymorphism) and disease risk, including after first stratifying individuals by sex, 2) a test for a differential effect of a SNP on disease between males and females, 3) motivated by X-inactivation, a test for higher variance of a trait in heterozygous females as compared with homozygous females, and 4) for all tests, a version that allows for combining evidence from all SNPs across a gene. We applied the toolset analysis pipeline to 16 GWAS datasets of immune-related disorders and 7 risk factors of coronary artery disease, and discovered several new X-linked genetic associations. XWAS will provide the tools and incentive for others to incorporate the X chromosome into GWAS and similar studies in any species with an XX/XY system, hence enabling discoveries of novel loci implicated in many diseases and in their sexual dimorphism.
- Published
- 2015
- Full Text
- View/download PDF
31. Extensive pathogenicity of mitochondrial heteroplasmy in healthy human individuals
- Author
-
Kaixiong Ye, Zhenglong Gu, Alon Keinan, Jian Lu, and Fei Ma
- Subjects
Genetics ,Nonsynonymous substitution ,education.field_of_study ,Mitochondrial DNA ,Multidisciplinary ,Population ,Biological Sciences ,Biology ,Human mitochondrial genetics ,Heteroplasmy ,Negative selection ,1000 Genomes Project ,education ,Allele frequency - Abstract
A majority of mitochondrial DNA (mtDNA) mutations reported to be implicated in diseases are heteroplasmic, a status with coexisting mtDNA variants in a single cell. Quantifying the prevalence of mitochondrial heteroplasmy and its pathogenic effect in healthy individuals could further our understanding of its possible roles in various diseases. A total of 1,085 human individuals from 14 global populations have been sequenced by the 1000 Genomes Project to a mean coverage of ∼2,000× on mtDNA. Using a combination of stringent thresholds and a maximum-likelihood method to define heteroplasmy, we demonstrated that ∼90% of the individuals carry at least one heteroplasmy. At least 20% of individuals harbor heteroplasmies reported to be implicated in disease. Mitochondrial heteroplasmy tend to show high pathogenicity, and is significantly overrepresented in disease-associated loci. Consistent with their deleterious effect, heteroplasmies with derived allele frequency larger than 60% within an individual show a significant reduction in pathogenicity, indicating the action of purifying selection. Purifying selection on heteroplasmies can also be inferred from nonsynonymous and synonymous heteroplasmy comparison and the unfolded site frequency spectra for different functional sites in mtDNA. Nevertheless, in comparison with population polymorphic mtDNA mutations, the purifying selection is much less efficient in removing heteroplasmic mutations. The prevalence of mitochondrial heteroplasmy with high pathogenic potential in healthy individuals, along with the possibility of these mutations drifting to high frequency inside a subpopulation of cells across lifespan, emphasizes the importance of managing mitochondrial heteroplasmy to prevent disease progression.
- Published
- 2014
- Full Text
- View/download PDF
32. Rare LPL gene variants attenuate triglyceride reduction and HDL cholesterol increase in response to fenofibric acid therapy in individuals with mixed dyslipidemia
- Author
-
Alon Keinan, Ariel Brautbar, Feng Gao, Li Ma, Salim S. Virani, and Christie M. Ballantyne
- Subjects
Male ,medicine.medical_specialty ,Statin ,Apolipoprotein B ,medicine.drug_class ,Fibrate ,Article ,chemistry.chemical_compound ,Double-Blind Method ,Fenofibrate ,Internal medicine ,medicine ,Humans ,Genetic Predisposition to Disease ,Prospective Studies ,Triglycerides ,Dyslipidemias ,Hypolipidemic Agents ,Apolipoprotein C-III ,biology ,Triglyceride ,Cholesterol ,Cholesterol, HDL ,Genetic Variation ,nutritional and metabolic diseases ,medicine.disease ,Lipoprotein Lipase ,Phenotype ,Treatment Outcome ,Endocrinology ,chemistry ,Pharmacogenetics ,Apolipoprotein B-100 ,biology.protein ,Drug Therapy, Combination ,Female ,lipids (amino acids, peptides, and proteins) ,Hydroxymethylglutaryl-CoA Reductase Inhibitors ,Cardiology and Cardiovascular Medicine ,Biomarkers ,Dyslipidemia ,medicine.drug - Abstract
Objective Individuals with mixed dyslipidemia have elevated triglycerides (TG), low high-density lipoprotein cholesterol (HDL-C), and increased risk for coronary disease. Fibrate therapy is commonly used to lower TG and increase HDL-C. Common genetic variants are known to affect the response to fibrate therapy. We sought to identify rare genetic variants (frequency ≤ 1%) in genes involved in TG and HDL-C metabolism that affect the response to fenofibric acid (FA) therapy. Methods Four genes with a major role in HDL-C and TG metabolism APOA1 , APOC2 , APOC-III and LPL were sequenced in 2385 participants with mixed dyslipidemia in a randomized, double-blind, active-controlled study comparing therapy with FA alone, in combination with statins, or statin alone. Rare variants collapsing or SKAT methods were used for the analysis. Results Synonymous rare variants in the LPL gene were significantly associated with absolute HDL-C change ( P = 9 × 10 −4 ) and TG percent change ( P = 6.76 × 10 −4 ) in those treated with FA only. Participants with these rare variants had a 2 mg/dL increase in HDL-C and 39 mg/dL decrease in TG as compared to 6.2 mg/dL increase in HDL-C and 100 mg/dL decrease in TG in those without these variants. Rare variants in the APOC-III gene were associated with a modest 3 mg/dL less reduction in APOB ( P = 8.72 × 10 −4 ) in those receiving FA and statin. Conclusion In individuals with mixed dyslipidemia rare synonymous variants within LPL gene were associated with attenuated response to FA therapy while APOCIII rare variants were associated with a modest effect on APOB response to FA-statin therapy. These results should be replicated in a similar clinical trial for further confirmation.
- Published
- 2014
- Full Text
- View/download PDF
33. Crowdsourced genealogies and genomes
- Author
-
Alexandre A. Lussier and Alon Keinan
- Subjects
0301 basic medicine ,Multidisciplinary ,Civilization ,business.industry ,media_common.quotation_subject ,Big data ,Research opportunities ,Genome ,Data science ,Article ,03 medical and health sciences ,030104 developmental biology ,Geography ,Crowdsourcing ,Humans ,business ,Genealogy and Heraldry ,media_common - Abstract
Family trees have vast applications in multiple fields from genetics to anthropology and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. Here, we collected 86 million profiles from publicly-available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of longevity by inspecting millions of relative pairs and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other datasets with our resource in order to empower studies with population-scale genealogical data.
- Published
- 2018
- Full Text
- View/download PDF
34. Association between rs2294020 in X-linked CCDC22 and susceptibility to autoimmune diseases with focus on systemic lupus erythematosus
- Author
-
Lauren J. Lo, Mariagrazia Granata, Giulio A. Rossi, Chiara Trovato, Alon Keinan, Maria Clorinda Mazzarino, Evangelia Skarmoutsou, Chiara Bellocchi, Maurizio Marchini, Fabio D’Amico, and Raffaella Scorza
- Subjects
0301 basic medicine ,Male ,Settore MED/09 - Medicina Interna ,Vitiligo ,Autoimmune disease ,CCDC22 gene ,FOXP3 gene ,Single nucleotide polymorphism ,Systemic lupus erythematosus ,Adult ,Autoimmune Diseases ,Case-Control Studies ,Exons ,Female ,Genotype ,Humans ,Lupus Erythematosus, Systemic ,Odds Ratio ,Phenotype ,Polymorphism, Single Nucleotide ,Proteins ,Alleles ,Genes, X-Linked ,Genetic Association Studies ,Genetic Predisposition to Disease ,medicine.disease_cause ,Autoimmunity ,0302 clinical medicine ,Immunology and Allergy ,skin and connective tissue diseases ,FOXP3 ,Single Nucleotide ,Rheumatoid arthritis ,Immunology ,Article ,03 medical and health sciences ,Blisibimod ,Psoriasis ,medicine ,Polymorphism ,Lupus Erythematosus ,business.industry ,Systemic ,X-Linked ,medicine.disease ,030104 developmental biology ,Genes ,business ,030217 neurology & neurosurgery ,Anti-SSA/Ro autoantibodies - Abstract
Autoimmune diseases often share common susceptibility genes. Most genetic variants associated with susceptibility to systemic lupus erythematosus are also associated with other autoimmune diseases. The X-linked variant rs2294020 is positioned in exon 7 of the CCDC22 gene. The encoded protein functions in the regulation of NF-kB, a master regulator in immune response. The aim of this study is to investigate whether the rs2294020 polymorphism may be a general susceptibility factor for autoimmunity. We evaluated case-control association between the occurrence of rs2294020 and different autoimmune diseases, including new data for systemic lupus erythematosus and previous genome-wide association studies (GWAS) (though most did not analyse the X chromosome) of psoriasis, celiac disease, Crohn’s disease, ulcerative colitis, multiple sclerosis, vitiligo, type-1 diabetes, rheumatoid arthritis, and ankylosing spondylitis. Cases from patients affected by amyotrophic lateral sclerosis and type-2 diabetes were also included in the study. We detected nominal significant associations of rs2294020 with systemic lupus erythematosus (additive model test: p=0.01), vitiligo (p =0.016), psoriasis (p =0.038), and in only one of two studies of multiple sclerosis (p =0.03). Our results suggest that rs2294020 is associated with the risk of several autoimmune diseases in European populations, specifically with diseases that present themselves, among else, in the skin.
- Published
- 2017
35. Population Growth Inflates the Per-Individual Number of Deleterious Mutations and Reduces Their Mean Effect
- Author
-
Alon Keinan, Elodie Gazave, Diana Chang, and Andrew G. Clark
- Subjects
Population fragmentation ,Population ,Population genetics ,deleterious mutations ,Biology ,Investigations ,03 medical and health sciences ,Negative selection ,0302 clinical medicine ,Genetics ,Chromosomes, Human ,Humans ,Computer Simulation ,human ,Selection, Genetic ,education ,Population Growth ,Population and Evolutionary Genetics ,Alleles ,030304 developmental biology ,Population Density ,0303 health sciences ,education.field_of_study ,Natural selection ,Models, Genetic ,Population size ,Selection coefficient ,exponential growth ,demographic history ,Genetics, Population ,Evolutionary biology ,Mutation ,Mutation–selection balance ,Genetic Fitness ,purifying selection ,030217 neurology & neurosurgery - Abstract
This study addresses the question of how purifying selection operates during recent rapid population growth such as has been experienced by human populations. This is not a straightforward problem because the human population is not at equilibrium: population genetics predicts that, on the one hand, the efficacy of natural selection increases as population size increases, eliminating ever more weakly deleterious variants; on the other hand, a larger number of deleterious mutations will be introduced into the population and will be more likely to increase in their number of copies as the population grows. To understand how patterns of human genetic variation have been shaped by the interaction of natural selection and population growth, we examined the trajectories of mutations with varying selection coefficients, using computer simulations. We observed that while population growth dramatically increases the number of deleterious segregating sites in the population, it only mildly increases the number carried by each individual. Our simulations also show an increased efficacy of natural selection, reflected in a higher fraction of deleterious mutations eliminated at each generation and a more efficient elimination of the most deleterious ones. As a consequence, while each individual carries a larger number of deleterious alleles than expected in the absence of growth, the average selection coefficient of each segregating allele is less deleterious. Combined, our results suggest that the genetic risk of complex diseases in growing populations might be distributed across a larger number of more weakly deleterious rare variants.
- Published
- 2013
36. Association of mitochondrial DNA variants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) symptoms
- Author
-
Zhenglong Gu, Alon Keinan, Paul Billing-Ross, Kaixiong Ye, Arnaud Germain, and Maureen R. Hanson
- Subjects
0301 basic medicine ,medicine.medical_specialty ,Mitochondrial DNA ,Myalgic encephalomyelitis (ME) ,Encephalomyelitis ,Single-nucleotide polymorphism ,Biology ,Heteroplasmy ,DNA, Mitochondrial ,Haplogroup ,General Biochemistry, Genetics and Molecular Biology ,Association ,03 medical and health sciences ,Internal medicine ,medicine ,Chronic fatigue syndrome ,Humans ,Association (psychology) ,Medicine(all) ,Genetics ,Fatigue Syndrome, Chronic ,mtDNA ,Biochemistry, Genetics and Molecular Biology(all) ,Variants ,Chronic fatigue syndrome (CFS) ,General Medicine ,medicine.disease ,030104 developmental biology ,Mutation ,Commentary ,Next-generation sequencing ,SNPs ,Human mitochondrial DNA haplogroup - Abstract
Earlier this year, we described an analysis of mitochondrial DNA (mtDNA) variants in myalgic encephalomyelitis (ME)/chronic fatigue syndrome (CFS) patients and healthy controls. We reported that there was no significant association of haplogroups or singe nucleotide polymorphisms (SNPs) with disease status. Nevertheless, a commentary about our paper appeared (Finsterer and Zarrouk-Mahjoub. J Transl Med14:182, 2016) that criticized the association of mtDNA haplogroups with ME/CFS, a conclusion that was absent from our paper. The aforementioned commentary also demanded experiments that were outside of the scope of our study, ones that we had suggested as follow-up studies. Because they failed to consult a published and cited report describing the cohorts we studied, the authors also cast aspersions on the method of selection of cases for inclusion. We reiterate that we observed statistically significant association of mtDNA variants with particular symptoms and their severity, though we observed no association with disease status.
- Published
- 2016
- Full Text
- View/download PDF
37. Explosive genetic evidence for explosive human population growth
- Author
-
Feng Gao and Alon Keinan
- Subjects
0301 basic medicine ,Population ,Biology ,Polymorphism, Single Nucleotide ,Article ,White People ,03 medical and health sciences ,0302 clinical medicine ,Genetic variation ,Genetics ,Population growth ,Humans ,Genetic Predisposition to Disease ,Selection, Genetic ,education ,Population Growth ,Selection (genetic algorithm) ,education.field_of_study ,Genome, Human ,Population size ,High-Throughput Nucleotide Sequencing ,Human genetics ,Genetic architecture ,030104 developmental biology ,Genetics, Population ,Evolutionary biology ,030217 neurology & neurosurgery ,Developmental Biology ,Personal genomics - Abstract
The advent of next-generation sequencing technology has allowed the collection of vast amounts of genetic variation data. A recurring discovery from studying larger and larger samples of individuals had been the extreme, previously unexpected, excess of very rare genetic variants, which has been shown to be mostly due to the recent explosive growth of human populations. Here, we review recent literature that inferred recent changes in population size in different human populations and with different methodologies, with many pointing to recent explosive growth, especially in European populations for which more data has been available. We also review the state-of-the-art methods and software for the inference of historical population size changes that lead to these discoveries. Finally, we discuss the implications of recent population growth on personalized genomics, on purifying selection in the non-equilibrium state it entails and, as a consequence, on the genetic architecture underlying complex disease and the performance of mapping methods in discovering rare variants that contribute to complex disease risk.
- Published
- 2016
38. High-resolution DNA accessibility profiles increase the discovery and interpretability of genetic associations
- Author
-
Feng Gao, Diana Chang, Andrew G. Clark, Timothy J. Vyse, Aviv Madar, Alon Keinan, Aaron J. Sams, Cunninghame Graham Ds, and Yedael Y. Waldman
- Subjects
Genetics ,0303 health sciences ,Genome-wide association study ,Locus (genetics) ,Computational biology ,Biology ,Genome ,Manhattan plot ,03 medical and health sciences ,chemistry.chemical_compound ,Annotation ,0302 clinical medicine ,chemistry ,Regulatory sequence ,030217 neurology & neurosurgery ,DNA ,030304 developmental biology ,Interpretability - Abstract
Genetic risk for common autoimmune diseases is influenced by hundreds of small effect, mostly non-coding variants, enriched in regulatory regions active in adaptive-immune cell types. DNaseI hypersensitivity sites (DHSs) are a genomic mark for regulatory DNA. Here, we generated a single DHSs annotation from fifteen deeply sequenced DNase-seq experiments in adaptive-immune as well as non-immune cell types. Using this annotation we quantified accessibility across cell types in a matrix format amenable to statistical analysis, deduced the subset of DHSs unique to adaptive-immune cell types, and grouped DHSs by cell-type accessibility profiles. Measuring enrichment with cell-type-specific TF binding sites as well as proximal gene expression and function, we show that accessibility profiles grouped DHSs into coherent regulatory functions. Using the adaptive-immune-specific DHSs as input (0.37% of genome), we associated DHSs to six autoimmune diseases with GWAS data. Associated loci showed higher replication rates when compared to loci identified by GWAS or by considering all DHSs, allowing the additional discovery of 327 loci (FDR
- Published
- 2016
- Full Text
- View/download PDF
39. The genetic history of Cochin Jews from India
- Author
-
Yedael Y. Waldman, Eitan Friedman, Christopher L. Campbell, Alon Keinan, Maya Dubrovsky, Harry Ostrer, Gil Atzmon, Arjun Biddanda, Carole Oddoux, and Eran Halperin
- Subjects
0301 basic medicine ,Linkage disequilibrium ,Genotype ,Genetic genealogy ,Judaism ,Population ,India ,Biology ,Linkage Disequilibrium ,03 medical and health sciences ,0302 clinical medicine ,Asian People ,Genetics ,Humans ,Genetics(clinical) ,Israel ,education ,Genetics (clinical) ,Alleles ,Original Investigation ,education.field_of_study ,Autosome ,Genome, Human ,Haplotype ,Genealogy ,030104 developmental biology ,Population bottleneck ,Genetics, Population ,Haplotypes ,Endogamy ,Jews ,030217 neurology & neurosurgery - Abstract
Cochin Jews form a small and unique community on the Malabar coast in southwest India. While the arrival time of any putative Jewish ancestors of the community has been speculated to have taken place as far back as biblical times (King Solomon’s era), a Jewish community in the Malabar coast has been documented only since the 9th century CE. Here, we explore the genetic history of Cochin Jews by collecting and genotyping 21 community members and combining the data with that of 707 individuals from 72 other Indian, Jewish, and Pakistani populations, together with additional individuals from worldwide populations. We applied comprehensive genome-wide analyses based on principal component analysis, FST, ADMIXTURE, identity-by-descent sharing, admixture linkage disequilibrium decay, haplotype sharing, allele sharing autocorrelation decay and contrasting the X chromosome with the autosomes. We find that, as reported by several previous studies, the genetics of Cochin Jews resembles that of local Indian populations. However, we also identify considerable Jewish genetic ancestry that is not present in any other Indian or Pakistani populations (with the exception of the Jewish Bene Israel, which we characterized previously). Combined, Cochin Jews have both Jewish and Indian ancestry. Specifically, we detect a significant recent Jewish gene flow into this community 13–22 generations (~470–730 years) ago, with contributions from Yemenite, Sephardi, and Middle-Eastern Jews, in accordance with historical records. Genetic analyses also point to high endogamy and a recent population bottleneck in this population, which might explain the increased prevalence of some recessive diseases in Cochin Jews. Electronic supplementary material The online version of this article (doi:10.1007/s00439-016-1698-y) contains supplementary material, which is available to authorized users.
- Published
- 2016
40. Positive selection on a regulatory insertion-deletion polymorphism in FADS2 influences apparent endogenous synthesis of arachidonic acid
- Author
-
Kinsley Ojukwu, J. Thomas Brenna, James Zou, Carlson Susan E, Stephanie S. Hyon, Alon Keinan, Ji Yao Zhang, Kumar S.D. Kothapalli, Maithili S. Gadgil, Hui Gyu Park, Kimberly O. O'Brien, Kaixiong Ye, and Kalpana Joshi
- Subjects
2. Zero hunger ,Genetics ,0303 health sciences ,education.field_of_study ,030309 nutrition & dietetics ,FADS1 ,FADS2 ,Linoleic acid ,Haplotype ,Population ,Biology ,Genotype frequency ,03 medical and health sciences ,chemistry.chemical_compound ,chemistry ,Genotype ,Arachidonic acid ,education ,030304 developmental biology - Abstract
Long chain polyunsaturated fatty acids (LCPUFA) are bioactive components of membrane phospholipids and serve as substrates for signaling molecules. LCPUFA can be obtained directly from animal foods or synthesized endogenously from 18 carbon precursors via the FADS2 coded enzyme. Vegans rely almost exclusively on endogenous synthesis to generate LCPUFA and we hypothesized that an adaptive genetic polymorphism would confer advantage. The rs66698963 polymorphism, a 22 bp insertion-deletion within FADS2, is associated with basal FADS1 expression, and coordinated induction of FADS1 and FADS2 in vitro. Here we determined rs66698963 genotype frequencies from 234 individuals of a primarily vegetarian Indian population and 311 individuals from the U.S. A much higher I/I genotype frequency was found in Indians (68%) than in the U.S. (18%). Analysis using 1000 Genomes Project data confirmed our observation, revealing a global I/I genotype of 70% in South Asians, 53% in Africans, 29% in East Asians, and 17% in Europeans. Tests based on population divergence, site frequency spectrum and long-range haplotype consistently point to positive selection encompassing rs66698963 in South Asian, African and some East Asian populations. Basal plasma phospholipid arachidonic acid status was 8% greater in I/I compared to D/D individuals. The biochemical pathway product-precursor difference, arachidonic acid minus linoleic acid, was 31% and 13% greater for I/I and I/D compared to D/D, respectively. Our study is consistent with previous in vitro data suggesting that the insertion allele enhances n-6 LCPUFA synthesis and may confer an adaptive advantage in South Asians because of the traditional plant-based diet practice.
- Published
- 2016
- Full Text
- View/download PDF
41. Interaction between SNPs in the RXRA and near ANGPTL3 gene region inhibits apoB reduction after statin-fenofibric acid therapy in individuals with mixed dyslipidemia
- Author
-
John W. Belmont, Ariel Brautbar, Alon Keinan, Li Ma, and Christie M. Ballantyne
- Subjects
Male ,Simvastatin ,Very low-density lipoprotein ,Apolipoprotein B ,Biochemistry ,Endocrinology ,Fenofibrate ,Atorvastatin ,Rosuvastatin Calcium ,pharmacogenetics ,Genetics ,Sulfonamides ,education.field_of_study ,biology ,single-nucleotide polymorphism ,Female ,lipids (amino acids, peptides, and proteins) ,medicine.drug ,medicine.medical_specialty ,Combination therapy ,Population ,apolipoprotein ,Single-nucleotide polymorphism ,QD415-436 ,Polymorphism, Single Nucleotide ,Internal medicine ,medicine ,Humans ,Pyrroles ,education ,Angiopoietin-Like Protein 3 ,Apolipoproteins B ,Dyslipidemias ,genetic variants ,nutritional and metabolic diseases ,Cell Biology ,medicine.disease ,Fluorobenzenes ,Angiopoietin-like Proteins ,Apolipoproteins ,Pyrimidines ,Heptanoic Acids ,biology.protein ,fibrates ,Hydroxymethylglutaryl-CoA Reductase Inhibitors ,Patient-Oriented and Epidemiological Research ,Angiopoietins ,Dyslipidemia ,Pharmacogenetics - Abstract
The mixed dyslipidemia phenotype is characterized by elevated triglycerides (TG), low HDL cholesterol (HDL-C), increased ApoB levels, and premature coronary atherosclerosis. Fibrate-statin combination therapy reduces ApoB levels and coronary events in the mixed dyslipidemia population. We sought to identify gene-gene interactions that affect ApoB response to statin-fenofibric acid therapy in the mixed dyslipidemia population. Using a predefined subset of single-nucleotide polymorphisms (SNPs) that were previously associated with TG, VLDL, or HDL-C, we applied gene-gene interaction testing in a randomized, double-blind, clinical trial examining the response to fenofibric acid (FNA) and its combination with statin in 1,865 individuals with mixed dyslipidemia. Of 11,783 possible SNP pairs examined, we detected a single significant interaction between rs12130333, located within the ANGPTL3 gene region, and rs4240705, within the RXRA gene, on ApoB reduction after statin-FNA therapy (P = 4.0 × 10(-6)). ApoB response to therapy gradually reduced with the increasing number of T alleles in the rs12130333 but only in the presence of the GG genotype of rs4240705. Individuals doubly homozygous for the minor alleles at rs12130333 and rs4240705 showed a paradoxical increase of 1.8% in ApoB levels after FNA-statin combination therapy. No gene-gene interaction was identified other than an interaction between SNPs in the ANGPTL3 and RXRA regions, which results in the inhibition of ApoB reduction in response to statin-FNA therapy. Further study is required to examine the clinical applicability of this genetic interaction and its effect on coronary events.
- Published
- 2012
- Full Text
- View/download PDF
42. Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants
- Author
-
Alon Keinan and Andrew G. Clark
- Subjects
Heterozygote ,Population genetics ,Biology ,Polymorphism, Single Nucleotide ,Population density ,White People ,Article ,Asian People ,Gene Frequency ,Abundance (ecology) ,Genetic variation ,Humans ,Population growth ,Genetic Predisposition to Disease ,Population Growth ,Allele frequency ,Genetic Association Studies ,Population Density ,Multidisciplinary ,Models, Genetic ,Genome, Human ,Haplotype ,Genetic Variation ,Sequence Analysis, DNA ,Genetics, Population ,Haplotypes ,Evolutionary biology ,Sample Size ,Mutation ,Mutation (genetic algorithm) - Abstract
Exponential Growth Effects Humans are an extraordinarily successful species, as measured by our large population size—approximately 7 billion—much of which can be put down to recent explosive growth. Leveraging human genomic data, Keinan and Clark (p. 740 ) examined the effects of population growth on our ability to detect rare genetic variants, those hypothesized to be most likely associated with disease. It appears that rapid recent growth increases the load of rare variants and is likely to play an important role in the individual genetic burden of complex disease risk.
- Published
- 2012
- Full Text
- View/download PDF
43. Selection for Translation Efficiency on Synonymous Polymorphisms in Recent Human Evolution
- Author
-
Tamir Tuller, Alon Keinan, Eytan Ruppin, and Yedael Y. Waldman
- Subjects
Genetics, Medical ,causal variants ,SNP ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Evolution, Molecular ,03 medical and health sciences ,Negative selection ,0302 clinical medicine ,Genetics ,Humans ,Selection, Genetic ,Allele ,Codon ,Gene ,Allele frequency ,Research Articles ,Ecology, Evolution, Behavior and Systematics ,Selection (genetic algorithm) ,030304 developmental biology ,0303 health sciences ,Natural selection ,Models, Genetic ,Genome, Human ,allele frequency differentiation ,synonymous mutations ,Racial Groups ,population genetics ,Proteins ,translation efficiency ,Protein Biosynthesis ,Synonymous substitution ,030217 neurology & neurosurgery - Abstract
Synonymous mutations are considered to be ‘‘silent’’ as they do not affect protein sequence. However, different silent codons have different translation efficiency (TE), which raises the question to what extent such mutations are really neutral. We perform the first genome-wide study of natural selection operating on TE in recent human evolution, surveying 13,798 synonymous single nucleotide polymorphisms (SNPs) in 1,198 unrelated individuals from 11 populations. We find evidence for both negative and positive selection on TE, as measured based on differentiation in allele frequencies between populations. Notably, the likelihood of an SNP to be targeted by positive or negative selection is correlated with the magnitude of its effect on the TE of the corresponding protein. Furthermore, negative selection acting against changes in TE is more marked in highly expressed genes, highly interacting proteins, complex members, and regulatory genes. It is also more common in functional regions and in the initial segments of highly expressed genes. Positive selection targeting sites with a large effect on TE is stronger in lowly interacting proteins and in regulatory genes. Similarly, essential genes are enriched for negative TE selection while underrepresented for positive TE selection. Taken together, these results point to the significant role of TE as a selective force operating in humans and hence underscore the importance of considering silent SNPs in interpreting associations with complex human diseases. Testifying to this potential, we describe two synonymous SNPs that may have clinical implications in phenylketonuria and in Best’s macular dystrophy due to TE differences between alleles.
- Published
- 2011
- Full Text
- View/download PDF
44. Gene-Based Nonparametric Testing of Interactions Using Distance Correlation Coefficient in Case-Control Association Studies
- Author
-
Xiaoyan Liu, Yingjie Guo, Alon Keinan, Maozu Guo, and Chenxi Wu
- Subjects
0301 basic medicine ,lcsh:QH426-470 ,distance correlation coefficient ,Nonparametric statistics ,qualitative trait ,Article ,Statistical power ,Distance correlation ,lcsh:Genetics ,03 medical and health sciences ,Permutation ,030104 developmental biology ,gene–gene interaction ,Joint probability distribution ,genome-wide association studies ,Statistics ,Genetics ,Genetics (clinical) ,Statistic ,Statistical hypothesis testing ,Interpretability ,Mathematics - Abstract
Among the various statistical methods for identifying gene⁻gene interactions in qualitative genome-wide association studies (GWAS), gene-based methods have recently grown in popularity because they confer advantages in both statistical power and biological interpretability. However, most of these methods make strong assumptions about the form of the relationship between traits and single-nucleotide polymorphisms, which result in limited statistical power. In this paper, we propose a gene-based method based on the distance correlation coefficient called gene-based gene-gene interaction via distance correlation coefficient (GBDcor). The distance correlation (dCor) is a measurement of the dependency between two random vectors with arbitrary, and not necessarily equal, dimensions. We used the difference in dCor in case and control datasets as an indicator of gene⁻gene interaction, which was based on the assumption that the joint distribution of two genes in case subjects and in control subjects should not be significantly different if the two genes do not interact. We designed a permutation-based statistical test to evaluate the difference between dCor in cases and controls for a pair of genes, and we provided the p-value for the statistic to represent the significance of the interaction between the two genes. In experiments with both simulated and real-world data, our method outperformed previous approaches in detecting interactions accurately.
- Published
- 2018
- Full Text
- View/download PDF
45. Detecting natural selection by empirical comparison to random regions of the genome
- Author
-
Christopher A. Walsh, Hua Chen, Robert Sean Hill, David Reich, Andre A. Mignault, Alon Keinan, Russell J. Ferland, and Fuli Yu
- Subjects
Neurogenesis ,Population ,Locus (genetics) ,Single-nucleotide polymorphism ,Nerve Tissue Proteins ,Biology ,Genome ,Polymorphism, Single Nucleotide ,Receptors, G-Protein-Coupled ,03 medical and health sciences ,0302 clinical medicine ,Gene Frequency ,Genetic variation ,Genetics ,Humans ,Computer Simulation ,Allele ,Selection, Genetic ,education ,Molecular Biology ,Allele frequency ,Genetics (clinical) ,030304 developmental biology ,Adaptor Proteins, Signal Transducing ,0303 health sciences ,education.field_of_study ,Natural selection ,Models, Genetic ,Genome, Human ,Forkhead Transcription Factors ,General Medicine ,Articles ,Sequence Analysis, DNA ,Adaptor Proteins, Vesicular Transport ,Haplotypes ,Evolutionary biology ,030217 neurology & neurosurgery - Abstract
Historical episodes of natural selection can skew the frequencies of genetic variants, leaving a signature that can persist for many tens or even hundreds of thousands of years. However, formal tests for selection based on allele frequency skew require strong assumptions about demographic history and mutation, which are rarely well understood. Here, we develop an empirical approach to test for signals of selection that compares patterns of genetic variation at a candidate locus with matched random regions of the genome collected in the same way. We apply this approach to four genes that have been implicated in syndromes of impaired neurological development, comparing the pattern of variation in our re-sequencing data with a large-scale, genomic data set that provides an empirical null distribution. We confirm a previously reported signal at FOXP2, and find a novel signal of selection centered at AHI1, a gene that is involved in motor and behavior abnormalities. The locus is marked by many high frequency derived alleles in non-Africans that are of low frequency in Africans, suggesting that selection at this or a closely neighboring gene occurred in the ancestral population of non-Africans. Our study also provides a prototype for how empirical scans for ancient selection can be carried out once many genomes are sequenced.
- Published
- 2009
46. Mitochondrial DNA variants correlate with symptoms in myalgic encephalomyelitis/chronic fatigue syndrome
- Author
-
Arnaud Germain, Alon Keinan, Maureen R. Hanson, Kaixiong Ye, Paul Billing-Ross, and Zhenglong Gu
- Subjects
0301 basic medicine ,Male ,Myalgic encephalomyelitis (ME) ,Disease ,Haplogroup ,Cohort Studies ,0302 clinical medicine ,Genetics ,Medicine(all) ,Fatigue Syndrome, Chronic ,mtDNA ,Variants ,General Medicine ,Chronic fatigue syndrome (CFS) ,Middle Aged ,Heteroplasmy ,Mitochondrial DNA ,3. Good health ,Female ,medicine.symptom ,SNPs ,musculoskeletal diseases ,Adult ,Genetic Markers ,Single-nucleotide polymorphism ,Exercise intolerance ,Biology ,DNA, Mitochondrial ,Polymorphism, Single Nucleotide ,General Biochemistry, Genetics and Molecular Biology ,Association ,03 medical and health sciences ,Young Adult ,Chronic fatigue syndrome ,medicine ,Humans ,Genetic Predisposition to Disease ,Alleles ,Genetic Association Studies ,Aged ,Biochemistry, Genetics and Molecular Biology(all) ,Research ,Haplotype ,medicine.disease ,030104 developmental biology ,Haplotypes ,Mutation ,Next-generation sequencing ,030217 neurology & neurosurgery - Abstract
Background Mitochondrial dysfunction has been hypothesized to occur in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS), a disease characterized by fatigue, cognitive difficulties, pain, malaise, and exercise intolerance. We investigated whether haplogroup, single nucleotide polymorphisms (SNPs), or heteroplasmy of mitochondrial DNA (mtDNA) were associated with health status and/or symptoms. Methods Illumina sequencing of PCR-amplified mtDNA was performed to analyze sequence and extent of heteroplasmy of mtDNAs of 193 cases and 196 age- and gender-matched controls from DNA samples collected by the Chronic Fatigue Initiative. Association testing was carried out to examine possible correlations of mitochondrial sequences with case/control status and symptom constellation and severity as reported by subjects on Short Form-36 and DePaul Symptom Questionnaires. Results No ME/CFS subject exhibited known disease-causing mtDNA mutations. Extent of heteroplasmy was low in all subjects. Although no association between mtDNA SNPs and ME/CFS vs. healthy status was observed, haplogroups J, U and H as well as eight SNPs in ME/CFS cases were significantly associated with individual symptoms, symptom clusters, or symptom severity. Conclusions Analysis of mitochondrial genomes in ME/CFS cases indicates that individuals of a certain haplogroup or carrying specific SNPs are more likely to exhibit certain neurological, inflammatory, and/or gastrointestinal symptoms. No increase in susceptibility to ME/CFS of individuals carrying particular mitochondrial genomes or SNPs was observed. Electronic supplementary material The online version of this article (doi:10.1186/s12967-016-0771-6) contains supplementary material, which is available to authorized users.
- Published
- 2016
47. The genetics of Bene Israel from India reveals both substantial Jewish and Indian ancestry
- Author
-
Eitan Friedman, Alon Keinan, Natalie R. Davidson, Harry Ostrer, Carole Oddoux, Gil Atzmon, Arjun Biddanda, Paul Billing-Ross, Christopher L. Campbell, Maya Dubrovsky, Eran Halperin, and Yedael Y. Waldman
- Subjects
Male ,0301 basic medicine ,Linkage disequilibrium ,Judaism ,lcsh:Medicine ,Genome-wide association study ,030105 genetics & heredity ,Biochemistry ,Linkage Disequilibrium ,Geographical Locations ,Mathematical and Statistical Techniques ,0302 clinical medicine ,Ethnicities ,Pakistan ,Israel ,lcsh:Science ,Genetics ,Principal Component Analysis ,0303 health sciences ,education.field_of_study ,Sex Chromosomes ,Multidisciplinary ,Chromosome Biology ,Autosomes ,X Chromosomes ,Mitochondrial DNA ,Nucleic acids ,Geography ,Endogamy ,Autocorrelation ,Physical Sciences ,Engineering and Technology ,Female ,Statistics (Mathematics) ,Research Article ,Asia ,Genotype ,Forms of DNA ,Genetic genealogy ,Population ,India ,Research and Analysis Methods ,Arrival time ,Chromosomes ,03 medical and health sciences ,Asian People ,Genetic drift ,Humans ,Statistical Methods ,education ,030304 developmental biology ,Autosome ,lcsh:R ,Biology and Life Sciences ,Cell Biology ,DNA ,Genetics, Population ,030104 developmental biology ,Population bottleneck ,Haplotypes ,Jews ,People and Places ,Multivariate Analysis ,Signal Processing ,Population Groupings ,lcsh:Q ,Allele sharing ,Mathematics ,030217 neurology & neurosurgery ,Genome-Wide Association Study - Abstract
The Bene Israel Jewish community from West India is a unique population whose history before the 18th century remains largely unknown. Bene Israel members consider themselves as descendants of Jews, yet the identity of Jewish ancestors and their arrival time to India are unknown, with speculations on arrival time varying between the 8th century BCE and the 6th century CE. Here, we characterize the genetic history of Bene Israel by collecting and genotyping 18 Bene Israel individuals. Combining with 486 individuals from 41 other Jewish, Indian and Pakistani populations, and additional individuals from worldwide populations, we conducted comprehensive genome-wide analyses based on FST, principal component analysis, ADMIXTURE, identity-by-descent sharing, admixture linkage disequilibrium decay, haplotype sharing and allele sharing autocorrelation decay, as well as contrasted patterns between the X chromosome and the autosomes. The genetics of Bene Israel individuals resemble local Indian populations, while at the same time constituting a clearly separated and unique population in India. They are unique among Indian and Pakistani populations we analyzed in sharing considerable genetic ancestry with other Jewish populations. Putting together the results from all analyses point to Bene Israel being an admixed population with both Jewish and Indian ancestry, with the genetic contribution of each of these ancestral populations being substantial. The admixture took place in the last millennium, about 19-33 generations ago. It involved Middle-Eastern Jews and was sex-biased, with more male Jewish and local female contribution. It was followed by a population bottleneck and high endogamy, which can lead to increased prevalence of recessive diseases in this population. This study provides an example of how genetic analysis advances our knowledge of human history in cases where other disciplines lack the relevant data to do so.
- Published
- 2015
- Full Text
- View/download PDF
48. Inference of super-exponential human population growth via efficient computation of the site frequency spectrum for generalized models
- Author
-
Feng Gao and Alon Keinan
- Subjects
0301 basic medicine ,Computation ,Population ,Inference ,Sample (statistics) ,Biology ,Investigations ,Models, Biological ,Coalescent theory ,03 medical and health sciences ,0302 clinical medicine ,Effective population size ,Exponential growth ,Statistics ,Genetics ,Applied mathematics ,Humans ,Computer Simulation ,education ,Population and Evolutionary Genetics ,Demography ,030304 developmental biology ,Mathematics ,education.field_of_study ,0303 health sciences ,Models, Statistical ,software ,coalescent ,human demographic history ,Exponential function ,Europe ,030104 developmental biology ,Sample size determination ,population growth ,Generalized Growth ,generalized models ,030217 neurology & neurosurgery - Abstract
The site frequency spectrum (SFS) and other genetic summary statistics are at the heart of many population genetics studies. Previous studies have shown that human populations had undergone a recent epoch of fast growth in effective population size. These studies assumed that growth is exponential, and the ensuing models leave unexplained excess amount of extremely rare variants. This suggests that human populations might have experienced a recent growth with speed faster than exponential. Recent studies have introduced a generalized growth model where the growth speed can be faster or slower than exponential. However, only simulation approaches were available for obtaining summary statistics under such models. In this study, we provide expressions to accurately and efficiently evaluate the SFS and other summary statistics under generalized models, which we further implement in a publicly available software. Investigating the power to infer deviation of growth from being exponential, we observed that decent sample sizes facilitate accurate inference, e.g. a sample of 3000 individuals with the amount of data expected from exome sequencing allows observing and accurately estimating growth with speed deviating by 10% or more from that of exponential. Applying our inference framework to data from the NHLBI Exome Sequencing Project, we found that a model with a generalized growth epoch fits the observed SFS significantly better than the equivalent model with exponential growth (p-value = 3.85 × 10-6). The estimated growth speed significantly deviates from exponential (p-value << 10-12), with the best-fit estimate being of growth speed 12% faster than exponential.
- Published
- 2015
- Full Text
- View/download PDF
49. Neurocontroller Analysis via Evolutionary Network Minimization
- Author
-
Alon Keinan, Zohar Ganon, and Eytan Ruppin
- Subjects
Behavior ,Hardware implementations ,business.industry ,Computer science ,Evolutionary algorithm ,Models, Theoretical ,Biological Evolution ,General Biochemistry, Genetics and Molecular Biology ,Task (project management) ,Memory ,Artificial Intelligence ,Genetic algorithm ,Neural Networks, Computer ,Artificial intelligence ,Minification ,business ,Algorithms ,Evolutionary programming ,Supervised training - Abstract
This study presents a new evolutionary network minimization (ENM) algorithm. Neurocontroller minimization is beneficial for finding small parsimonious networks that permit a better understanding of their workings. The ENM algorithm is specifically geared to an evolutionary agents setup, as it does not require any explicit supervised training error, and is very easily incorporated in current evolutionary algorithms. ENM is based on a standard genetic algorithm with an additional step during reproduction in which synaptic connections are irreversibly eliminated. It receives as input a successfully evolved neurocontroller and aims to output a pruned neurocontroller, while maintaining the original fitness level. The small neurocontrollers produced by ENM provide upper bounds on the neurocontroller size needed to perform a given task successfully, and can provide for more effcient hardware implementations.
- Published
- 2006
- Full Text
- View/download PDF
50. Neural Processing of Counting in Evolved Spiking and McCulloch-Pitts Agents
- Author
-
Eytan Ruppin, Alon Keinan, and Keren Saggie-Wexler
- Subjects
Computer science ,business.industry ,Models, Neurological ,Autonomous agent ,Feeding Behavior ,Biological evolution ,Machine learning ,computer.software_genre ,Biological Evolution ,Incremental evolution ,General Biochemistry, Genetics and Molecular Biology ,Evolutionary computation ,Feeding behavior ,Artificial Intelligence ,Neural processing ,Animals ,Neural Networks, Computer ,Artificial intelligence ,business ,Subnetwork ,computer - Abstract
This article investigates the evolution of autonomous agents that perform a memory-dependent counting task. Two types of neurocontrollers are evolved: networks of McCulloch-Pitts neurons, and spiking integrate-and-fire networks. The results demonstrate the superiority of the spiky model in evolutionary success and network simplicity. The combination of spiking dynamics with incremental evolution leads to the successful evolution of agents counting over very long periods. Analysis of the evolved networks unravels the counting mechanism and demonstrates how the spiking dynamics are utilized. Using new measures of spikiness we find that even in agents with spiking dynamics, these are usually truly utilized only when they are really needed, that is, in the evolved subnetwork responsible for counting.
- Published
- 2006
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.