23 results on '"Genome Aggregation Database Production Team"'
Search Results
2. Characterising the loss-of-function impact of 5’ untranslated region variants in 15,708 individuals
- Author
-
Nicola Whiffin, Konrad J. Karczewski, Xiaolei Zhang, Sonia Chothani, Miriam J. Smith, D. Gareth Evans, Angharad M. Roberts, Nicholas M. Quaife, Sebastian Schafer, Owen Rackham, Jessica Alföldi, Anne H. O’Donnell-Luria, Laurent C. Francioli, Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Stuart A. Cook, Paul J. R. Barton, Daniel G. MacArthur, and James S. Ware
- Subjects
Science - Abstract
Upstream open reading frames (uORFs), located in 5’ untranslated regions, are regulators of downstream protein translation. Here, Whiffin et al. use the genomes of 15,708 individuals in the Genome Aggregation Database (gnomAD) to systematically assess the deleteriousness of variants creating or disrupting uORFs.
- Published
- 2020
- Full Text
- View/download PDF
3. Author Correction: Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes
- Author
-
Qingbo Wang, Emma Pierce-Hoffman, Beryl B. Cummings, Jessica Alföldi, Laurent C. Francioli, Laura D. Gauthier, Andrew J. Hill, Anne H. O’Donnell-Luria, Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Konrad J. Karczewski, and Daniel G. MacArthur
- Subjects
Science - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-21077-8.
- Published
- 2021
- Full Text
- View/download PDF
4. Author Correction: Characterising the loss-of-function impact of 5’ untranslated region variants in 15,708 individuals
- Author
-
Nicola Whiffin, Konrad J. Karczewski, Xiaolei Zhang, Sonia Chothani, Miriam J. Smith, D. Gareth Evans, Angharad M. Roberts, Nicholas M. Quaife, Sebastian Schafer, Owen Rackham, Jessica Alföldi, Anne H. O’Donnell-Luria, Laurent C. Francioli, Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Stuart A. Cook, Paul J. R. Barton, Daniel G. MacArthur, and James S. Ware
- Subjects
Science - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-21052-3
- Published
- 2021
- Full Text
- View/download PDF
5. A structural variation reference for medical and population genetics.
- Author
-
Collins, Ryan L, Brand, Harrison, Karczewski, Konrad J, Zhao, Xuefang, Alföldi, Jessica, Francioli, Laurent C, Khera, Amit V, Lowther, Chelsea, Gauthier, Laura D, Wang, Harold, Watts, Nicholas A, Solomonson, Matthew, O'Donnell-Luria, Anne, Baumann, Alexander, Munshi, Ruchi, Walker, Mark, Whelan, Christopher W, Huang, Yongqing, Brookings, Ted, Sharpe, Ted, Stone, Matthew R, Valkanas, Elise, Fu, Jack, Tiao, Grace, Laricchia, Kristen M, Ruano-Rubio, Valentin, Stevens, Christine, Gupta, Namrata, Cusick, Caroline, Margolin, Lauren, Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Taylor, Kent D, Lin, Henry J, Rich, Stephen S, Post, Wendy S, Chen, Yii-Der Ida, Rotter, Jerome I, Nusbaum, Chad, Philippakis, Anthony, Lander, Eric, Gabriel, Stacey, Neale, Benjamin M, Kathiresan, Sekar, Daly, Mark J, Banks, Eric, MacArthur, Daniel G, and Talkowski, Michael E
- Subjects
Genome Aggregation Database Production Team ,Genome Aggregation Database Consortium ,Humans ,Disease ,Genetics ,Medical ,Genetics ,Population ,Mutation ,Polymorphism ,Single Nucleotide ,Genome ,Human ,Reference Standards ,Middle Aged ,Continental Population Groups ,Female ,Male ,Genetic Variation ,Genetic Testing ,Selection ,Genetic ,Genotyping Techniques ,Whole Genome Sequencing ,Genetics ,Medical ,Population ,Genome ,Human ,Polymorphism ,Single Nucleotide ,Selection ,Genetic ,General Science & Technology - Abstract
Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
- Published
- 2020
6. Author Correction:Characterising the loss-of-function impact of 5' untranslated region variants in 15,708 individuals
- Author
-
Jessica Alföldi, Xiaolei Zhang, Miriam J. Smith, Owen J. L. Rackham, Nicholas M Quaife, Konrad J. Karczewski, Daniel G. MacArthur, Paul J.R. Barton, Nicola Whiffin, Anne H. O’Donnell-Luria, Stuart A. Cook, Laurent C. Francioli, James S. Ware, Sebastian Schafer, Sonia Chothani, D. Gareth Evans, Angharad M. Roberts, and Genome Aggregation Database Production Team
- Subjects
Multidisciplinary ,Text mining ,Five prime untranslated region ,business.industry ,Computer science ,Science ,General Physics and Astronomy ,General Chemistry ,Computational biology ,business ,General Biochemistry, Genetics and Molecular Biology ,Loss function - Abstract
Correction to: Nature Communications https://doi.org/10.1038/s41467-019-10717-9, published online 27 May 2020.The original version of this Article omitted from the Genome Aggregation Database consortium the member Marquis P. Vawter, from the Department of Psychiatry & Human Behavior, University of California Irvine, Irvine, CA, USA. Additionally, the following was added to the Author Contributions: ‘All authors listed under The Genome Aggregation Database Consortium contributed to the generation of the primary data incorporated into the gnomAD resource’. This has been corrected in both the PDF and HTML versions of the Article.
- Published
- 2021
- Full Text
- View/download PDF
7. The effect of LRRK2 loss-of-function variants in humans
- Author
-
Paul Cannon, Kalpana M. Merchant, Jamie L. Marshall, Jung-Jin Lee, Nicola Whiffin, Danish Saleheen, Anna Guan, Tõnu Esko, Qingbo Wang, Aki S. Havulinna, Cole Whiteman, Daniel G. MacArthur, Christina M. Hultman, Carlos N. Pato, Ruth J. F. Loos, Marco A. S. Baptista, Babak Alipanahi, Mark J. Daly, Kristen M. Laricchia, Irina M. Armean, Aaron Kleinman, James S. Ware, Beryl B. Cummings, Nicholas M Quaife, Laurent C. Francioli, Lili Milani, Konrad J. Karczewski, Jessica Alföldi, Julia K. Goodrich, Patrick F. Sullivan, Genome Aggregation Database Production Team, Eric Vallabh Minikel, Peter Morrison, Aarno Palotie, Bozenna Iliadou, Joanne B. Cole, Michele T. Pato, Girish N. Nadkarni, Medicum, Institute for Molecular Medicine Finland, Complex Disease Genetics, University of Helsinki, Centre of Excellence in Complex Disease Genetics, Research Programs Unit, Aarno Palotie / Principal Investigator, Genomics of Neurological and Neuropsychiatric Disorders, Helsinki Institute of Life Science HiLIFE, Department of Public Health, Samuli Olli Ripatti / Principal Investigator, University Management, Biostatistics Helsinki, HUS Helsinki and Uusimaa Hospital District, Tampere University, Clinical Medicine, Department of Clinical Chemistry, Imper, Rosetrees Trust, and Wellcome Trust
- Subjects
0301 basic medicine ,Male ,ved/biology.organism_classification_rank.species ,Research & Experimental Medicine ,Genome ,0302 clinical medicine ,PARKINSONS-DISEASE ,Loss of Function Mutation ,EPIDEMIOLOGY ,Myocytes, Cardiac ,Lymphocytes ,11 Medical and Health Sciences ,Biological Specimen Banks ,Genetics ,Aged, 80 and over ,Drug discovery ,Genome Aggregation Database Production Team ,Parkinson Disease ,General Medicine ,Middle Aged ,LRRK2 ,3. Good health ,Phenotype ,Medicine, Research & Experimental ,Gain of Function Mutation ,Female ,Life Sciences & Biomedicine ,Adult ,Biochemistry & Molecular Biology ,Heterozygote ,Longevity ,Immunology ,Genomics ,Biology ,Leucine-Rich Repeat Serine-Threonine Protein Kinase-2 ,23andMe Research Team ,General Biochemistry, Genetics and Molecular Biology ,Cell Line ,03 medical and health sciences ,Humans ,COHORT ,Kinase activity ,Model organism ,Gene ,Loss function ,Embryonic Stem Cells ,Aged ,Science & Technology ,ved/biology ,Cell Biology ,the 23andMe Research Team ,GENE ,nervous system diseases ,030104 developmental biology ,Genome Aggregation Database Consortium ,1182 Biochemistry, cell and molecular biology ,3111 Biomedicine ,DISEASE-ASSOCIATED MUTATIONS ,030217 neurology & neurosurgery - Abstract
Analysis of large genomic datasets, including gnomAD, reveals that partial LRRK2 loss of function is not strongly associated with diseases, serving as an example of how human genetics can be leveraged for target validation in drug discovery. Human genetic variants predicted to cause loss-of-function of protein-coding genes (pLoF variants) provide natural in vivo models of human gene inactivation and can be valuable indicators of gene function and the potential toxicity of therapeutic inhibitors targeting these genes(1,2). Gain-of-kinase-function variants in LRRK2 are known to significantly increase the risk of Parkinson's disease(3,4), suggesting that inhibition of LRRK2 kinase activity is a promising therapeutic strategy. While preclinical studies in model organisms have raised some on-target toxicity concerns(5-8), the biological consequences of LRRK2 inhibition have not been well characterized in humans. Here, we systematically analyze pLoF variants in LRRK2 observed across 141,456 individuals sequenced in the Genome Aggregation Database (gnomAD)(9), 49,960 exome-sequenced individuals from the UK Biobank and over 4 million participants in the 23andMe genotyped dataset. After stringent variant curation, we identify 1,455 individuals with high-confidence pLoF variants in LRRK2. Experimental validation of three variants, combined with previous work(10), confirmed reduced protein levels in 82.5% of our cohort. We show that heterozygous pLoF variants in LRRK2 reduce LRRK2 protein levels but that these are not strongly associated with any specific phenotype or disease state. Our results demonstrate the value of large-scale genomic databases and phenotyping of human loss-of-function carriers for target validation in drug discovery.
- Published
- 2020
8. Transcript expression-aware annotation improves rare variant interpretation
- Author
-
Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Cummings, Beryl B., Karczewski, Konrad J., Kosmicki, Jack A., Seaby, Eleanor G., Watts, Nicholas A., Singer-Berk, Moriel, Mudge, Jonathan M., Karjalainen, Juha, Satterstrom, F. Kyle, O’Donnell-Luria, Anne H., Poterba, Timothy, Seed, Cotton, Solomonson, Matthew, Alföldi, Jessica, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Daly, Mark J., Donnelly, Stacey, Farjoun, Yossi, Ferriera, Steven, Francioli, Laurent, Gabriel, Stacey, Gauthier, Laura D., Gentry, Jeff, Gupta, Namrata, Jeandet, Thibault, Kaplan, Diane, Laricchia, Kristen M., Llanwarne, Christopher, Minikel, Eric V., Munshi, Ruchi, Neale, Benjamin M., Novod, Sam, Petrillo, Nikelle, Roazen, David, Ruano-Rubio, Valentin, Saltzman, Andrea, Lehtimäki, Terho, Mattila, Kari M., Suvisaari, Jaana, Tampere University, Clinical Medicine, Department of Clinical Chemistry, Centre of Excellence in Complex Disease Genetics, HUS Abdominal Center, Department of Medicine, Clinicum, Gastroenterologian yksikkö, HUS Psychiatry, Department of Psychiatry, HUS Neurocenter, Department of Neurosciences, Neurologian yksikkö, Institute for Molecular Medicine Finland, Department of Public Health, Aarno Palotie / Principal Investigator, Genomics of Neurological and Neuropsychiatric Disorders, Biosciences, Samuli Olli Ripatti / Principal Investigator, Complex Disease Genetics, and Biostatistics Helsinki
- Subjects
Male ,Transcription, Genetic ,Autism Spectrum Disorder ,Developmental Disabilities ,Datasets as Topic ,Haploinsufficiency ,Human genetic variation ,Genome ,Exon ,0302 clinical medicine ,Loss of Function Mutation ,Disease ,Poisson Distribution ,Exome sequencing ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,Disease genetics ,ARRHYTHMIA ,1184 Genetics, developmental biology, physiology ,Genome Aggregation Database Production Team ,Exons ,Female ,Medical genomics ,GENES ,Genotype ,General Science & Technology ,Population ,Computational biology ,Biology ,3121 Internal medicine ,Article ,DNA sequencing ,03 medical and health sciences ,Rare Diseases ,Intellectual Disability ,Exome Sequencing ,Humans ,RNA, Messenger ,Author Correction ,Transcriptomics ,education ,Gene ,030304 developmental biology ,MUTATIONS ,Alternative splicing ,Reproducibility of Results ,Molecular Sequence Annotation ,Genome Aggregation Database Consortium ,3111 Biomedicine ,Transcriptome ,030217 neurology & neurosurgery - Abstract
The acceleration of DNA sequencing in samples from patients and population studies has resulted in extensive catalogues of human genetic variation, but the interpretation of rare genetic variants remains problematic. A notable example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Here, by manual curation of putative loss-of-function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD)1, we show that one explanation for this paradox involves alternative splicing of mRNA, which allows exons of a gene to be expressed at varying levels across different cell types. Currently, no existing annotation tool systematically incorporates information about exon expression into the interpretation of variants. We develop a transcript-level annotation metric known as the ‘proportion expressed across transcripts’, which quantifies isoform expression for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression (GTEx) project2 and show that it can differentiate between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder and intellectual disability or developmental disorders to show that pLoF variants in weakly expressed regions have similar effect sizes to those of synonymous variants, whereas pLoF variants in highly expressed exons are most strongly enriched among cases. Our annotation is fast, flexible and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for the genetic diagnosis of rare diseases, the analysis of rare variant burden in complex disorders, and the curation and prioritization of variants in recall-by-genotype studies., A novel variant annotation metric that quantifies the level of expression of genetic variants across tissues is validated in the Genome Aggregation Database (gnomAD) and is shown to improve rare variant interpretation.
- Published
- 2020
9. A structural variation reference for medical and population genetics
- Author
-
Harold Z. Wang, Yii-Der Ida Chen, Elise Valkanas, Michael E. Talkowski, Kent D. Taylor, Xuefang Zhao, Henry J. Lin, Konrad J. Karczewski, Ryan L. Collins, Eric Banks, Benjamin M. Neale, Lauren Margolin, Christopher W. Whelan, Valentin Ruano-Rubio, Laura D. Gauthier, Stacey Gabriel, Harrison Brand, Namrata Gupta, Jessica Alföldi, Ruchi Munshi, Yongqing Huang, Daniel G. MacArthur, Laurent C. Francioli, Chad Nusbaum, Eric S. Lander, Mark J. Daly, Nicholas A. Watts, Anthony A. Philippakis, Matthew Solomonson, Sekar Kathiresan, Genome Aggregation Database Production Team, Wendy S. Post, Jack Fu, Alexander Baumann, Kristen M. Laricchia, Amit Khera, Ted Brookings, Anne H. O’Donnell-Luria, Jerome I. Rotter, Matthew R. Stone, Chelsea Lowther, Christine Stevens, Caroline N. Cusick, Ted Sharpe, Grace Tiao, Stephen S. Rich, Mark Walker, Tampere University, Clinical Medicine, Department of Clinical Chemistry, Centre of Excellence in Complex Disease Genetics, HUS Abdominal Center, Department of Medicine, Clinicum, Gastroenterologian yksikkö, Institute for Molecular Medicine Finland, HUS Psychiatry, Department of Psychiatry, Department of Public Health, Helsinki Institute of Life Science HiLIFE, Aarno Palotie / Principal Investigator, Genomics of Neurological and Neuropsychiatric Disorders, Samuli Olli Ripatti / Principal Investigator, Complex Disease Genetics, Biostatistics Helsinki, Biosciences, HUS Neurocenter, Department of Neurosciences, and Neurologian yksikkö
- Subjects
0301 basic medicine ,Male ,Genotyping Techniques ,IMPACT ,Population genetics ,VARIANTS ,Genome informatics ,Genome ,0302 clinical medicine ,Disease ,Copy-number variation ,education.field_of_study ,Multidisciplinary ,Continental Population Groups ,REARRANGEMENTS ,1184 Genetics, developmental biology, physiology ,Genome Aggregation Database Production Team ,Genomics ,Single Nucleotide ,Reference Standards ,Middle Aged ,3. Good health ,GENOME ,Female ,Biotechnology ,Human ,General Science & Technology ,Population ,Computational biology ,Biology ,Article ,Structural variation ,03 medical and health sciences ,Genetic ,Medical ,Genetics ,Humans ,Genetic Testing ,Polymorphism ,education ,Selection ,COPY NUMBER VARIATION ,Whole genome sequencing ,Whole Genome Sequencing ,DELETION ,Racial Groups ,Human Genome ,Genetic Variation ,Chromosome abnormality ,EVOLUTION ,Human genetics ,030104 developmental biology ,Genome Aggregation Database Consortium ,Mutation ,PATTERNS ,Generic health relevance ,3111 Biomedicine ,030217 neurology & neurosurgery - Abstract
Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening., A large empirical assessment of sequence-resolved structural variants from 14,891 genomes across diverse global populations in the Genome Aggregation Database (gnomAD) provides a reference map for disease-association studies, population genetics, and diagnostic screening.
- Published
- 2020
10. Human loss-of-function variants suggest that partial LRRK2 reduction is not associated with severe disease
- Author
-
Whiffin, N, Armean, IM, Kleinman, A, Marshall, JL, Minikel, EV, Goodrich, JK, Quaife, N, Cole, JB, Wang, Q, Karczewski, KJ, Cummings, BB, Francioli, L, Laricchia, K, Guan, A, Alipanahi, B, Morrison, P, Baptista, MAS, Merchant, KM, Genome Aggregation Database Production Team^, Genome Aggregation Database Consortium, Ware, J, Havulinna, AS, Iliadou, B, Lee, J-J, Nadkarni, GN, Whiteman, C, Daly, M, Esko, T, Hultman, C, Loos, RJF, Milani, L, Palotie, A, Pato, C, Pato, M, Saleheen, D, Sullivan, PF, Alföldi, J, Cannon, P, MacArthur, DG, Wellcome Trust, Imper, and Rosetrees Trust
- Subjects
Immunology ,11 Medical and Health Sciences ,nervous system diseases - Abstract
Human genetic variants predicted to cause loss-of-function of protein-coding genes (pLoF variants) provide natural in vivo models of human gene inactivation, and can be valuable indicators of gene function and the potential toxicity of therapeutic inhibitors targeting these genes1,2. Gain-of-kinase-function variants in LRRK2 are known to significantly increase the risk of Parkinson’s disease3,4, suggesting that inhibition of LRRK2 kinase activity is a promising therapeutic strategy. While preclinical studies in model organisms have raised some on-target toxicity concerns5–8, the biological consequences of LRRK2 inhibition have not been well-characterized in humans. Here we systematically analyse pLoF variants in LRRK2 observed across 141,456 individuals sequenced in the Genome Aggregation Database (gnomAD)9, 49,960 exome sequenced individuals from the UK Biobank, and over 4 million participants in the 23andMe genotyped dataset. After stringent variant curation, we identify 1,455 individuals with high-confidence pLoF variants in LRRK2. Experimental validation of three variants, combined with prior work10, confirmed reduced protein levels in 82.5% of our cohort. We show that heterozygous pLoF variants in LRRK2 reduce LRRK2 protein levels but are not strongly associated with any specific phenotype or disease state. Our results demonstrate the value of large-scale genomic databases and phenotyping of human LoF carriers for target validation in drug discovery.
- Published
- 2020
11. Evaluating potential drug targets through human loss-of-function genetic variation
- Author
-
Minikel, EV, Karczewski, KJ, Martin, HC, Cummings, BB, Whiffin, N, Rhodes, D, Alföldi, J, Trembath, RC, Van Heel, DA, Daly, MJ, Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Schreiber, SL, MacArthur, DG, and Rosetrees Trust
- Subjects
General Science & Technology ,Genome Aggregation Database Consortium ,Genome Aggregation Database Production Team - Abstract
Naturally occurring human genetic variants predicted to inactivate protein-coding genes provide an in vivo model of human gene inactivation that complements cell and model organism knockout studies. Here we report three key findings regarding assessment of candidate drug targets using human loss-of-function variants. First, even essential genes, where loss-of-function variants are not tolerated, can be highly successful as targets of inhibitory drugs. Second, in most genes, loss-of-function variants are sufficiently rare that genotype-based ascertainment of homozygous or compound heterozygous “knockout” humans will await sample sizes ~1,000 times those presently available, unless recruitment focuses on consanguineous individuals. Third, automated variant annotation and filtering are powerful, but manual curation remains critical for removing artifacts, and is a prerequisite for recall-by-genotype efforts. Our results provide a roadmap for human “knockout” studies and should guide interpretation of loss-of-function variants in drug development.
- Published
- 2020
12. Characterising the loss-of-function impact of 5' untranslated region variants in 15,708 individuals
- Author
-
Genome Aggregation Database Production Team, Genome Aggregation Database Consortium, Whiffin, Nicola, Karczewski, Konrad J., Zhang, Xiaolei, Chothani, Sonia, Smith, Miriam J., Evans, D. Gareth, Roberts, Angharad M., Quaife, Nicholas M., Schafer, Sebastian, Rackham, Owen, Alföldi, Jessica, O’Donnell-Luria, Anne H., Francioli, Laurent C., Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Cummings, Beryl, Daly, Mark J., Donnelly, Stacey, Farjoun, Yossi, Ferriera, Steven, Gabriel, Stacey, Gauthier, Laura D., Gentry, Jeff, Gupta, Namrata, Jeandet, Thibault, Kaplan, Diane, Laricchia, Kristen M., Llanwarne, Christopher, Minikel, Eric V., Munshi, Ruchi, Neale, Benjamin M., Novod, Sam, Petrillo, Nikelle, Poterba, Timothy, Roazen, David, Ruano-Rubio, Valentin, Saltzman, Andrea, Samocha, Kaitlin E., Schleicher, Molly, Seed, Cotton, Solomonson, Matthew, Soto, Jose, Lehtimäki, Terho, Mattila, Kari M., Suvisaari, Jaana, Imper, Rosetrees Trust, Wellcome Trust, Centre of Excellence in Complex Disease Genetics, HUS Abdominal Center, Department of Medicine, Clinicum, Gastroenterologian yksikkö, Institute for Molecular Medicine Finland, Biosciences, Genomics of Neurological and Neuropsychiatric Disorders, HUS Psychiatry, Department of Psychiatry, HUS Neurocenter, Department of Neurosciences, Neurologian yksikkö, Department of Public Health, Research Programme of Molecular Medicine, Aarno Palotie / Principal Investigator, Doctoral Programme in Social Sciences, Samuli Olli Ripatti / Principal Investigator, Complex Disease Genetics, University Management, Biostatistics Helsinki, Faculty of Medicine, University of Helsinki, Tampere University, Clinical Medicine, Department of Clinical Chemistry, BioMediTech, Fondation Leducq, Department of Health, Leducq Foundation for Cardiovascular Research, British Heart Foundation, and Royal Brompton & Harefield NHS Foundation Trust
- Subjects
0301 basic medicine ,Untranslated region ,Five prime untranslated region ,General Physics and Astronomy ,Genome informatics ,Negative selection ,0302 clinical medicine ,Loss of Function Mutation ,Genetics research ,Coding region ,lcsh:Science ,education.field_of_study ,Multidisciplinary ,1184 Genetics, developmental biology, physiology ,Genome Aggregation Database Production Team ,Genomics ,Multidisciplinary Sciences ,Genome Aggregation Database (gnomAD) Consortium ,Science & Technology - Other Topics ,5'-UNTRANSLATED REGIONS ,Medical genomics ,Genome Aggregation Database (gnomAD) Production Team ,OPEN READING FRAMES ,Science ,Population ,WOUDE ,Computational biology ,Biology ,Article ,General Biochemistry, Genetics and Molecular Biology ,ORFS ,Frameshift mutation ,03 medical and health sciences ,Humans ,Author Correction ,education ,Gene ,Science & Technology ,Base Sequence ,Genome, Human ,MUTATIONS ,Genetic Variation ,Proteins ,General Chemistry ,030104 developmental biology ,INITIATION CODON ,Genome Aggregation Database Consortium ,Human genome ,lcsh:Q ,3111 Biomedicine ,TRANSLATION ,5' Untranslated Regions ,030217 neurology & neurosurgery ,VAN - Abstract
Upstream open reading frames (uORFs) are tissue-specific cis-regulators of protein translation. Isolated reports have shown that variants that create or disrupt uORFs can cause disease. Here, in a systematic genome-wide study using 15,708 whole genome sequences, we show that variants that create new upstream start codons, and variants disrupting stop sites of existing uORFs, are under strong negative selection. This selection signal is significantly stronger for variants arising upstream of genes intolerant to loss-of-function variants. Furthermore, variants creating uORFs that overlap the coding sequence show signals of selection equivalent to coding missense variants. Finally, we identify specific genes where modification of uORFs likely represents an important disease mechanism, and report a novel uORF frameshift variant upstream of NF2 in neurofibromatosis. Our results highlight uORF-perturbing variants as an under-recognised functional class that contribute to penetrant human disease, and demonstrate the power of large-scale population sequencing data in studying non-coding variant classes., Upstream open reading frames (uORFs), located in 5’ untranslated regions, are regulators of downstream protein translation. Here, Whiffin et al. use the genomes of 15,708 individuals in the Genome Aggregation Database (gnomAD) to systematically assess the deleteriousness of variants creating or disrupting uORFs.
- Published
- 2019
13. Author Correction: Evaluating drug targets through human loss-of-function genetic variation
- Author
-
Beryl B. Cummings, Richard C. Trembath, Jessica Alföldi, David A. van Heel, Genome Aggregation Database Production Team, Hilary C. Martin, Nicola Whiffin, Daniel G. MacArthur, Stuart L. Schreiber, Eric Vallabh Minikel, Konrad J. Karczewski, Mark J. Daly, and Daniel R. Rhodes
- Subjects
Drug ,Heterozygote ,media_common.quotation_subject ,MEDLINE ,tau Proteins ,Computational biology ,Biology ,Leucine-Rich Repeat Serine-Threonine Protein Kinase-2 ,Prion Proteins ,Target validation ,Automation ,Consanguinity ,Text mining ,Gene Frequency ,Loss of Function Mutation ,Genetic variation ,Humans ,Molecular Targeted Therapy ,Author Correction ,Loss function ,media_common ,Huntingtin Protein ,Multidisciplinary ,Genes, Essential ,business.industry ,Published Erratum ,Homozygote ,Reproducibility of Results ,Neurodegenerative Diseases ,Genomics ,Exons ,Gain of Function Mutation ,Gene Knockdown Techniques ,Sample Size ,business ,Artifacts - Abstract
Naturally occurring human genetic variants that are predicted to inactivate protein-coding genes provide an in vivo model of human gene inactivation that complements knockout studies in cells and model organisms. Here we report three key findings regarding the assessment of candidate drug targets using human loss-of-function variants. First, even essential genes, in which loss-of-function variants are not tolerated, can be highly successful as targets of inhibitory drugs. Second, in most genes, loss-of-function variants are sufficiently rare that genotype-based ascertainment of homozygous or compound heterozygous 'knockout' humans will await sample sizes that are approximately 1,000 times those presently available, unless recruitment focuses on consanguineous individuals. Third, automated variant annotation and filtering are powerful, but manual curation remains crucial for removing artefacts, and is a prerequisite for recall-by-genotype efforts. Our results provide a roadmap for human knockout studies and should guide the interpretation of loss-of-function variants in drug development.
- Published
- 2021
14. Author Correction: A structural variation reference for medical and population genetics
- Author
-
Anne H. O’Donnell-Luria, Chelsea Lowther, Christine Stevens, Elise Valkanas, Matthew R. Stone, Anthony A. Philippakis, Matthew Solomonson, Mark Walker, Yongqing Huang, Jack Fu, Jerome I. Rotter, Laurent C. Francioli, Wendy S. Post, Yii-Der Ida Chen, Kristen M. Laricchia, Amit Khera, Eric S. Lander, Kent D. Taylor, Mark J. Daly, Xuefang Zhao, Lauren Margolin, Ryan L. Collins, Henry J. Lin, Konrad J. Karczewski, Laura D. Gauthier, Ted Brookings, Jessica Alföldi, Benjamin M. Neale, Harrison Brand, Caroline N. Cusick, Eric Banks, Nicholas A. Watts, Stacey Gabriel, Harold Z. Wang, Valentin Ruano-Rubio, Michael E. Talkowski, Ruchi Munshi, Stephen S. Rich, Genome Aggregation Database Production Team, Sekar Kathiresan, Christopher W. Whelan, Daniel G. MacArthur, Namrata Gupta, Chad Nusbaum, Ted Sharpe, Grace Tiao, and Alexander Baumann
- Subjects
Male ,Genotyping Techniques ,Genetics, Medical ,MEDLINE ,Population genetics ,Computational biology ,Biology ,Genome informatics ,Polymorphism, Single Nucleotide ,Structural variation ,Humans ,Disease ,Genetic Testing ,Selection, Genetic ,Author Correction ,Multidisciplinary ,Whole Genome Sequencing ,Genome, Human ,Published Erratum ,Racial Groups ,Genetic Variation ,Chromosome abnormality ,Genomics ,Middle Aged ,Reference Standards ,Genetics, Population ,Mutation ,Female - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-020-03176-6.
- Published
- 2021
15. Author Correction: The effect of LRRK2 loss-of-function variants in humans
- Author
-
Irina M. Armean, Jung-Jin Lee, Ruth J. F. Loos, Danish Saleheen, Beryl B. Cummings, Genome Aggregation Database Production Team, Cole Whiteman, Lili Milani, Kalpana M. Merchant, Nicola Whiffin, Kristen M. Laricchia, Jessica Alföldi, Paul Cannon, Aarno Palotie, Girish N. Nadkarni, Michele T. Pato, Tõnu Esko, Patrick F. Sullivan, Jamie L. Marshall, Mark J. Daly, Konrad J. Karczewski, Daniel G. MacArthur, Aki S. Havulinna, Marco A. S. Baptista, Babak Alipanahi, Qingbo Wang, James S. Ware, Laurent C. Francioli, Christina M. Hultman, Nicholas M Quaife, Carlos N. Pato, Aaron Kleinman, Julia K. Goodrich, Anna Guan, Eric Vallabh Minikel, Peter Morrison, Bozenna Iliadou, and Joanne B. Cole
- Subjects
Text mining ,business.industry ,Published Erratum ,MEDLINE ,Genomics ,General Medicine ,Computational biology ,Biology ,business ,LRRK2 ,General Biochemistry, Genetics and Molecular Biology ,Loss function - Published
- 2021
- Full Text
- View/download PDF
16. Transcript expression-aware annotation improves rare variant interpretation.
- Author
-
Cummings, Beryl B., Karczewski, Konrad J., Kosmicki, Jack A., Seaby, Eleanor G., Watts, Nicholas A., Singer-Berk, Moriel, Mudge, Jonathan M., Karjalainen, Juha, Satterstrom, F. Kyle, O'Donnell-Luria, Anne H., Poterba, Timothy, Seed, Cotton, Solomonson, Matthew, Alföldi, Jessica, Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, and Collins, Ryan L.
- Abstract
The acceleration of DNA sequencing in samples from patients and population studies has resulted in extensive catalogues of human genetic variation, but the interpretation of rare genetic variants remains problematic. A notable example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Here, by manual curation of putative loss-of-function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD)1, we show that one explanation for this paradox involves alternative splicing of mRNA, which allows exons of a gene to be expressed at varying levels across different cell types. Currently, no existing annotation tool systematically incorporates information about exon expression into the interpretation of variants. We develop a transcript-level annotation metric known as the 'proportion expressed across transcripts', which quantifies isoform expression for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression (GTEx) project2 and show that it can differentiate between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder and intellectual disability or developmental disorders to show that pLoF variants in weakly expressed regions have similar effect sizes to those of synonymous variants, whereas pLoF variants in highly expressed exons are most strongly enriched among cases. Our annotation is fast, flexible and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for the genetic diagnosis of rare diseases, the analysis of rare variant burden in complex disorders, and the curation and prioritization of variants in recall-by-genotype studies. A novel variant annotation metric that quantifies the level of expression of genetic variants across tissues is validated in the Genome Aggregation Database (gnomAD) and is shown to improve rare variant interpretation. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
17. Evaluating drug targets through human loss-of-function genetic variation.
- Author
-
Minikel, Eric Vallabh, Karczewski, Konrad J., Martin, Hilary C., Cummings, Beryl B., Whiffin, Nicola, Rhodes, Daniel, Alföldi, Jessica, Trembath, Richard C., van Heel, David A., Daly, Mark J., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Donnelly, Stacey, and Farjoun, Yossi
- Abstract
Naturally occurring human genetic variants that are predicted to inactivate protein-coding genes provide an in vivo model of human gene inactivation that complements knockout studies in cells and model organisms. Here we report three key findings regarding the assessment of candidate drug targets using human loss-of-function variants. First, even essential genes, in which loss-of-function variants are not tolerated, can be highly successful as targets of inhibitory drugs. Second, in most genes, loss-of-function variants are sufficiently rare that genotype-based ascertainment of homozygous or compound heterozygous 'knockout' humans will await sample sizes that are approximately 1,000 times those presently available, unless recruitment focuses on consanguineous individuals. Third, automated variant annotation and filtering are powerful, but manual curation remains crucial for removing artefacts, and is a prerequisite for recall-by-genotype efforts. Our results provide a roadmap for human knockout studies and should guide the interpretation of loss-of-function variants in drug development. Analysis of predicted loss-of-function variants from 125,748 human exomes and 15,708 whole genomes in the Genome Aggregation Database (gnomAD) provides a roadmap for human 'knockout' studies and a guide for future research into disease biology and drug-target selection. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
18. Characterising the loss-of-function impact of 5' untranslated region variants in 15,708 individuals.
- Author
-
Whiffin, Nicola, Karczewski, Konrad J., Zhang, Xiaolei, Chothani, Sonia, Smith, Miriam J., Evans, D. Gareth, Roberts, Angharad M., Quaife, Nicholas M., Schafer, Sebastian, Rackham, Owen, Alföldi, Jessica, O'Donnell-Luria, Anne H., Francioli, Laurent C., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., and Connolly, Kristen M.
- Subjects
GENETIC translation ,NUCLEOTIDE sequencing ,GENETIC code ,GENOMES ,EXOMES - Abstract
Upstream open reading frames (uORFs) are tissue-specific cis-regulators of protein translation. Isolated reports have shown that variants that create or disrupt uORFs can cause disease. Here, in a systematic genome-wide study using 15,708 whole genome sequences, we show that variants that create new upstream start codons, and variants disrupting stop sites of existing uORFs, are under strong negative selection. This selection signal is significantly stronger for variants arising upstream of genes intolerant to loss-of-function variants. Furthermore, variants creating uORFs that overlap the coding sequence show signals of selection equivalent to coding missense variants. Finally, we identify specific genes where modification of uORFs likely represents an important disease mechanism, and report a novel uORF frameshift variant upstream of NF2 in neurofibromatosis. Our results highlight uORF-perturbing variants as an under-recognised functional class that contribute to penetrant human disease, and demonstrate the power of large-scale population sequencing data in studying non-coding variant classes. Upstream open reading frames (uORFs), located in 5' untranslated regions, are regulators of downstream protein translation. Here, Whiffin et al. use the genomes of 15,708 individuals in the Genome Aggregation Database (gnomAD) to systematically assess the deleteriousness of variants creating or disrupting uORFs. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
19. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes.
- Author
-
Wang, Qingbo, Pierce-Hoffman, Emma, Cummings, Beryl B., Alföldi, Jessica, Francioli, Laurent C., Gauthier, Laura D., Hill, Andrew J., O'Donnell-Luria, Anne H., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Daly, Mark J., Donnelly, Stacey, Farjoun, Yossi, and Ferriera, Steven
- Subjects
AMINO acid sequence ,GENOMES ,DEAMINATION ,EXOMES - Abstract
Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2 bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs. Multi-nucleotide variants (MNV) are genetic variants in close proximity of each other on the same haplotype whose functional impact is difficult to predict if they reside in the same codon. Here, Wang et al. use the gnomAD dataset to assemble a catalogue of MNVs and estimate their global mutation rate. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
20. Author Correction: Transcript expression-aware annotation improves rare variant interpretation.
- Author
-
Cummings, Beryl B., Karczewski, Konrad J., Kosmicki, Jack A., Seaby, Eleanor G., Watts, Nicholas A., Singer-Berk, Moriel, Mudge, Jonathan M., Karjalainen, Juha, Satterstrom, F. Kyle, O'Donnell-Luria, Anne H., Poterba, Timothy, Seed, Cotton, Solomonson, Matthew, Alföldi, Jessica, Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, and Collins, Ryan L.
- Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-020-03175-7 [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
21. Author Correction: Evaluating drug targets through human loss-of-function genetic variation.
- Author
-
Minikel, Eric Vallabh, Karczewski, Konrad J., Martin, Hilary C., Cummings, Beryl B., Whiffin, Nicola, Rhodes, Daniel, Alföldi, Jessica, Trembath, Richard C., van Heel, David A., Daly, Mark J., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Donnelly, Stacey, and Farjoun, Yossi
- Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-020-03177-5 [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
22. Author Correction: Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes.
- Author
-
Wang, Qingbo, Pierce-Hoffman, Emma, Cummings, Beryl B., Alföldi, Jessica, Francioli, Laurent C., Gauthier, Laura D., Hill, Andrew J., O'Donnell-Luria, Anne H., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., Connolly, Kristen M., Covarrubias, Miguel, Daly, Mark J., Donnelly, Stacey, Farjoun, Yossi, and Ferriera, Steven
- Subjects
GENOMES ,LANDSCAPES ,HUMAN beings ,EXOMES - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-21077-8. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
23. Author Correction: Characterising the loss-of-function impact of 5' untranslated region variants in 15,708 individuals.
- Author
-
Whiffin, Nicola, Karczewski, Konrad J., Zhang, Xiaolei, Chothani, Sonia, Smith, Miriam J., Evans, D. Gareth, Roberts, Angharad M., Quaife, Nicholas M., Schafer, Sebastian, Rackham, Owen, Alföldi, Jessica, O'Donnell-Luria, Anne H., Francioli, Laurent C., Genome Aggregation Database Production Team, Armean, Irina M., Banks, Eric, Bergelson, Louis, Cibulskis, Kristian, Collins, Ryan L., and Connolly, Kristen M.
- Subjects
PUBLISHING - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-21052-3 [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.