Back to Search
Start Over
Pathogenic missense protein variants affect different functional pathways and proteomic features than healthy population variants
- Source :
- PLoS Biology, PLoS Biology, Vol 19, Iss 4, p e3001207 (2021)
- Publication Year :
- 2021
- Publisher :
- Public Library of Science, 2021.
-
Abstract
- Missense variants are present amongst the healthy population, but some of them are causative of human diseases. A classification of variants associated with “healthy” or “diseased” states is therefore not always straightforward. A deeper understanding of the nature of missense variants in health and disease, the cellular processes they may affect, and the general molecular principles which underlie these differences is essential to offer mechanistic explanations of the true impact of pathogenic variants. Here, we have formalised a statistical framework which enables robust probabilistic quantification of variant enrichment across full-length proteins, their domains, and 3D structure-defined regions. Using this framework, we validate and extend previously reported trends of variant enrichment in different protein structural regions (surface/core/interface). By examining the association of variant enrichment with available functional pathways and transcriptomic and proteomic (protein half-life, thermal stability, abundance) data, we have mined a rich set of molecular features which distinguish between pathogenic and population variants: Pathogenic variants mainly affect proteins involved in cell proliferation and nucleotide processing and are enriched in more abundant proteins. Additionally, rare population variants display features closer to common than pathogenic variants. We validate the association between these molecular features and variant pathogenicity by comparing against existing in silico variant impact annotations. This study provides molecular details into how different proteins exhibit resilience and/or sensitivity towards missense variants and provides the rationale to prioritise variant-enriched proteins and protein domains for therapeutic targeting and development. The ZoomVar database, which we created for this study, is available at fraternalilab.kcl.ac.uk/ZoomVar. It allows users to programmatically annotate missense variants with protein structural information and to calculate variant enrichment in different protein structural regions.<br />How do can one improve the classification of genetic variants as harmful or harmless? This study uses a robust statistical analysis to exploit the interplay between protein structure, proteomic measurements and functional pathways to enable better discrimination between missense variants in health and disease.
- Subjects :
- 0301 basic medicine
Proteomics
Protein Folding
Proteome
Protein Conformation
Thermal Stability
Disease
Biochemistry
Transcriptome
Database and Informatics Methods
0302 clinical medicine
Medicine and Health Sciences
Macromolecular Structure Analysis
Missense mutation
Biology (General)
Databases, Protein
education.field_of_study
Proteomic Databases
General Neuroscience
Healthy population
Physics
Computer-Aided Drug Design
Methods and Resources
Oncology
Health
Physical Sciences
Thermodynamics
Structural Proteins
General Agricultural and Biological Sciences
Signal Transduction
Protein Structure
Drug Research and Development
QH301-705.5
In silico
Protein domain
Population
Mutation, Missense
Computational biology
Biology
Research and Analysis Methods
General Biochemistry, Genetics and Molecular Biology
03 medical and health sciences
Protein Domains
Humans
Genetic Predisposition to Disease
Protein Interaction Domains and Motifs
Amino Acid Sequence
education
Molecular Biology
Germ-Line Mutation
Pharmacology
General Immunology and Microbiology
Genetic Diseases, Inborn
Biology and Life Sciences
Proteins
Cancers and Neoplasms
Computational Biology
030104 developmental biology
Biological Databases
Drug Design
Mutant Proteins
Protein Processing, Post-Translational
030217 neurology & neurosurgery
Protein Abundance
Software
Subjects
Details
- Language :
- English
- ISSN :
- 15457885 and 15449173
- Volume :
- 19
- Issue :
- 4
- Database :
- OpenAIRE
- Journal :
- PLoS Biology
- Accession number :
- edsair.doi.dedup.....dc3f030eff2bc8536b736f0d8fca2632