Back to Search Start Over

Delineation of functionally essential protein regions for 242 neurodevelopmental genes

Authors :
Sumaiya Iqbal
Tobias Brünger
Eduardo Pérez-Palma
Marie Macnee
Andreas Brunklaus
Mark J Daly
Arthur J Campbell
David Hoksza
Patrick May
Dennis Lal
Institute for Molecular Medicine Finland
Centre of Excellence in Complex Disease Genetics
Fonds National de la Recherche - FnR [sponsor]
DFG [sponsor]
BMBF [sponsor]
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group) [research center]
Source :
Brain. 146:519-533
Publication Year :
2022
Publisher :
Oxford University Press (OUP), 2022.

Abstract

Neurodevelopmental disorders (NDDs), including severe paediatric epilepsy, autism and intellectual disabilities are heterogeneous conditions in which clinical genetic testing can often identify a pathogenic variant. For many of them, genetic therapies will be tested in this or the coming years in clinical trials. In contrast to first-generation symptomatic treatments, the new disease-modifying precision medicines require a genetic test-informed diagnosis before a patient can be enrolled in a clinical trial. However, even in 2022, most identified genetic variants in NDD genes are ‘variants of uncertain significance’. To safely enrol patients in precision medicine clinical trials, it is important to increase our knowledge about which regions in NDD-associated proteins can ‘tolerate’ missense variants and which ones are ‘essential’ and will cause a NDD when mutated. In addition, knowledge about functionally indispensable regions in the 3D structure context of proteins can also provide insights into the molecular mechanisms of disease variants. We developed a novel consensus approach that overlays evolutionary, and population based genomic scores to identify 3D essential sites (Essential3D) on protein structures. After extensive benchmarking of AlphaFold predicted and experimentally solved protein structures, we generated the currently largest expert curated protein structure set for 242 NDDs and identified 14 377 Essential3D sites across 189 gene disorders associated proteins. We demonstrate that the consensus annotation of Essential3D sites improves prioritization of disease mutations over single annotations. The identified Essential3D sites were enriched for functional features such as intermembrane regions or active sites and discovered key inter-molecule interactions in protein complexes that were otherwise not annotated. Using the currently largest autism, developmental disorders, and epilepsies exome sequencing studies including >360 000 NDD patients and population controls, we found that missense variants at Essential3D sites are 8-fold enriched in patients. In summary, we developed a comprehensive protein structure set for 242 NDDs and identified 14 377 Essential3D sites in these. All data are available at https://es-ndd.broadinstitute.org for interactive visual inspection to enhance variant interpretation and development of mechanistic hypotheses for 242 NDDs genes. The provided resources will enhance clinical variant interpretation and in silico drug target development for NDD-associated genes and encoded proteins.

Details

ISSN :
14602156 and 00068950
Volume :
146
Database :
OpenAIRE
Journal :
Brain
Accession number :
edsair.doi.dedup.....8de97e10b6264507d48ee25dd5c51f65