63 results on '"Rodrigues, CHM"'
Search Results
2. piscesCSM: prediction of anticancer synergistic drug combinations.
- Author
-
AlJarf, R, Rodrigues, CHM, Myung, Y, Pires, DEV, Ascher, DB, AlJarf, R, Rodrigues, CHM, Myung, Y, Pires, DEV, and Ascher, DB
- Abstract
While drug combination therapies are of great importance, particularly in cancer treatment, identifying novel synergistic drug combinations has been a challenging venture. Computational methods have emerged in this context as a promising tool for prioritizing drug combinations for further evaluation, though they have presented limited performance, utility, and interpretability. Here, we propose a novel predictive tool, piscesCSM, that leverages graph-based representations to model small molecule chemical structures to accurately predict drug combinations with favourable anticancer synergistic effects against one or multiple cancer cell lines. Leveraging these insights, we developed a general supervised machine learning model to guide the prediction of anticancer synergistic drug combinations in over 30 cell lines. It achieved an area under the receiver operating characteristic curve (AUROC) of up to 0.89 on independent non-redundant blind tests, outperforming state-of-the-art approaches on both large-scale oncology screening data and an independent test set generated by AstraZeneca (with more than a 16% improvement in predictive accuracy). Moreover, by exploring the interpretability of our approach, we found that simple physicochemical properties and graph-based signatures are predictive of chemotherapy synergism. To provide a simple and integrated platform to rapidly screen potential candidate pairs with favourable synergistic anticancer effects, we made piscesCSM freely available online at https://biosig.lab.uq.edu.au/piscescsm/ as a web server and API. We believe that our predictive tool will provide a valuable resource for optimizing and augmenting combinatorial screening libraries to identify effective and safe synergistic anticancer drug combinations. SCIENTIFIC CONTRIBUTION: This work proposes piscesCSM, a machine-learning-based framework that relies on well-established graph-based representations of small molecules to identify and provide better predictive a
- Published
- 2024
3. LEGO-CSM: a tool for functional characterization of proteins
- Author
-
Valencia, A, Nguyen, TB, de Sa, AGC, Rodrigues, CHM, Pires, DE, Ascher, DB, Valencia, A, Nguyen, TB, de Sa, AGC, Rodrigues, CHM, Pires, DE, and Ascher, DB
- Abstract
MOTIVATION: With the development of sequencing techniques, the discovery of new proteins significantly exceeds the human capacity and resources for experimentally characterizing protein functions. Localization, EC numbers, and GO terms with the structure-based Cutoff Scanning Matrix (LEGO-CSM) is a comprehensive web-based resource that fills this gap by leveraging the well-established and robust graph-based signatures to supervised learning models using both protein sequence and structure information to accurately model protein function in terms of Subcellular Localization, Enzyme Commission (EC) numbers, and Gene Ontology (GO) terms. RESULTS: We show our models perform as well as or better than alternative approaches, achieving area under the receiver operating characteristic curve of up to 0.93 for subcellular localization, up to 0.93 for EC, and up to 0.81 for GO terms on independent blind tests. AVAILABILITY AND IMPLEMENTATION: LEGO-CSM's web server is freely available at https://biosig.lab.uq.edu.au/lego_csm. In addition, all datasets used to train and test LEGO-CSM's models can be downloaded at https://biosig.lab.uq.edu.au/lego_csm/data.
- Published
- 2023
4. DockNet: high-throughput protein-protein interface contact prediction.
- Author
-
Cowen, L, Williams, NP, Rodrigues, CHM, Truong, J, Ascher, DB, Holien, JK, Cowen, L, Williams, NP, Rodrigues, CHM, Truong, J, Ascher, DB, and Holien, JK
- Abstract
MOTIVATION: Over 300 000 protein-protein interaction (PPI) pairs have been identified in the human proteome and targeting these is fast becoming the next frontier in drug design. Predicting PPI sites, however, is a challenging task that traditionally requires computationally expensive and time-consuming docking simulations. A major weakness of modern protein docking algorithms is the inability to account for protein flexibility, which ultimately leads to relatively poor results. RESULTS: Here, we propose DockNet, an efficient Siamese graph-based neural network method which predicts contact residues between two interacting proteins. Unlike other methods that only utilize a protein's surface or treat the protein structure as a rigid body, DockNet incorporates the entire protein structure and places no limits on protein flexibility during an interaction. Predictions are modeled at the residue level, based on a diverse set of input node features including residue type, surface accessibility, residue depth, secondary structure, pharmacophore and torsional angles. DockNet is comparable to current state-of-the-art methods, achieving an area under the curve (AUC) value of up to 0.84 on an independent test set (DB5), can be applied to a variety of different protein structures and can be utilized in situations where accurate unbound protein structures cannot be obtained. AVAILABILITY AND IMPLEMENTATION: DockNet is available at https://github.com/npwilliams09/docknet and an easy-to-use webserver at https://biosig.lab.uq.edu.au/docknet. All other data underlying this article are available in the article and in its online supplementary material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- Published
- 2023
5. DDMut: predicting effects of mutations on protein stability using deep learning
- Author
-
Zhou, Y, Pan, Q, Pires, DE, Rodrigues, CHM, Ascher, DB, Zhou, Y, Pan, Q, Pires, DE, Rodrigues, CHM, and Ascher, DB
- Abstract
Understanding the effects of mutations on protein stability is crucial for variant interpretation and prioritisation, protein engineering, and biotechnology. Despite significant efforts, community assessments of predictive tools have highlighted ongoing limitations, including computational time, low predictive power, and biased predictions towards destabilising mutations. To fill this gap, we developed DDMut, a fast and accurate siamese network to predict changes in Gibbs Free Energy upon single and multiple point mutations, leveraging both forward and hypothetical reverse mutations to account for model anti-symmetry. Deep learning models were built by integrating graph-based representations of the localised 3D environment, with convolutional layers and transformer encoders. This combination better captured the distance patterns between atoms by extracting both short-range and long-range interactions. DDMut achieved Pearson's correlations of up to 0.70 (RMSE: 1.37 kcal/mol) on single point mutations, and 0.70 (RMSE: 1.84 kcal/mol) on double/triple mutants, outperforming most available methods across non-redundant blind test sets. Importantly, DDMut was highly scalable and demonstrated anti-symmetric performance on both destabilising and stabilising mutations. We believe DDMut will be a useful platform to better understand the functional consequences of mutations, and guide rational protein engineering. DDMut is freely available as a web server and API at https://biosig.lab.uq.edu.au/ddmut.
- Published
- 2023
6. CSM-Toxin: A Web-Server for Predicting Protein Toxicity
- Author
-
Morozov, V, Rodrigues, CHM, Ascher, DB, Morozov, V, Rodrigues, CHM, and Ascher, DB
- Abstract
Biologics are one of the most rapidly expanding classes of therapeutics, but can be associated with a range of toxic properties. In small-molecule drug development, early identification of potential toxicity led to a significant reduction in clinical trial failures, however we currently lack robust qualitative rules or predictive tools for peptide- and protein-based biologics. To address this, we have manually curated the largest set of high-quality experimental data on peptide and protein toxicities, and developed CSM-Toxin, a novel in-silico protein toxicity classifier, which relies solely on the protein primary sequence. Our approach encodes the protein sequence information using a deep learning natural languages model to understand "biological" language, where residues are treated as words and protein sequences as sentences. The CSM-Toxin was able to accurately identify peptides and proteins with potential toxicity, achieving an MCC of up to 0.66 across both cross-validation and multiple non-redundant blind tests, outperforming other methods and highlighting the robust and generalisable performance of our model. We strongly believe the CSM-Toxin will serve as a valuable platform to minimise potential toxicity in the biologic development pipeline. Our method is freely available as an easy-to-use webserver.
- Published
- 2023
7. Applying Sodium Carbonate Extraction Mass Spectrometry to Investigate Defects in the Mitochondrial Respiratory Chain
- Author
-
Robinson, DRL, Hock, DH, Muellner-Wong, L, Kugapreethan, R, Reljic, B, Surgenor, EE, Rodrigues, CHM, Caruana, NJ, Stroud, DA, Robinson, DRL, Hock, DH, Muellner-Wong, L, Kugapreethan, R, Reljic, B, Surgenor, EE, Rodrigues, CHM, Caruana, NJ, and Stroud, DA
- Abstract
Mitochondria are complex organelles containing 13 proteins encoded by mitochondrial DNA and over 1,000 proteins encoded on nuclear DNA. Many mitochondrial proteins are associated with the inner or outer mitochondrial membranes, either peripherally or as integral membrane proteins, while others reside in either of the two soluble mitochondrial compartments, the mitochondrial matrix and the intermembrane space. The biogenesis of the five complexes of the oxidative phosphorylation system are exemplars of this complexity. These large multi-subunit complexes are comprised of more than 80 proteins with both membrane integral and peripheral associations and require soluble, membrane integral and peripherally associated assembly factor proteins for their biogenesis. Mutations causing human mitochondrial disease can lead to defective complex assembly due to the loss or altered function of the affected protein and subsequent destabilization of its interactors. Here we couple sodium carbonate extraction with quantitative mass spectrometry (SCE-MS) to track changes in the membrane association of the mitochondrial proteome across multiple human knockout cell lines. In addition to identifying the membrane association status of over 840 human mitochondrial proteins, we show how SCE-MS can be used to understand the impacts of defective complex assembly on protein solubility, giving insights into how specific subunits and sub-complexes become destabilized.
- Published
- 2022
8. Structural landscapes of PPI interfaces
- Author
-
Rodrigues, CHM, Pires, DE, Blundell, TL, Ascher, DB, Rodrigues, CHM, Pires, DE, Blundell, TL, and Ascher, DB
- Abstract
Proteins are capable of highly specific interactions and are responsible for a wide range of functions, making them attractive in the pursuit of new therapeutic options. Previous studies focusing on overall geometry of protein-protein interfaces, however, concluded that PPI interfaces were generally flat. More recently, this idea has been challenged by their structural and thermodynamic characterisation, suggesting the existence of concave binding sites that are closer in character to traditional small-molecule binding sites, rather than exhibiting complete flatness. Here, we present a large-scale analysis of binding geometry and physicochemical properties of all protein-protein interfaces available in the Protein Data Bank. In this review, we provide a comprehensive overview of the protein-protein interface landscape, including evidence that even for overall larger, more flat interfaces that utilize discontinuous interacting regions, small and potentially druggable pockets are utilized at binding sites.
- Published
- 2022
9. CSM-Potential: mapping protein interactions and biological ligands in 3D space using geometric deep learning
- Author
-
Rodrigues, CHM, Ascher, DB, Rodrigues, CHM, and Ascher, DB
- Abstract
Recent advances in protein structural modelling have enabled the accurate prediction of the holo 3D structures of almost any protein, however protein function is intrinsically linked to the interactions it makes. While a number of computational approaches have been proposed to explore potential biological interactions, they have been limited to specific interactions, and have not been readily accessible for non-experts or use in bioinformatics pipelines. Here we present CSM-Potential, a geometric deep learning approach to identify regions of a protein surface that are likely to mediate protein-protein and protein-ligand interactions in order to provide a link between 3D structure and biological function. Our method has shown robust performance, outperforming existing methods for both predictive tasks. By assessing the performance of CSM-Potential on independent blind tests, we show that our method was able to achieve ROC AUC values of up to 0.81 for the identification of potential protein-protein binding sites, and up to 0.96 accuracy on biological ligand classification. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/csm_potential.
- Published
- 2022
10. CSM-peptides: A computational approach to rapid identification of therapeutic peptides
- Author
-
Rodrigues, CHM, Garg, A, Keizer, D, Pires, DE, Ascher, DB, Rodrigues, CHM, Garg, A, Keizer, D, Pires, DE, and Ascher, DB
- Abstract
Peptides are attractive alternatives for the development of new therapeutic strategies due to their versatility and low complexity of synthesis. Increasing interest in these molecules has led to the creation of large collections of experimentally characterized therapeutic peptides, which greatly contributes to development of data-driven computational approaches. Here we propose CSM-peptides, a novel machine learning method for rapid identification of eight different types of therapeutic peptides: anti-angiogenic, anti-bacterial, anti-cancer, anti-inflammatory, anti-viral, cell-penetrating, quorum sensing, and surface binding. Our method has shown to outperform existing approaches, achieving an AUC of up to 0.92 on independent blind tests, and consistent performance on cross-validation. We anticipate CSM-peptides to be of great value in helping screening large libraries to identify novel peptides with therapeutic potential and have made it freely available as a user-friendly web server and Application Programming Interface at https://biosig.lab.uq.edu.au/csm_peptides.
- Published
- 2022
11. kinCSM: Using graph-based signatures to predict small molecule CDK2 inhibitors
- Author
-
Zhou, Y, Al-Jarf, R, Alavi, A, Thanh, BN, Rodrigues, CHM, Pires, DE, Ascher, DB, Zhou, Y, Al-Jarf, R, Alavi, A, Thanh, BN, Rodrigues, CHM, Pires, DE, and Ascher, DB
- Abstract
Protein phosphorylation acts as an essential on/off switch in many cellular signaling pathways. This has led to ongoing interest in targeting kinases for therapeutic intervention. Computer-aided drug discovery has been proven a useful and cost-effective approach for facilitating prioritization and enrichment of screening libraries, but limited effort has been devoted providing insights on what makes a potent kinase inhibitor. To fill this gap, here we developed kinCSM, an integrative computational tool capable of accurately identifying potent cyclin-dependent kinase 2 (CDK2) inhibitors, quantitatively predicting CDK2 ligand-kinase inhibition constants (pKi ) and classifying different types of inhibitors based on their favorable binding modes. kinCSM predictive models were built using supervised learning and leveraged the concept of graph-based signatures to capture both physicochemical properties and geometry properties of small molecules. CDK2 inhibitors were accurately identified with Matthew's Correlation Coefficients (MCC) of up to 0.74, and inhibition constants predicted with Pearson's correlation of up to 0.76, both with consistent performances of 0.66 and 0.68 on a nonredundant blind test, respectively. kinCSM was also able to identify the potential type of inhibition for a given molecule, achieving MCC of up to 0.80 on cross-validation and 0.73 on the blind test. Analyzing the molecular composition of revealed enriched chemical fragments in CDK2 inhibitors and different types of inhibitors, which provides insights into the molecular mechanisms behind ligand-kinase interactions. kinCSM will be an invaluable tool to guide future kinase drug discovery. To aid the fast and accurate screening of CDK2 inhibitors, kinCSM is freely available at https://biosig.lab.uq.edu.au/kin_csm/.
- Published
- 2022
12. HGDiscovery: An online tool providing functional and phenotypic information on novel variants of homogentisate 1,2- dioxigenase
- Author
-
Karmakar, M, Cicaloni, V, Rodrigues, CHM, Spiga, O, Santucci, A, Ascher, DB, Karmakar, M, Cicaloni, V, Rodrigues, CHM, Spiga, O, Santucci, A, and Ascher, DB
- Abstract
Alkaptonuria (AKU), a rare genetic disorder, is characterized by the accumulation of homogentisic acid (HGA) in the body. Affected individuals lack functional levels of an enzyme required to breakdown HGA. Mutations in the homogentisate 1,2-dioxygenase (HGD) gene cause AKU and they are responsible for deficient levels of functional HGD, which, in turn, leads to excess levels of HGA. Although HGA is rapidly cleared from the body by the kidneys, in the long term it starts accumulating in various tissues, especially cartilage. Over time (rarely before adulthood), it eventually changes the color of affected tissue to slate blue or black. Here we report a comprehensive mutation analysis of 111 pathogenic and 190 non-pathogenic HGD missense mutations using protein structural information. Using our comprehensive suite of graph-based signature methods, mCSM complemented with sequence-based tools, we studied the functional and molecular consequences of each mutation on protein stability, interaction and evolutionary conservation. The scores generated from the structure and sequence-based tools were used to train a supervised machine learning algorithm with 89% accuracy. The empirical classifier was used to generate the variant phenotype for novel HGD missense mutations. All this information is deployed as a user friendly freely available web server called HGDiscovery (https://biosig.lab.uq.edu.au/hgdiscovery/).
- Published
- 2022
13. A structural biology community assessment of AlphaFold2 applications.
- Author
-
Akdel, M, Pires, DEV, Pardo, EP, Jänes, J, Zalevsky, AO, Mészáros, B, Bryant, P, Good, LL, Laskowski, RA, Pozzati, G, Shenoy, A, Zhu, W, Kundrotas, P, Serra, VR, Rodrigues, CHM, Dunham, AS, Burke, D, Borkakoti, N, Velankar, S, Frost, A, Basquin, J, Lindorff-Larsen, K, Bateman, A, Kajava, AV, Valencia, A, Ovchinnikov, S, Durairaj, J, Ascher, DB, Thornton, JM, Davey, NE, Stein, A, Elofsson, A, Croll, TI, Beltrao, P, Akdel, M, Pires, DEV, Pardo, EP, Jänes, J, Zalevsky, AO, Mészáros, B, Bryant, P, Good, LL, Laskowski, RA, Pozzati, G, Shenoy, A, Zhu, W, Kundrotas, P, Serra, VR, Rodrigues, CHM, Dunham, AS, Burke, D, Borkakoti, N, Velankar, S, Frost, A, Basquin, J, Lindorff-Larsen, K, Bateman, A, Kajava, AV, Valencia, A, Ovchinnikov, S, Durairaj, J, Ascher, DB, Thornton, JM, Davey, NE, Stein, A, Elofsson, A, Croll, TI, and Beltrao, P
- Abstract
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods across structural-biology applications remains to be tested. Here, we evaluate the use of AlphaFold2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modeling of interactions; and modeling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modeled when compared with homology modeling, identifying structural features rarely seen in the Protein Data Bank. AF2-based predictions of protein disorder and complexes surpass dedicated tools, and AF2 models can be used across diverse applications equally well compared with experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life-science research.
- Published
- 2022
14. A structural biology community assessment of AlphaFold 2 applications
- Author
-
Pires Dev, Janet M. Thornton, Kundrotas P, Roman A. Laskowski, Jänes J, Tristan I. Croll, Rodrigues Chm, Mehmet Akdel, Sameer Velankar, Bryant P, Alistair Dunham, Durairaj J, Amelie Stein, Wensi Zhu, David F. Burke, Gabriele Pozzati, Norman E. Davey, Arthur O. Zalevsky, Alfonso Valencia, Porta Pardo E, Shenoy A, Liam Good, Sergey Ovchinnikov, Arne Elofsson, Kresten Lindorff-Larsen, Ruiz Serra, Pedro Beltrao, Bálint Mészáros, Adam Frost, David B. Ascher, and Neera Borkakoti
- Subjects
Science research ,Protein structure ,Structural biology ,Computer science ,Protein Data Bank (RCSB PDB) ,Computational biology - Abstract
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods have led to protein structure predictions that have reached the accuracy of experimentally determined models. While this has been independently verified, the implementation of these methods across structural biology applications remains to be tested. Here, we evaluate the use of AlphaFold 2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modelling of interactions; and modelling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modelled when compared to homology modelling, identifying structural features rarely seen in the PDB. AF2-based predictions of protein disorder and protein complexes surpass state-of-the-art tools and AF2 models can be used across diverse applications equally well compared to experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life science research.
- Published
- 2021
- Full Text
- View/download PDF
15. Exploring the structural distribution of genetic variation in SARS-CoV-2 with the COVID-3D online resource (vol 52, pg 999, 2020)
- Author
-
Portelli, S, Olshansky, M, Rodrigues, CHM, D'Souza, EN, Myung, Y, Silk, M, Alavi, A, Pires, DEV, Ascher, DB, Portelli, S, Olshansky, M, Rodrigues, CHM, D'Souza, EN, Myung, Y, Silk, M, Alavi, A, Pires, DEV, and Ascher, DB
- Published
- 2021
16. mmCSM-PPI: predicting the effects of multiple point mutations on protein-protein interactions
- Author
-
Rodrigues, CHM, Pires, DE, Ascher, DB, Rodrigues, CHM, Pires, DE, and Ascher, DB
- Abstract
Protein-protein interactions play a crucial role in all cellular functions and biological processes and mutations leading to their disruption are enriched in many diseases. While a number of computational methods to assess the effects of variants on protein-protein binding affinity have been proposed, they are in general limited to the analysis of single point mutations and have been shown to perform poorly on independent test sets. Here, we present mmCSM-PPI, a scalable and effective machine learning model for accurately assessing changes in protein-protein binding affinity caused by single and multiple missense mutations. We expanded our well-established graph-based signatures in order to capture physicochemical and geometrical properties of multiple wild-type residue environments and integrated them with substitution scores and dynamics terms from normal mode analysis. mmCSM-PPI was able to achieve a Pearson's correlation of up to 0.75 (RMSE = 1.64 kcal/mol) under 10-fold cross-validation and 0.70 (RMSE = 2.06 kcal/mol) on a non-redundant blind test, outperforming existing methods. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/mmcsm_ppi.
- Published
- 2021
17. MTR3D: identifying regions within protein tertiary structures under purifying selection
- Author
-
Silk, M, Pires, DE, Rodrigues, CHM, D'Souza, EN, Olshansky, M, Thorne, N, Ascher, DB, Silk, M, Pires, DE, Rodrigues, CHM, D'Souza, EN, Olshansky, M, Thorne, N, and Ascher, DB
- Abstract
The identification of disease-causal variants is non-trivial. By mapping population variation from over 448,000 exome and genome sequences to over 81,000 experimental structures and homology models of the human proteome, we have calculated both regional intolerance to missense variation (Missense Tolerance Ratio, MTR), using a sliding window of 21-41 codons, and introduce a new 3D spatial intolerance to missense variation score (3D Missense Tolerance Ratio, MTR3D), using spheres of 5-8 Å. We show that the MTR3D is less biased by regions with limited data and more accurately identifies regions under purifying selection than estimates relying on the sequence alone. Intolerant regions were highly enriched for both ClinVar pathogenic and COSMIC somatic missense variants (Mann-Whitney U test P < 2.2 × 10-16). Further, we combine sequence- and spatial-based scores to generate a consensus score, MTRX, which distinguishes pathogenic from benign variants more accurately than either score separately (AUC = 0.85). The MTR3D server enables easy visualisation of population variation, MTR, MTR3D and MTRX scores across the entire gene and protein structure for >17,000 human genes and >42,000 alternative alternate transcripts, including both Ensembl and RefSeq transcripts. MTR3D is freely available by user-friendly web-interface and API at http://biosig.unimelb.edu.au/mtr3d/.
- Published
- 2021
18. mCSM-membrane: predicting the effects of mutations on transmembrane proteins.
- Author
-
Pires, DEV, Rodrigues, CHM, Ascher, DB, Pires, DEV, Rodrigues, CHM, and Ascher, DB
- Abstract
Significant efforts have been invested into understanding and predicting the molecular consequences of mutations in protein coding regions, however nearly all approaches have been developed using globular, soluble proteins. These methods have been shown to poorly translate to studying the effects of mutations in membrane proteins. To fill this gap, here we report, mCSM-membrane, a user-friendly web server that can be used to analyse the impacts of mutations on membrane protein stability and the likelihood of them being disease associated. mCSM-membrane derives from our well-established mutation modelling approach that uses graph-based signatures to model protein geometry and physicochemical properties for supervised learning. Our stability predictor achieved correlations of up to 0.72 and 0.67 (on cross validation and blind tests, respectively), while our pathogenicity predictor achieved a Matthew's Correlation Coefficient (MCC) of up to 0.77 and 0.73, outperforming previously described methods in both predicting changes in stability and in identifying pathogenic variants. mCSM-membrane will be an invaluable and dedicated resource for investigating the effects of single-point mutations on membrane proteins through a freely available, user friendly web server at http://biosig.unimelb.edu.au/mcsm_membrane.
- Published
- 2020
19. COVID-3D: An online resource to explore the structural distribution of genetic variation in SARS-CoV-2 and its implication on therapeutic development
- Author
-
Portelli, S, Olshansky, M, Rodrigues, CHM, D’Souza, E, Myung, Y, Silk, M, Alavi, A, Pires, DEV, Ascher, D, Portelli, S, Olshansky, M, Rodrigues, CHM, D’Souza, E, Myung, Y, Silk, M, Alavi, A, Pires, DEV, and Ascher, D
- Abstract
SUMMARY
The emergence of the COVID-19 pandemic has spurred a global rush to uncover basic biological mechanisms, to inform effective vaccine and drug development. Despite viral novelty, global sequencing efforts have already identified genomic variation across isolates. To enable easy exploration and spatial visualization of the potential implications of SARS-CoV-2 mutations on infection, host immunity and drug development we have developed COVID-3D ( http://biosig.unimelb.edu.au/covid3d/ ).- Published
- 2020
20. Structure guided prediction of Pyrazinamide resistance mutations in pncA
- Author
-
Karmakar, M, Rodrigues, CHM, Horan, K, Denholm, JT, Ascher, DB, Karmakar, M, Rodrigues, CHM, Horan, K, Denholm, JT, and Ascher, DB
- Abstract
Pyrazinamide plays an important role in tuberculosis treatment; however, its use is complicated by side-effects and challenges with reliable drug susceptibility testing. Resistance to pyrazinamide is largely driven by mutations in pyrazinamidase (pncA), responsible for drug activation, but genetic heterogeneity has hindered development of a molecular diagnostic test. We proposed to use information on how variants were likely to affect the 3D structure of pncA to identify variants likely to lead to pyrazinamide resistance. We curated 610 pncA mutations with high confidence experimental and clinical information on pyrazinamide susceptibility. The molecular consequences of each mutation on protein stability, conformation, and interactions were computationally assessed using our comprehensive suite of graph-based signature methods, mCSM. The molecular consequences of the variants were used to train a classifier with an accuracy of 80%. Our model was tested against internationally curated clinical datasets, achieving up to 85% accuracy. Screening of 600 Victorian clinical isolates identified a set of previously unreported variants, which our model had a 71% agreement with drug susceptibility testing. Here, we have shown the 3D structure of pncA can be used to accurately identify pyrazinamide resistance mutations. SUSPECT-PZA is freely available at: http://biosig.unimelb.edu.au/suspect_pza/.
- Published
- 2020
21. Empirical ways to identify novel Bedaquiline resistance mutations in AtpE
- Author
-
Mokrousov, I, Karmakar, M, Rodrigues, CHM, Holt, KE, Dunstan, SJ, Denholm, J, Ascher, DB, Mokrousov, I, Karmakar, M, Rodrigues, CHM, Holt, KE, Dunstan, SJ, Denholm, J, and Ascher, DB
- Abstract
Clinical resistance against Bedaquiline, the first new anti-tuberculosis compound with a novel mechanism of action in over 40 years, has already been detected in Mycobacterium tuberculosis. As a new drug, however, there is currently insufficient clinical data to facilitate reliable and timely identification of genomic determinants of resistance. Here we investigate the structural basis for M. tuberculosis associated bedaquiline resistance in the drug target, AtpE. Together with the 9 previously identified resistance-associated variants in AtpE, 54 non-resistance-associated mutations were identified through comparisons of bedaquiline susceptibility across 23 different mycobacterial species. Computational analysis of the structural and functional consequences of these variants revealed that resistance associated variants were mainly localized at the drug binding site, disrupting key interactions with bedaquiline leading to reduced binding affinity. This was used to train a supervised predictive algorithm, which accurately identified likely resistance mutations (93.3% accuracy). Application of this model to circulating variants present in the Asia-Pacific region suggests that current circulating variants are likely to be susceptible to bedaquiline. We have made this model freely available through a user-friendly web interface called SUSPECT-BDQ, StrUctural Susceptibility PrEdiCTion for bedaquiline (http://biosig.unimelb.edu.au/suspect_bdq/). This tool could be useful for the rapid characterization of novel clinical variants, to help guide the effective use of bedaquiline, and to minimize the spread of clinical resistance.
- Published
- 2019
22. mCSM-PPI2: predicting the effects of mutations on protein-protein interactions
- Author
-
Rodrigues, CHM, Myung, Y, Pires, DEV, Ascher, DB, Rodrigues, CHM, Myung, Y, Pires, DEV, and Ascher, DB
- Abstract
Protein-protein Interactions are involved in most fundamental biological processes, with disease causing mutations enriched at their interfaces. Here we present mCSM-PPI2, a novel machine learning computational tool designed to more accurately predict the effects of missense mutations on protein-protein interaction binding affinity. mCSM-PPI2 uses graph-based structural signatures to model effects of variations on the inter-residue interaction network, evolutionary information, complex network metrics and energetic terms to generate an optimised predictor. We demonstrate that our method outperforms previous methods, ranking first among 26 others on CAPRI blind tests. mCSM-PPI2 is freely available as a user friendly webserver at http://biosig.unimelb.edu.au/mcsm_ppi2/.
- Published
- 2019
23. Combating mutations in genetic disease and drug resistance: understanding molecular mechanisms to guide drug design
- Author
-
Albanaz, ATS, Rodrigues, CHM, Pires, DEV, Ascher, DB, Ascher, David [0000-0003-2948-2413], and Apollo - University of Cambridge Repository
- Subjects
mutational analysis ,drug resistance ,genetic diseases ,drug design ,cancer ,genotype-phenotype association ,molecular mechanism - Abstract
INTRODUCTION: Mutations introduce diversity into genomes, leading to selective changes and driving evolution. These changes have contributed to the emergence of many of the current major health concerns of the 21st century, from the development of genetic diseases and cancers to the rise and spread of drug resistance. The experimental systematic testing of all mutations in a system of interest is impractical and not cost-effective, which has created interest in the development of computational tools to understand the molecular consequences of mutations to aid and guide rational experimentation. AREAS COVERED: Here, the authors discuss the recent development of computational methods to understand the effects of coding mutations to protein function and interactions, particularly in the context of the 3D structure of the protein. EXPERT OPINION: While significant progress has been made in terms of innovative tools to understand and quantify the different range of effects in which a mutation or a set of mutations can give rise to a phenotype, a great gap still exists when integrating these predictions and drawing causality conclusions linking variants. This often requires a detailed understanding of the system being perturbed. However, as part of the drug development process it can be used preemptively in a similar fashion to pharmacokinetics predictions, to guide development of therapeutics to help guide the design and analysis of clinical trials, patient treatment and public health policy strategies.
- Published
- 2017
- Full Text
- View/download PDF
24. DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability
- Author
-
Rodrigues, CHM, Pires, DE, Ascher, DB, Rodrigues, CHM, Pires, DE, and Ascher, DB
- Abstract
Proteins are highly dynamic molecules, whose function is intrinsically linked to their molecular motions. Despite the pivotal role of protein dynamics, their computational simulation cost has led to most structure-based approaches for assessing the impact of mutations on protein structure and function relying upon static structures. Here we present DynaMut, a web server implementing two distinct, well established normal mode approaches, which can be used to analyze and visualize protein dynamics by sampling conformations and assess the impact of mutations on protein dynamics and stability resulting from vibrational entropy changes. DynaMut integrates our graph-based signatures along with normal mode dynamics to generate a consensus prediction of the impact of a mutation on protein stability. We demonstrate our approach outperforms alternative approaches to predict the effects of mutations on protein stability and flexibility (P-value < 0.001), achieving a correlation of up to 0.70 on blind tests. DynaMut also provides a comprehensive suite for protein motion and flexibility analysis and visualization via a freely available, user friendly web server at http://biosig.unimelb.edu.au/dynamut/.
- Published
- 2018
25. Kinact: a computational approach for predicting activating missense mutations in protein kinases
- Author
-
Rodrigues, CHM, Ascher, DB, Pires, DE, Rodrigues, CHM, Ascher, DB, and Pires, DE
- Abstract
Protein phosphorylation is tightly regulated due to its vital role in many cellular processes. While gain of function mutations leading to constitutive activation of protein kinases are known to be driver events of many cancers, the identification of these mutations has proven challenging. Here we present Kinact, a novel machine learning approach for predicting kinase activating missense mutations using information from sequence and structure. By adapting our graph-based signatures, Kinact represents both structural and sequence information, which are used as evidence to train predictive models. We show the combination of structural and sequence features significantly improved the overall accuracy compared to considering either primary or tertiary structure alone, highlighting their complementarity. Kinact achieved a precision of 87% and 94% and Area Under ROC Curve of 0.89 and 0.92 on 10-fold cross-validation, and on blind tests, respectively, outperforming well established tools (P < 0.01). We further show that Kinact performs equally well on homology models built using templates with sequence identity as low as 33%. Kinact is freely available as a user-friendly web server at http://biosig.unimelb.edu.au/kinact/.
- Published
- 2018
26. Assessing the predicted impact of single amino acid substitutions in MAPK proteins for CAGI6 challenges.
- Author
-
Turina P, Petrosino M, Enriquez Sandoval CA, Novak L, Pasquo A, Alexov E, Alladin MA, Ascher DB, Babbi G, Bakolitsa C, Casadio R, Cheng J, Fariselli P, Folkman L, Kamandula A, Katsonis P, Li M, Li D, Lichtarge O, Mahmud S, Martelli PL, Pal D, Panday SK, Pires DEV, Portelli S, Pucci F, Rodrigues CHM, Rooman M, Savojardo C, Schwersensky M, Shen Y, Strokach AV, Sun Y, Woo J, Radivojac P, Brenner SE, Chiaraluce R, Consalvi V, and Capriotti E
- Abstract
New thermodynamic and functional studies have been recently conducted to evaluate the impact of amino acid substitutions on the Mitogen Activated Protein Kinases 1 and 3 (MAPK1/3). The Critical Assessment of Genome Interpretation (CAGI) data provider, at Sapienza University of Rome, measured the unfolding free energy and the enzymatic activity of a set of variants (MAPK challenge dataset). Thermodynamic measurements for the denaturant-induced equilibrium unfolding of the phosphorylated and unphosphorylated forms of the MAPKs were obtained by monitoring the far-UV circular dichroism and intrinsic fluorescence changes as a function of denaturant concentration. These values have been used to calculate the change in unfolding free energy between the variant and wild-type proteins at zero concentration of denaturant ( Δ Δ G H 2 O ). The enzymatic activity of the phosphorylated MAPKs variants was also measured using Chelation-Enhanced Fluorescence to monitor the phosphorylation of a peptide substrate. The MAPK challenge dataset, composed of a total of 23 single amino acid substitutions (11 and 12 for MAPK1 and MAPK3, respectively), was used to assess the effectiveness of the computational methods in predicting the Δ Δ G H 2 O values, associated with the variants, and categorize them as destabilizing and not destabilizing. The data on the enzymatic activity of the MAPKs mutants were used to assess the performance of the methods for predicting the functional impact of the variants. For the sixth edition of CAGI, thirteen independent research groups from four continents (Asia, Australia, Europe and North America) submitted > 80 sets of predictions, obtained from different approaches. In this manuscript, we summarized the results of our assessment to highlight the possible limitations of the available algorithms., Competing Interests: Declarations. Conflict of interest: The authors declare no competing interests., (© 2025. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.)
- Published
- 2025
- Full Text
- View/download PDF
27. CSM-Potential2: A comprehensive deep learning platform for the analysis of protein interacting interfaces.
- Author
-
Rodrigues CHM and Ascher DB
- Subjects
- Binding Sites, Ligands, Computational Biology methods, Protein Conformation, Protein Interaction Mapping methods, Humans, Protein Interaction Domains and Motifs, Databases, Protein, Models, Molecular, Deep Learning, Protein Binding, Proteins chemistry, Proteins metabolism, Software
- Abstract
Proteins are molecular machinery that participate in virtually all essential biological functions within the cell, which are tightly related to their 3D structure. The importance of understanding protein structure-function relationship is highlighted by the exponential growth of experimental structures, which has been greatly expanded by recent breakthroughs in protein structure prediction, most notably RosettaFold, and AlphaFold2. These advances have prompted the development of several computational approaches that leverage these data sources to explore potential biological interactions. However, most methods are generally limited to analysis of single types of interactions, such as protein-protein or protein-ligand interactions, and their complexity limits the usability to expert users. Here we report CSM-Potential2, a deep learning platform for the analysis of binding interfaces on protein structures. In addition to prediction of protein-protein interactions binding sites and classification of biological ligands, our new platform incorporates prediction of interactions with nucleic acids at the residue level and allows for ligand transplantation based on sequence and structure similarity to experimentally determined structures. We anticipate our platform to be a valuable resource that provides easy access to a range of state-of-the-art methods to expert and non-expert users for the study of biological interactions. Our tool is freely available as an easy-to-use web server and API available at https://biosig.lab.uq.edu.au/csm_potential., (© 2023 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals LLC.)
- Published
- 2025
- Full Text
- View/download PDF
28. Assessing the predicted impact of single amino acid substitutions in calmodulin for CAGI6 challenges.
- Author
-
Turina P, Dal Cortivo G, Enriquez Sandoval CA, Alexov E, Ascher DB, Babbi G, Bakolitsa C, Casadio R, Fariselli P, Folkman L, Kamandula A, Katsonis P, Li D, Lichtarge O, Martelli PL, Panday SK, Pires DEV, Portelli S, Pucci F, Rodrigues CHM, Rooman M, Savojardo C, Schwersensky M, Shen Y, Strokach AV, Sun Y, Woo J, Radivojac P, Brenner SE, Dell'Orco D, and Capriotti E
- Abstract
Recent thermodynamic and functional studies have been conducted to evaluate the impact of amino acid substitutions on Calmodulin (CaM). The Critical Assessment of Genome Interpretation (CAGI) data provider at University of Verona (Italy) measured the melting temperature (T
m ) and the percentage of unfolding (%unfold) of a set of CaM variants (CaM challenge dataset). Thermodynamic measurements for the equilibrium unfolding of CaM were obtained by monitoring far-UV Circular Dichroism as a function of temperature. These measurements were used to determine the Tm and the percentage of protein remaining unfolded at the highest temperature. The CaM challenge dataset, comprising a total of 15 single amino acid substitutions, was used to evaluate the effectiveness of computational methods in predicting the Tm and unfolding percentages associated with the variants, and categorizing them as destabilizing or not. For the sixth edition of CAGI, nine independent research groups from four continents (Asia, Australia, Europe, and North America) submitted over 52 sets of predictions, derived from various approaches. In this manuscript, we summarize the results of our assessment to highlight the potential limitations of current algorithms and provide insights into the future development of more accurate prediction tools. By evaluating the thermodynamic stability of CaM variants, this study aims to enhance our understanding of the relationship between amino acid substitutions and protein stability, ultimately contributing to more accurate predictions of the effects of genetic variants., Competing Interests: Declarations. Conflict of interest: The authors declare no competing interests., (© 2024. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.)- Published
- 2024
- Full Text
- View/download PDF
29. EFG-CS: Predicting chemical shifts from amino acid sequences with protein structure prediction using machine learning and deep learning models.
- Author
-
Gu X, Myung Y, Rodrigues CHM, and Ascher DB
- Subjects
- Nuclear Magnetic Resonance, Biomolecular, Software, Protein Conformation, Amino Acid Sequence, Models, Molecular, Deep Learning, Proteins chemistry, Machine Learning
- Abstract
Nuclear magnetic resonance (NMR) crystallography is one of the main methods in structural biology for analyzing protein stereochemistry and structure. The chemical shift of the resonance frequency reflects the effect of the protons in a molecule producing distinct NMR signals in different chemical environments. Apprehending chemical shifts from NMR signals can be challenging since having an NMR structure does not necessarily provide all the required chemical shift information, making predictive models essential for accurately deducing chemical shifts, either from protein structures or, more ideally, directly from amino acid sequences. Here, we present EFG-CS, a web server that specializes in chemical shift prediction. EFG-CS employs a machine learning-based transfer prediction model for backbone atom chemical shift prediction, using ESMFold-predicted protein structures. Additionally, ESG-CS incorporates a graph neural network-based model to provide comprehensive side-chain atom chemical shift predictions. Our method demonstrated reliable performance in backbone atom prediction, achieving comparable accuracy levels with root mean square errors (RMSE) of 0.30 ppm for H, 0.22 ppm for Hα, 0.89 ppm for C, 0.89 ppm for Cα, 0.84 ppm for Cβ, and 1.69 ppm for N. Moreover, our approach also showed predictive capabilities in side-chain atom chemical shift prediction achieving RMSE values of 0.71 ppm for Hβ, 0.74-1.15 ppm for Hδ, and 0.58-0.94 ppm for Hγ, solely utilizing amino acid sequences without homology or feature curation. This work shows for the first time that generative AI protein models can predict NMR shifts nearly comparable to experimental models. This web server is freely available at https://biosig.lab.uq.edu.au/efg_cs, and the chemical shift prediction results can be downloaded in tabular format and visualized in 3D format., (© 2024 The Author(s). Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.)
- Published
- 2024
- Full Text
- View/download PDF
30. MTR3D-AF2: Expanding the coverage of spatially derived missense tolerance scores across the human proteome using AlphaFold2.
- Author
-
Kovacs AS, Portelli S, Silk M, Rodrigues CHM, and Ascher DB
- Subjects
- Humans, Software, Models, Molecular, Proteins chemistry, Proteins genetics, Proteins metabolism, Databases, Protein, Proteome chemistry, Proteome genetics, Proteome analysis, Proteome metabolism, Mutation, Missense
- Abstract
The missense tolerance ratio (MTR) was developed as a novel approach to assess the deleteriousness of variants. Its three-dimensional successor, MTR3D, was demonstrated powerful at discriminating pathogenic from benign variants. However, its reliance on experimental structures and homologs limited its coverage of the proteome. We have now utilized AlphaFold2 models to develop MTR3D-AF2, which covers 89.31% of proteins and 85.39% of residues across the human proteome. This work has improved MTR3D's ability to distinguish clinically established pathogenic from benign variants. MTR3D-AF2 is freely available as an interactive web server at https://biosig.lab.uq.edu.au/mtr3daf2/., (© 2024 The Author(s). Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.)
- Published
- 2024
- Full Text
- View/download PDF
31. piscesCSM: prediction of anticancer synergistic drug combinations.
- Author
-
AlJarf R, Rodrigues CHM, Myung Y, Pires DEV, and Ascher DB
- Abstract
While drug combination therapies are of great importance, particularly in cancer treatment, identifying novel synergistic drug combinations has been a challenging venture. Computational methods have emerged in this context as a promising tool for prioritizing drug combinations for further evaluation, though they have presented limited performance, utility, and interpretability. Here, we propose a novel predictive tool, piscesCSM, that leverages graph-based representations to model small molecule chemical structures to accurately predict drug combinations with favourable anticancer synergistic effects against one or multiple cancer cell lines. Leveraging these insights, we developed a general supervised machine learning model to guide the prediction of anticancer synergistic drug combinations in over 30 cell lines. It achieved an area under the receiver operating characteristic curve (AUROC) of up to 0.89 on independent non-redundant blind tests, outperforming state-of-the-art approaches on both large-scale oncology screening data and an independent test set generated by AstraZeneca (with more than a 16% improvement in predictive accuracy). Moreover, by exploring the interpretability of our approach, we found that simple physicochemical properties and graph-based signatures are predictive of chemotherapy synergism. To provide a simple and integrated platform to rapidly screen potential candidate pairs with favourable synergistic anticancer effects, we made piscesCSM freely available online at https://biosig.lab.uq.edu.au/piscescsm/ as a web server and API. We believe that our predictive tool will provide a valuable resource for optimizing and augmenting combinatorial screening libraries to identify effective and safe synergistic anticancer drug combinations. SCIENTIFIC CONTRIBUTION: This work proposes piscesCSM, a machine-learning-based framework that relies on well-established graph-based representations of small molecules to identify and provide better predictive accuracy of syngenetic drug combinations. Our model, piscesCSM, shows that combining physiochemical properties with graph-based signatures can outperform current architectures on classification prediction tasks. Furthermore, implementing our tool as a web server offers a user-friendly platform for researchers to screen for potential synergistic drug combinations with favorable anticancer effects against one or multiple cancer cell lines., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
32. DDMut-PPI: predicting effects of mutations on protein-protein interactions using graph-based deep learning.
- Author
-
Zhou Y, Myung Y, Rodrigues CHM, and Ascher DB
- Subjects
- Protein Binding, Mutation, Software, Protein Interaction Maps genetics, Humans, Proteins genetics, Proteins metabolism, Proteins chemistry, Point Mutation, Deep Learning, Protein Interaction Mapping methods
- Abstract
Protein-protein interactions (PPIs) play a vital role in cellular functions and are essential for therapeutic development and understanding diseases. However, current predictive tools often struggle to balance efficiency and precision in predicting the effects of mutations on these complex interactions. To address this, we present DDMut-PPI, a deep learning model that efficiently and accurately predicts changes in PPI binding free energy upon single and multiple point mutations. Building on the robust Siamese network architecture with graph-based signatures from our prior work, DDMut, the DDMut-PPI model was enhanced with a graph convolutional network operated on the protein interaction interface. We used residue-specific embeddings from ProtT5 protein language model as node features, and a variety of molecular interactions as edge features. By integrating evolutionary context with spatial information, this framework enables DDMut-PPI to achieve a robust Pearson correlation of up to 0.75 (root mean squared error: 1.33 kcal/mol) in our evaluations, outperforming most existing methods. Importantly, the model demonstrated consistent performance across mutations that increase or decrease binding affinity. DDMut-PPI offers a significant advancement in the field and will serve as a valuable tool for researchers probing the complexities of protein interactions. DDMut-PPI is freely available as a web server and an application programming interface at https://biosig.lab.uq.edu.au/ddmut_ppi., (© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2024
- Full Text
- View/download PDF
33. Exploring the effects of missense mutations on protein thermodynamics through structure-based approaches: findings from the CAGI6 challenges.
- Author
-
Rodrigues CHM, Portelli S, and Ascher DB
- Abstract
Missense mutations are known contributors to diverse genetic disorders, due to their subtle, single amino acid changes imparted on the resultant protein. Because of this, understanding the impact of these mutations on protein stability and function is crucial for unravelling disease mechanisms and developing targeted therapies. The Critical Assessment of Genome Interpretation (CAGI) provides a valuable platform for benchmarking state-of-the-art computational methods in predicting the impact of disease-related mutations on protein thermodynamics. Here we report the performance of our comprehensive platform of structure-based computational approaches to evaluate mutations impacting protein structure and function on 3 challenges from CAGI6: Calmodulin, MAPK1 and MAPK3. Our stability predictors have achieved correlations of up to 0.74 and AUCs of 1 when predicting changes in ΔΔG for MAPK1 and MAPK3, respectively, and AUC of up to 0.75 in the Calmodulin challenge. Overall, our study highlights the importance of structure-based approaches in understanding the effects of missense mutations on protein thermodynamics. The results obtained from the CAGI6 challenges contribute to the ongoing efforts to enhance our understanding of disease mechanisms and facilitate the development of personalised medicine approaches., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
34. DDMut: predicting effects of mutations on protein stability using deep learning.
- Author
-
Zhou Y, Pan Q, Pires DEV, Rodrigues CHM, and Ascher DB
- Subjects
- Mutation, Point Mutation, Deep Learning, Protein Stability, Software, Proteins chemistry, Proteins genetics
- Abstract
Understanding the effects of mutations on protein stability is crucial for variant interpretation and prioritisation, protein engineering, and biotechnology. Despite significant efforts, community assessments of predictive tools have highlighted ongoing limitations, including computational time, low predictive power, and biased predictions towards destabilising mutations. To fill this gap, we developed DDMut, a fast and accurate siamese network to predict changes in Gibbs Free Energy upon single and multiple point mutations, leveraging both forward and hypothetical reverse mutations to account for model anti-symmetry. Deep learning models were built by integrating graph-based representations of the localised 3D environment, with convolutional layers and transformer encoders. This combination better captured the distance patterns between atoms by extracting both short-range and long-range interactions. DDMut achieved Pearson's correlations of up to 0.70 (RMSE: 1.37 kcal/mol) on single point mutations, and 0.70 (RMSE: 1.84 kcal/mol) on double/triple mutants, outperforming most available methods across non-redundant blind test sets. Importantly, DDMut was highly scalable and demonstrated anti-symmetric performance on both destabilising and stabilising mutations. We believe DDMut will be a useful platform to better understand the functional consequences of mutations, and guide rational protein engineering. DDMut is freely available as a web server and API at https://biosig.lab.uq.edu.au/ddmut., (© The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2023
- Full Text
- View/download PDF
35. LEGO-CSM: a tool for functional characterization of proteins.
- Author
-
Nguyen TB, de Sá AGC, Rodrigues CHM, Pires DEV, and Ascher DB
- Subjects
- Humans, Software, Proteins chemistry
- Abstract
Motivation: With the development of sequencing techniques, the discovery of new proteins significantly exceeds the human capacity and resources for experimentally characterizing protein functions. Localization, EC numbers, and GO terms with the structure-based Cutoff Scanning Matrix (LEGO-CSM) is a comprehensive web-based resource that fills this gap by leveraging the well-established and robust graph-based signatures to supervised learning models using both protein sequence and structure information to accurately model protein function in terms of Subcellular Localization, Enzyme Commission (EC) numbers, and Gene Ontology (GO) terms., Results: We show our models perform as well as or better than alternative approaches, achieving area under the receiver operating characteristic curve of up to 0.93 for subcellular localization, up to 0.93 for EC, and up to 0.81 for GO terms on independent blind tests., Availability and Implementation: LEGO-CSM's web server is freely available at https://biosig.lab.uq.edu.au/lego_csm. In addition, all datasets used to train and test LEGO-CSM's models can be downloaded at https://biosig.lab.uq.edu.au/lego_csm/data., (© The Author(s) 2023. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
36. CSM-Toxin: A Web-Server for Predicting Protein Toxicity.
- Author
-
Morozov V, Rodrigues CHM, and Ascher DB
- Abstract
Biologics are one of the most rapidly expanding classes of therapeutics, but can be associated with a range of toxic properties. In small-molecule drug development, early identification of potential toxicity led to a significant reduction in clinical trial failures, however we currently lack robust qualitative rules or predictive tools for peptide- and protein-based biologics. To address this, we have manually curated the largest set of high-quality experimental data on peptide and protein toxicities, and developed CSM-Toxin, a novel in-silico protein toxicity classifier, which relies solely on the protein primary sequence. Our approach encodes the protein sequence information using a deep learning natural languages model to understand "biological" language, where residues are treated as words and protein sequences as sentences. The CSM-Toxin was able to accurately identify peptides and proteins with potential toxicity, achieving an MCC of up to 0.66 across both cross-validation and multiple non-redundant blind tests, outperforming other methods and highlighting the robust and generalisable performance of our model. We strongly believe the CSM-Toxin will serve as a valuable platform to minimise potential toxicity in the biologic development pipeline. Our method is freely available as an easy-to-use webserver.
- Published
- 2023
- Full Text
- View/download PDF
37. DockNet: high-throughput protein-protein interface contact prediction.
- Author
-
Williams NP, Rodrigues CHM, Truong J, Ascher DB, and Holien JK
- Subjects
- Humans, Proteome, Pharmacophore, Area Under Curve, Computational Biology, Neural Networks, Computer, Algorithms
- Abstract
Motivation: Over 300 000 protein-protein interaction (PPI) pairs have been identified in the human proteome and targeting these is fast becoming the next frontier in drug design. Predicting PPI sites, however, is a challenging task that traditionally requires computationally expensive and time-consuming docking simulations. A major weakness of modern protein docking algorithms is the inability to account for protein flexibility, which ultimately leads to relatively poor results., Results: Here, we propose DockNet, an efficient Siamese graph-based neural network method which predicts contact residues between two interacting proteins. Unlike other methods that only utilize a protein's surface or treat the protein structure as a rigid body, DockNet incorporates the entire protein structure and places no limits on protein flexibility during an interaction. Predictions are modeled at the residue level, based on a diverse set of input node features including residue type, surface accessibility, residue depth, secondary structure, pharmacophore and torsional angles. DockNet is comparable to current state-of-the-art methods, achieving an area under the curve (AUC) value of up to 0.84 on an independent test set (DB5), can be applied to a variety of different protein structures and can be utilized in situations where accurate unbound protein structures cannot be obtained., Availability and Implementation: DockNet is available at https://github.com/npwilliams09/docknet and an easy-to-use webserver at https://biosig.lab.uq.edu.au/docknet. All other data underlying this article are available in the article and in its online supplementary material., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
38. A structural biology community assessment of AlphaFold2 applications.
- Author
-
Akdel M, Pires DEV, Pardo EP, Jänes J, Zalevsky AO, Mészáros B, Bryant P, Good LL, Laskowski RA, Pozzati G, Shenoy A, Zhu W, Kundrotas P, Serra VR, Rodrigues CHM, Dunham AS, Burke D, Borkakoti N, Velankar S, Frost A, Basquin J, Lindorff-Larsen K, Bateman A, Kajava AV, Valencia A, Ovchinnikov S, Durairaj J, Ascher DB, Thornton JM, Davey NE, Stein A, Elofsson A, Croll TI, and Beltrao P
- Subjects
- Binding Sites, Proteins chemistry, Databases, Protein, Protein Conformation, Computational Biology methods, Furylfuramide
- Abstract
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods across structural-biology applications remains to be tested. Here, we evaluate the use of AlphaFold2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modeling of interactions; and modeling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modeled when compared with homology modeling, identifying structural features rarely seen in the Protein Data Bank. AF2-based predictions of protein disorder and complexes surpass dedicated tools, and AF2 models can be used across diverse applications equally well compared with experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life-science research., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
39. kinCSM: Using graph-based signatures to predict small molecule CDK2 inhibitors.
- Author
-
Zhou Y, Al-Jarf R, Alavi A, Nguyen TB, Rodrigues CHM, Pires DEV, and Ascher DB
- Subjects
- Cyclin-Dependent Kinase 2 chemistry, Ligands, Drug Discovery, Protein Kinase Inhibitors pharmacology, Protein Kinase Inhibitors chemistry, Antineoplastic Agents chemistry
- Abstract
Protein phosphorylation acts as an essential on/off switch in many cellular signaling pathways. This has led to ongoing interest in targeting kinases for therapeutic intervention. Computer-aided drug discovery has been proven a useful and cost-effective approach for facilitating prioritization and enrichment of screening libraries, but limited effort has been devoted providing insights on what makes a potent kinase inhibitor. To fill this gap, here we developed kinCSM, an integrative computational tool capable of accurately identifying potent cyclin-dependent kinase 2 (CDK2) inhibitors, quantitatively predicting CDK2 ligand-kinase inhibition constants (pK
i ) and classifying different types of inhibitors based on their favorable binding modes. kinCSM predictive models were built using supervised learning and leveraged the concept of graph-based signatures to capture both physicochemical properties and geometry properties of small molecules. CDK2 inhibitors were accurately identified with Matthew's Correlation Coefficients (MCC) of up to 0.74, and inhibition constants predicted with Pearson's correlation of up to 0.76, both with consistent performances of 0.66 and 0.68 on a nonredundant blind test, respectively. kinCSM was also able to identify the potential type of inhibition for a given molecule, achieving MCC of up to 0.80 on cross-validation and 0.73 on the blind test. Analyzing the molecular composition of revealed enriched chemical fragments in CDK2 inhibitors and different types of inhibitors, which provides insights into the molecular mechanisms behind ligand-kinase interactions. kinCSM will be an invaluable tool to guide future kinase drug discovery. To aid the fast and accurate screening of CDK2 inhibitors, kinCSM is freely available at https://biosig.lab.uq.edu.au/kin_csm/., (© 2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.)- Published
- 2022
- Full Text
- View/download PDF
40. CSM-peptides: A computational approach to rapid identification of therapeutic peptides.
- Author
-
Rodrigues CHM, Garg A, Keizer D, Pires DEV, and Ascher DB
- Subjects
- Anti-Inflammatory Agents, Computational Biology methods, Machine Learning, Peptides chemistry, Software
- Abstract
Peptides are attractive alternatives for the development of new therapeutic strategies due to their versatility and low complexity of synthesis. Increasing interest in these molecules has led to the creation of large collections of experimentally characterized therapeutic peptides, which greatly contributes to development of data-driven computational approaches. Here we propose CSM-peptides, a novel machine learning method for rapid identification of eight different types of therapeutic peptides: anti-angiogenic, anti-bacterial, anti-cancer, anti-inflammatory, anti-viral, cell-penetrating, quorum sensing, and surface binding. Our method has shown to outperform existing approaches, achieving an AUC of up to 0.92 on independent blind tests, and consistent performance on cross-validation. We anticipate CSM-peptides to be of great value in helping screening large libraries to identify novel peptides with therapeutic potential and have made it freely available as a user-friendly web server and Application Programming Interface at https://biosig.lab.uq.edu.au/csm_peptides., (© 2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.)
- Published
- 2022
- Full Text
- View/download PDF
41. HGDiscovery: An online tool providing functional and phenotypic information on novel variants of homogentisate 1,2- dioxigenase.
- Author
-
Karmakar M, Cicaloni V, Rodrigues CHM, Spiga O, Santucci A, and Ascher DB
- Abstract
Alkaptonuria (AKU), a rare genetic disorder, is characterized by the accumulation of homogentisic acid (HGA) in the body. Affected individuals lack functional levels of an enzyme required to breakdown HGA. Mutations in the homogentisate 1,2-dioxygenase (HGD) gene cause AKU and they are responsible for deficient levels of functional HGD, which, in turn, leads to excess levels of HGA. Although HGA is rapidly cleared from the body by the kidneys, in the long term it starts accumulating in various tissues, especially cartilage. Over time (rarely before adulthood), it eventually changes the color of affected tissue to slate blue or black. Here we report a comprehensive mutation analysis of 111 pathogenic and 190 non-pathogenic HGD missense mutations using protein structural information. Using our comprehensive suite of graph-based signature methods, mCSM complemented with sequence-based tools, we studied the functional and molecular consequences of each mutation on protein stability, interaction and evolutionary conservation. The scores generated from the structure and sequence-based tools were used to train a supervised machine learning algorithm with 89% accuracy. The empirical classifier was used to generate the variant phenotype for novel HGD missense mutations. All this information is deployed as a user friendly freely available web server called HGDiscovery (https://biosig.lab.uq.edu.au/hgdiscovery/)., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2022 The Authors.)
- Published
- 2022
- Full Text
- View/download PDF
42. Structural landscapes of PPI interfaces.
- Author
-
Rodrigues CHM, Pires DEV, Blundell TL, and Ascher DB
- Subjects
- Binding Sites, Databases, Protein, Protein Binding, Proteins chemistry
- Abstract
Proteins are capable of highly specific interactions and are responsible for a wide range of functions, making them attractive in the pursuit of new therapeutic options. Previous studies focusing on overall geometry of protein-protein interfaces, however, concluded that PPI interfaces were generally flat. More recently, this idea has been challenged by their structural and thermodynamic characterisation, suggesting the existence of concave binding sites that are closer in character to traditional small-molecule binding sites, rather than exhibiting complete flatness. Here, we present a large-scale analysis of binding geometry and physicochemical properties of all protein-protein interfaces available in the Protein Data Bank. In this review, we provide a comprehensive overview of the protein-protein interface landscape, including evidence that even for overall larger, more flat interfaces that utilize discontinuous interacting regions, small and potentially druggable pockets are utilized at binding sites., (© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.)
- Published
- 2022
- Full Text
- View/download PDF
43. CSM-Potential: mapping protein interactions and biological ligands in 3D space using geometric deep learning.
- Author
-
Rodrigues CHM and Ascher DB
- Subjects
- Binding Sites, Ligands, Membrane Proteins, Protein Conformation, Deep Learning, Software, Protein Interaction Mapping methods
- Abstract
Recent advances in protein structural modelling have enabled the accurate prediction of the holo 3D structures of almost any protein, however protein function is intrinsically linked to the interactions it makes. While a number of computational approaches have been proposed to explore potential biological interactions, they have been limited to specific interactions, and have not been readily accessible for non-experts or use in bioinformatics pipelines. Here we present CSM-Potential, a geometric deep learning approach to identify regions of a protein surface that are likely to mediate protein-protein and protein-ligand interactions in order to provide a link between 3D structure and biological function. Our method has shown robust performance, outperforming existing methods for both predictive tasks. By assessing the performance of CSM-Potential on independent blind tests, we show that our method was able to achieve ROC AUC values of up to 0.81 for the identification of potential protein-protein binding sites, and up to 0.96 accuracy on biological ligand classification. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/csm_potential., (© The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2022
- Full Text
- View/download PDF
44. Applying Sodium Carbonate Extraction Mass Spectrometry to Investigate Defects in the Mitochondrial Respiratory Chain.
- Author
-
Robinson DRL, Hock DH, Muellner-Wong L, Kugapreethan R, Reljic B, Surgenor EE, Rodrigues CHM, Caruana NJ, and Stroud DA
- Abstract
Mitochondria are complex organelles containing 13 proteins encoded by mitochondrial DNA and over 1,000 proteins encoded on nuclear DNA. Many mitochondrial proteins are associated with the inner or outer mitochondrial membranes, either peripherally or as integral membrane proteins, while others reside in either of the two soluble mitochondrial compartments, the mitochondrial matrix and the intermembrane space. The biogenesis of the five complexes of the oxidative phosphorylation system are exemplars of this complexity. These large multi-subunit complexes are comprised of more than 80 proteins with both membrane integral and peripheral associations and require soluble, membrane integral and peripherally associated assembly factor proteins for their biogenesis. Mutations causing human mitochondrial disease can lead to defective complex assembly due to the loss or altered function of the affected protein and subsequent destabilization of its interactors. Here we couple sodium carbonate extraction with quantitative mass spectrometry (SCE-MS) to track changes in the membrane association of the mitochondrial proteome across multiple human knockout cell lines. In addition to identifying the membrane association status of over 840 human mitochondrial proteins, we show how SCE-MS can be used to understand the impacts of defective complex assembly on protein solubility, giving insights into how specific subunits and sub-complexes become destabilized., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2022 Robinson, Hock, Muellner-Wong, Kugapreethan, Reljic, Surgenor, Rodrigues, Caruana and Stroud.)
- Published
- 2022
- Full Text
- View/download PDF
45. pdCSM-PPI: Using Graph-Based Signatures to Identify Protein-Protein Interaction Inhibitors.
- Author
-
Rodrigues CHM, Pires DEV, and Ascher DB
- Subjects
- Protein Interaction Mapping, Machine Learning, Software
- Abstract
Protein-protein interactions are promising sites for development of selective drugs; however, they have generally been viewed as challenging targets. Molecules targeting protein-protein interactions tend to be larger and more lipophilic than other drug-like molecules, mimicking the properties of interacting interfaces. Here, we propose a machine learning approach that uses a graph-based representation of small molecules to guide identification of inhibitors modulating protein-protein interactions, pdCSM-PPI. This approach was applied to 21 different PPI targets. We developed interaction-specific models that were able to accurately identify active compounds achieving MCC and F1 scores up to 1, and Pearson's correlations up to 0.87, outperforming previous approaches. Using insights from these individual models, we developed a generic protein-protein interaction modulator predictive model, which accurately predicted IC50 with a Pearson's correlation of 0.64 on a low redundancy blind test. Importantly, we were able to accurately identify active from inactive compounds, achieving an AUC of 0.77 and sensitivity and specificity of 76% and 78%, respectively. We believe pdCSM-PPI will be an important tool to help guide more efficient screening of new PPI inhibitors; it is freely available as an easy-to-use web server and API at http://biosig.unimelb.edu.au/pdcsm_ppi.
- Published
- 2021
- Full Text
- View/download PDF
46. Machine Learning of ECG Waveforms to Improve Selection for Testing for Asymptomatic Left Ventricular Dysfunction.
- Author
-
Potter EL, Rodrigues CHM, Ascher DB, Abhayaratna WP, Sengupta PP, and Marwick TH
- Subjects
- Aged, Echocardiography, Electrocardiography, Female, Humans, Machine Learning, Male, Middle Aged, Predictive Value of Tests, Ventricular Dysfunction, Left diagnostic imaging
- Abstract
Objectives: The purpose of this study was to identify whether machine learning from processing of continuous wave transforms (CWTs) to provide an "energy waveform" electrocardiogram (ewECG) could be integrated with echocardiographic assessment of subclinical systolic and diastolic left ventricular dysfunction (LVD)., Background: Asymptomatic LVD has management implications, but routine echocardiography is not undertaken in subjects at risk of heart failure. Signal processing of the surface ECG with the use of CWT can identify abnormal myocardial relaxation., Methods: EwECG and echocardiography were undertaken in 398 participants at risk of heart failure (HF). Reduced global longitudinal strain (GLS ≤16%)), diastolic abnormalities (E/e' >15, left atrial enlargement with E/e' >10 or impaired relaxation) or LV hypertrophy defined LVD. EwECG feature selection and supervised machine-learning by random forest (RF) classifier was undertaken with 643 CWT-derived features and the ARIC (Atherosclerosis Risk In Communities) heart failure risk score., Results: The ARIC score and 18 CWT features were selected to build a RF predictive model for LVD in a training dataset (n = 287; 60% female, median age 71 [interquartile range: 68 to 74] years). Model performance was tested in an independent group (n = 111; 49% female, median age 61 years [59 to 66 years]), demonstrating 85% sensitivity and 72% specificity (area under the receiver-operating characteristic curve [AUC]: 0.83; 95% confidence interval [CI]: 0.74 to 0.92). With ARIC score removed, sensitivity was 88% and specificity, 70% (AUC: 0.78; 95% CI: 0.70 to 0.86). RF models for reduced GLS and diastolic abnormalities including similar features had sensitivities that were unsuitable for screening. Conventional candidates for LVD screening (ARIC score, N-terminal pro-B-type natriuretic peptide, and standard automated ECG analysis) had inferior discriminative ability. Integration of ewECG in screening of people at risk of HF would reduce need for echocardiography by 45% while missing 12% of LVD cases., Conclusions: Machine learning applied to ewECG is a sensitive screening test for LVD, and its integration into screening of patients at risk for HF would reduce the number of echocardiograms by almost one-half., Competing Interests: Funding Support and Author Disclosures The work was partially supported by a partnership grant (1149692) from the National Health and Medical Research Council, Canberra, the Ian Potter Foundation, Melbourne, and the Baker Heart and Diabetes Institute, Melbourne. Dr. Potter is supported by a Monash University postgraduate scholarship. All other authors have reported that they have no relationships relevant to the contents of this paper to disclose., (Copyright © 2021 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
47. Structure-guided machine learning prediction of drug resistance mutations in Abelson 1 kinase.
- Author
-
Zhou Y, Portelli S, Pat M, Rodrigues CHM, Nguyen TB, Pires DEV, and Ascher DB
- Abstract
Kinases play crucial roles in cellular signalling and biological processes with their dysregulation associated with diseases, including cancers. Kinase inhibitors, most notably those targeting ABeLson 1 (ABL1) kinase in chronic myeloid leukemia, have had a significant impact on cancer survival, yet emergence of resistance mutations can reduce their effectiveness, leading to therapeutic failure. Limited effort, however, has been devoted to developing tools to accurately identify ABL1 resistance mutations, as well as providing insights into their molecular mechanisms. Here we investigated the structural basis of ABL1 mutations modulating binding affinity of eight FDA-approved drugs. We found mutations impair affinity of type I and type II inhibitors differently and used this insight to developed a novel web-based diagnostic tool, SUSPECT-ABL, to pre-emptively predict resistance profiles and binding free-energy changes (ΔΔ G ) of all possible ABL1 mutations against inhibitors with different binding modes. Resistance mutations in ABL1 were successfully identified, achieving a Matthew's Correlation Coefficient of up to 0.73 and the resulting change in ligand binding affinity with a Pearson's correlation of up to 0.77, with performances consistent across non-redundant blind tests. Through an in silico saturation mutagenesis, our tool has identified possibly emerging resistance mutations, which offers opportunities for in vivo experimental validation. We believe SUSPECT-ABL will be an important tool not just for improving precision medicine efforts, but for facilitating the development of next-generation inhibitors that are less prone to resistance. We have made our tool freely available at http://biosig.unimelb.edu.au/suspect_abl/., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2021 The Authors.)
- Published
- 2021
- Full Text
- View/download PDF
48. mmCSM-PPI: predicting the effects of multiple point mutations on protein-protein interactions.
- Author
-
Rodrigues CHM, Pires DEV, and Ascher DB
- Subjects
- Machine Learning, Mutation, Missense, Point Mutation, Protein Interaction Mapping methods, Software
- Abstract
Protein-protein interactions play a crucial role in all cellular functions and biological processes and mutations leading to their disruption are enriched in many diseases. While a number of computational methods to assess the effects of variants on protein-protein binding affinity have been proposed, they are in general limited to the analysis of single point mutations and have been shown to perform poorly on independent test sets. Here, we present mmCSM-PPI, a scalable and effective machine learning model for accurately assessing changes in protein-protein binding affinity caused by single and multiple missense mutations. We expanded our well-established graph-based signatures in order to capture physicochemical and geometrical properties of multiple wild-type residue environments and integrated them with substitution scores and dynamics terms from normal mode analysis. mmCSM-PPI was able to achieve a Pearson's correlation of up to 0.75 (RMSE = 1.64 kcal/mol) under 10-fold cross-validation and 0.70 (RMSE = 2.06 kcal/mol) on a non-redundant blind test, outperforming existing methods. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/mmcsm_ppi., (© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2021
- Full Text
- View/download PDF
49. MTR3D: identifying regions within protein tertiary structures under purifying selection.
- Author
-
Silk M, Pires DEV, Rodrigues CHM, D'Souza EN, Olshansky M, Thorne N, and Ascher DB
- Subjects
- Genomics, Humans, Neoplasms genetics, Structural Homology, Protein, Mutation, Missense, Protein Structure, Tertiary genetics, Software
- Abstract
The identification of disease-causal variants is non-trivial. By mapping population variation from over 448,000 exome and genome sequences to over 81,000 experimental structures and homology models of the human proteome, we have calculated both regional intolerance to missense variation (Missense Tolerance Ratio, MTR), using a sliding window of 21-41 codons, and introduce a new 3D spatial intolerance to missense variation score (3D Missense Tolerance Ratio, MTR3D), using spheres of 5-8 Å. We show that the MTR3D is less biased by regions with limited data and more accurately identifies regions under purifying selection than estimates relying on the sequence alone. Intolerant regions were highly enriched for both ClinVar pathogenic and COSMIC somatic missense variants (Mann-Whitney U test P < 2.2 × 10-16). Further, we combine sequence- and spatial-based scores to generate a consensus score, MTRX, which distinguishes pathogenic from benign variants more accurately than either score separately (AUC = 0.85). The MTR3D server enables easy visualisation of population variation, MTR, MTR3D and MTRX scores across the entire gene and protein structure for >17,000 human genes and >42,000 alternative alternate transcripts, including both Ensembl and RefSeq transcripts. MTR3D is freely available by user-friendly web-interface and API at http://biosig.unimelb.edu.au/mtr3d/., (© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2021
- Full Text
- View/download PDF
50. Author Correction: Exploring the structural distribution of genetic variation in SARS-CoV-2 with the COVID-3D online resource.
- Author
-
Portelli S, Olshansky M, Rodrigues CHM, D'Souza EN, Myung Y, Silk M, Alavi A, Pires DEV, and Ascher DB
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.