Back to Search
Start Over
Combining global and local measures for structure-based druggability predictions
- Source :
- Journal of chemical information and modeling. 52(2)
- Publication Year :
- 2011
-
Abstract
- Predicting druggability and prioritizing certain disease modifying targets for the drug development process is of high practical relevance in pharmaceutical research. DoGSiteScorer is a fully automatic algorithm for pocket and druggability prediction. Besides consideration of global properties of the pocket, also local similarities shared between pockets are reflected. Druggability scores are predicted by means of a support vector machine (SVM), trained, and tested on the druggability data set (DD) and its nonredundant version (NRDD). The DD consists of 1069 targets with assigned druggable, difficult, and undruggable classes. In 90% of the NRDD, the SVM model based on global descriptors correctly classifies a target as either druggable or undruggable. Nevertheless, global properties suffer from binding site changes due to ligand binding and from the pocket boundary definition. Therefore, local pocket properties are additionally investigated in terms of a nearest neighbor search. Local similarities are described by distance dependent histograms between atom pairs. In 88% of the DD pocket set, the nearest neighbor and the structure itself conform with their druggability type. A discriminant feature between druggable and undruggable pockets is having less short-range hydrophilic-hydrophilic pairs and more short-range lipophilic-lipophilic pairs. Our findings for global pocket descriptors coincide with previously published methods affirming that size, shape, and hydrophobicity are important global pocket descriptors for automatic druggability prediction. Nevertheless, the variety of pocket shapes and their flexibility upon ligand binding limit the automatic projection of druggable features onto descriptors. Incorporating local pocket properties is another step toward a reliable descriptor-based druggability prediction.
- Subjects :
- Binding Sites
Support Vector Machine
Computer science
General Chemical Engineering
Nearest neighbor search
Druggability
General Chemistry
Library and Information Sciences
computer.software_genre
Ligands
Computer Science Applications
k-nearest neighbors algorithm
Support vector machine
Set (abstract data type)
Pharmaceutical Preparations
Drug Design
Drug Discovery
Feature (machine learning)
Structure based
Data mining
Projection (set theory)
computer
Algorithms
Subjects
Details
- ISSN :
- 1549960X
- Volume :
- 52
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Journal of chemical information and modeling
- Accession number :
- edsair.doi.dedup.....ac31df52d3cb0a185f7e0e0ae84c7f52