Back to Search
Start Over
PDB-scale analysis of known and putative ligand-binding sites with structural sketches
- Source :
- Proteins: Structure, Function, and Bioinformatics. 80:747-763
- Publication Year :
- 2011
- Publisher :
- Wiley, 2011.
-
Abstract
- Computational investigation of protein functions is one of the most urgent and demanding tasks in the field of structural bioinformatics. Exhaustive pairwise comparison of known and putative ligand-binding sites, across protein families and folds, is essential in elucidating the biological functions and evolutionary relationships of proteins. Given the vast amounts of data available now, existing 3D structural comparison methods are not adequate due to their computation time complexity. In this article, we propose a new bit string representation of binding sites called structural sketches, which is obtained by random projections of triplet descriptors. It allows us to use ultra-fast all-pair similarity search methods for strings with strictly controlled error rates. Exhaustive comparison of 1.2 million known and putative binding sites finished in ∼30 h on a single core to yield 88 million similar binding site pairs. Careful investigation of 3.5 million pairs verified by TM-align revealed several notable analogous sites across distinct protein families or folds. In particular, we succeeded in finding highly plausible functions of several pockets via strong structural analogies. These results indicate that our method is a promising tool for functional annotation of binding sites derived from structural genomics projects.
- Subjects :
- Models, Molecular
Proteomics
Binding Sites
Protein family
Computer science
Nearest neighbor search
Protein Data Bank (RCSB PDB)
Proteins
Computational biology
Ligands
Biochemistry
Structural genomics
Structural bioinformatics
Structural Biology
Pairwise comparison
Binding site
Databases, Protein
Molecular Biology
Time complexity
Algorithm
Algorithms
Subjects
Details
- ISSN :
- 08873585
- Volume :
- 80
- Database :
- OpenAIRE
- Journal :
- Proteins: Structure, Function, and Bioinformatics
- Accession number :
- edsair.doi.dedup.....2d02d13eb32779dc4c9033705b0bdb8a