Back to Search
Start Over
A new bioinformatic approach to detect common 3D sites in protein structures
- Source :
- Proteins: Structure, Function, and Bioinformatics. 52:137-145
- Publication Year :
- 2003
- Publisher :
- Wiley, 2003.
-
Abstract
- An innovative bioinformatic method has been designed and implemented to detect similar three-dimensional (3D) sites in proteins. This approach allows the comparison of protein structures or substructures and detects local spatial similarities: this method is completely independent from the amino acid sequence and from the backbone structure. In contrast to already existing tools, the basis for this method is a representation of the protein structure by a set of stereochemical groups that are defined independently from the notion of amino acid. An efficient heuristic for finding similarities that uses graphs of triangles of chemical groups to represent the protein structures has been developed. The implementation of this heuristic constitutes a software named SuMo (Surfing the Molecules), which allows the dynamic definition of chemical groups, the selection of sites in the proteins, and the management and screening of databases. To show the relevance of this approach, we focused on two extreme examples illustrating convergent and divergent evolution. In two unrelated serine proteases, SuMo detects one common site, which corresponds to the catalytic triad. In the legume lectins family composed of >100 structures that share similar sequences and folds but may have lost their ability to bind a carbohydrate molecule, SuMo discriminates between functional and non-functional lectins with a selectivity of 96%. The time needed for searching a given site in a protein structure is typically 0.1 s on a PIII 800MHz/Linux computer; thus, in further studies, SuMo will be used to screen the PDB.
- Subjects :
- chemistry.chemical_classification
0303 health sciences
Heuristic
030302 biochemistry & molecular biology
Protein Data Bank (RCSB PDB)
Computational biology
Biology
Biochemistry
Amino acid
Set (abstract data type)
03 medical and health sciences
Protein structure
chemistry
Structural Biology
Catalytic triad
Representation (mathematics)
Molecular Biology
Peptide sequence
030304 developmental biology
Subjects
Details
- ISSN :
- 08873585
- Volume :
- 52
- Database :
- OpenAIRE
- Journal :
- Proteins: Structure, Function, and Bioinformatics
- Accession number :
- edsair.doi...........92376d0d42d24138ae1651e7d7876402
- Full Text :
- https://doi.org/10.1002/prot.10339