Back to Search
Start Over
Generation of a consensus protein domain dictionary
- Source :
- Bioinformatics. 27:46-54
- Publication Year :
- 2010
- Publisher :
- Oxford University Press (OUP), 2010.
-
Abstract
- Motivation: The discovery of new protein folds is a relatively rare occurrence even as the rate of protein structure determination increases. This rarity reinforces the concept of folds as reusable units of structure and function shared by diverse proteins. If the folding mechanism of proteins is largely determined by their topology, then the folding pathways of members of existing folds could encompass the full set used by globular protein domains. Results: We have used recent versions of three common protein domain dictionaries (SCOP, CATH and Dali) to generate a consensus domain dictionary (CDD). Surprisingly, 40% of the metafolds in the CDD are not composed of autonomous structural domains, i.e. they are not plausible independent folding units. This finding has serious ramifications for bioinformatics studies mining these domain dictionaries for globular protein properties. However, our main purpose in deriving this CDD was to generate an updated CDD to choose targets for MD simulation as part of our dynameomics effort, which aims to simulate the native and unfolding pathways of representatives of all globular protein consensus folds (metafolds). Consequently, we also compiled a list of representative protein targets of each metafold in the CDD. Availability and implementation: This domain dictionary is available at www.dynameomics.org. Contact: daggett@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Models, Molecular
Statistics and Probability
Protein Folding
Globular protein
Computer science
Protein domain
Dictionaries as Topic
Computational biology
Biochemistry
Domain (software engineering)
Protein structure
Molecular Biology
chemistry.chemical_classification
Computational Biology
Molecular Sequence Annotation
Folding (DSP implementation)
Original Papers
Data science
Protein Structure, Tertiary
Computer Science Applications
Structure and function
Computational Mathematics
Computational Theory and Mathematics
chemistry
Protein folding
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 27
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....ac22f7258171367dbb36d1982e334433
- Full Text :
- https://doi.org/10.1093/bioinformatics/btq625