Back to Search Start Over

Dynameomics: design of a computational lab workflow and scientific data repository for protein simulations.

Authors :
Simms AM
Toofanny RD
Kehl C
Benson NC
Daggett V
Source :
Protein engineering, design & selection : PEDS [Protein Eng Des Sel] 2008 Jun; Vol. 21 (6), pp. 369-77. Date of Electronic Publication: 2008 Apr 14.
Publication Year :
2008

Abstract

Dynameomics is a project to investigate and catalog the native-state dynamics and thermal unfolding pathways of representatives of all protein folds using solvated molecular dynamics simulations, as described in the preceding paper. Here we introduce the design of the molecular dynamics data warehouse, a scalable, reliable repository that houses simulation data that vastly simplifies management and access. In the succeeding paper, we describe the development of a complementary multidimensional database. A single protein unfolding or native-state simulation can take weeks to months to complete, and produces gigabytes of coordinate and analysis data. Mining information from over 3000 completed simulations is complicated and time-consuming. Even the simplest queries involve writing intricate programs that must be built from low-level file system access primitives and include significant logic to correctly locate and parse data of interest. As a result, programs to answer questions that require data from hundreds of simulations are very difficult to write. Thus, organization and access to simulation data have been major obstacles to the discovery of new knowledge in the Dynameomics project. This repository is used internally and is the foundation of the Dynameomics portal site http://www.dynameomics.org. By organizing simulation data into a scalable, manageable and accessible form, we can begin to address substantial questions that move us closer to solving biomedical and bioengineering problems.

Details

Language :
English
ISSN :
1741-0126
Volume :
21
Issue :
6
Database :
MEDLINE
Journal :
Protein engineering, design & selection : PEDS
Publication Type :
Academic Journal
Accession number :
18411223
Full Text :
https://doi.org/10.1093/protein/gzn012