학술논문

Grid warehousing of molecular dynamics protein unfolding data
Document Type
Conference
Source
CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005. Cluster Computing and the Grid Cluster Computing and the Grid, 2005. CCGrid 2005. IEEE International Symposium on. 1:496-503 Vol. 1 2005
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Warehousing
Proteins
Computational modeling
Analytical models
Data warehouses
Explosions
Genomics
Bioinformatics
Computer simulation
Grid computing
Language
Abstract
With the increasing awareness of protein folding disorders, the explosion of genomic information, and the need for efficient ways to predict protein structure, protein folding and unfolding has become a central issue in molecular sciences research. Molecular dynamics computer simulations are increasingly employed to understand the folding and unfolding of proteins. Running protein unfolding simulations is computationally expensive and finding ways to enhance performance is a grid issue on its own. However, more and more groups run such simulations and generate a myriad of data, which raises new challenges in managing and analyzing these data. Because the vast range of proteins researchers want to study and simulate, the computational effort needed to generate data, the large data volumes involved, and the different types of analyses scientists need to perform, it is desirable to provide a public repository allowing researchers to pool and share protein unfolding data. This paper describes efforts to provide a grid-enabled data warehouse for protein unfolding data. We outline the challenge and present first results in the design and implementation of the data warehouse.