1. GrayWulf: Scalable Software Architecture for Data Intensive Computing
- Author
-
László Dobos, Roger Barga, Michael Shipway, Yogesh Simmhan, Sue Werner, Maria Nieto-Santisteban, Nolan Li, Catharine van Ingen, J. N. Heasley, and Alexander S. Szalay
- Subjects
Software ,Resource-oriented architecture ,business.industry ,Software deployment ,Computer science ,Distributed computing ,Server ,Scalability ,Big data ,Data-intensive computing ,Petabyte ,business ,Software architecture - Abstract
Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf . These services are intended for deployment on a cluster of commodity servers similar to the well-known Beowulf clusters. We use the Pan-STARRS system currently under development as an example of the architecture and principles in action.
- Published
- 2009
- Full Text
- View/download PDF