KOR

e-Article

An extensible, portable, scalable cluster management software architecture
Document Type
Conference
Source
Proceedings. IEEE International Conference on Cluster Computing Cluster computing Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on. :287-295 2002
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Software architecture
Network topology
Databases
Computer architecture
Laboratories
Scalability
Hardware
Application software
Large-scale systems
Production
Language
Abstract
This paper describes an object-oriented software architecture for cluster integration and management that enables extensibility, portability, and scalability. This architecture has been successfully implemented and deployed on several large-scale production clusters at Sandia National Laboratories, the largest of which is currently 1861 nodes. This paper discusses the key features of the architecture that allow for easily extending the range of supported hardware devices and network topologies. We also describe in detail how the object-oriented structure that represents the hardware components can be used to implement scalable and portable cluster management tools.