학술논문

High performance fault-tolerance for clouds
Document Type
Conference
Source
2015 IEEE Symposium on Computers and Communication (ISCC) Computers and Communication (ISCC), 2015 IEEE Symposium on. :251-257 Jul, 2015
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Servers
Fault tolerance
Fault tolerant systems
Virtual machine monitors
Hardware
Cloud computing
Computer architecture
cloud computing
fault-tolerance
high-performance
live-migration
resource consolidation
Language
Abstract
Cloud computing and virtualized infrastructures are currently the baseline environments for the provision of services in different application domains. While the number of service consumers increasingly grows, service providers aim at exploiting infrastructures that enable non-disruptive service provisioning, thus minimizing or even eliminating downtime. Nonetheless, to achieve the latter current approaches are either application-specific or cost inefficient, requiring the use of dedicated hardware. In this paper we present the reference architecture of a fault-tolerance scheme, which not only enhances cloud environments with the aforementioned capabilities but also achieves high-performance as required by mission critical every day applications. To realize the proposed approach, a new paradigm for memory and I/O externalization and consolidation is introduced, while current implementation references are also provided.