학술논문

Optimizing Post-Copy Live Migration with System-Level Checkpoint Using Fabric-Attached Memory
Document Type
Conference
Source
2019 IEEE/ACM Workshop on Memory Centric High Performance Computing (MCHPC) MCHPC Memory Centric High Performance Computing (MCHPC), 2019 IEEE/ACM Workshop on. :16-24 Nov, 2019
Subject
Computing and Processing
Language
Abstract
Emerging Non-Volatile Memories have byte- addressability and low latency, close to the latency of main memory, together with the non-volatility of storage devices. Similarly, recently emerging interconnect fabrics, such as Gen- Z, provide high bandwidth, together with exceptionally low latency. These concurrently emerging technologies are making possible new system architectures in the data centers including systems with Fabric-Attached Memories (FAMs). FAMs can serve to create scalable, high-bandwidth, distributed, shared, byte- addressable, and non-volatile memory pools at a rack scale, opening up new usage models and opportunities. Based on these attractive properties, in this paper we pro- pose FAM-aware, checkpoint-based, post-copy live migration mechanism to improve the performance of migration. We have implemented our prototype with a Linux open source checkpoint tool, CRIU (Checkpoint/Restore In Userspace). According to our evaluation results, compared to the existing solution, our FAM- aware post-copy can improve at least 15% the total migration time, at least 33% the busy time, and can let the migrated application perform at least 12% better during migration.