학술논문

Towards high performance paged memory for GPUs

Document Type

Conference

Author

Zheng, Tianhao; Nellans, David; Zulfiqar, Arslan; Stephenson, Mark; Keckler, Stephen W.

Source

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA) High Performance Computer Architecture (HPCA), 2016 IEEE International Symposium on. :345-357 Mar, 2016

Subject

Computing and Processing
Graphics processing units
Kernel
Bandwidth
Runtime
Memory management
Programming

Language

ISSN

2378-203X

Abstract

Despite industrial investment in both on-die GPUs and next generation interconnects, the highest performing parallel accelerators shipping today continue to be discrete GPUs. Connected via PCIe, these GPUs utilize their own privately managed physical memory that is optimized for high bandwidth. These separate memories force GPU programmers to manage the movement of data between the CPU and GPU, in addition to the on-chip GPU memory hierarchy. To simplify this process, GPU vendors are developing software runtimes that automatically page memory in and out of the GPU on-demand, reducing programmer effort and enabling computation across datasets that exceed the GPU memory capacity. Because this memory migration occurs over a high latency and low bandwidth link (compared to GPU memory), these software runtimes may result in significant performance penalties. In this work, we explore the features needed in GPU hardware and software to close the performance gap of GPU paged memory versus legacy programmer directed memory management. Without modifying the GPU execution pipeline, we show it is possible to largely hide the performance overheads of GPU paged memory, converting an average 2x slowdown into a 12% speedup when compared to programmer directed transfers. Additionally, we examine the performance impact that GPU memory oversubscription has on application run times, enabling application designers to make informed decisions on how to shard their datasets across hosts and GPU instances.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송