학술논문

Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking
Document Type
Conference
Source
2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA) HPCA High-Performance Computer Architecture (HPCA), 2022 IEEE International Symposium on. :543-557 Apr, 2022
Subject
Components, Circuits, Devices and Systems
Computing and Processing
Program processors
Multicore processing
Computer architecture
Linear algebra
Performance gain
Parallel processing
Hardware
caches
locality optimization
graph analytics
sparse linear algebra
irregular workloads
Language
ISSN
2378-203X
Abstract
Many application domains perform irregular memory updates. Irregular accesses lead to inefficient use of conventional cache hierarchies. To make better use of the cache, we focus on Propagation Blocking (PB), a software-based cache locality optimization initially designed for graph processing applications. We make two contributions in this work. First, we show that PB generalizes beyond graph processing applications to any application with unordered parallelism and irregular memory updates. Second, we identify the inefficiencies of a PB execution on conventional multicore processors and propose architecture support to further improve the performance gains from PB. Our proposed architecture, COBRA, optimizes the PB execution of a range of applications with irregular memory updates, offering speedups of up to 3.78x compared to PB (1.74x on average).