학술논문

Scalable high-quality 1D partitioning
Document Type
Conference
Source
2014 International Conference on High Performance Computing & Simulation (HPCS) High Performance Computing & Simulation (HPCS), 2014 International Conference on. :112-119 Jul, 2014
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Partitioning algorithms
Vectors
Heuristic algorithms
Probes
Runtime
Benchmark testing
Atmospheric modeling
High performance computing
Dynamic load balancing
One-dimensional partitioning
Hierarchical partitioning
Scalability
Language
Abstract
The decomposition of one-dimensional workload arrays into consecutive partitions is a core problem of many load balancing methods, especially those based on space-filling curves. While previous work has shown that heuristics can be parallelized, only sequential algorithms exist for the optimal solution. However, centralized partitioning will become infeasible in the exascale era due to the vast amount of tasks to be mapped to millions of processors. In this work, we first introduce optimizations to a published exact algorithm. Further, we investigate a hierarchical approach which combines a parallel heuristic and an exact algorithm to form a scalable and high-quality 1D partitioning algorithm. We compare load balance, execution time, and task migration of the algorithms for up to 262 144 processes using real-life workload data. The results show a 300 times speed-up compared to an existing fast exact algorithm, while achieving nearly the optimal load balance.