Scalable high-quality 1D partitioning
Publikation: Beitrag zu Konferenzen › Paper › Beigetragen › Begutachtung
Beitragende
Abstract
The decomposition of one-dimensional workload arrays into consecutive partitions is a core problem of many load balancing methods, especially those based on space-filling curves. While previous work has shown that heuristics can be parallelized, only sequential algorithms exist for the optimal solution. However, centralized partitioning will become infeasible in the exascale era due to the vast amount of tasks to be mapped to millions of processors. In this work, we first introduce optimizations to a published exact algorithm. Further, we investigate a hierarchical approach which combines a parallel heuristic and an exact algorithm to form a scalable and high-quality 1D partitioning algorithm. We compare load balance, execution time, and task migration of the algorithms for up to 262 144 processes using real-life workload data. The results show a 300 times speed-up compared to an existing fast exact algorithm, while achieving nearly the optimal load balance.
Details
Originalsprache | Englisch |
---|---|
Seiten | 112-119 |
Seitenumfang | 8 |
Publikationsstatus | Veröffentlicht - 2014 |
Peer-Review-Status | Ja |
Konferenz
Titel | International Conference on High Performance Computing & Simulation (HPCS) |
---|---|
Dauer | 21 - 25 Juli 2014 |
Stadt | Bologna |
Land | Italien |
Externe IDs
Scopus | 84908655384 |
---|---|
ORCID | /0000-0003-3137-0648/work/142238862 |
Schlagworte
Schlagwörter
- parallel algorithms, resource allocation