Alpaka - An Abstraction Library for Parallel Kernel Acceleration

Erik Zenker; Benjamin Worpitz; Rene Widera; Axel Huebl; Guido Juckeland; Andreas Knuepfer; Wolfgang E. Nagel; Michael Bussmann

doi:10.1109/IPDPSW.2016.50

Alpaka - An Abstraction Library for Parallel Kernel Acceleration

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Erik Zenker - , Chair of Automata Theory, Helmholtz-Zentrum Dresden-Rossendorf (HZDR) (Author)
Benjamin Worpitz - , Helmholtz-Zentrum Dresden-Rossendorf (HZDR), TUD Dresden University of Technology (Author)
Rene Widera - , Helmholtz-Zentrum Dresden-Rossendorf (HZDR), TUD Dresden University of Technology (Author)
Axel Huebl - , Faculty of Physics, Helmholtz-Zentrum Dresden-Rossendorf (HZDR) (Author)
Guido Juckeland - , Center for Information Services and High Performance Computing (ZIH), Helmholtz-Zentrum Dresden-Rossendorf (HZDR) (Author)
Andreas Knuepfer - , Center for Information Services and High Performance Computing (ZIH) (Author)
Wolfgang E. Nagel - , Center for Information Services and High Performance Computing (ZIH), Chair of Computer Architecture (Author)
Michael Bussmann - , Helmholtz-Zentrum Dresden-Rossendorf (HZDR) (Author)

Abstract

Porting applications to new hardware or programming models is a tedious and error prone process. Every help that eases these burdens is saving developer time that can then be invested into the advancement of the application itself instead of preserving the status-quo on a new platform. The Alpaka library defines and implements an abstract hierarchical redundant parallelism model. The model exploits parallelism and memory hierarchies on a node at all levels available in current hardware. By doing so, it allows to achieve platform and performance portability across various types of accelerators by ignoring specific unsupported levels and utilizing only the ones supported on a specific accelerator. All hardware types (multi-and many-core CPUs, GPUs and other accelerators) are supported for and can be programmed in the same way. The Alpaka C++ template interface allows for straightforward extension of the library to support other accelerators and specialization of its internals for optimization.

Running Alpaka applications on a new (and supported) platform requires the change of only one source code line instead of a lot of #ifdefs.

Details

Original language	English
Title of host publication	2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
Pages	631-640
Number of pages	10
ISBN (electronic)	978-1-5090-3682-0
Publication status	Published - 2016
Peer-reviewed	Yes

Publication series

Series	2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
ISSN	2164-7062

Conference

Title	30th IEEE International Parallel and Distributed Processing Symposium
Abbreviated title	IPDPS 2016
Conference number	30
Duration	23 - 27 May 2016
Website	https://www.ipdps.org/archives.htm
Location	Chicago Hyatt Regency
City	Chicago
Country	United States of America

External IDs

Scopus	84991727562

Keywords

Heterogeneous computing, HPC, C plus, CUDA, OpenMP, platform portability, performance portability

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

Conference

External IDs

Keywords

Keywords