Analyzing Offloading Inefficiencies in Scalable Heterogeneous Applications

Robert Dietrich; Ronny Tschüter; Guido Juckeland; Andreas Knüpfer

doi:10.1007/978-3-319-67630-2_34

Analyzing Offloading Inefficiencies in Scalable Heterogeneous Applications

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Robert Dietrich - , Center for Information Services and High Performance Computing (ZIH) (Author)
Ronny Tschüter - , Center for Information Services and High Performance Computing (ZIH) (Author)
Guido Juckeland - , Helmholtz-Zentrum Dresden-Rossendorf (Author)
Andreas Knüpfer - , Center for Information Services and High Performance Computing (ZIH) (Author)

Abstract

With the rise of accelerators in high performance computing, programming models for the development of heterogeneous applications have evolved and are continuously being improved to increase program performance and programmer productivity. The concept of computation offloading to massively parallel compute devices has established itself as a new layer of parallelism in scientific applications, next to message passing and multi-threading. To optimize the execution of a respective parallel heterogeneous program for a specific platform, performance analysis is crucial. This work abstracts from specific offloading APIs such as available with CUDA, OpenCL, OpenACC, and OpenMP and summarizes common inefficiencies for offloading. Based on the definition of inefficiency patterns, the offloading concept can be included in generic analysis techniques such as critical-path and root-cause analysis. We implemented the detection and evaluation of inefficiency patterns as a post-mortem trace analysis, which finally highlights program activities with a high potential to reduce the total program runtime.

Details

Original language	English
Title of host publication	High Performance Computing
Editors	Julian M. Kunkel, Rio Yokota, Michela Taufer, John Shalf
Place of Publication	Cham
Publisher	Springer International Publishing AG
Pages	457-476
Number of pages	20
ISBN (electronic)	978-3-319-67630-2
ISBN (print)	978-3-319-67629-6
Publication status	Published - 2017
Peer-reviewed	Yes

Publication series

Series	Lecture Notes in Computer Science
Volume	10524
ISSN	0302-9743

External IDs

Scopus	85032658602

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

External IDs