Non-intrusive Performance Analysis of Parallel Hardware Accelerated Applications on Hybrid Architectures
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
New high performance computing (HPC) applications recently have to face scalability over an increasing number of nodes and the programming of special accelerator hardware. Hybrid composition of large computing systems leads to a new dimension in complexity of software development. This paper presents a novel approach to gain insight into accelerator interaction and utilization without any changes to the application. It leverages well established methods for performance analysis to accelerator hardware, allowing a holistic view on performance bottlenecks of hybrid applications. A general strategy is presented to get dynamic runtime information about hybrid program execution with minimal impact on the program ???ow. The achievable level of detail is exemplarily studied for the CUDA environment and the OpenCL framework. Combined with existing performance analysis techniques this facilitates obtaining the full potential of hybrid computing power.
Details
Originalsprache | Englisch |
---|---|
Titel | 2010 39th International Conference on Parallel Processing Workshops |
Herausgeber (Verlag) | Wiley-IEEE Press |
Seiten | 135-143 |
Seitenumfang | 8 |
Publikationsstatus | Veröffentlicht - 1 Sept. 2010 |
Peer-Review-Status | Ja |
Externe IDs
Scopus | 78649832365 |
---|---|
ORCID | /0000-0002-5437-3887/work/154740514 |