Performance Measurement for the OpenMP 4.0 Offloading Model - Lecture Notes in Computer Science

Publikation: Beitrag zu KonferenzenPaperBeigetragenBegutachtung

Beitragende

  • R. Dietrich - (Autor:in)
  • F. Schmitt - (Autor:in)
  • A. Grund - (Autor:in)
  • D. Schmidl - (Autor:in)

Abstract

OpenMP is one of the most widely used standards for enabling thread-level parallelism in high performance computing codes. The recently released version 4.0 of the specification introduces directives that enable application developers to offload portions of the computation to massively-parallel target devices. However, to efficiently utilize these devices, sophisticated performance analysis tools are required. The emerging OpenMP Tools Interface (OMPT) aids the development of portable tools, but currently lacks the support for OpenMP 4.0 target directives. This paper presents a novel approach to measure the performance of applications utilizing OpenMP offloading. It introduces libmpti, an OMPT-based measurement library for Intel MIC target devices. For host-side analysis we extended the OPARI2 instrumenter and prototypically integrated the complete approach into the state-of-the-art tool infrastructure Score-P. We demonstrate the effectiveness of the presented method and implementation with a Conjugate-Gradient (CG) kernel on an Intel Xeon Phi coprocessor. Finally, we visualize the obtained performance data with Vampir.

Details

OriginalspracheEnglisch
Seiten291-301
Seitenumfang11
PublikationsstatusVeröffentlicht - 2014
Peer-Review-StatusJa

Externe IDs

Scopus 84920092368

Schlagworte

Ziele für nachhaltige Entwicklung

Schlagwörter

  • performance analysis, offloading, OpenMP 4.0, Intel MIC, Score-P