Optimizing OpenMP Parallelized DGEMM Calls on SGI Altix 3700

Publikation: Beitrag zu KonferenzenPaperBeigetragenBegutachtung

Abstract

Using functions of parallelized mathematical libraries is a common way to accelerate numerical applications. Computer architectures with shared memory characteristics support different approaches for the implementation of such libraries, usually OpenMP or MPI.
This paper’s content is based on the performance comparison of DGEMM calls (floating point matrix multiplication, double precision) with different OpenMP parallelized numerical libraries, namely Intel MKL and SGI SCSL, and how they can be optimized. Additionally, we have a look at the memory placement policy and give hints for initializing data. Our attention has been focused on a SGI Altix 3700 Bx2 system using BenchIT [1] as a very convenient performance measurement suite for the examinations.

Details

OriginalspracheEnglisch
Seiten145-154
Seitenumfang10
PublikationsstatusVeröffentlicht - 2006
Peer-Review-StatusJa

Konferenz

TitelEuropean Conference on Parallel Processing
KurztitelEuroPar 2006
Veranstaltungsnummer12
Dauer28 August - 1 September 2006
Webseite
BekanntheitsgradInternationale Veranstaltung
StadtDresden
LandDeutschland

Externe IDs

Scopus 33750021165
ORCID /0000-0002-8491-770X/work/141543259
ORCID /0009-0003-0666-4166/work/151475555