Optimizing OpenMP Parallelized DGEMM Calls on SGI Altix 3700
Research output: Contribution to conferences › Paper › Contributed › peer-review
Contributors
Abstract
Using functions of parallelized mathematical libraries is a common way to accelerate numerical applications. Computer architectures with shared memory characteristics support different approaches for the implementation of such libraries, usually OpenMP or MPI.
This paper’s content is based on the performance comparison of DGEMM calls (floating point matrix multiplication, double precision) with different OpenMP parallelized numerical libraries, namely Intel MKL and SGI SCSL, and how they can be optimized. Additionally, we have a look at the memory placement policy and give hints for initializing data. Our attention has been focused on a SGI Altix 3700 Bx2 system using BenchIT [1] as a very convenient performance measurement suite for the examinations.
This paper’s content is based on the performance comparison of DGEMM calls (floating point matrix multiplication, double precision) with different OpenMP parallelized numerical libraries, namely Intel MKL and SGI SCSL, and how they can be optimized. Additionally, we have a look at the memory placement policy and give hints for initializing data. Our attention has been focused on a SGI Altix 3700 Bx2 system using BenchIT [1] as a very convenient performance measurement suite for the examinations.
Details
Original language | English |
---|---|
Pages | 145-154 |
Number of pages | 10 |
Publication status | Published - 2006 |
Peer-reviewed | Yes |
Conference
Title | European Conference on Parallel Processing |
---|---|
Abbreviated title | EuroPar 2006 |
Conference number | 12 |
Duration | 28 August - 1 September 2006 |
Website | |
Degree of recognition | International event |
City | Dresden |
Country | Germany |
External IDs
Scopus | 33750021165 |
---|---|
ORCID | /0000-0002-8491-770X/work/141543259 |
ORCID | /0009-0003-0666-4166/work/151475555 |