Auto-Vectorization Techniques for Modern SIMD Architectures

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Abstract

The current speed of the development of SIMD hardware promises a steady performance enhancement of vectorized code. But more and broader vector registers as well as new instructions require a constant adjustment of the used auto-vectorization techniques in order to avoid obstacles and to exploit the provided means to a maximum effect. The paper introduces approaches targeting particular features of modern and upcoming SIMD Architectures. All presented approaches were recently realized in our source-to-source vectorizer
Scout. They address issues encountered during the vectorization of production codes, mainly from the CFD domain. The performance measurements show considerable speedups of codes auto-vectorized by Scout over conventionally vectorized codes. Thus programs can benefit from modern SIMD hardware even more by enhancing auto-vectorization techniques with the introduced methods.

Details

OriginalspracheEnglisch
TitelProceedings of the 16th Workshop on Compilers for Parallel Computing, Padova, Italy (January 2012)
Seitenumfang11
PublikationsstatusVeröffentlicht - 2012
Peer-Review-StatusJa

Workshop

Titel16th Workshop on Compilers for Parallel Computing
KurztitelCPC 2012
Veranstaltungsnummer
Dauer11 - 13 Januar 2012
Webseite
BekanntheitsgradInternationale Veranstaltung
OrtUniversity of Padova, Palazzo del Bo
StadtPadova
LandItalien

Externe IDs

ORCID /0009-0009-9342-629X/work/194088087
ORCID /0000-0001-8719-5741/work/194256556

Schlagworte

Forschungsprofillinien der TU Dresden

DFG-Fachsystematik nach Fachkollegium

Schlagwörter

  • Auto-Vectorization, SIMD, Scout, Pragma, CFD