Auto-Vectorization Techniques for Modern SIMD Architectures

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Abstract

The current speed of the development of SIMD hardware promises a steady performance enhancement of vectorized code. But more and broader vector registers as well as new instructions require a constant adjustment of the used auto-vectorization techniques in order to avoid obstacles and to exploit the provided means to a maximum effect. The paper introduces approaches targeting particular features of modern and upcoming SIMD Architectures. All presented approaches were recently realized in our source-to-source vectorizer
Scout. They address issues encountered during the vectorization of production codes, mainly from the CFD domain. The performance measurements show considerable speedups of codes auto-vectorized by Scout over conventionally vectorized codes. Thus programs can benefit from modern SIMD hardware even more by enhancing auto-vectorization techniques with the introduced methods.

Details

Original languageEnglish
Title of host publicationProceedings of the 16th Workshop on Compilers for Parallel Computing, Padova, Italy (January 2012)
Number of pages11
Publication statusPublished - 2012
Peer-reviewedYes

Workshop

Title16th Workshop on Compilers for Parallel Computing
Abbreviated titleCPC 2012
Conference number
Duration11 - 13 January 2012
Website
Degree of recognitionInternational event
LocationUniversity of Padova, Palazzo del Bo
CityPadova
CountryItaly

External IDs

ORCID /0009-0009-9342-629X/work/194088087
ORCID /0000-0001-8719-5741/work/194256556

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Keywords

  • Auto-Vectorization, SIMD, Scout, Pragma, CFD