Optimizing cache access: A tool for source-to-source transformations and real-life compiler tests

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in Buch/Sammelband/GutachtenBeigetragenBegutachtung

Beitragende

  • Ralph Müller-Pfefferkorn - , Center for High Performance Computing (ZHR), Technische Universität Dresden (Autor:in)
  • Wolfgang E. Nagel - , Technische Universität Dresden (Autor:in)
  • Bernd Trenkler - , Technische Universität Dresden (Autor:in)

Abstract

Loop transformations are well known to be a very useful tool for performance improvements by optimizing cache access. Nevertheless, the automatic application is a complex and challenging task especially for parallel codes. Since the end of the 1980's it has been promised by most compiler vendors that these features will be implemented - in the next release. We tested current FORTRAN90 compilers (on IBM, Intel and SGI hardware) for their capabilities in this field. This paper shows the results of our analysis. Motivated by this experience we have developed the optimization environment Goofi to assist programmers in applying loop transformations to their code thus gaining better performance for parallel codes even today.

Details

OriginalspracheEnglisch
TitelEuro-Par 2004 Parallel Processing
Redakteure/-innenMarco Danelutto, Marco Vanneschi, Domenico Laforenza
Herausgeber (Verlag)Springer, Berlin [u. a.]
Seiten72-81
Seitenumfang10
ISBN (Print)3540229248
PublikationsstatusVeröffentlicht - 2004
Peer-Review-StatusJa
Extern publiziertJa

Publikationsreihe

ReiheLecture Notes in Computer Science, Volume 3149
ISSN0302-9743

Schlagworte

Bibliotheksschlagworte