Polyhedral Compilation for Racetrack Memories

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

Abstract

Traditional memory hierarchy designs, primarily based on SRAM and DRAM, become increasingly unsuitable to meet the performance, energy, bandwidth, and area requirements of modern embedded and high-performance computer systems. Racetrack memory (RTM), an emerging nonvolatile memory technology, promises to meet these conflicting demands by offering simultaneously high speed, higher density, and nonvolatility. RTM provides these efficiency gains by not providing immediate access to all storage locations, but by instead storing data sequentially in the equivalent to nanoscale tapes called tracks. Before any data can be accessed, explicit shift operations must be issued that cost energy and increase access latency. The result is a fundamental change in memory performance behavior: the address distance between subsequent memory accesses now has a linear effect on memory performance. While there are first techniques to optimize programs for linear-latency memories, such as RTM, existing automatic solutions treat only scalar memory accesses. This work presents the first automatic compilation framework that optimizes static loop programs over arrays for linear-latency memories. We extend the polyhedral compilation framework Polly to generate code that maximizes accesses to the same or consecutive locations, thereby minimizing the number of shifts. Our experimental results show that the optimized code incurs up to 85% fewer shifts (average 41%), improving both performance and energy consumption by an average of 17.9% and 39.8%, respectively. Our results show that automatic techniques make it possible to effectively program linear-latency memory architectures such as RTM.

Details

OriginalspracheEnglisch
Aufsatznummer9216560
Seiten (von - bis)3968-3980
Seitenumfang13
FachzeitschriftIEEE transactions on computer-aided design of integrated circuits and systems
Jahrgang39
Ausgabenummer11
PublikationsstatusVeröffentlicht - Nov. 2020
Peer-Review-StatusJa

Externe IDs

ORCID /0000-0002-5007-445X/work/141545526

Schlagworte

Forschungsprofillinien der TU Dresden

Ziele für nachhaltige Entwicklung

Schlagwörter

  • Compiler optimization, domain wall memory, layout transformation, loop transformation, polyhedral compilation, racetrack memory (RTM), shifts optimization, tensor contraction

Bibliotheksschlagworte