An Experimental Setup to Evaluate RAPL Energy Counters for Heterogeneous Memory
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
Power consumption of the main memory in modern heterogeneous high-performance computing (HPC) constitutes a significant part of the total power consumption of a node. This motivates energyefficient solutions targeting the memory domain as well. Practitioners need reliable energy measurement techniques for analyzing energy and power consumption of applications and performance optimizations. Running Average Power Limit (RAPL) is a common choice, as it provides uncomplicated access to the energy measurements. While RAPL's accuracy has been studied and validated on homogeneous memory platforms, no work we are aware of investigated its accuracy on heterogeneous memory platforms, specifically with high-capacity memory (HCM). This paper describes the process of measuring the memory power consumption externally using riser cards in detail. We validate RAPL's accuracy by comparing results obtained from Intel's Ice Lake-SP system equipped with DDR4 DRAM and Intel Optane Persistent Memory Modules (PMM). In addition, we verify the accuracy of our instrumentation setup by comparing the results from an older Broadwell system with the results in the literature. We show that the RAPL values on a heterogeneous memory system report a higher offset from the reference measurements. The difference is more pronounced at lower memory load for all memory types. Also, we find that RAPL readings are inconsistent between multiple sockets and over time. Based on the evaluated scenarios, we conclude that RAPL overestimates the actual power consumption on heterogeneous memory systems and provide a discussion on the possible causes of this effect.
Details
Originalsprache | Englisch |
---|---|
Titel | ICPE 2024 - Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering |
Herausgeber (Verlag) | Association for Computing Machinery, Inc |
Seiten | 71-82 |
Seitenumfang | 12 |
ISBN (elektronisch) | 9798400704444 |
Publikationsstatus | Veröffentlicht - 7 Mai 2024 |
Peer-Review-Status | Ja |
Publikationsreihe
Reihe | ICPE: ACM/SPEC International Conference on Performance Engineering |
---|
Konferenz
Titel | 15th ACM/SPEC International Conference on Performance Engineering |
---|---|
Kurztitel | ICPE 2024 |
Veranstaltungsnummer | 15 |
Dauer | 7 - 11 Mai 2024 |
Webseite | |
Bekanntheitsgrad | Internationale Veranstaltung |
Ort | Imperial College London |
Stadt | London |
Land | Großbritannien/Vereinigtes Königreich |
Externe IDs
ORCID | /0000-0002-5437-3887/work/167217166 |
---|
Schlagworte
Ziele für nachhaltige Entwicklung
ASJC Scopus Sachgebiete
Schlagwörter
- Energy Efficiency, Heterogeneous Memory, HPC, Intel Optane Persistent Memory, Running Average Power Limit