Bridging the Gap between Application Performance Analysis and System Monitoring

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Abstract

Performance analysis has a long history in the high-performance computing community. On the one hand, the traditional application analysis focuses on scalable yet detailed instrumentation of parallel execution. On the other hand, per-node or cluster-wide monitoring solutions are used in data center operation. However, performance anomalies resulting from the interaction between applications, background processes and the operating system, are difficult to analyze with tools that reveal only part of the issue. In this paper, we present a novel approach that covers all aspects of individual nodes. We extend a monitoring tool to combine call stack sampling, process monitoring, and syscall recording into a symbiotic view of the application execution, background activity, and the operating system.

Details

OriginalspracheEnglisch
TitelProceedings - 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022
Herausgeber (Verlag)IEEE Xplore
Seiten611-615
Seitenumfang5
ISBN (elektronisch)9781665498562
ISBN (Print)978-1-6654-9857-9
PublikationsstatusVeröffentlicht - 8 Sept. 2022
Peer-Review-StatusJa

Konferenz

Titel2022 IEEE International Conference on Cluster Computing
KurztitelCLUSTER 2022
Dauer6 - 8 September 2022
Webseite
BekanntheitsgradInternationale Veranstaltung
OrtRuprecht-Karls-Universität Heidelberg
StadtHeidelberg
LandDeutschland

Externe IDs

Scopus 85140931014
Mendeley b6698516-5a65-3643-8955-c54d2f3a7bfc
ORCID /0000-0002-5437-3887/work/154740526

Schlagworte

Schlagwörter

  • Symbiosis, Process monitoring, Data centers, Operating systems, Instruments, High performance computing, Cluster computing, system monitoring, performance analysis, performance tools