Bridging the Gap between Application Performance Analysis and System Monitoring

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Abstract

Performance analysis has a long history in the high-performance computing community. On the one hand, the traditional application analysis focuses on scalable yet detailed instrumentation of parallel execution. On the other hand, per-node or cluster-wide monitoring solutions are used in data center operation. However, performance anomalies resulting from the interaction between applications, background processes and the operating system, are difficult to analyze with tools that reveal only part of the issue. In this paper, we present a novel approach that covers all aspects of individual nodes. We extend a monitoring tool to combine call stack sampling, process monitoring, and syscall recording into a symbiotic view of the application execution, background activity, and the operating system.

Details

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022
PublisherIEEE Xplore
Pages611-615
Number of pages5
ISBN (electronic)9781665498562
ISBN (print)978-1-6654-9857-9
Publication statusPublished - 8 Sept 2022
Peer-reviewedYes

Conference

Title2022 IEEE International Conference on Cluster Computing
Abbreviated titleCLUSTER 2022
Duration6 - 8 September 2022
Website
Degree of recognitionInternational event
LocationRuprecht-Karls-Universität Heidelberg
CityHeidelberg
CountryGermany

External IDs

Scopus 85140931014
Mendeley b6698516-5a65-3643-8955-c54d2f3a7bfc
ORCID /0000-0002-5437-3887/work/154740526

Keywords

Keywords

  • Symbiosis, Process monitoring, Data centers, Operating systems, Instruments, High performance computing, Cluster computing, system monitoring, performance analysis, performance tools