Performance analysis has a long history in the high-performance computing community. On the one hand, the traditional application analysis focuses on scalable yet detailed instrumentation of parallel execution. On the other hand, per-node or cluster-wide monitoring solutions are used in data center operation. However, performance anomalies resulting from the interaction between applications, background processes and the operating system, are difficult to analyze with tools that reveal only part of the issue. In this paper, we present a novel approach that covers all aspects of individual nodes. We extend a monitoring tool to combine call stack sampling, process monitoring, and syscall recording into a symbiotic view of the application execution, background activity, and the operating system.
|Title of host publication||Proceedings - 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022|
|Number of pages||5|
|Publication status||Published - 8 Sept 2022|
|Title||2022 IEEE International Conference on Cluster Computing (CLUSTER)|
|Duration||5 - 8 September 2022|
- Symbiosis, Process monitoring, Data centers, Operating systems, Instruments, High performance computing, Cluster computing, system monitoring, performance analysis, performance tools