Program analysis tools for massively parallel applications: How to achieve highest performance

Andreas Knuepfer; Dieter Kranzlmueller; Bernd W. Mohr; Wolfgang E. Nagel

doi:10.1145/1188455.1188687

Program analysis tools for massively parallel applications: How to achieve highest performance

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Andreas Knuepfer - , Department of Interdisciplinary Application Development and Coordination (IAK) (Author)
Dieter Kranzlmueller - (Author)
Bernd W. Mohr - , Julich Supercomputing Centre (Author)
Wolfgang E. Nagel - , Center for Information Services and High Performance Computing (ZIH), Chair of Computer Architecture (Author)

Abstract

Today's HPC environments are increasingly complex in order to achieve highest performance. Hardware platforms introduce features like out-of-order execution, multi-level caches, multi-cores, non-uniform memory access etc. Application software combines OpenMP, MPI, optimized libraries and various types of compiler optimization to exploit potential performance.To reach a reasonable percentage of the theoretical peak performance, three fundamental steps need to be accomplished. First, correctness must be guaranteed especially during the course of optimization. Second, the actual performance achieved needs to be determined. In particular the contributions/limitations of all sub-systems involved (CPU, memory, network, I/O) have to be identified. Third, actual optimization can only be successful with the previously obtained knowledge.Those steps are by no means trivial. There are sophisticated tools beyond simple profiling to support the HPC user. The tutorial introduces a variety of such tools: it shows how they play together and how they scale with long-running massively parallel cases.

Details

Original language	English
Title of host publication	SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
Publisher	Association for Computing Machinery (ACM), New York
ISBN (print)	978-0-7695-2700-0
Publication status	Published - 2006
Peer-reviewed	Yes

Publication series

Series	SC: The International Conference for High Performance Computing, Networking, Storage, and Analysis
ISSN	2167-4329

Research Portal of the TU Dresden

Program analysis tools for massively parallel applications: How to achieve highest performance

Contributors

Abstract

Details

Publication series

Keywords

Research priority areas of TU Dresden

ASJC Scopus subject areas