An Interface for Integrated MPI Correctness Checking
Research output: Contribution to book/conference proceedings/anthology/report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Usage errors of the widely accepted Message-Passing Interface (MPI) are common and complicate the development process of parallel applications considerably. Some of these errors are hard to track, especially when they only occur in certain application runs or on certain platforms. Runtime correctness checking tools for MPI simplify the detection of these errors. However, they usually need the MPI profiling interface for their analysis. This paper addresses two issues related to correctness tools: First, due to the exclusive usage of the MPI profiling interface, it is not possible to use such tools in conjunction with other MPI tools, which are also based on the profiling interface. Second, correctness checking tools usually lack the ability to provide a detailed history of the events leading to an error, whereas such a history is provided naturally by tracing frameworks. We introduce the Universal MPI Correctness Interface (UniMCI) to overcome the first problem. This interface provides functions that invoke correctness checking and return detected errors in a manner that is independent of the correctness checker in use. Furthermore, we demonstrate the applicability of UniMCI with an implementation that uses the Marmot correctness checker and an exemplary integration of the interface into the VampirTrace performance analysis framework. As a result, we can provide a history for detected correctness events, which provides detailed information for debugging. Finally, we present a study using the SPEC MPI2007 benchmark to demonstrate the feasibility and applicability of our approach.
Details
Original language | English |
---|---|
Title of host publication | PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE |
Editors | B Chapman, F Desprez, GR Joubert, A Lichnewsky, F Peters, T Priol |
Publisher | IOS Press, Amsterdam [u. a.] |
Pages | 693-700 |
Number of pages | 8 |
ISBN (print) | 978-1-60750-529-7 |
Publication status | Published - 2010 |
Peer-reviewed | Yes |
Publication series
Series | Advances in Parallel Computing |
---|---|
Volume | 19 |
ISSN | 0927-5452 |
Conference
Title | International Conference on Parallel Computing 2009 |
---|---|
Abbreviated title | ParCo 2009 |
Duration | 1 - 4 September 2009 |
Location | École Normale Supérieure de Lyon |
City | Lyon |
Country | France |
External IDs
Scopus | 84906502143 |
---|---|
ORCID | /0000-0001-6520-4563/work/142236628 |
Keywords
Keywords
- Correctness checking, Message-Passing Interface, Tools, Marmot, Vampir