Overhead of a Decentralized Gossip Algorithm on the Performance of HPC Applications

Research output: Contribution to conferencesPaperContributedpeer-review

Contributors

Abstract

Gossip algorithms can provide online information about the availability and the state of the resources in supercomputers. These algorithms require minimal computing and storage capabilities at each node and when properly tuned, they are not expected to overload the nodes or the network that connects these nodes. These properties make gossip interesting for future exascale systems. This paper examines the overhead of a decentralized gossip algorithm on the performance of parallel MPI applications running on up to 8192 nodes of an IBM BlueGene/Q supercomputer. The applications that were used in the experiments include PTRANS and MPI-FFT from the HPCC benchmark suite as well as the coupled weather and cloud simulation model COSMO-SPECS+FD4. In most cases, no gossip overhead was observed when the gossip messages were sent at intervals of 256ms or more. As expected, the overhead that is observed at higher rates is sensitive to the communication pattern of the application and the amount of gossip information being circulated.

Details

Original languageEnglish
Pages1-7
Number of pages7
Publication statusPublished - 2014
Peer-reviewedYes

Workshop

Title4th International Workshop on Runtime and Operating Systems for Supercomputers
Conference number
Duration10 June 2014
Website
Degree of recognitionInternational event
Location
CityMunich
CountryGermany

External IDs

Scopus 84903625032
ORCID /0000-0003-3137-0648/work/142238846

Keywords

Keywords

  • algorithm, HPC, apllication, performance