SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Abstract

Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency. While studies have explored this issue on local machines and cloud clusters, research on modern high-performance computing (HPC) infrastructures is yet limited due to the lack of scalable measurement tools. This work presents SProBench, a novel benchmark suite designed to evaluate the performance of data stream processing frameworks in large-scale computing systems. Building on best practices, SProBench incorporates a modular architecture, offers native support for SLURM-based clusters, and seamlessly integrates with popular stream processing frameworks such as Apache Flink, Apache Spark Streaming, and Apache Kafka Streams. Experiments conducted on HPC clusters demonstrate its exceptional scalability, delivering throughput that surpasses existing benchmarks by more than tenfold. The distinctive features of SProBench, including complete customization options, built-in automated experiment management tools, seamless interoperability, and an open-source license, distinguish it as an innovative benchmark suite tailored to meet the needs of modern data stream processing frameworks.

Details

Original languageEnglish
Title of host publicationEuro-Par 2025: Parallel Processing
EditorsWolfgang E. Nagel, Diana Goehringer, Pedro C. Diniz
PublisherSpringer Science and Business Media B.V.
Pages268-282
Number of pages15
ISBN (electronic)978-3-031-99872-0
ISBN (print)978-3-031-99871-3
Publication statusPublished - 2026
Peer-reviewedYes

Publication series

SeriesLecture notes in computer science
Volume15902 LNCS
ISSN0302-9743

Conference

Title31st International Conference on Parallel and Distributed Computing
Abbreviated titleEuro-Par 2025
Conference number31
Duration25 - 29 August 2025
Website
LocationTechnische Universität Dresden
CityDresden
CountryGermany

Keywords

Keywords

  • Benchmark suite, HPC cluster, Slurm, Stream Processing