SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Recent advancements in data stream processing frameworks have improved real-time data handling, however, scalability remains a significant challenge affecting throughput and latency. While studies have explored this issue on local machines and cloud clusters, research on modern high-performance computing (HPC) infrastructures is yet limited due to the lack of scalable measurement tools. This work presents SProBench, a novel benchmark suite designed to evaluate the performance of data stream processing frameworks in large-scale computing systems. Building on best practices, SProBench incorporates a modular architecture, offers native support for SLURM-based clusters, and seamlessly integrates with popular stream processing frameworks such as Apache Flink, Apache Spark Streaming, and Apache Kafka Streams. Experiments conducted on HPC clusters demonstrate its exceptional scalability, delivering throughput that surpasses existing benchmarks by more than tenfold. The distinctive features of SProBench, including complete customization options, built-in automated experiment management tools, seamless interoperability, and an open-source license, distinguish it as an innovative benchmark suite tailored to meet the needs of modern data stream processing frameworks.
Details
| Original language | English |
|---|---|
| Title of host publication | Euro-Par 2025: Parallel Processing |
| Editors | Wolfgang E. Nagel, Diana Goehringer, Pedro C. Diniz |
| Publisher | Springer Science and Business Media B.V. |
| Pages | 268-282 |
| Number of pages | 15 |
| ISBN (electronic) | 978-3-031-99872-0 |
| ISBN (print) | 978-3-031-99871-3 |
| Publication status | Published - 2026 |
| Peer-reviewed | Yes |
Publication series
| Series | Lecture notes in computer science |
|---|---|
| Volume | 15902 LNCS |
| ISSN | 0302-9743 |
Conference
| Title | 31st International Conference on Parallel and Distributed Computing |
|---|---|
| Abbreviated title | Euro-Par 2025 |
| Conference number | 31 |
| Duration | 25 - 29 August 2025 |
| Website | |
| Location | Technische Universität Dresden |
| City | Dresden |
| Country | Germany |
Keywords
ASJC Scopus subject areas
Keywords
- Benchmark suite, HPC cluster, Slurm, Stream Processing