Fault Tolerance of Sorting Algorithms Under Varying Input Characteristics
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Bit flips in main memory or in the CPU can lead to silent data corruptions (SDCs) and therefore pose a threat to all modern computer systems. The selection of particularly fault-tolerant algorithms is known to have a strong impact on a system's resilience, but involves time-consuming fault-injection (FI) campaigns. This problem is exacerbated by the fact that the optimal selection also depends on input characteristics like data length or structure. In this paper, we exemplarily analyze different sorting algorithms with respect to their fault tolerance. Confirming and complementing other studies, we show that input-data length and ordering are relevant for algorithm selection. We find that existing hybrid algorithms, like Timsort or Introsort, also make favorable choices for fault tolerance and dominate the field. In a further consideration, we correlate the SDC probabilities with simpler metrics, such as the number of retired instructions. We find that these metrics can also be used to select the most resilient sorting algorithm, as the resulting algorithm ranking is strongly correlated to one obtained from an expensive FI campaign.
Details
| Original language | English |
|---|---|
| Title of host publication | 2025 20th European Dependable Computing Conference (EDCC) |
| Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
| Pages | 63-71 |
| Number of pages | 9 |
| ISBN (electronic) | 979-8-3315-1280-4 |
| ISBN (print) | 979-8-3315-1281-1 |
| Publication status | Published - 10 Apr 2025 |
| Peer-reviewed | Yes |
Publication series
| Series | European Dependable Computing Conference (EDCC) |
|---|---|
| ISSN | 2641-810X |
Conference
| Title | 20th European Dependable Computing Conference |
|---|---|
| Abbreviated title | EDCC 2025 |
| Conference number | 20 |
| Duration | 8 - 11 April 2025 |
| Website | |
| Location | University of Lisboa (FCUL) |
| City | Lisbon |
| Country | Portugal |
External IDs
| Scopus | 105015528051 |
|---|---|
| ORCID | /0000-0002-1427-9343/work/193705399 |
Keywords
ASJC Scopus subject areas
Keywords
- Europe, Fault tolerance, Fault tolerant systems, Lead, Measurement, Redundancy, Registers, Sorting, fault tolerance, bit flip, soft error, sorting algorithm, input characteristics, correlation, silent data corruption, fault injection