HAFT: Hardware-assisted Fault Tolerance
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Transient hardware faults during the execution of a program
can cause data corruptions. We present HAFT, a fault tolerance
technique using hardware extensions of commodity CPUs to
protect unmodified multithreaded applications against such cor-
ruptions. HAFT utilizes instruction-level redundancy for fault
detection and hardware transactional memory for fault recovery.
We evaluated HAFT with Phoenix and PARSEC benchmarks.
The observed normalized runtime is 2×, with 98.9% of the in-
jected data corruptions being detected and 91.2% being corrected.
To demonstrate the effectiveness of HAFT, we applied it to real-
world case studies including Memcached, Apache, and SQLite.
can cause data corruptions. We present HAFT, a fault tolerance
technique using hardware extensions of commodity CPUs to
protect unmodified multithreaded applications against such cor-
ruptions. HAFT utilizes instruction-level redundancy for fault
detection and hardware transactional memory for fault recovery.
We evaluated HAFT with Phoenix and PARSEC benchmarks.
The observed normalized runtime is 2×, with 98.9% of the in-
jected data corruptions being detected and 91.2% being corrected.
To demonstrate the effectiveness of HAFT, we applied it to real-
world case studies including Memcached, Apache, and SQLite.
Details
Original language | English |
---|---|
Title of host publication | Proceedings of the Eleventh European Conference on Computer Systems (Eurosys 2016) |
Publisher | ACM Press |
Publication status | Published - 2016 |
Peer-reviewed | Yes |
External IDs
Scopus | 84971515768 |
---|