HAFT: Hardware-assisted Fault Tolerance

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

Abstract

Transient hardware faults during the execution of a program
can cause data corruptions. We present HAFT, a fault tolerance
technique using hardware extensions of commodity CPUs to
protect unmodified multithreaded applications against such cor-
ruptions. HAFT utilizes instruction-level redundancy for fault
detection and hardware transactional memory for fault recovery.
We evaluated HAFT with Phoenix and PARSEC benchmarks.
The observed normalized runtime is 2×, with 98.9% of the in-
jected data corruptions being detected and 91.2% being corrected.
To demonstrate the effectiveness of HAFT, we applied it to real-
world case studies including Memcached, Apache, and SQLite.

Details

Original languageEnglish
Title of host publicationProceedings of the Eleventh European Conference on Computer Systems (Eurosys 2016)
PublisherACM Press
Publication statusPublished - 2016
Peer-reviewedYes

External IDs

Scopus 84971515768

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards