Enforcing Perfect Failure Detection
Publikation: Beitrag zu Konferenzen › Paper › Beigetragen › Begutachtung
Beitragende
Abstract
Perfect failure detectors can correctly decide whether a computer is crashed. However it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed distributed systems with hardware watchdogs. The two main system model assumptions are: each computer can measure time intervals with a known maximum error; and each computer has a watchdog that crashes the computer unless the watchdog is periodically updated. We have implemented a system that satisfies both assumptions using a combination of off-the-shelf software and hardware.
Details
Originalsprache | Englisch |
---|---|
Seiten | 350-359 |
Seitenumfang | 10 |
Publikationsstatus | Veröffentlicht - 2001 |
Peer-Review-Status | Ja |
Konferenz
Titel | 2001 21st International Conference on Distributed Computing Systems |
---|---|
Kurztitel | ICDSC 2001 |
Veranstaltungsnummer | 21 |
Dauer | 16 - 19 April 2001 |
Bekanntheitsgrad | Internationale Veranstaltung |
Stadt | Mesa |
Land | USA/Vereinigte Staaten |
Schlagworte
Forschungsprofillinien der TU Dresden
DFG-Fachsystematik nach Fachkollegium
Schlagwörter
- perfect failure detection, crash failures, asynchronous distribued systems, timed asynchronous system model, Computer crashes, detectors, time measurement, Computer errors, Fault tolerant systems, Clocks, Error correction, Fault detection, Heart beat, Fault tolerant computing, distributed processing, system recovery, purely asynchronous systems, timed distribued systems, hardware watchdogs, time intervals, off-the-shelf software