Enforcing Perfect Failure Detection

Christof Fetzer

Enforcing Perfect Failure Detection

Publikation: Beitrag zu Konferenzen › Paper › Beigetragen › Begutachtung

Beitragende

Christof Fetzer - , Professur für Systems Engineering (SE) (Autor:in)

Abstract

Perfect failure detectors can correctly decide whether a computer is crashed. However it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed distributed systems with hardware watchdogs. The two main system model assumptions are: each computer can measure time intervals with a known maximum error; and each computer has a watchdog that crashes the computer unless the watchdog is periodically updated. We have implemented a system that satisfies both assumptions using a combination of off-the-shelf software and hardware.

Details

Originalsprache	Englisch
Seiten	350-359
Seitenumfang	10
Publikationsstatus	Veröffentlicht - 2001
Peer-Review-Status	Ja

Konferenz

Titel	2001 21st International Conference on Distributed Computing Systems
Kurztitel	ICDSC 2001
Veranstaltungsnummer	21
Dauer	16 - 19 April 2001
Bekanntheitsgrad	Internationale Veranstaltung
Stadt	Mesa
Land	USA/Vereinigte Staaten

Schlagworte

Forschungsprofillinien der TU Dresden

Informationstechnologien und Mikroelektronik

DFG-Fachsystematik nach Fachkollegium

Sicherheit und Verlässlichkeit

Schlagwörter

perfect failure detection, crash failures, asynchronous distribued systems, timed asynchronous system model, Computer crashes, detectors, time measurement, Computer errors, Fault tolerant systems, Clocks, Error correction, Fault detection, Heart beat, Fault tolerant computing, distributed processing, system recovery, purely asynchronous systems, timed distribued systems, hardware watchdogs, time intervals, off-the-shelf software

Forschungsportal der TU Dresden