Revisiting Fault-Injection Experiment-Platform Architectures

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

  • Horst Schirmeier - , Chair of Operating Systems, Dortmund University of Technology (Author)
  • Martin Hoffmann - , Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Rudiger Kapitza - , Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Daniel Lohmann - , Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Olaf Spinczyk - , Dortmund University of Technology (Author)

Abstract

Many years of research on dependable, fault-tolerant software systems yielded a myriad of tool implementations for vulnerability analysis and experimental validation of resilience measures. Trace recording and fault injection are among the core functionalities these tools provide for hardware debuggers or system simulators, partially including some means to automate larger experiment campaigns. We argue that current fault-injection tools are too highly specialized for specific hardware devices or simulators, and are developed in poorly modularized implementations impeding evolution and maintenance. In this article, we present a novel design approach for a fault-injection infrastructure that allows experimenting researchers to switch simulator or hardware back ends with little effort, fosters experiment code reuse, and retains a high level of maintainability.

Details

Original languageEnglish
Title of host publication2011 IEEE 17th Pacific Rim International Symposium on Dependable Computing
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages284-285
Number of pages2
ISBN (print)978-0-7695-4590-5
Publication statusPublished - 14 Dec 2011
Peer-reviewedYes

Conference

Title2011 IEEE 17th Pacific Rim International Symposium on Dependable Computing
Duration12 - 14 December 2011
LocationPasadena, CA, USA

External IDs

Scopus 84857774793
ORCID /0000-0002-1427-9343/work/167216807

Keywords

Keywords

  • Hardware, Circuit faults, Fault tolerance, Fault tolerant systems, Programming, Buffer storage, Switches