Artificial intelligence based task mapping and pipelined scheduling for checkpointing on real time systems with imperfect fault detection

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

  • Anup Das - , National University of Singapore (Author)
  • Akash Kumar - , National University of Singapore (Author)
  • Bharadwaj Veeravalli - , National University of Singapore (Author)

Abstract

Fault-tolerance is emerging as one of the important optimization objectives for designs in deep submicron technology nodes. This paper proposes a technique of application mapping and scheduling with checkpointing on a multiprocessor system to maximize the reliability considering transient faults. The proposed model incorporates checkpoints with imperfect fault detection probability, and pipelined execution and cyclic dependency associated with multimedia applications. This is solved using an Artificial Intelligence technique known as Particle Swarm Optimization to determine the number of checkpoints of every task of the application that maximizes the confidence of the output. The proposed approach is validated experimentally with synthetic and real-life application graphs. Results demonstrate the proposed technique improves the probability of correct result by an average 15% with imperfect fault detection. Additionally, even with 100% fault detection, the proposed technique is able to achieve better results (25% higher confidence) as compared to the existing fault-tolerant techniques.

Details

Original languageEnglish
Title of host publication2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)
Place of PublicationAmsterdam
PublisherIEEE Xplore
Pages134-140
Number of pages7
ISBN (electronic)978-1-4799-6155-9, 978-1-4799-6154-2
Publication statusPublished - 18 Nov 2014
Peer-reviewedYes
Externally publishedYes

Publication series

SeriesIEEE International Symposium on Defect and Fault Tolerance in VLSI Systems
ISSN1550-5774

Conference

Title27th IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, DFT 2014
Duration1 - 3 October 2014
CityAmsterdam
CountryNetherlands

Keywords

Research priority areas of TU Dresden

ASJC Scopus subject areas