Design Optimization of Time- and Cost-Constrained Fault-Tolerant Embedded Systems with Checkpointing and Replication

Paul Pop, Viacheslav Izosimov, Petru Eles, Zebo Peng

    Research output: Contribution to journalJournal articleResearchpeer-review

    722 Downloads (Orbit)

    Abstract

    We present an approach to the synthesis of fault-tolerant hard real-time systems for safety-critical applications. We use checkpointing with rollback recovery and active replication for tolerating transient faults. Processes and communications are statically scheduled. Our synthesis approach decides the assignment of fault-tolerance policies to processes, the optimal placement of checkpoints and the mapping of processes to processors such that multiple transient faults are tolerated and the timing constraints of the application are satisfied. We present several design optimization approaches which are able to find fault-tolerant implementations given a limited amount of resources. The developed algorithms are evaluated using extensive experiments, including a real-life example.
    Original languageEnglish
    JournalI E E E Transactions on Very Large Scale Integration Systems
    Volume172
    Issue number3
    Pages (from-to)389-402
    ISSN1063-8210
    DOIs
    Publication statusPublished - 2009

    Bibliographical note

    Copyright: 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE

    Fingerprint

    Dive into the research topics of 'Design Optimization of Time- and Cost-Constrained Fault-Tolerant Embedded Systems with Checkpointing and Replication'. Together they form a unique fingerprint.

    Cite this