Designing run-time fault-tolerance using dynamic updates

Document Type

Conference Proceeding

Publication Date



We present a framework for designing run-time fault-tolerance using dynamic program updates triggered by faults. This is an important problem in the design of autonomous systems as it is often the case that a running program needs to be upgraded to its fault-tolerant version once faults occur. We formally state fault-triggered program updates as a design problem. We then present a sound and complete algorithm that automates the design of fault-triggered updates for replacing a program that does not tolerate faults with a fault-tolerant version thereof at run-time. We also define three classes of fault-triggered dynamic updates that tolerate faults during the update. We demonstrate our approach in the context of a fault-triggered update for the gate controller of a parking lot. © 2007 IEEE.

Publication Title

Proceedings - ICSE 2007 Workshops: International Workshop on Software Engineering for Adaptive and Self-Managing Systems, SEAMS 2007