Fault-tolerance embedded control systems for unreliable hardware

D. Goswami, D. Muller-Gritschneder, A.A. Basten, U. Schlichtmann, S. Chakraborty

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

7 Citations (Scopus)
1 Downloads (Pure)


Past years have seen intense research on reliability techniques for error detection recovery at various levels ranging from circuit level up to architectural level or even software level. In such scenarios, affordable techniques for error correction usually imply a timing penalty, e.g., check-pointing usually requires to repeat some part of the computation, which imposes a higher computation time. This can be problematic for real-time embedded control applications especially in the presence of intermittent hardware faults, for which delays due to re-computation are repeatedly encountered with high repetition rate. In this work, we investigate a setting where the control loops are executed on an unreliable embedded platform that may suffer from such intermittent faults. First, we characterize the impact of intermittent faults in the hardware by using an intermittent bit-flip fault model and RTL level error effect simulation. Subsequently, we look at novel fault-tolerant control algorithms that guarantee stability of the loops even in presence of repeating timing errors due to the error recovery of the unreliable hardware.
Original languageEnglish
Title of host publicationProceedings of the 2014 14th International Symposium on Integrated Circuits (ISIC), 10-12 December 2014, Singapore
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Publication statusPublished - 2014
Eventconference; International Symposium on Integrated Circuits (ISIC); 2014-12-10; 2014-12-12 -
Duration: 10 Dec 201412 Dec 2014


Conferenceconference; International Symposium on Integrated Circuits (ISIC); 2014-12-10; 2014-12-12
OtherInternational Symposium on Integrated Circuits (ISIC)


Dive into the research topics of 'Fault-tolerance embedded control systems for unreliable hardware'. Together they form a unique fingerprint.

Cite this