On the probability of detecting data errors generated by permanent faults using time redundancy

Joakim Aidemark; Peter Folkesson; Johan Karlsson

doi:10.1109/OLT.2003.1214369

On the probability of detecting data errors generated by permanent faults using time redundancy
Paper in proceeding, 2003

Time redundant execution of tasks and comparison of results is a well-known technique for detecting transient faults in computer systems. However, time redundancy is also capable of detecting permanent faults that occur during or between the executions of two task replicas, provided the faults affect the results of the two tasks in different ways. In this paper, we derive an expression for estimating the probability of detecting data errors generated by permanent faults with time redundant execution. The expression is validated experimentally by injecting permanent stuck-at faults into a multiplier unit of a microprocessor. We use the derived expression to show how tasks can be scheduled to improve the detection probability of errors generated by permanent faults. We also show that the detection capability of permanent faults is low for the Temporal Error Masking (TEM) technique (i.e. triplicated execution and voting to mask transient faults) and may not be increased by scheduling. Thus, we propose complementing TEM with special test tasks.

Author

Joakim Aidemark

Chalmers, Department of Computer Engineering

Other publications Research

Peter Folkesson

Chalmers, Department of Computer Engineering

Other publications Research

Johan Karlsson

Chalmers, Department of Computer Engineering

Other publications Research

Proceedings of the 9th IEEE International On-Line Testing Symposium, Kos, 7-9 July 2003

68-74
0-7695-1968-7 (ISBN)

Subject Categories (SSIF 2011)

Computer Engineering

DOI

10.1109/OLT.2003.1214369

Publication data connected to DOI

ISBN

0-7695-1968-7

More information

Created

10/7/2017

On the probability of detecting data errors generated by permanent faults using time redundancy Paper in proceeding, 2003

Author

Joakim Aidemark

Peter Folkesson

Johan Karlsson

Proceedings of the 9th IEEE International On-Line Testing Symposium, Kos, 7-9 July 2003

Subject Categories (SSIF 2011)

DOI

ISBN

More information

Created

On the probability of detecting data errors generated by permanent faults using time redundancy
Paper in proceeding, 2003