Study on the Level of Confidence for Roll-back Recovery with Checkpointing
(2011) 1st Intl. Workshop on Dependability Issues in Deep-submicron Technologies (DDT 2011)- Abstract
- Increasing soft error rates for semiconductor devices manufactured in later technologies enforces the use of fault tolerant techniques such as Roll-back Recovery with Checkpointing (RRC). However, RRC introduces time overhead that increases the completion (execution) time. For non-real-time systems, research have focused on optimizing RRC and shown that it is possible to find the optimal number of checkpoints such that the average execution time is minimal. While minimal average execution time is important, it is for real-time systems important to provide a high probability of meeting given deadlines. Hence, there is a need of probabilistic guarantees that jobs employing RRC complete before a given deadline. Therefore, in this paper we... (More)
- Increasing soft error rates for semiconductor devices manufactured in later technologies enforces the use of fault tolerant techniques such as Roll-back Recovery with Checkpointing (RRC). However, RRC introduces time overhead that increases the completion (execution) time. For non-real-time systems, research have focused on optimizing RRC and shown that it is possible to find the optimal number of checkpoints such that the average execution time is minimal. While minimal average execution time is important, it is for real-time systems important to provide a high probability of meeting given deadlines. Hence, there is a need of probabilistic guarantees that jobs employing RRC complete before a given deadline. Therefore, in this paper we present a mathematical framework for the evaluation of level of confidence, the probability that a given deadline is met, when RRC is employed. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/5050608
- author
- Nikolov, Dimitar LU ; Ingelsson, Urban ; Singh, Virendra and Larsson, Erik LU
- publishing date
- 2011
- type
- Contribution to conference
- publication status
- published
- subject
- conference name
- 1st Intl. Workshop on Dependability Issues in Deep-submicron Technologies (DDT 2011)
- conference location
- Trondheim, Norway
- conference dates
- 2011-05-27
- language
- English
- LU publication?
- no
- id
- 5ac6db00-215e-49ea-b0c8-ae3a5265dfcd (old id 5050608)
- date added to LUP
- 2016-04-04 14:02:00
- date last changed
- 2018-11-21 21:17:51
@misc{5ac6db00-215e-49ea-b0c8-ae3a5265dfcd, abstract = {{Increasing soft error rates for semiconductor devices manufactured in later technologies enforces the use of fault tolerant techniques such as Roll-back Recovery with Checkpointing (RRC). However, RRC introduces time overhead that increases the completion (execution) time. For non-real-time systems, research have focused on optimizing RRC and shown that it is possible to find the optimal number of checkpoints such that the average execution time is minimal. While minimal average execution time is important, it is for real-time systems important to provide a high probability of meeting given deadlines. Hence, there is a need of probabilistic guarantees that jobs employing RRC complete before a given deadline. Therefore, in this paper we present a mathematical framework for the evaluation of level of confidence, the probability that a given deadline is met, when RRC is employed.}}, author = {{Nikolov, Dimitar and Ingelsson, Urban and Singh, Virendra and Larsson, Erik}}, language = {{eng}}, title = {{Study on the Level of Confidence for Roll-back Recovery with Checkpointing}}, year = {{2011}}, }