Stochastic Models for Fault Tolerance Restart, Rejuvenation and Checkpointing /

As modern society relies on the fault-free operation of complex computing systems, system fault-tolerance has become an indispensable requirement. Therefore, we need mechanisms that guarantee correct service in cases where system components fail, be they software or hardware elements. Redundancy pat...

Full description

Bibliographic Details
Main Author: Wolter, Katinka. (Author)
Corporate Author: SpringerLink (Online service)
Format: Electronic
Language:English
Published: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2010.
Subjects:
Online Access:https://ezaccess.library.uitm.edu.my/login?url=http://dx.doi.org/10.1007/978-3-642-11257-7
Table of Contents:
  • Part I: Introduction
  • 1) Basic Concepts and Problems - 2) Task Completion Time
  • Part II: Restart
  • 3) Applicability Analysis of Restart - 4) Moments of Completion Time under Restart - 5) Meeting Deadlines through Restart
  • Part III: Software Rejuvenation
  • 6) Practical Aspects of Preventive Maintenance and Software Rejuvenation - 7) Stochastic Models for Preventive Maintenance and Software Rejuvenation
  • Part IV: Checkpointing
  • 8) Checkpointing Systems - 9) Stochastic Models for Checkpointing - 10) Summary, Conclusion and Outlook
  • Appendix
  • A) Properties in Discrete Systems - B) Important Probability Distributions - C) Estimating the Hazard Time - D) The Laplace and the Laplace-Stieltjes Transform.