Gregory M. Kapfhammer
Details in the Textbook
Availability: ready to be use immediately
Reliability: run continuously without failure
Safety: nothing catastrophic happens on failure
Maintainability: system can be easily repaired on failure
How are these terms related to each other?
Fail: system cannot meet its promises
Error: incorrect system state that may lead to failure
Fault: the cause of an error
PIE Model: how a fault manifests itself in a failure
Fault tolerance: provide services in the presence of faults
How can systems provide fault tolerance?
Transient: occur once and then disappear
Intermittent: occurs, vanishes, and the reappears
Permanent: continues to exist until replacement
Which kinds of faults most frequently occur in software?
See Figure 8-1 for more details!