0% found this document useful (0 votes)

73 views

Availability Digest: Calculating Availability - Redundant Systems

This article will show the origin of this equation and what it means. It also leads to the concept of "9s" as a measure of availability. The availability relationship also leads to some useful associated rules.

Uploaded by

anwarsleem

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views

Availability Digest: Calculating Availability - Redundant Systems

Uploaded by

anwarsleem

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

the

Availability Digest
October 2006

Calculating Availability Redundant Systems

Our logo expresses the basic availability equation for an active/active application network in a somewhat stylized form (see Whats That Nerd Logo?). But what is the real relationship between the various subsystem factors and system availability? Though the relationship can become quite complex when many factors are taken 1 into account, the overriding availability equation is relatively simple. It is

A 1 f(1 a)s 1
In this article, we will show the origin of this equation and what it means. The relationship also leads to the concept of 9s as a measure of availability as well as to some useful associated rules. We will explore these topics as well. If your eyes glaze over at some of the algebra, skip the body of this article and go right to the end to read the simple but important availability rules that come out of the analysis.

The Availability Relationship

To simplify things a bit, we will dive into this topic in small increments. First, we will look at a simple two-node system with one spare. Next, we will look at a multinode system with one spare. Finally, we will look at a multinode system with multiple spares. First, let us define the following terms: A F a n s f is the probability that the system will be up (its availability). is the probability that the system will be down. is the availability of a node. is the number of nodes in the system. is the number of spare nodes in the system. is the number of ways that all of the spares plus one other node can fail (that is, the number of node failures that will cause the system to fail).

Clearly, A = 1-F.

A more extensive derivation of the availability equation may be found in the book entitled Breaking the Availability Barrier: Survivable Systems for Enterprise Computing, by Dr. Bill Highleyman, Paul J. Holenstein, and Dr. Bruce Holenstein, published by AuthorHouse; 2004
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

Dual Node, Single Spare We first consider an active/active system with two nodes, only one of which need be operational for the system to be considered available. The availability of a single node is a. This is the probability that the node will be up. Therefore, the probability that it will be down is (1-a). The 2 probability that both nodes will be down is (1-a) . This is the probability F for the failure of the system:
F (1 a)2

Thus, the availability of the system, A, is

A 1 F 1 (1 a)2
Node 1

For instance, if the node availability is .99, the probability that it will be down is .01. The probability that both nodes will be down, thus 2 causing a system failure, is .01 , or .0001. Thus, the system availability is (1 0001), or .9999. The system has an availability of four 9s. Multiple Nodes, Single Spare

Network

Node 2

Dual Node, Single Spare

In a multinode system with one spare, it will still take only the failure of two nodes to take down the system. However, there are many ways that we can have a failure of two nodes. For instance, if there are five nodes in the system, there are ten ways that two nodes can fail (count them). Thus, in this case, the number of failure modes, f, is ten. In general, if there are n nodes, there are n ways that one node can fail. Given a single node failure, there are (n-1) ways that a second node can fail. However, this reasoning has counted each failure mode twice; e.g., node 2 followed by node 5 and node 5 followed by node 2. Therefore, for an n node system, the number of failure modes, f, is n(n 1) f 2 The probability of failure of the system is the probability that any two nodes will fail times the number of ways that two nodes can fail:
F n(n 1) (1 a)2 2

Node 2 Node 3 Node 1 Network

Node 4 Node 5

Multinode, Single Spare

Thus, the availability of the system is

n(n 1) (1 a)2 2 For instance, consider a five-node system. Using our previous example for nodes with an availability of .99, the number of failure modes is ten; and the availability of a five-node system is 2 [1 - 10(.01) ], or .999. This is three 9s of availability. A 1 F 1
Note that this is less than the availability of four 9s for the two-node system. Here is an important rule to note:

2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

As an active/active application network gets larger with no increased sparing, its availability goes down. This is because of the increase in the number of failure modes. We talk about additional levels of sparing next. Multiple Nodes, Multiple Spares The next step to consider is the impact of having more than one spare. We have defined s as the number of spare nodes in the network. Therefore, it will take the loss of s+1 nodes to take down the network. Since the probability of losing one node is (1-a), the s+1 probability of losing s+1 nodes is (1-a) . Note that if 2 there is a single spare (s = 1), this reduces to (1-a) , as Node 2 used above for a single spared network. Node 3 The next question is how many ways are there for s+1 nodes to fail? This is the number of failure modes, f, for the network and is the number of ways that s+1 nodes out of n nodes can fail. The number of such combinations is given by the rather imposing expression
f n! (s 1)!(n s 1)!

Node 1 Network Node 4

Node 6 Node 5

Multinode, Multiple Spares

The symbol ! means factorial. For instance, 3! is 3x2x1 = 6.

Thus, the probability of failure for an n node system with s spares is

F f(1 a)s 1 and its availability is A 1 f(1 a)s 1

where f is given above for n nodes and s spares. This is the relationship that we promised you at the beginning of this article. As an example, if there are two spares (s = 2), the number of failure modes, f, becomes
f n(n 1)(n 2) 6

for two spares.

Consider a six-node system with two spares. That is, at least four nodes must be up and running in order for the system to be operational. Then f = 20 (thats right there are twenty ways that three nodes out of six can fail count them). Using our example above of a nodal availability of 3 .99, the probability of failure of the system, F, is 20x(.01) , or .00002. This yields a system availability of .99998, or almost five 9s. This compares to the similar singly-spared system above that had an availability of three 9s.

What About Those 9s?

The measure of 9s for availability is a logarithmic measure. It is like the Richter scale. An earthquake of magnitude 6 is ten times more powerful than a magnitude 5 earthquake. Likewise,
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

a system with an availability of four 9s is ten times more reliable than a system with an availability of three nines. Now theres a clue. Lets take the logarithm of the failure probability of an active/active system:

9s log10 (F) log10 [f(1 a)s 1 ] log10 (f ) (s 1)log10 (1 a)

(We needed to add the minus sign since A = 1-F.) For instance, consider a five-node system (n = 5) with one spare (s = 1). In this case, f is equal to 10. Log(f) is log(10), which is 1. Further, assuming that the nodal availability, a, is .99, log(1-a) = log(.01), which is -2. The system availability measured in 9s is then -1+2X2, or three 9s (.999). Notice that the log of the nodal failure probability, (1-a), is the measure of that nodes availability -2 in 9s. For instance, if the node has an availability of .01 = 10 , the log of its availability is -2, equivalent to two nines of availability. Noting that the log of the nodal availability is multiplied by (s+1), we come up with the following important rule: Adding a spare node adds the number of nines associated with that node to the system availability but reduced by the increase in failure modes. That is, adding an additional spare node adds the number of 9s of that node to the system availability almost. This improvement in availability is reduced a bit by the increase in the number of failure modes in the system. More nodes mean more failure modes.

Our Logo
Let us now return to our logo. It represents the failure probability of an active/active system with one spare node the most common of active/active systems. The first f represents the number of failure modes the number of ways that two nodes can fail in the system. The second f represents the probability of failure of any two nodes. Thus, the probability of failure of the 2 system is ff (if you will forgive the stylization).

Rules of Availability
We leave you with the following rules for the availability of an active/active application network: 1. The more nodes in an active/active network, the less reliable it is for a given sparing level. This is because of the increase in failure modes. 2. Adding a spare node to an active/active network adds the number of nines associated with that node to the system availability almost. The improved system availability is reduced somewhat by the increase in failure modes. As an example, using the relations we derived above, if the availability of a node is .99, the availability of a two-node system in which only one node is required to be up (i.e., there is one spare) is .9999, or four 9s. If we add a third node to the system, maintaining still only one spare, the systems availability drops to .9994 (a little over three 9s).

2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

If, however, that third node was an additional spare node, the system availability becomes .999994 (a little over five 9s). We hope that this leaves you with a feeling of the impact on active/active system availability as a function of system size and its sparing level.

2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

FEM-9 222 Englisch PDF
No ratings yet
FEM-9 222 Englisch PDF
20 pages
Availability Digest: Reliability Diagrams
No ratings yet
Availability Digest: Reliability Diagrams
8 pages
BSCSF 1552 09 21 Cosf326
No ratings yet
BSCSF 1552 09 21 Cosf326
4 pages
System Reliability and Availability
No ratings yet
System Reliability and Availability
5 pages
Pressure Transmitters
No ratings yet
Pressure Transmitters
8 pages
SDA Session 8
No ratings yet
SDA Session 8
17 pages
BDS Session 3
No ratings yet
BDS Session 3
68 pages
EENG 415 Power System Reliability Analytical Methods: Lecture # 5
No ratings yet
EENG 415 Power System Reliability Analytical Methods: Lecture # 5
51 pages
Definition of Reliability
No ratings yet
Definition of Reliability
8 pages
System Reliability Availability Calculations
No ratings yet
System Reliability Availability Calculations
6 pages
IT 602 Week 2 - Slides
No ratings yet
IT 602 Week 2 - Slides
31 pages
2. Availability Concepts
No ratings yet
2. Availability Concepts
32 pages
dis sys
No ratings yet
dis sys
16 pages
Network Reliability and Fault Tolerance
No ratings yet
Network Reliability and Fault Tolerance
10 pages
R2 ACM CS Part 1 Failure Prediction
No ratings yet
R2 ACM CS Part 1 Failure Prediction
42 pages
Rtos Group 10
No ratings yet
Rtos Group 10
9 pages
Design Patterns For High Availability
No ratings yet
Design Patterns For High Availability
10 pages
3 Reliability of Basic Systems PDF
No ratings yet
3 Reliability of Basic Systems PDF
25 pages
University of Massachusetts Dept. of Electrical & Computer Engineering Fault Tolerant Computing
No ratings yet
University of Massachusetts Dept. of Electrical & Computer Engineering Fault Tolerant Computing
19 pages
Fault Tolerance Slides
No ratings yet
Fault Tolerance Slides
18 pages
Fault Tolerance Slides
No ratings yet
Fault Tolerance Slides
18 pages
Fault Tolerance Refers To The Ability of A System
No ratings yet
Fault Tolerance Refers To The Ability of A System
3 pages
Unit 1 Ccws QB
No ratings yet
Unit 1 Ccws QB
34 pages
IEEEStd 30067 - 2013presentation
100% (3)
IEEEStd 30067 - 2013presentation
42 pages
FEM 9 222 Englisch
No ratings yet
FEM 9 222 Englisch
20 pages
Depndability
No ratings yet
Depndability
33 pages
Reliability Prediction Basics
No ratings yet
Reliability Prediction Basics
9 pages
Part9 Ch3 Data Replic
No ratings yet
Part9 Ch3 Data Replic
14 pages
Chapter 5 (Updated)
No ratings yet
Chapter 5 (Updated)
85 pages
Class 4 - CAP Theorem
No ratings yet
Class 4 - CAP Theorem
3 pages
Beyond Five-Nines: Awhitepaperondesigningavoipnetwork Forappropriateavailability
No ratings yet
Beyond Five-Nines: Awhitepaperondesigningavoipnetwork Forappropriateavailability
8 pages
PROJECT- 2A REPORT
No ratings yet
PROJECT- 2A REPORT
28 pages
Design For Six Sigma - Contd..: Session13
100% (1)
Design For Six Sigma - Contd..: Session13
43 pages
Chapter 5 (Updated)
No ratings yet
Chapter 5 (Updated)
85 pages
Reliability and Availablity
No ratings yet
Reliability and Availablity
6 pages
Availability and Reliability
No ratings yet
Availability and Reliability
3 pages
DS unit_4
No ratings yet
DS unit_4
20 pages
Reliability and Availability Engineering Modeling, Analysis, and Applications (Kishor S. Trivedi, Andrea Bobbio) (Z-Library)
100% (1)
Reliability and Availability Engineering Modeling, Analysis, and Applications (Kishor S. Trivedi, Andrea Bobbio) (Z-Library)
730 pages
Chapter 7 Basic Probability Concepts 22
No ratings yet
Chapter 7 Basic Probability Concepts 22
7 pages
Ten Fallacies of Availability and Reliability Analysis: 1 Prologue
No ratings yet
Ten Fallacies of Availability and Reliability Analysis: 1 Prologue
20 pages
System Reliability - STEP 8 - Path Reliability - HANDOUT
No ratings yet
System Reliability - STEP 8 - Path Reliability - HANDOUT
5 pages
Reliability in Maintenance: Source: Chapter 8 From Maintenance Engineering and Management by R.C.Mishra
No ratings yet
Reliability in Maintenance: Source: Chapter 8 From Maintenance Engineering and Management by R.C.Mishra
20 pages
High Availability
No ratings yet
High Availability
2 pages
chapter 2 maintnability reliability and availability
No ratings yet
chapter 2 maintnability reliability and availability
60 pages
High Availability PDF
No ratings yet
High Availability PDF
14 pages
Information Technology Infrastructure IT602
No ratings yet
Information Technology Infrastructure IT602
10 pages
Relationship Between Availability and Reliability
No ratings yet
Relationship Between Availability and Reliability
3 pages
PowerHA Workshop Part1
No ratings yet
PowerHA Workshop Part1
50 pages
ieee_std_p3006.7_presentation
No ratings yet
ieee_std_p3006.7_presentation
21 pages
CS61C Su18 27 MRR Dependability
No ratings yet
CS61C Su18 27 MRR Dependability
60 pages
An Empirical Appraisal of The Availability of A 500KVA Stand
No ratings yet
An Empirical Appraisal of The Availability of A 500KVA Stand
12 pages
Availabilty
No ratings yet
Availabilty
23 pages
Availability and Reliability
No ratings yet
Availability and Reliability
25 pages
Network Availability-MTTR MTBF
100% (1)
Network Availability-MTTR MTBF
3 pages
R2 ACM CS Part 2 Failure Prediction Appendix
No ratings yet
R2 ACM CS Part 2 Failure Prediction Appendix
16 pages
Predictive Mainenance
No ratings yet
Predictive Mainenance
32 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
From Everand
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Dr. Bruce Holenstein
No ratings yet
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
圆形 IPQ6010A Wi-Fi 6 Mesh Router Hard Disk Smart Speaker-V3.0 20201217
No ratings yet
圆形 IPQ6010A Wi-Fi 6 Mesh Router Hard Disk Smart Speaker-V3.0 20201217
23 pages
Unicenta oPOS Barcodes Guide
No ratings yet
Unicenta oPOS Barcodes Guide
13 pages
QOSConfigsample
No ratings yet
QOSConfigsample
2 pages
The Treasure
No ratings yet
The Treasure
1 page
OpenWrt Project MaxPham 15mar2015
No ratings yet
OpenWrt Project MaxPham 15mar2015
15 pages
Marketing Plan For Pocket Fireball
No ratings yet
Marketing Plan For Pocket Fireball
3 pages
Boarding Pass 23102016
No ratings yet
Boarding Pass 23102016
1 page
Lunenburg, Fred C. Goal-Setting Theoryof Motivation IJMBA V15 N1 2011
0% (1)
Lunenburg, Fred C. Goal-Setting Theoryof Motivation IJMBA V15 N1 2011
6 pages
Industrial Drive and Application PDF
No ratings yet
Industrial Drive and Application PDF
90 pages
Tomb of Otamyaco
No ratings yet
Tomb of Otamyaco
10 pages
Woman and The New Race by Sanger, Margaret, 1883-1966
No ratings yet
Woman and The New Race by Sanger, Margaret, 1883-1966
89 pages
01F. Introduction To EBM
No ratings yet
01F. Introduction To EBM
57 pages
Giuseppe Garibaldi: A Guerrilla With A Dream: Western Civ Home
No ratings yet
Giuseppe Garibaldi: A Guerrilla With A Dream: Western Civ Home
20 pages
Law For Business, 17e by Ashcroft and Ashcroft
No ratings yet
Law For Business, 17e by Ashcroft and Ashcroft
12 pages
Statement of Chuck Grassley Re John J. McConnell Jr. Nomination 3-31-2011
No ratings yet
Statement of Chuck Grassley Re John J. McConnell Jr. Nomination 3-31-2011
4 pages
Effect of Customer Relationship Management On Cust
No ratings yet
Effect of Customer Relationship Management On Cust
6 pages
English - Lidia's Lessons
No ratings yet
English - Lidia's Lessons
7 pages
Ebooks File Slave Culture Nationalist Theory and The Foundations of Black America 2nd Edition Sterling Stuckey All Chapters
100% (3)
Ebooks File Slave Culture Nationalist Theory and The Foundations of Black America 2nd Edition Sterling Stuckey All Chapters
84 pages
Pages From 0625 - s16 - QP - 43-10
No ratings yet
Pages From 0625 - s16 - QP - 43-10
2 pages
Lewis - Herries - Research Paper
No ratings yet
Lewis - Herries - Research Paper
56 pages
Chiang Kai Shek vs. CA
No ratings yet
Chiang Kai Shek vs. CA
1 page
Module 4
No ratings yet
Module 4
3 pages
Acute Management of Pediatric Cyclic Vomiting Syndrome A Systematic Review
No ratings yet
Acute Management of Pediatric Cyclic Vomiting Syndrome A Systematic Review
11 pages
Journal For Success
67% (12)
Journal For Success
15 pages
Sentence Structure
No ratings yet
Sentence Structure
24 pages
Job Feedback Giving Seeking and Using Feedback for Performance Improvement Applied Psychology 2nd Edition Manuel London - The ebook in PDF/DOCX format is available for instant download
100% (1)
Job Feedback Giving Seeking and Using Feedback for Performance Improvement Applied Psychology 2nd Edition Manuel London - The ebook in PDF/DOCX format is available for instant download
50 pages
Review Unit 9
No ratings yet
Review Unit 9
4 pages
G.R. No. 230222
No ratings yet
G.R. No. 230222
11 pages
Elmers Special Day
No ratings yet
Elmers Special Day
18 pages
Thomas Edison H-WPS Office
No ratings yet
Thomas Edison H-WPS Office
8 pages
Igcse June 2013 Igcse June 2013 Igcse Timetable Final Timetable Final
No ratings yet
Igcse June 2013 Igcse June 2013 Igcse Timetable Final Timetable Final
2 pages
Download full The Pearl A True Tale of Forbidden Love in Catherine the Great s Russia First Edition Douglas Smith ebook all chapters
100% (1)
Download full The Pearl A True Tale of Forbidden Love in Catherine the Great s Russia First Edition Douglas Smith ebook all chapters
77 pages
Claudia_Rankine
No ratings yet
Claudia_Rankine
9 pages
Arterial Blood Gas
No ratings yet
Arterial Blood Gas
65 pages
Perez-Ferraris Vs Ferraris
100% (1)
Perez-Ferraris Vs Ferraris
1 page
B. Toby - Fore Fathers of Sociology
No ratings yet
B. Toby - Fore Fathers of Sociology
19 pages
19.06.2023 Accused Handed Over For Military Court Trial
No ratings yet
19.06.2023 Accused Handed Over For Military Court Trial
2 pages