Bayesian Network Approach To Assessing System Reliability For Improving System Design and Optimizing System Maintenance
Bayesian Network Approach To Assessing System Reliability For Improving System Design and Optimizing System Maintenance
by
Dongjin Lee
May 2018
ABSTRACT
herent in complicated reliability structures that may cause unexpected and undesired
results.
bility analysis tools such as Fault Tree and Reliability Block Diagram due to their
ner. This dissertation focuses on analyzing system reliability for the entire system
methodologies that evaluate system reliability on real-time basis and optimize main-
tenance schedules.
In early design stages, the research aims to predict system reliability based on
the current system design and to improve the design if necessary. The three main
challenges in this research are: 1) the lack of field failure data, 2) the complex reliabil-
ity structure and 3) how to effectively improve the design. To tackle the difficulties,
Bayesian network where the system is explicitly analyzed through the sensitivity
i
accompanies with high computational efforts, especially, when a complex and large
In summary, this dissertation studies and explores the use of Bayesian network
studies.
ii
ACKNOWLEDGMENTS
First of all, I must show my sincere gratitude to Dr. Rong Pan for mentoring me
with generosity, patient, and financial support. His knowledge and encouragement
have guided me into reliability engineering and made my graduate school experience
Dr. Teresa Wu, and Dr. Xiaoping Du. The key ingredient of writing this dissertation
is their support.
I would like thank to my parents and parents in law. Their constant support and
encouragement are the most valuable source for me to go through the challenging
I cannot find adequate words to thank my wife, Yeawon Yoo. I deeply appreciate
her dedication despite she is also having tough time in the same Ph.D. program.
iii
TABLE OF CONTENTS
Page
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
CHAPTER
1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.3 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
iv
CHAPTER Page
3.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
NETWORK APPROACH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4.1.1 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.1.2 Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
v
CHAPTER Page
4.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
APPENDIX
vi
LIST OF TABLES
Table Page
(0,0,0,0) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
(1,0,1,0) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
(1,0,1,2) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.4 Failure Probability of Each Component at Time 40, 42, 44, and 46 . . . . 81
vii
Table Page
viii
LIST OF FIGURES
Figure Page
2.1 A BN Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
ix
Figure Page
x
Figure Page
5.1 A BN Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
xi
Chapter 1
INTRODUCTION
occur where their behaviors are interdependent. To avoid any losses (cost and quality)
series or parallel systems). Consequentially, any popular reliability analysis tools such
as Fault Tree and Reliability Block Diagram are inadequate. In order to overcome
the challenge, this dissertation focuses on modeling system reliability structure using
Bayesian network (BN) and making correct decisions for complex systems.
bility analysis will also be specifically addressed and overcome. The broad overviews
including distinct research purposes and assumptions for each chapter are given in
Section 1.2.
1
1.2 Dissertation Overview and Organization
fore, PdM technique effectively saves cost over the most prevalent maintenance
strategy. In this chapter, several approaches are presented using discrete time
over time for evaluating system reliability in the future. The PdM schemes are
a later time in the system’s lifecyle will be costly and, oftentimes, impossible.
However, this early prediction is a difficult task because the absence of field
2
tackle this problem and limitation of discrete BN, nonparametric Bayesian net-
Chapter 3, the proposed approach provides more flexible and broader range of
(GA) is used to improve the system design at the minimum development cost.
inference on NPBN given evidence may require Monte Carlo simulation for a
significant amount of time. Thus, NPBN may not be the desired approach to
to minimize the sampling procedure from NPBN that asks large computational
3
• Chapter 6 concludes this dissertation by summarizing the previous chapters and
4
Chapter 2
2.1 Introduction
itoring and warning mechanisms in place. Scheduling maintenance at the right time
in the right place is one of the most important decisions to be made by system engi-
maintenance (PM), where maintenance actions are scheduled based on the analy-
sis of historical system repair data. This strategy cannot be effective in preventing
unexpected system failures and may carry out unnecessary maintenance because it
does not take into account of the current health state of a system. However, these
problems can be handled by deploying onboard sensors in the system. Using real-
time sensor signals, the future states of monitored components, as well as the whole
system, can be forecasted for planning maintenance on the fly. Thus, predictive main-
tenance (PdM), which is enabled by onboard sensors, can avoid unexpected failures
and unnecessary maintenance, and thus improve the overall system efficiency.
integrated framework of discrete time Markov chain (DTMC) model and Bayesian
network (BN) model. A complex system is defined as the system that consists of
multiple components and the dependency between system’s and components’ relia-
bility functions may not be completely known. DTMC is employed to model the
5
vian property of a Markov chain, which ignores the component’s past deterioration
history and thus may lead to large prediction errors, we propose three different ways
to define the state space of DTMC and to better approximate component degradation
been proved to be a powerful tool for modeling system reliability due to its ability of
BN model. The uncertainties in the relationships between system node and subsys-
defined between these nodes so that the probabilistic inference of reliability at any
The main purpose of PdM is to carry out maintenance actions whenever they are
implement PdM, the current- and future-time system reliability should be assessed
based on the data from onboard sensors. In this paper, we focus on the PdM of
systems are required to be highly reliable and such systems are, in general, designed
with multi-level hierarchical structures (Liu et al., 2011). To implement PdM, sensors
Using the information from these sensors, the system reliability can be predicted.
The remaining of the paper is organized as follows. In the next section, 2.2 the
and how they can be used for online system reliability assessment. We propose three
6
predictive maintenance models. In Section 2.4, a simulation study is performed to
Due to the rapid increase of complexity in engineering systems and the availability
of condition monitoring data, PdM has gained a lot of attention in the last decade.
of CBM with a focus on the general procedure of implementing CBM. Heng et al.
(2009) gave a broad overview of the prognosis of rotating machinery and suggested
some future research topics that have been indeed extensively studied these days.
Many researchers have studied system degradation for the purpose of diagnosis and
prognosis where the system has a single component or it has multiple components with
however, are not applicable in many real-world problems. The assumption of well-
defined reliability structure may lead to large errors on evaluating system reliability
Gebraeel et al. (2005) developed an exponential degradation path model for the
fashion by infusing sensor data. This model was further generalized to a bivariate
degradation path by Gebraeel (2006). Kaiser and Gebraeel (2009) provided the PdM
policy based on the degradation model in these previous studies. A residual lifetime
distributions (RLD) were derived from the degradation path and maintenance was
performed based on the desired reliability level. All these papers focused on the
7
single-component systems.
Wang and Coit (2004) provided a general approach to system reliability prediction
using multiple degradation measures. Their approach was for the case when a para-
metric degradation model was difficult to apply due to the limited understanding of
degradation mechanism. Li et al. (2011) provided a method for estimating a series sys-
and Parlikad (2014) presented a set of general models and the optimal maintenance
policy for multi-component system with the consideration of interactions among com-
The BN model has become a popular tool for modeling multi-component systems
in recent years. Langseth and Portinale (2007) argued the advantages of BN model
over traditional system reliability models such as reliability block diagrams and fault
trees. Liu et al. (2011) provided a method for modeling multi-component systems
information from many binary component nodes. Li et al. (2014) further developed
the method proposed in Liu et al. (2011) to the case where the nodes in a BN have
multiple states. Wilson and Huzurbazar (2007) used BNs to model multilevel systems
and discussed how to make probabilistic inference for different cases. Mahadevan et al.
with the consideration of the correlation of components. Zhai and Lin (2013) and
tem reliability. Ozgür-Unlüakın and Bilgiç (2006) used a dynamic Bayesian network
(DBN) for modeling multi-component systems and implemented PdM with general
temporal dimension in the network. Przytula and Choi (2008) proposed a DBN model
8
for the diagnosis and prognosis of system’s health state. McNaught and Zagorecki
The CBM of a complex system has been made possible with the advent of ad-
vanced sensor technology. For the system’s degradation process that is continuously
monitored, stochastic processes have been usually applied to modeling. Castro et al.
of minimizing the operation cost. Caballé et al. (2015) generalized the work of Cas-
tro et al. (2015) by allowing internal degradation processes to be dependent, and the
external shocks that lead to the system failure were modeled as a doubly stochas-
tic Poisson process. Do et al. (2015) provided the CBM policy that employed the
perfect and imperfect maintenances together. Their optimal maintenance policy was
proposed for minimizing the maintenance cost. Chen and Wu (2007) proposed PdM
using multivariate statistical methods and DTMC. The system’s state was evalu-
ated by a machine capability index, which was developed based on the multivariate
process capability methodology, and the system’s deterioration process was modeled
context of opportunistic maintenance. Xia et al. (2013) provided the PdM policy
that optimizes the availability and the cost effectiveness for multi-unit series systems.
Komijani et al. (2017) presented the optimal CBM model for minimizing the total
operation cost with three decision variables – the number of incomplete maintenance,
the maintenance time interval and the probability of failure, while the whole system’s
deterioration process was modeled by two independent stochastic processes for erosion
9
and external shock.
els for modeling multi-level hierarchical systems. Employing the proposed method,
maintenance decisions can be made based on the forecasted values of system reliabil-
The next section provides brief summary of DTMC, Semi-Markov chain (SMC) and
BN models and presents our proposed PdM framework. Three different models are
2.3 Methodology
A DTMC is a stochastic process {Xn , n > 0} in a discrete state and time space
ruled by the Markovian property. The Markovian property states that any future
event is independent of the past events of the process and it only depends on the
where Xn = xn indicates that the random variable X takes the state xn at time n.
Let pij > 0 represent the one-step transition probability from state i to state j.
The transition probabilities between all states can be fully specified based on this
10
The n-step TPM is calculated as the power function of one-step TPM; i.e., P n , where
(n)
its element pij denotes the probability of transitioning from state i to state j over n
steps.
If n is defined as the number of transition steps, instead of the discrete time, {Xn }
time of the nth transition, where Yn is the sojourn time at the state Xn−1 , if it satisfies
Holding Eq. (2.3), the process Z = {Zt }t∈N is called a semi-Markov chain (SMC)
Zt = XN (t) (2.4)
where N (t) := max {n ∈ N |Sn ≤ t}. The matrix q = qij (k); i, j ∈ E, k ∈ N is called
j∈E
l=1
11
2.3.2 Bayesian Networks
and edges indicate direct probabilistic dependencies between the two connected nodes
(Friedman and Koller, 2003); therefore, the joint probability distribution of all nodes
illustrates the probabilistic causal relationship between random variables and the
parents is called the root node, and a node without child is called the leaf node. Figure
2.1 is a simple BN example. In this example, X2 and X3 are root nodes and they are
also the parent nodes of X1 , while X1 is a leaf node and the child of X2 and X3 . The
probabilities of the child node conditioning on any combination of its parent nodes are
specified in its conditional probability table (CPT). The marginal probabilities are
given for the root nodes. Table 2.1 and Table 2.2 show these marginal probabilities
X1
X2 X3
The conditional independence of BN implies the chain rule that is useful to cal-
culate the joint probability. The joint distribution of all nodes is factorized by using
i=1
12
Table 2.1: Marginal Probability Tables for the Root Nodes
X2 = 0 X2 = 1 X3 = 0 X3 = 1
p2 1 − p2 p3 1 − p3
X 2 = 0 X3 = 0 p00 1 − p00
X 2 = 0 X3 = 1 p01 1 − p01
X2 = 1 X3 = 0 p10 1 − p10
X2 = 1 X3 = 1 p11 1 − p11
and vector operations. First, we define the CPT of a child node as matrix T , and T(i)
denotes the column vector corresponding to the child node value i. Define a vector V
For example, using the two parent node distributions as defined by Table 2.1, their
with the CPT of child node in Table 2.2 as a 4 × 2 matrix T , the marginal probability
P (X1 = i) = T(i)
t
V (2.10)
where T(i)
t
is the transpose of T(i) . So, P (X1 = 0) is given by
P (X1 = 0) = T(0)
t
V (2.11)
13
Using BNs to model system reliability, the uncertainties between system and com-
Langseth and Portinale (2007) compared BNs with block diagrams or fault trees as
the graphical means for modeling system reliability. Yontay et al. (2015) discussed
and component-level data, and Yontay and Pan (2016) extended the discussion to the
levels.
In this section, three PdM methods are developed. All methods employ DTMCs
by using its current sensor data so that we can make a decision of the time that a
1. Onboard sensors provide the true health states of the components being moni-
tored.
nance action.
3. The nodes at the lower hierarchy level of a BN model will affect the nodes at
14
Prediction with a Discrete Time Markov Chain (PdM-DTMC)
probability of the top node represents the system’s health state. For example, the
BN in Figure 2.1 represents a system, in which the system node, X1 , is the child
node of two component nodes, X2 and X3 . If all nodes are binary random variable
with functional state 0 and dysfunctional state 1, the system reliability is calculated
by Eq. (2.11) . Thus, we need the component’s future state to forecast the system
that changes over time, and we employ a DTMC to model this change process.
Suppose a system consists of N components, and the ith component has discretized
health states 0, 1, ..., fi where 0 is the healthy state, fi is the failure state, and any
states in between 0 and fi are degraded states. Assume the degradation is irreversible
without maintenance actions, then each component has a (fi + 1) × (fi + 1) one-step
TPM such as
p00 p01 . . . p0fi
0
p11 . . . p1fi
Pi = (2.12)
. .. . . ..
.. . . .
0 0 1
...
Given the current component’s health state that is provided by an onboard sensor,
the probability that this component will be residing in any specific state after n-step
transitions can be found from the elements in Pin . Any row of Pin plays a role of
marginal distribution for the corresponding component when its initial state is fixed.
In BN, a root node, which represents a component, has at least one child node, which
represents the system or a subsystem. Once we have collected all health states of
components from onboard sensors, the marginal distributions of the states of these
15
components after n steps can be calculated. Then, (2.10) can be used to predict the
multiple components, and they are configured (reliability-wise) in levels. The highest
level (Level 1) belongs to the state of the whole system, while the lowest level (Level l)
indicates the states of basic components. This structure is depicted in Figure 2.2. Let
Xi,j denote the health state of the j th node at the ith level. Every component in level l
has a TPM. Suppose the onboard sensors give a vector of current components’ health
state h = {h1 , h2 , ..., hN }, where hi is the health state of ith component, and we want
to forecast the system reliability after the n-step. A marginal distribution that will
the CPT of level l, and the same scheme is repeated until the marginal distribution
of the node representing the system on the top level is achieved. Mathematically,
P (Xi,j = k) = Ti,j,(k)
t
Vi+1 (2.13)
where Ti,j,(k)
t
is the column corresponding to state k in the t-step CPT of the j th node
at level i, and Vi+1 is the vector calculated by the Kronecker product using marginal
The Markovian property, which completely ignores the deterioration history of a com-
a Markov chain to better approximate degradation process, we can extend the state
definition in a DTMC to contain both current and previous health states of a com-
ponent. This model is called the higher order Markov chain (HOMC) model. A state
16
Level 1 X1,1
Level 2 X2,1
•••
X2,l2
• • •
• • •
• • •
Level l − 1 Xl−1,1
•••
Xl−1,i
•••
Xl−1,ll−1
Level l
for the k th -order Markov process at time t is defined as (xt−k+1 , xt−k+2 , ..., xt−1 , xt ),
Note that xj ≤ xi , for all i, j where j < i, due to the second assumption in Section 3.3.
Same as the transition probability in Section 3.1, P(xt−k+1 ,xt−k+2 ,...,xt−1 ,xt )(xt−k+2 ,xt−k+3 ,...,xt ,xt+1 )
is denoted as the one-step transition probability from the current state (xt−k+1 , xt−k+2 , ..., xt−1 , xt )
17
Prediction with a Semi-Markov Chain (PdM-SMC)
that the current transition probability must be the same as the previous transition
does not directly solve this problem. In this section, a discrete time semi Markov chain
et al. (2008), the semi-Markov chain is handled as a discrete time Markov chain so
that the method presented in the previous section can still be implemented. Because
DTSMC does consider the sojourn time of a component staying in any state, it will
change its health state at every transition time points. In other words, not only the
current health state but also the sojourn time in the current state will determine the
transition rate.
defined as
Ut = t − SN (t) (2.15)
Table 2.3 provides examples of backward recurrence time with an associated semi-
Markov chain. Here, Ut can be viewed as the sojourn time that a random variable
has stayed at its current state. Then, a stochastic process (Zt , Ut )t∈N is a Markov
18
Table 2.3: Examples of Backward Recurrence Time and Semi-markov Chain
t 0 1 2 3 4 5 6 7 8
Xt 0 0 0 1 1 1 2 2 3
SN (t) 0 0 0 3 3 3 6 6 8
Ut 0 1 2 0 1 2 0 1 0
Note that qij (·) and Hi (·) are defined in Section 3.1. Because it is a DTMC with two-
dimensional states (the current health state and the sojourn time), the prediction of
three proposed models. The proposed methodologies are generally applicable to any
The system to be discussed in this section was originally derived from a case study of
active vehicle suspension system, which had been previously discussed in Zhong et al.
(2010) and Yontay and Pan (2016). However, we will simply treat it as a generic
system with certain reliability structure. In order to perform PdM, onboard sensors
located at the right place in the system are needed. In addition, condition monitoring
data and various historical reliability data are critical for estimating TPMs and CPTs.
In this simulation study, transition matrices are estimated based on two degradation
datasets, which will be introduced in the next section. The CPTs and the reliability
19
10
1.6
8
1.4
degradation
degradation
6
1.2
4
1.0
2
0.8
0
For the illustration purpose, we use the laser dataset (Meeker and Escobar, 2014)
and the fatigue crack dataset (Lu and Meeker, 1993), shown in Table 2.4 and Table
plots of degradation versus time are given in Figure 2.3 and Figure 2.4. We discretize
each dataset into five states based on Table 2.6 where 0 is the healthiest state, 4 is
the failure state, and the states between 0 and 4 indicate gradual degradation.
The TPMs are estimated for DTMC, HOMC (a 2nd order Markov chain), and
SMC for each dataset. To compare performances of the TPMs, Akaike0 s Information
Criterion (AIC) and Bayesian Information Criterion (BIC) are used where AIC is
defined as
20
Table 2.4: Laser Dataset
21
Table 2.5: Fatigue Crack Dataset
22
Table 2.6: Discretization Criteria
Laser data Fatigue Crack data
is the number of data points. For p, the parameters that are estimated as zero will
where N is the number of states, xt is a value the random variable takes at time t,
xt−k . . . xt−1 is the consecutive values for k time interval, and nxt−k . . .xt is the number
of the corresponding sequence in dataset. The fit statistics of DTMC, HOMC, and
For both datasets, SMC is the most preferred model as it has the minimum AIC
23
and BIC values. This result demonstrates that the degradation process of a compo-
assumed that the components in subsystem 1 follow the fatigue crack degradation
process, while the components in subsystem 2 follow the laser degradation process.
The whole system has two states (0 for a functional system and 1 for a dysfunctional
system), but all subsystems and components have five states from 0 to 4, where 0
indicates the healthiest state and 4 is the failure state. Figure 2.5 gives the graphical
probability tables for the system and subsystem nodes are given in Appendix A.
System
Subsystem 1 Subsystem 2
Figure 2.6 plots the forecasted system reliabilities over the next thirty time stamps
when the current component states, as detected by sensors, are (0,0,0,0). The reli-
are colored as red, black and blue, respectively. Although the three different curves
24
exhibit similar patterns, PdM-SMC, which is represented by the blue line, shows the
most noticeable slope change. This is because SMC learned a sojourn time distri-
bution for each component from the given dataset. In other words, SMC will never
predict a shorter sojourn time than the minimum sojourn time in the dataset or a
longer lifetime than maximum sojourn time in the dataset for each healthy state. For
example, component 1 or 2, which is modeled after the fatigue crack dataset, should
not make a transition from state 0 to state 1 within two time stamps, but should
have a transition after six time stamps. Thus, the SMC model will predict a higher
continuous operation without failure within eight time stamps. On the other hand,
the DTMC and HOMC models provide gradually decreased reliability without the
PdM−DTMC
0.8
Forecasted Reliability
PdM−HOMC
PdM−SMC
0.6
0.4
0.2
0.0
0 5 10 15 20 25 30
Time
Figure 2.6: System Reliability Forecast
reduced to 0.88. Table 2.8 provides the forecasted and actual system reliabilities
25
when the current component health states are (0,0,0,0). According to PdM-DTMC,
stamps, but PdM-HOMC and PdM-SMC would schedule it sometime after time 4.
In addition, components 1 and 2, which are modeled by using the fatigue crack
dataset, are expected to be more deteriorated than components 3 and 4 at the time
of maintenance. Thus, under the current scenario, the maintenance crew should
monitored by sensors, the maintenance schedule can be updated based on the latest
sensor data. For example, if after a few time lags these components’ health states
become (1,0,1,0), maintenance schedule can be changed based on the updated system
reliability forecast. Table 2.9 presents estimated and actual system reliabilities when
the current component health states become (1,0,1,0). The maintenance action should
be taken after one time stamp for the PdM-DTMC model (or two and three time
stamps for PdM-HOMC and PdM-SMC, respectively), which is ahead of the previous
schedule. The actual system reliability passes the 0.88 threshold between three and
four time stamps. In this case, component 1 is expected to be the one that will
deteriorate most.
Assume two more scenarios in which PM policies are implemented and mainte-
nance actions are strictly conducted with a time interval of 3 and 5, respectively,
regardless of components’ states. Using Table 2.8 we find that the system reliability
at every maintenance time will be 0.9668 and 0.80425, respectively. If the first PM
cause the system is 96.68% reliable; while if the second PM policy is implemented, the
system reliability is considerably lower than the minimum system reliability require-
ment (88%). Starting from the healthy component state, the optimal maintenance
time interval is found to be 4, which is the same result as using the PdM-SMC model.
26
1.0
0.8
Forecasted Reliability
0.6
0.4
0.2
0.0
0 5 10 15 20 25 30
Time
Figure 2.7: System Reliability Forecast with Realizations
Table 2.8: System Reliability Forecast When the Current Component State Is
(0,0,0,0)
Time Stamp 0 1 2 3 4 5
However, the PM policy will not predict which component is the most deteriorated
component at the next scheduled maintenance time, nor will it be able to respond
future or a far enough future when system certainly fails. The forecast in the middle
component’s health states that can make a wide range of system reliability prediction.
In case that both components and subsystems are monitored by sensors, to esti-
27
Table 2.9: System Reliability Forecast When the Current Component State Is
(1,0,1,0)
Time Stamp 0 1 2 3 4 5
mate the system reliability, we only need to know the subsystem sensor data. How-
ever, for the maintenance purpose, the health state of any components that are not
directly monitored can still be inferred from the health states of the subsystem and
other components.
For the system being discussed above, if its subsystem 2 is monitored, for example,
but not component 4, the health state of component 4 can still be inferred by
where X indicates the health state of unmonitored component and e is the evidence
If the onboard sensors give the health state as (1,0,1,1) where the first three
numbers indicate the health of components 1, 2, and 3, and the last one indicates
the health of subsystem 2. By Eq. (2.20), it can be shown that the health state of
health states are known. Table 2.10 shows the predicted and actual system reliability
where component 4’s state is inferred from other sensor data. According to the Table
but it will be scheduled either before or after one time stamp by the PdM-HOMC or
28
Table 2.10: System Reliability Forecast When the Current Component State Is
(1,0,1,2)
Time Stamp 0 1 2 3 4 5
2.5 Conclusion
models for forecasting system reliability in oder to implement PdM. Our method
can be made ahead of catastrophic failure. Three PdM models were proposed and
the performance of their transition matrices were evaluated. According to the fit
process when the state of DTMC is extended to include the past deterioration history.
Among the three models, DTSMC, in which state transitions consider the sojourn
time of the current state, has more advantages in terms of fitting real data. In
the simulation study, PdM-SMC provides the most accurate prediction of system
tion processes, the increased number of states brings along some undesirable com-
29
putational burdens. To overcome this limitation, other approaches such that using
life time, etc.) will be investigated in our future research. In addition, the problem
considered in this paper can be extended by adding other failure modes. For example,
external shocks, which is not considered in this study, can be added as a factor that
tion models can also be formulated based on the results from this study. To minimize
the system operation and maintenance cost, the time intervals between consecutive
variables to be optimized so that we can maintain a high reliability level while reduce
30
Chapter 3
3.1 Introduction
hyper competitive market by enhancing overall system efficiency. Over the past two
ment of effective reliability analysis methodologies for evaluating and predicting reli-
is how to infuse the system reliability assessment into the system’s early design stage.
It is known that the quality of a product is decided, to a great extent, in its early
design stage (Clark and Paasch, 1996). Specifically, about 80% of a system’s lifecy-
cle cost is decided before the production stages (Buede and Miller, 2016). Besides,
DFR is a critical objective in system engineering for designing the system robust to
its operating environment (Wasson, 2015). However, the early assessment of system
reliability is a very challenging problem because field failure data, which is the ideal
data source for reliability analysis, can only be gathered after the product has been
manufactured and put in use. This is why reliability assessment was considered as
prediction.
design stage, embodiment design phase and detailed design phase (Cheng and Du,
31
2015). This paper focuses on assessing system reliability as early as at a product’s
embodiment design phase. Note that the embodiment design phase is still at the
been manufactured.
network (NPBN), introduced in Kurowicka (2005) for modeling and analyzing sys-
the probabilistic relationship between nodes, a BN model is well suited for a com-
plex system design. Moreover, continuous random variables, which are assigned to
the NPBN model can depict the entire reliability configuration of the system and the
strength of relationship between the system and its components. We can gain more
insights of the system’s working and failure mechanisms via sensitivity analysis. This
NPBN approach does not require defining conditional probability tables (CPTs) as
in discrete BNs.
sian BN requires that each random variable follows a normal distribution and each
can handle any type of continuous random variable. In addition, like a hybrid BN,
NPBN is also able to simultaneously model continuous and discrete random variables,
while the details of these had been described in (Hanea and Kurowicka, 2008). The
computational burden of discrete BNs. It is very effective for handling a large system
with hierarchical structures, where the system can be decomposed into many repet-
itive subsystems. OOBNs have been applied on analyzing some modern complex
systems in various domains (Molina et al., 2010; Cai et al., 2016; Weidl et al., 2005;
32
Weber and Jouffe, 2006). However, most of these applications assume that random
variables are discrete and the networks are parameterized by CPTs. Both OOBN and
but from different perspectives. OOBN attacks the problem from the perspective of
structured modeling, while NPBN makes the handling of continuous random variable
Note that NPBN has only started getting noticed for its system analysis abilities.
In the past decade, discrete BN has been intensively researched and promoted as
a powerful tool for modeling and analyzing system reliability with the probabilistic
reliability structure (Langseth and Portinale, 2007). However, due to the disadvan-
still limited. The methodology proposed in this paper expands the bandwidth of BN
back of NPBN is that it relies on Monte Carlo simulation for updating the joint and
marginal distributions in the model. But, it can be avoided by using the normal
between two nodes in a NPBN. The elicitation of expert opinions on the relationship
between two nodes has been studied in (Morales Nápoles, 2010). However, a better
The rest of the paper is organized as follows: The literature review of system
reliability is given in Section 3.2, while some backgrounds for this study are presented
in Section 3.3. In Section 3.4, we provide a brief introduction of NPBN, the sampling
procedure of NPBN, and the methods for eliciting dependency. A simple illustrative
33
example is given in Section 3.5 to show the process of employing NPBN on system
reliability assessment. Section 3.6 discusses a real industrial case. Finally, the paper
There are only a few studies on evaluating system reliability at early design stages so
far. Ormon et al. (2002) developed simulation and analytic based methods for pre-
dicting the system reliability at the early stage of design process. Tumer and Stone
(2003) presented a method to predict potential failures at the conceptual design stage
parenting process for a single component. Cheng and Du (2015) provided an opti-
mization model which gives narrower rage of reliability than traditional methods for
Recently, Bayesian network models for modeling system reliability structure have
fault tree were presented in Torres-Toledano and Sucar (1998); Bobbio et al. (2001);
large structure systems, power systems, military vehicles, and semiconductor manu-
facturing systems can be found in Mahadevan et al. (2001); Yu et al. (1999); Daemi
et al. (2012); Neil et al. (2001); Bouaziz et al. (2013), respectively. Weber et al.
eling. They mainly discussed the latest trend of BN model and its various uses on
studied the BN in which nodes have multiple states and node information may come
modeling was addressed in Sigurdsson et al. (2001). Yontay et al. (2015) discussed
34
the use of discrete BN for estimating system reliability at the conceptual design stage
through a rigorous functional analysis for complicated systems. The BN was param-
eterized by combining historical failure data (parent products) and expert opinion
solicitation using Bayesian inference. Yontay and Pan (2016) extended the discussion
for scenarios where failure data is simultaneously generated from system level and
component level.
methods of fusing subjective information into BN have been provided in some previous
studies too. However, the use of discrete BN is very restrictive in most cases because
overcome these hurdles, we will explore the utility of NPBN for system reliability
assessment.
For a new product, customers often demand not only proper product functions but
also a high reliability. In the conceptual design phase, such customer’s needs are
identified and the corresponding solutions are found from conceptually established
between functions and sub-functions to meet the customer’s needs. Any details about
physical components are not addressed in this phase. Multiple design concepts can
be created in this design phase, and candidates are evaluated and compared.
35
3.3.2 Functional Analysis
A systematic process that fulfills the creation of conceptual designs is called the
ship between functions and to identify potential design faults in the system. All
elements of system, which are main- and sub-functions (not physical components),
functional analysis could prevent considerable losses in later steps because if a func-
In the embodiment design stage, the abstract design concepts generated in the con-
ceptual design stage start being represented by actual components. For any physical
parts fulfilling the intended functions of the product, their general characteristics,
such as overall layout, geometry, preliminary production process, etc., are discussed.
In other words, before creating very detailed designs, the shape and material of the
product and design for manufacturability are determined in accordance with engi-
sign stage, as well as other early design stages, comes from the design uncertainty,
which can result in the unexpected relationship between components. This is pri-
marily the matter of the lack of knowledge. Besides, analytical tools can also be the
reason for this challenge if these tools cannot take advantage of the useful knowledge
36
extracted from any available information. For example, FT and RBD may not be
able to integrate all available information due to the limitation of Boolean logic rules.
node, thus the expert’s knowledge can be included into the probability distribution
assign to these nodes. Therefore, BN is a proper analytical tool for analyzing the
In this section, the NPBN method for evaluating a system’s reliability at its early
design stage is proposed. First, we give a brief introduction of NPBN and its prob-
abilistic inference procedure. Then, we discuss how to solicit expert opinions on the
parameters of NPBN model. For some recent applications of NPBN in system designs,
one may see Cooke et al. (2015) and Nannapaneni et al. (2017).
often limited by the immense sizes of its CPTs. It becomes an acute problem when we
a node (for convenience, we use ‘continuous random variable’ for ‘invertible univari-
37
ate continuous random variable’ to the rest of this paper). More specifically, these
original distribution can be converted back from the their inverse cumulative dis-
dependency between two nodes, CPTs are not used. Moreover, all (conditional) rank
correlations in a NPBN are algebraically independent. Fig. 3.1 shows a NPBN exam-
ple. Each node in Fig. 3.1 has a uniform random variable as stated before. On the arc
between X1 and X2 , r12 is the rank correlation that indicates their probabilistic de-
between X1 and X3 for a given value of X2 . Conditional rank correlations need not
ing to the value of its conditioning node. For example, in Fig 3.1, the relationship,
r13|2 , can be strong when X2 has a large value, and weak when X2 has a small value.
Given a NPBN, (conditional) rank correlations are decided such that if Xn has par-
ent nodes X1 , ..., Xj , (conditional) rank correlations are assigned by successively and
r(Xn , X1 ) = rn1
r(Xn , X2 ) = rn2|1
r(Xn , X3 ) = rn3|12
..
.
r(Xn , Xj ) = rnj|12···(j−1)
For each arc in a NPBN, not only a (conditional) rank correlation but also a
one parameter bivariate copula is assigned (for convenience, we use ‘copula’ for ‘one
parameter bivariate copula’ in the rest of this paper). The copula function defines
the joint distribution of two uniform random variables adjacent to the corresponding
38
arc. Because a (conditional) rank correlation is a property of a copula, it can be
realized from a parameterized copula. Any copula function can be used if a zero rank
ables X and Y , ρXY , with finite expectations, E(X) and E(Y ), and variances, σX
2
and σY2 , is
FXi (xi ) is the cumulative distribution function of Xi , is the joint distribution of con-
Y is defined as
where FX (x) and FY (y) are CDFs of X and Y , respectively, and ρ is the Pearson’s
correlation.
39
The conditional rank correlation of X and Y given Z has the same definition
except of replacing X and Y by X|Z and Y |Z, respectively. While ρXY indicates a
proportional change rate of two random variables (linear dependency) between two
random variables, X and Y , r(X, Y ) measures the strength and the direction of their
Definition 4 Vine: A vine on n variables is a nested set of trees where arcs of ith
A vine is called a regular vine if combined arcs in the ith tree, i = 1, 2, ..., n − 1,
have a common node. D-vine, which is one of regular vine classes, is a vine that
all trees are paths, and it is closely related to NPBN. Fig. 3.3 and 3.4 show the
this modeling technique may suffer in terms of accuracy and flexibility as the number
the vine copula approach and it is much more flexible than a multivariate copula. Al-
gorithms for sampling from particular vines (D-vine or other arbitrary regular vines)
are provided in (Aas et al., 2009; Kurowicka, 2011). The set of conditional indepen-
40
D-vine. In other words, the multivariate probabilistic distribution is sampled by the
vine copula approach, where the sampling procedure is simplified according to the
Once copulas and rank correlations are assigned to all arcs on a D-vine, the full
joint distribution has been fully specified. If all random variables on a vine are
Sampling a Vine
Probabilistic inference from NPBN is performed through Monte Carlo simulation and
Copula densities and marginal distributions of these random variables can also be
used to represent a joint density distribution. If the joint density distribution has a
f (x1 , x2 , ..., xn ) = f (x1 )f (x2 ) · · · f (xn )c12 c23 · · · c(n−1)n · · · c1n|2:(n−1) (3.3)
41
Figure 3.5: A D-vine on n Random Variables
where cij|k = cij|k (F (xi |xk ), F (xj |xk )) is the density copula of Cij (F (xi |xk ), F (xj |xk )).
Sampling a NPBN
variables X1 , X2 , ..., Xi and perform a D-vine sampling. The order of random variables
placed for the D-vine is: Xi , parent nodes of Xi , and remaining random variables
and X4 where all random variables are continuous, and the a set of causal relationships
is known as Fig. 3.6. Based on the structure of this network, the joint distribution
can be factorized as
(3, 1, 2) and (4, 1, 3, 2) for sampling X3 and X4 , respectively. These D-vines are shown
in Figs. 3.7 – 3.10 and we use the sampling algorithm for D-vine. Note that the
42
Figure 3.6: NPBN on X1 , X2 , X3 , and X4
implied by a BN structure. The sampling procedures for X3 and X4 in Fig. 3.7 and
In this section, the methodology for eliciting rank and conditional rank correlations
from experts and combining expert opinions and existing data is presented.
43
Eliciting Expert Opinions for (Conditional) Rank Correlations
As discussed before, reliability data for a new product do not exist in the embodiment
design stage. Thus, we have to rely on expert opinions for modeling system reliability.
In this paper, we discuss how to obtain (conditional) rank correlation values from
experts. Morales Nápoles (2010) thoroughly studied and presented a structured ap-
between random variables X and Y , the following question is designed for assessing
In what probability, Y ’s value that is above q th quantile leads to X’s value that is
above q th quantile?
Given the joint density distribution of X and Y , Eq. 3.5 can be written as (Morales
et al. (2008))
1 Z∞ Z∞
Pexpert = fXY (x, y)dydx (3.6)
1 − q FX−1 (q) FY−1 (q)
From Eq. 3.6, the parameter of copula can be determined and the rank correlation
can be realized.
Suppose Z’s value is observed above the q th quantile. In what probability, Y ’s value
Although the reliability data of a new product are unavailable at its early design stage,
it is possible that we may have some information regarding the correlations between
44
different components or different failure modes from the parent products because the
we can combine the information from parent products with expert opinions, we can
model parameters represent our beliefs and the observations (data) from the system
will form a likelihood function. Using below Bayes’ theorem, these two streams of
P (B|A)P (A)
P (A|B) = (3.7)
P (B)
where P (A), P (B|A), and P (A|B) are prior distribution, likelihood, and posterior
distribution, respectively.
ceedance, p, which is Pexpert in Section 3.4.2. We treat the information from parent
products and expert opinions as sources for prior distribution and likelihood, respec-
First, we define success and failure that is correspondent to X’s value conditioning
is treated as a failure if it is observed below q th quantile
X
To make expert opinions binary type, we can give the question below to many
experts:
Suppose Y ’s value is observed above the q th quantile. What is the expected value
45
If the answer is above (below) its q th quantile, it is considered as a success (failure).
Suppose a group of n experts were solicited their opinions, then the number of success
Given the historical information of the correlation of these two random variables,
we can fit a beta distribution, Beta(α, β), as the prior distribution for p. In general,
The first advantage is helpful as historical data from parent products may have
various distribution shapes. The second advantage finds the conjugated posterior
Beta(α + x, β + n − x) (3.8)
as mean and mode, or median can be used as the rank correlation value in the copula.
readers can refer to Liang and Mahadevan (2017) for another method of overcoming
in representing uncertainty of experts. Kay (2007) discussed DST from the reliability
point of view. Simon et al. (2008) modeled and performed probabilistic reasoning of
reliability without any data in its early design stage. Some assumptions are made
46
1. There are no irrelevant nodes in the Bayesian network model.
3. The reliability-wise configuration of the system has been established and expert
rated mechanical systems. This reason will be elaborated more at the end of this
dom variable (Xi ) in a NPBN model. In the product design, these random variables
component. Besides, a rank correlation indicates the influence that the performance
1
C(u1 , u2 ) = (u−θ
1 + u2 − 1)
−θ −θ
, θ>0 (3.11)
47
If a system consists of two components such as the NPBN in Fig. 3.1, the node X1
represents the system’s performance, and nodes X2 and X3 represent its components’
performances. Then, r12 and r13|2 are the probabilistic dependencies between the
system and the two components. The value of rank correlation may vary between
-1 to 1, where a positive (negative) value indicates that the two variables move to
same (opposite) direction in changes, and zero indicates they are independent. In a
coherent system, good performance of a component should not cause bad performance
of another component or the system. If this contradiction happens, the root cause
needs to be sought out and the current design should be adjusted. In this study, we
there are more than one conditioning variable, such as rij|kl . In other words, experts
are questioned about relationship where more than three elements (functional perfor-
mances of system, sub-systems and components) are associated. This is because such
scenario is more complicated and less likely to encounter in past experience. This
consideration affects the order of rank correlation elicitation. The most influential
parent node should be queried first and the least influential parent node should come
last.
In order to decide the rank correlation r12 , we ask the following question to ten
experts:
If component 2’s performance is higher than 0.5 quantile, what is the expected
Suppose seven experts’ answers are greater than 0.5 and Beta(78, 38) is estimated
for the probability of exceedance from parent systems. Considering experience of do-
main experts, we treat each expert opinion as equivalent to ten system tests, so these
expert opinions can be viewed as the data from a binomial distribution B(100, 0.7).
48
Beta(78 + 70, 38 + 100 − 78) = Beta(148, 60) is the posterior distribution, and we
can use the mean value, 148/(148 + 60) = 0.712, for estimating the parameter of cop-
ula and the rank correlation. This results in the parameter value of Clayton copula,
For the conditional rank correlation between X1 and X3 , the elicitation question
can be framed as
If components 2 and 3 show the performances higher than 0.5 quantile, what is
Suppose that six out of eight experts’ answers are corresponding to success and
Beta(83, 17) is the prior distribution. Then, the posterior distribution is Beta(83 +
60, 17 + 80 − 60) = Beta(143, 37). In order to find the conditional rank correlation,
we build the D-vine on X1 , X2 , and X3 , and estimate the parameter of copula that
shows
where Pposterior is derived from the posterior distribution. For Pposterior , we use the
mean, 143/(143+37) = 0.794, and then we get θ = 0.428 and r13|2 = 0.261. Note that
the conditional rank correlation r13|2 is relatively small compared to r12 despite the
posterior distribution has a larger mean value. This is because the answer to the first
question affects the answer to the second question. Fig. 3.2 shows the parameterized
NPBN.
will answer “what-if” questions for a given scenario. For example, if performances
the prognosis of the system can be made through simulation. This simulation result
is given in Fig. 3.11. Fig. 3.12 is the result of another scenario, X2 = 0.1 and
49
X3 = 0.3. As expected, the system shows high (low) performance when both of
its components are well functioning (not well functioning). From the distribution of
It is also possible to perform the what-if analysis for system diagnosis. Figs. 3.13
– 3.16 give two extreme cases for the illustration purpose. The system’s performance
is set as X1 = 0.9 or X1 = 0.1. Figs. 3.13 and 3.14 depict Components 2 and
3’s performance, respectively, in the high system performance scenario. One can see
that the distribution of X2 has been dramatically changed, while the distribution
more sensitive to the system (larger r12 ). However, from Figs. 3.15 and 3.16 one
can see that when the system performance is low, both components’ performance
example, let X2 = 0.5, then, for X1 = 0.1 and X1 = 0.9, Component 3’s performance
Figure 3.13: Component 2’s Perfor- Figure 3.14: Component 3’s Perfor-
mance Where x1 = .9 mance Where x1 = .9
Another interesting observation from these sensitivity analyses is that when the
values of conditioning variables are smaller, the distribution changes in other vari-
50
Figure 3.15: Component 2’s Perfor- Figure 3.16: Component 3’s Perfor-
mance Where x1 = .1 mance Where x1 = .1
Figure 3.17: Component 3’s Perfor- Figure 3.18: Component 3’s Perfor-
mance Where x1 = .1 and x2 = .5 mance Where x1 = .9 and x2 = .5
ables will be more noticeable. This is due to the lower tail dependence property of
Clayton copula, which is demonstrated in Fig. 3.19. This figure shows samples from
a bivariate Clayton copula with the rank correlation of 0.885 and one can see the
stronger correlation when the two variables are having smaller values. It is the lower
tail dependence of these variables. In contrast, Gumbel copula possesses an upper tail
dependence such that the dependence becomes stronger when variables have larger
values, which is shown in Fig. 3.20 with a Gumbel copula of 0.943 rank correlation.
Note that some copulas such as Frank copula and Gaussian copula do not have the
tail dependence property. In our example, the sensitivity analyses results can be ex-
plained that the impact of a relevant component of the system is stronger when it does
not function well than when it is well functioning. Thus, this lower tail dependence
property is a desirable property and it is the reason for using Clayton copula.
Note that, in this example, the marginal distributions of all nodes are distributed
in (0, 1) and they can be easily converted back to their original distributions. One of
the advantages of NPBN is that we can decouple the marginal distribution of each
node from the probabilistic model of the whole system and the decision making can
be made by using only the correlation structure in the system model. All simulation
codes used in this paper were written in R and they are available from the authors
51
Figure 3.19: Lower Tail Dependence Figure 3.20: Upper Tail Dependence
of Clayton Copula of Gumbel Copula
upon request.
automated production line of Li/MnO2 Cell described in Zhang et al. (2011). The
Li/MnO2 Cell production line is an assembly line for main parts (negative shell,
positive shell, lithium-chip, and MnO2 chip) of the battery and it requires highly
efficient process and reliability in order to meet an increasing demand. Zhang et al.
(2011) applied Fault Tree Analysis (FTA) to this automated production line and
In the automated production line, there are five main modules that may cause the
entire system failure. Among these five modules, we focus on the cut Lithium-chip
module. The reliability structure of the cut Lithium-chip module modeled by a fault
tree, which is given in (Zhang et al., 2011), is shown in Fig. 3.21. To demonstrate
the use of NPBN in this case, we assume that the reliability structure is known, (con-
ditional) rank correlations are constant, bivariate Clayton copula is appropriate, and
52
Figure 3.21: Fault Tree of the Lithium-Chip Manufacturing Process
all (conditional) rank correlations have been elicited from domain experts. To eval-
uate the current design and make proper decision for each node, we set the required
The NPBN structure of the system is shown in Fig. 3.22 and these nodes are
• X5 : Cylinder performance
• X7 : Lithium-belt performance
• X8 : Sensor performance
53
• X9 : Cut lithium-chip performance
The aim is to predict the system reliability and decide if the current design is accept-
able. In this case study, two scenarios of prognosis and diagnosis are presented. In
the first scenario, we assume the performances of all components at the mission time
T , and the distribution of system performance at time T will be given as the result.
54
For the first scenario, Table 3.1 provides the expected performance of each compo-
nent and the resulting fiftieth percentile value of system performance (X9 ). Because
the fiftieth percentile value, xT , is smaller than its target reliability 0.9, the current
design is unacceptable.
X1 X2 X3 X4 X6 X7 X9
measures not only the influence of an input to output but also its interactions with
other inputs based on its elementary effects. For each input, an elementary effect is
calculated as follow
f (X + ei ∆i ) − f (X)
EEi = i = 1...n (3.13)
∆i
where X is a start value which is a n-dimensional input vector, ei is a n-dimensional
vector with all zeros except for ith entry that is ± 1, and ∆ is an amount of variation.
One trajectory is generated when all EEi are calculated. Generating r trajectories,
where EEij indicates the elementary effect of ith input in j th trajectory. Larger values
of µi and σi imply a strong influence to the output and an interaction with other
55
The sensitivity analysis simultaneously considers, µ∗i and σi .
µ∗ σ σ/µ∗
We set r = 20 and the results are given in Table 3.2, which do not show nonlin-
earity or significant interactions as all σ/µ∗ are smaller than 1 (Menberg et al., 2016).
0.65 as in the current design, the system can reach its reliability target. However,
will be next candidates to be considered because they are second and third in line for
In the second scenario, both of the system’s target performance (X9 ) and the
sub-system 2’s expected performance (X8 ) are required to be 0.9, and component 3
and 4’s performances are expected to be 0.67 and 0.6 at the mission time T . The
simulation is run with the condition of X9 = 0.9, X8 = 0.9, X4 = 0.67 and X3 = 0.6.
Figs. 3.24 – 3.27. Utilizing the simulation result, we can set the minimum performance
56
requirements of these components. As in the first scenario, engineers also need to
consider financial and engineering constraints when they make component design
recommendations.
This case study demonstrates that at the early design stage of a system, even
though the system information is very limited and there is no any system testing
data, we are still able to quantify the performance of the system or its components
by using NPBN. This is because we can integrate the engineering knowledge from
system reliability structure and the prior information of rank correlations between
components and system into the NPBN framework. Through the simulation, we can
specify the minimum performance requirement of each component for achieving the
system reliability target. Therefore, system design and system reliability assessment
3.7 Conclusion
In this paper, we propose the methodology of evaluating and analyzing system reli-
ability at its early design stages. The objective is to decide if the current design is
Figure 3.24: Component 1’s Ex- Figure 3.25: Component 2’s Ex-
pected Performance pected Performance
Figure 3.26: Component 6’s Ex- Figure 3.27: Component 7’s Ex-
pected Performance pected Performance
57
acceptable in terms of reaching its reliability target and, if not, how to improve the
current design. Given that no physical products and product tests are available at
these stages, historical data from parent products and expert opinions on the product
design are the main sources that we can draw knowledge from.
the relationship between system performance and component functionality. The rank
experts’ opinions and by infusing the existing knowledge from parent products into
tion, the BN model defines a probabilistic relationship between system reliability and
its component functions. The use of NPBN overcomes the limitation of discrete BN,
which can have an unmanageable size of CPTs for a complex system when discretizing
continuous random variable. Thus, NPBN is very useful tool for quantifying system
performance at its early design stages. Through Monte Carlo simulation, probabilis-
tic inferences for system prognosis and diagnosis can be carried out, and appropriate
decisions can be made. All the simulation results presented in this paper are consis-
tent with intuitions; but, by quantifying them, the proposed methodology can assist
decision makers to evaluate design options and to improve the current design.
before the product is manufactured. This approach helps the designer to adjust
a complex system, we demonstrate the what-if analysis by using NPBN model for the
58
Chapter 4
NETWORK APPROACH
4.1 Introduction
To develop a new system, a designer will create a set of design specifications based
on the intended system functions. Reliability is always one of the cardinal concerns
for system design in a variety of applications, since it may directly affect the safety
devoted to achieving the desired system reliability at various system design stages.
For a system that consists of many subsystems and components, its reliability can
ity specifications to individual components such that the entire system can meet the
system and component and cooperates implicit and explicit constraints such as func-
tional requirements, technology, cost, etc. (Lamberson and Kapur, 1977). Therefore,
a mathematically rigorous reliability allocation process can assure the economical use
tem. These advantages have sparked the growing interest in developing competitive
reliability allocation strategies, which is even more challenging nowadays due to the
A typical engineering design process needs to go through three main design stages
59
– conceptual design, embodiment design, and detail design – to produce a system
with specific design specifications and concrete design materials. In the conceptual
relationship between main function and sub-function and between sub-function and
sub-sub-function, etc. Engineers identify potential functional deficits and safety haz-
ards of the new system and propose solutions. In the embodiment design stage, the
solutions found in the conceptual design stage are further developed by identifying
proper components and the assembly of these components. Then, engineers can gen-
erate the initial physical layout of the system under the bounds of economical and
engineering feasibilities. Note that most information generated in the conceptual and
embodiment design stages is qualitative and abstract. Only in the detail design stage,
the very specific physical attributes of components will be determined and the man-
be assessed as soon as the system design concept has been formulated, as the re-
liability requirement can guide system design changes in the early design stages to
avoid any costly late change. In practice, reliability allocation to components often
happen at the embodiment stage so that the reliability target to each component can
equal reliability allocation rule, ARINC (developed by the ARINC Research Corpo-
Equipment). Their popularity comes from the easiness of applying these methods
and, sometimes, reasonable results. However, these methods are only pragmatic un-
der several restrictive assumptions. For example, it is assumed that the system must
have a series reliability structure configuration and each component must have a
constant failure rate. Besides, seldom do these conventional methods deal with the
60
economical and engineering feasibility constraints in system designs. Actually, the
techniques. To overcome these limitations, there have been some new developments
on reliability allocation methodologies for the systems that have more complex relia-
minimizing the system development cost (or maximizing the system reliability) sub-
ject to meeting its targeted reliability (or not exceeding a given system development
budget). The optimization problem can become very challenging as it may involve
parameters that can be estimated from a large amount of field data. These assump-
tions make the system reliability estimation problem mathematically malleable and
early design stage, as considered in this paper, the understanding of system reliabil-
ity structure is often obscure due to the absence of field system performance data.
Another assumption that often appears in previous studies is to assume the discrete
state space for the system. For example, a binary state space is defined for a system
degradation process (Luque and Straub, 2016). However, this approach has to make
In summary, there are at least four challenges in the reliability allocation prob-
lem at an early system design stage: the lack of reliability data, the obscurity of
61
system reliability structure, the difficulty in handling continuous system condition,
and the intricacies in optimization. The first three items will hinder the accuracy of
system reliability estimation, while the last one adds more computational difficulties
4.1.1 Contributions
This paper presents a new approach to tackling these challenges. Our proposed
eling and analyzing system reliability structures. NPBN is a directed acyclic graph
(DAG) where any univariate invertible continuous random variables can be used.
Nodes in a NPBN will be associated with the operating conditions of the system and
its components in terms of their failure probabilities. Fig. ?? illustrates the change of
failure probability of a system over time due to its degradation. Note that the black
exceeds the red line (the threshold value), a failure occurs. Based on the NPBN
model evaluation, Genetic Algorithm (GA) will be used for solving an optimization
explore the dynamic behavior of a system at early design stages and can be used as
In this paper, we propose to integrate the NPBN model with component degrada-
tion to model a continuous-time system. The full description of NPBN can be found
in (Kurowicka, 2005). Like Bayesian network (BN) and dynamic Bayesian network
of conditional independence; however, unlike BN and DBN, it can handle any contin-
62
Figure 4.1: Failure Probability of Component over Time
uous distributions as its nodes. It uses (conditional) rank correlations to specify the
association between nodes, instead of CPTs. In this research, the parametric lifetime
The joint distribution of the network is defined by a set of bivariate copula functions.
Following the system reliability analysis, GA will be applied to generate new designs
is to minimize the total system development cost, while the reliability target is served
as a constraint. Through this paper, we assume that the overall layout of a new
system has been determined and the functional requirements of each component need
design stage. Fig. 4.2 illustrates our methodology when Di indicates a degradation
4.1.2 Organization
reliability and reliability allocation is given in Section 4.2. Then, Section 4.3 describes
how NPBN may be used for system reliability modeling. The methods for parameter
63
𝐷1 𝑡, 𝜃1 , 𝑋1
Historical 𝐷2 𝑡, 𝜃,2 𝑋2
degradation
…
datasets
𝐷𝑛 𝑡, 𝜃𝑛 , 𝑋𝑛 Domain experts
(Design) Construct
a reliability structure with
Bayesian network
(Reliability allocation)
NO Is the current
Generate alternative
design
design options and select
acceptable?
the best design
YES
Proceed to the
detail design stage
estimation in NPBN are presented in Section 4.4. The proposed methodology, along
with examples and a case study, is explained in Section 4.5 and Section 4.6. Finally,
For analyzing a complex multi-component system, Bayesian network (BN) and dy-
namic Bayesian network (DBN) have been proposed for modeling uncertainties in
system reliability structures. Note that traditional reliability analysis tools, such as
fault tree (FT) or reliability block diagram (RBD), can be generalized to BN models,
which greatly empower system modeling and analysis (Su et al., 2016). The detailed
comparison of these traditional tools with BNs can be found in Langseth and Portinale
(2007).
same BN at sequential discrete time points and successive BNs being connected by
64
arcs, called ‘inter-time-slice edges’. While a single BN is a multivariate distribution
For an effective representation of the random process, the Markov assumption can be
called a stationary process where the transition and observation models are invariant
in time t. The most compact DBN model is the first order DBN, and it is also
DBN, more versatile reliability analysis can be conducted such as system prognosis
Although DBN may seem to be an ideal tool for modeling the stochastic behav-
ior of a system, its use is actually quite limited in real-world problems. One of the
BN, nodes (random variables) on the network are discrete and their joint distribution
is defined by CPTs. As the nodes in the network and their number of states grow,
the size of CPTs will explode. Any DBN has to deal with this problem when dis-
cretizing continuous nodes. Eventually, there will be too many parameters in CPTs
continuous nodes. However, this approach requires that each random variable follows
systems at conceptual design stage and Yontay and Pan (2016) used BN to evaluate
Cai et al. (2016) presented Object-Oriented BN (OOBN) approach for real time fault
diganosis. Weber and Jouffe (2006) presented Dynamic OOBN (DOOBN) in modeling
65
complex systems. Lee and Pan (2018) presented a NPBN approach to analyzing
system reliability at system’s early design stages. However, BN has been of little
help when dealing with a system’s evolution over time such as degradation, corrosion,
fatigue, etc. In order to model these stochastic factors, dynamic FT (DFT), dynamic
RBD (DRBD) and dynamic BN (DBN) had been developed and applied on dynamic
systems. DFT and DRBD were compared in Distefano and Puliafito (2007). The
relationship between DRBN and DBN was discussed in Cai et al. (2017).
for time intervals that a specific event can occur. This is called the “event-based”
approach. Boudali and Dugan (2005) and Boudali and Dugan (2006) provided the
framework of “event-based” approach for modeling system where nodes take discrete
and continuous random variables, respectively. On the other hand, DBN subsequently
constructs the same BN to build a time dimension (Boudali and Dugan, 2005; We-
ber et al., 2012). This is a the “time-sliced” approach. Weber and Jouffe (2003)
compared Markov Chain (MC) and DBN in modeling dynamic systems and shows
the advantages of DBN over MC. Salem et al. (2006) provided a formalism and in-
ference techniques in modeling and analyzing system reliability using DBN. Luque
and Straub (2016) used DBN to model a deterioration process of a structural sys-
tem while the parameters in BN are updated based on inspection and monitoring
results. Su et al. (2016) modeled a multi-state system with DBN using structural
analysis and design technique (SADT) and failure mode and effect analysis (FMEA).
Cai et al. (2017) presented a methodology for fault diagnosis through fault symp-
are modeled by Markov models. Montani et al. (2008) developed a software tool for
conversion between DFT and DBN in modeling and analyzing the system reliability.
Cai et al. (2015) proposed a two phase approach for real time system diagnosis and
66
reliability evaluation using BN and DBN based on sensor data and observed informa-
tion. McNaught and Zagorecki (2009) and Ozgür-Unlüakın and Bilgiç (2006) applied
DBN to the optimal maintenance policy of multi-component systems. Lee and Pan
(2017) used discrete time Markov chain (DTMC) and BN for predictive maintenance
and Gámez (2017) developed a decision support system (DSS) to detect failures in
advance for predictive maintenance using DBN. In relation to literature above, copula
system. Wang and Pham (2012) developed the competing risk models considering a
Wilson (2017) studied the influence of dependence structure to system reliability and
lem and redundancy allocation problem. In general, each component has a non-linear
cost function for reliability improvement. The objective is to minimize the develop-
ment cost while the target reliability is treated as a constraint. Kuo and Wan (2007)
provided the comprehensive overview of these two problems for various types of sys-
lem. Yalaoui et al. (2005) studied series-parallel systems under the specific convex
condition for cost functions. Kim et al. (2013) proposed a methodology of weighted
ity and severity. Yang et al. (1999) applied GA for optimal reliability allocation of
pressurized water reactor. Guo et al. (2014) presented a reliability allocation method
67
with confidence levels for various reliability configurations. Elegbede et al. (2003)
parallel-series structure. Garg and Sharma (2013) presented a fuzzy optimization for-
mulation for overcoming ambiguity and vagueness of system complexity and human
judgement and a particle swarm optimization technique for solving the optimization
problem. Zhao et al. (2013) studied redundancy allocation problem for two compo-
nent series systems where all components and redundancies have exponential lifetime
Various approaches have been proposed for system reliability modeling and relia-
bility allocation problems in complex industrial systems. However, the four challenges
mentioned in Section 4.1 are still largely unsolved, which motivates us to develop a
more flexible and practical method. In this paper, we propose to integrate NPBN
and degradation path models to study a system’s behavior over time, and thus to
The Bayesian network model is a directed acyclic graph (DAG) representing a mul-
tivariate distribution, in which nodes and arcs indicate random variables and prob-
networks (NPBN) are suitable for continuous multivariate distributions. Any con-
tinuous random variable can be defined as a node in a NPBN, as long as its dis-
68
uniform distributions at its nodes and this is called a multivariate copula.
The simplest copula is the bivariate copula. For any two uniform random vari-
ables, Ui and Uj , connected by an arc which indicating the rank correlation, rij , or
the strength of directed probabilistic dependency between them. The rank corre-
bivariate copula function, C(FXi (xi ), FXj (xj )), defines the joint distribution of these
arc in a NPBN, we can build the multivariate distribution for a given network struc-
ture. To make probabilistic inferences from a series of bivariate copulas, Monte Carlo
Bivariate copula function is the most fundamental unit of NPBN that possesses
two uniformly distributed nodes. As the first property, the rank correlation between
where U and V , the two uniformly distributed random variables, are transformed
is the copula function defined as C(u, v) = FX,Y (FX−1 (u), FY−1 (v)).
Another property of bivariate copula is called the zero independence property; that
is, a copula becomes an independent copula (C(u, v) = uv) when its rank correlation
is zero.
trivate copula function in this example) can be a general approach as they are capable
69
of measuring dependencies between random variables for non-elliptical distributions.
it allows only one parameter to capture dependencies. In fact, most previous research
had focused on bivariate copula families and many publications have shown successful
ables. Given the network structure, NPBN utilizes a series of bivariate copulas to
model the entire joint distribution. For a node having more than one parent nodes,
we specify conditional rank correlations for some arcs so that a value of the node is
defined based on values of all its parent nodes. More specifically, if a node, Xn , has j
number of parent nodes, the probabilistic relationship within each copula is given by
Fig. 4.3 shows one example of the multivariate distribution of three nodes, X1 ,
well as their strengths, r12 and r13|2 , from the graph. Note that for modeling the
joint distribution of this NPBN, the bivariate copula function between two nodes can
be independently chosen and the multivariate distribution of the three nodes in this
𝑋1
𝑟12 𝑟13|2
𝑋2 𝑋3
70
example is then decomposed to two bivariate copulas defined with rank correlation
introduce another probabilistic graphical model called vine. A vine provides a specific
dependence structure that consists of a set of nested trees, where arcs on ith tree is
defined as nodes on i + 1th tree. If the arcs sharing a common node in the ith tree are
always linked by an arc in the i + 1th tree, it is called a regular vine (Morales et al.,
probabilistic computing.
There are several types of regular vine such as D-vine, C-vine and R-vine, but
only D-vine is adapted in NPBN where all trees in D-vine are paths. Two examples
of D-vines are shown in Fig. 4.4 and 4.5. Obviously, Fig. 4.4 is a path. In Fig. 4.5,
the 1st tree is a path consisting of random variables X1 , X2 and X3 , and the 2nd tree
𝑟12 𝑟23
𝑟12 1 2 3
1 2
𝑟13|2
Figure 4.4: D-vine on Two Random Figure 4.5: D-vine on Three Random
Variables Variables
For each arc, adjacent nodes are probabilistically related. A vine uses rank corre-
conditional rank correlations, r12 , r23 and r13|2 , are shown in the D-vine in Fig. 4.5.
71
To map Fig. 4.5 to Fig. 4.3, as Nodes X2 and X3 are independent, r23 should be set
to zero.
NPBN, where parent nodes are always sampled ahead of child nodes. The sam-
pling order of parent nodes can be derived from the factorization order of the joint
distribution of BN. For example, the NPBN in Fig. 4.3 can be factorized as:
f (X1 , X2 , X3 ) = f (X3 )f (X2 |X3 )f (X1 |X2 , X3 ) = f (X3 )f (X2 )f (X1 |X2 , X3 ) (4.2)
Then, the sampling order can be decided as X3 → X2 → X1 . With the given sampling
vine.
D-vine is built on random variables, Xi , and its parent nodes. Then, these parent
nodes are sampled first (many sampling orders may exist among parent nodes) and a
copula, where a sample of random variable can be taken from a bivariate copula as
follows:
xi = Fr−1
ij ;xj
(u) (4.3)
where u is a sample from a standard uniform distribution, U (0, 1), and xj is the
72
Generally, random variables in a vine are successively sampled conditioning on
taken from a uniform distribution, U (0, 1), and the remaining random variables are
sampled as follow:
xn = Fr−1
n(n−1) ;xn−1
(un |x1 , ..., xn−1 ) (4.4)
where un , n = 2, ..., n, is a sample from the uniform distribution, U (0, 1). The detailed
In this section, we introduce two methods for specifying conditional rank correlations
that are needed for defining a NPBN. In general, this statistic can be easily calculated
if data are available. However, as our study focuses on the embodiment design stage
where field failure data are absent, we resort to the opinions from domain experts
and the data from parent products (historical data) for correlation estimation.
The first method will be used when expert opinion is the only obtainable informa-
tion source. The structured inquiry method developed in Morales et al. (2008) can
be utilized to learn the rank correlation or conditional rank correlation between two
nodes. If historical data are also available, Bayesian inference technique can be used
noted as PEP in Equation (4.5). Then, we can determine the parameter in copula
73
function, as well as the rank correlation, by using Equation (4.1).
Suppose that the random variable Y has a value above q th quantile, what is the
probability that the random variable X also has a value above q th quantile?
Given an answer, we can calculate a rank correlation using Equations 4.5 and 4.1.
In a BN structure, the random variables X and Y are a child node and parent node,
Suppose that the random variable Y and Z have values above q th quantile, what is
the probability that the random variable X also has a value above q th quantile?
If a new system shares some part of its system reliability structure with a parent
system, the historical data should be very useful information. Besides, expert opinion
is made.
74
tional rank correlation, we treat the distributions derived from historical data and
expert opinions as the prior distribution and the likelihood function for a conditional
to statistical data by encoding the opinion into binomial type of data. An expert is
questioned the query given in Section 4.4.1 for a probability dependency between two
Suppose we have compiled n answers where Ai is an answer from the ith domain
Pn
Ai
expert. Then, the fitted Binomial distribution is Bin(n, i=1
n
).
compulsory but a Beta distribution is preferred because not only it is very pliable for
various shapes of data but also it is a conjugate prior with binomial type of data.
as
n n
Beta(α + Ai , β + n − Ai ). (4.6)
X X
i=1 i=1
Given the posterior distribution, any appropriate statistic, e.g., mean, mode, me-
rank correlation. For a conditional rank correlation, the query is accordingly modi-
fied as Section 4.4.1 shows. Note that this approach can commonly be applicable for
The parameter estimation protocol in Section 4.4 allows us to model a system reli-
ability structure without field failure data. In this section, we enlarge the model by
75
ral dimension. We assume that historical degradation data exists for some subsystems
and components.
and the component fails when it passes a certain threshold value. We assume this
parent node represents the failure probability of a component, so the failure proba-
bility at a given time will be computed from its degradation path. For a coherent
system, when its components are highly probable to fail the system itself will be more
likely to fail too. Thus, a copula function with the property of upper tail dependence
is appropriate.
As only bivariate copulas are needed for quantifying a NPBN, the Gumbel copula
Fig. 4.6 depicts an example of a joint behavior of two uniform random variables
correlation is 0.96. Due to the large rank correlation, the plot obviously shows a
positive strong relationship between the uniform random variables. One can see that
as the random variables have larger values, their correlation becomes even stronger.
This upper tail dependence property is desirable for modeling the cascading effect
that results in a high system failure probability once any of its component is severely
damaged. It suggests that system deterioration is accelerating when any of its com-
76
1.0
0.8
0.6
v
0.4
0.2
0.0
In summary, the following assumptions have been made for system reliability:
(coherent system).
• There exist domain experts to provide their opinion, and historical degradation
component/system; thus, its numerical value is ranged from zero to one. Suppose
where η(t, θ) and X are a mean degradation at time t and a random variation, re-
77
the component at time t can be derived to be (Bae et al., 2007)
In the embodiment design stage, we can have the following two scenarios:
For the first scenario, the questions given in section 4.4.1 are modified to quantify
Question 1: Suppose that the component Y has a degradation state above the q th
failure quantile. What is the probability that the component X also has a degradation
Equation (4.5) and its rank correlation can be computed using Equation (4.1). Simi-
larly, if there exists a conditioning degradation state, the following query is provided:
Question 2: Suppose that the components Z and Y have degradation states above
the q th failure quantile. What is the probability that the degradation state X also has
78
time 0 1 2 ... n-1 n
A bivariate Gumbel copula is parameterized via sampling a D-vine, and the cor-
For the second scenario, we utilize the Bayesian inference technique introduced in
Section 4.4.2. The same queries with those of the first scenario are asked to a group of
experts. However, each answer is encoded into success (failure) if it is higher (lower)
than q value. A likelihood function, Bin(n, p), can be fitted where n is the number
components as shown in Table 4.1, in which Ck and djk denote a component k and a
empirical distributions, for each component can be found, F̂k (ik ). A conditional
Y , is computed as
79
time 1 2 ... n-1 n
We choose a Beta distribution, Beta(α, β), for fitting the conditional probability
Eq. 4.6. Using any proper statistics of the posterior distribution, a bivariate Gumbel
nents where its reliability structure is the NPBN in Fig. 4.3. Note that any historical
reliability data regrading the new system itself dose not exist. For r12 and r13|2 we ask
the previous questions with q = 0.5. Suppose the answer from the group of experts is
0.7524. Then, a Gumbel copula has a parameter, θ = 2.016, and a rank correlation,
0.687. Similarly, for the conditional rank correlation r13|2 , the group of experts give
𝑋1
0.687 0.335
𝑋2 𝑋3
Suppose we are interested in the system reliability at the time points 40, 42, 44,
and 46, and the components’ degradation models are given in Table 4.3. For each time
point, Monte Carlo simulation of NPBN is performed with the failure probabilities of
the components in Table 4.4 as evidence. Each sample of the simulation indicates a
probability of system failure at the corresponding time point. The simulation results
80
are given in Figs. 4.8 – 4.11. We can observe that the system’s failure probability
increases over time and it sharply rises between the time points 44 and 46. To quantify
the system reliability, we define that the system reliability as 1 − q50 where q50 is a
median value of the sampling results. The system reliability at the time points 40,
42, 44, and 46 are 1 − 0.076 = 0.924, 1 − 0.129 = 0.871, 1 − 0.269 = 0.731 and
1 − 0.969 = 0.031. This example shows that we quantify system reliability at the
embodiment design stage for any time points, thus to predict the system behavior
Degradation Lifetime
Comp X ∼ Wei(α, β) Dif
Model Distribution
800
1500
600
Frequency
Frequency
1000
400
500
200
0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
81
3000
400
2500
300
2000
Frequency
Frequency
1500
200
1000
100
500
0
0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
subsystem of a mixing tank. The mixing tank mixes and heats flammable liquid dur-
ing general chemical processes. Khakzad et al. (2013) discussed the safety analysis
of mixing tank by applying discrete BN. The reliability of this type of tank is ex-
tremely important because the vapor cloud can explode if the tank temperature was
system is given in Fig. 4.12. Definitions of the nodes in Fig. 4.12 are provided below:
are subsystems and X2 consists of components X4 and X5 . For each component, its
degradation model, the distribution of random variation (W eibull(α, β)), the implied
lifetime distribution, and the pre-fixed threshold (Dif ) are given in Table 4.5.
Suppose there are 10 domain experts and the quantile value for the query is set
82
𝑋1
𝑋1
𝑟12 𝑟13|2
0.687 0.335
𝑋2 𝑋3
𝑋2 𝑋3
𝑟24 𝑟25|4
0.544 0.226
𝑋4 𝑋5
𝑋4 𝑋5
Degradation
Comp X ∼ W ei(α, β) Lifetime Distribution Dif
Model
0.4 exp(0.1t) +
C3 W ei(1, 1.54) exp(( 40−0.41.54 ))
exp(0.1t) 1
40
X3
0.266 exp(0.11t)+
C4 W ei(1.03, 1.59) exp(( 42−0.2661.6 ) )
exp(0.11t) 1.03
42
X4
0.667 exp(0.09t)+
C5 W ei(1, 1.42) exp(( 42−0.6671.42 ))
exp(0.09t) 1
42
X5
Table 4.5: Degradation Models and Lifetime Distributions for Components
as 0.5. First, we estimate the rank correlation (r24 ) and conditional rank correlation
(r25|4 ) between the subsystem X2 and its components X4 and X5 , respectively. Using
the historical data of X2 and X4 , we have fitted a Beta distribution for the conditional
probability of exceedance as Beta(65, 31). Seven experts gave answers higher than 0.5
for Question 1 with nodes X2 and X4 . To give more weight to expert opinion, we scale
up one expert’s answer as ten reliability tests. Then, the fitted binomial distribution
becomes Bin(100, 0.7), and the posterior distribution is Beta(65 + 70, 31 + 100 −
70) = Beta(135, 61). Using the mean value (0.69) of this posterior distribution, the
83
parameter of a bivariate Gumbel copula is calculated as 1.63 and the corresponding
For a conditional rank correlation, r25|4 , for example, suppose that we have a Beta
prior distribution, Beta(71, 31), and eight experts have answers higher than 0.5 for
Question 2. Then, the fitted Binomial distribution is Bin(100, 0.8), and the posterior
distribution becomes Beta(71 + 80, 31 + 100 − 80) = Beta(151, 51). Based on the
and the parameter of bivariate copula with r25|4 can numerically be computed by
The estimation processes of r12 and r13|2 are the same as before, but without any
historical data. Based on expert opinions only, they are estimated to be r24 = 0.608
Suppose we are interested in the system reliability at the time points 45, 45.7, and
46. Using the following sampling algorithm, Monte Carlo simulations are run where
x3 = u3
x4 = u4
x5 = u5
x2 = Fr−1
24 ;x4
Fr−1
25|4 ;Fr (x5 ) (u2 ) = Fr24 ;x4 Fr25|4 ;x5 (u2 )
−1 −1
45 ;x4
x1 = Fr−1
12 ;x2
Fr−1
13|2 ;Fr (x3 ) (u1 ) = Fr12 ;x2 Fr13|2 ;x3 (u1 )
−1 −1
23 ;x2
The simulation results are provided in Fig. 4.14. We predict that the system is
quite reliable at time 45 but very likely to fail at time 46. The state of system at time
45.7 shows the transition of the system from reliable to unreliable. Using the median
values of these samples, we quantify the system reliability at these time points are
84
time 45 45.7 46
300
2000
1000
250
800
1500
200
Frequency
Frequency
Frequency
600
150
1000
400
100
500
200
50
0
0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Figure 4.14: Monte Carlo Simulation Results at Time 45, 45.7 and 46
Suppose that the mission time of the system is set at the time point 45, but
we would like the system to attain a higher reliability, 0.88. Our goal is to find
That is, we will decide which set of components to be upgraded and the extents of
their improvements. There are many factors to be considered, including the non-
deterministic reliability structure, various cost functions, available budget, and the
reliability allocation problem such as the total development cost will be minimized,
subject to the target reliability constraint. Cost functions of all components are
where Ci (Ri ) is the cost for the improvement effect of the ith component, Rcurrent is
85
is current reliability, and Rmax is the maximally achievable reliability with current
technology.
min Ci (Ri )
X
Rcurrent 6 Ri 6 Rmax ∀i
where f (R1 , R2 , ..., Rn ) is the system reliability function and RSys is the targeted
reliability.
The results in Table 4.8 are taken from several iterations of GA, while the associ-
C1 C2 C3
The results in Table 4.8 show that although different design solutions may provide
similar reliability, their costs can be dramatically different. For example, we can
achieve the reliability 0.8806 with a cost of 4.4544 using the first design candidate,
but if we choose the design of the last row, the system reliability is improved a little
to 0.8897, while the cost increases to a prohibited large number, 35102.2651. These
solutions are plotted in Fig. 4.15 to find a Pareto frontier with the preference to be
high reliability and low cost. A line connecting the solutions marked as a circle is
the Pareto frontier, where above (below) this line is the feasible region (the infeasible
region). This plot clearly illustrates the trade-off between reliability and cost. The
final decision may vary, depending on the user’s preference; however, the fifth solution,
which gives a reliability value of 0.9082, seems to be the best compromise solution.
86
Num C1 C2 C3 RelSys Cost
system can be quantitatively evaluated at the embodiment design stage, so the design
can be improved accordingly. The combination of NPBN and degradation models has
87
400
cost
200
● ●
0 ● ● ●
overcome the limitation of DBN and GA has been successful in generating alternative
designs.
4.7 Conclusion
In a system’s early design stage, the reliability structure of the system is often un-
reliability with such reliability structure uncertainty. This methodology can be best
system design can be quantitatively assessed over its intended lifespan, better design
options are generated if the current design fails to reach its reliability target.
certainties existed between system and components and integrated the component
veloped for dealing with continuous random nodes in the network. This approach
88
overcomes many limitations of discrete BNs and it can incorporate the degradation
behaviors of components into the system reliability analysis. Therefore, our research
has established a system design framework in which the uncertainties in system reli-
ability structure and the temporal change of system reliability can be quantitatively
assessed. It is also a novel decision-making tool for generating and comparing alter-
speed, particularly for a big, complex system, because both Monte Carlo sampling
tion algorithms.
89
Chapter 5
5.1 Introduction
ues of design variables and estimate the corresponding system reliability. This is a
continuous process to economically modify and improve a system design until a sys-
problem to find the optimal system design compromising between the minimum de-
Min: C(θ)
θ
(5.1)
Subject to: P rob[Gi (θ) ≥ 0] ≥ Ri , i = 1, . . . , m.
safety region and vice versa. The performance function should satisfy a desired prob-
the last two decades, there have been extensive studies on how to efficiently quantify
this uncertainty. Zhuang and Pan (2012a) provides various sources of uncertainty
that can arise in RBDO and points out that epistemic uncertainty may lead to either
90
There exist three main different approaches to solve the probabilistic constraint in
an RBDO problem: the double loop method, decoupled loop method, and single loop
method. The double loop method is the most conventional approach. All probabilis-
tic constraints are repeatedly executed to account for the associated uncertainties.
This process is fulfilled with a different set of design variables at every optimization
iteration. Thus, the double loop method generally requires massive amount compu-
demand, Du and Chen (2004) developed an efficient decoupled loop approach called
reliability assessment process and optimization problem so that these two indepen-
dent processes are consequentially carried out. Many previous studies have employed
the SORA method to improve computational efficiency (Zhuang and Pan, 2012a,b;
tion method on the concept of SORA. The proposed method provides close solutions
to the SORA method with a less number of performance function evaluations. Ba-
Abbad et al. (2006) presented a new RBDO approach for series systems by modifying
the SORA method. The single loop method is a technique to convert an RBDO
single loop method for the problem defined in (Ba-Abbad et al., 2006). Shan and
Wang (2008) proposed a single loop approach for RBDO problems using the gradient
information of all constraints. This approach eliminates the reliability analysis loop
RBDO becomes even more challenging when performance functions cannot be an-
91
with significant amount experiment time. To overcome the difficulty on computer
Zhuang and Pan (2012b) used a Kriging surrogate model and developed sequential
formance of the surrogate model. The sampling strategy particularly aims to precisely
find the most probable point (MPP) instead of approximating the entire surface of
the unknown constraint function. Dubourg et al. (2011) also used a kriging surrogate
solved in a double loop manner. Liang and Mahadevan (2017) used nonparametric
Bayesian network (NPBN) as a surrogate model in the context of RBDO for multi-
rates uncertainties and dependencies between objective functions. Cooke et al. (2015)
also utilized NPBN to optimize an engineering design. Choi et al. (2010) provided a
new RBDO method using copula functions for accurate estimations of marginal and
joint input distributions when we have an insufficient amount of input data. Offset-
ting the limited data available, this approach provides RBDO outputs with confidence
Beta-Binomial conjugate.
This paper proposes a new surrogate modeling approach for RBDO problems
probabilistic model of the system reliability structure and to surrogate this NPBN
model, respectively. The proposed approach aims to reduce an amount of time that
92
NPBN is first proposed in (Kurowicka, 2005). In contrast to discrete BN, NPBN
variables to uniform random variables. Therefore, NPBN can provide a more natural
way to model continuous processes such as performance, decay, and degradation than
discrete BN. In fact, many other reliability assessment tools including the Fault Tree
(FT), Dynamic FT, Reliability Block Diagram (RBD), Dynamic RBD and BN dis-
cretize continuous system’s condition states and focus on binary or multiple outcomes.
However, this discretization approach is always accompanied with the trade-off be-
ences can be made via Monte Carlo simulation. The inference produces a sampling
distribution on [0, 1] for each random variable of interest. Lee and Pan (2018) pro-
its reliability at design stages. The system reliability is quantified based on its pre-
dicted functional performance (FP) at the mission time given the expected FPs of
components. More specifically, the system’s predicted FP at the mission time is rep-
a value of sample in the k th quantile. This method obviously shows the advantages
of NPBN approach over the other reliability assessment tools in analyzing system
reliability with continuous system conditions. However, the NPBN approach is not
desirable for solving RBDO problems because of its shortcoming that probabilistic
Based on the approach for system reliability assessment presented in (Lee and
Pan, 2018), we will propose to build a quantile regression (QR) model that emulates
93
a NPBN model to overcome its computational disadvantage. Since our main interest
model should not be capable of establishing the complex relationship between system
reliability and its components’ FPs, which was originally modeled by a NPBN. In
order to handle this difficulty, a QR model will be locally fitted to a small region of
the input-space of a system’s performance function in this research with the following
two assumptions:
Based on the first assumption, we can currently achieve the highest system re-
liability at the mission time when each component shows its best FP. In addition,
we can guarantee that the current system design can be improved if any of its com-
to enhance the system design. Finally, the proposed methodology will provide the
optimal expected FPs of components in order for the system to guarantee the desired
The organization of the paper is as follow: Section 5.2 will briefly introduce NPBN
and its system reliability approach and Section 5.2 reviews QR model and its Bayesian
approach. In Section 5.4, we will propose QR surrogate model for NPBN model. It
94
is followed by a case study to demonstrate the proposed methodology in Section 5.5.
This section gives a brief introduction of NPBN and its application to system
BN’s flexible capability to model complex relationship between random variables, its
application domain is highly diverse including the system reliability analysis. Much
existing literature has utilized the classical BN in analyzing complex system reli-
ability over more the last decade (Weber et al., 2012). However, it is technically
problematic to use the classical BN if there exist continuous random variables hav-
problem, it can cost very expensive computational efforts. To overcome this short-
incorporating copulas on the same concept of BNs. Using copulas, NPBN explicitly
given evidence. One may refer to (Hanea et al., 2015) for the broad overview of NPBN
95
applications.
multivariate distribution where each marginal follows a uniform distribution on [0, 1].
the associated random variables. The strength also can be represented by a rank
dimension is a difficult task even with copulas because of the complex interactions
In NPBN, only bivariate copula family is used. A copula and its rank correlation
the adjacent uniform random variables. Thus, each copula locally models a bivari-
ate joint distribution on an arc, and all bivariate copulas on a BN are integrated to
multivariate copula where the network topology reveals a set of conditional indepen-
dence. As mentioned in the previous paragraph, each uniform random variable can
96
Figure 5.1: A BN Example
tween two random variables as zero rank correlation can be used where this is called
the ‘zero independence property’. Fig. 5.1 shows a small NPBN example representing
the particular dependability structure between the three random variables (X1 , X2 ,
and X3 ). That is, X2 and X3 are independent and they become dependent once a
copulas to two pairs of correlated random variables ((X1 , X2 ) and (X1 , X3 )). The rank
correlations, r12 and r13|2 , are the strengths of probabilistic relationships between the
random variables.
tional bivariate copulas (Monte Carlo simulation). Given evidence, an inference result
97
Figure 5.3: A D-vine Example
5.1. If the simulation result is the distribution given in Fig. 5.2, we can interpret
that X1 will mostly take values around 0.38 or 0.39 and have values closer 0 and 1
paths where each node and arc represent a random variable and a direct relationship
between the adjacent nodes, respectively. Given the first path that is composed of
random variables and arcs, the second path is constructed by defining the arcs of the
first path as its nodes. The consecutive paths are built in the same scheme. A D-vine
example is given in Fig. 5.3. This D-vine is consisting of two paths. The first path has
nodes 1, 2, and 3 and the second path has nodes a12 and a23 (aij is an arc connecting
represented by rank correlation. Note that the exactly same relationship between
random variables on BN in Fig. 5.1 is also shown in this D-vine example. Moreover,
the D-vine specifies the relationship between X2 and X3 , which is independent based
on the structure of BN. Given the D-vine, one can take samples of each random
98
variable with the following algorithm:
x3 = u3
x2 = Fr−1
23 ;x3
(u2 ) (5.3)
x1 = Fr−1
12 ;x2
(Fr−1
13|2 ;Fr
(u1 )) = Fr−1
12 ;x2
(Fr−1
13|2 ;x3
(u1 ))
23 ;x2 (x3 )
where Frij ;xj (xi ) is a CDF of conditional copula with a rank correlation, rij , and
conditioning value, xj , and ui is a sample from a uniform distribution, U (0, 1). The
Lee and Pan (2018) used NPBN for evaluating system reliability at design stages
where a system has a hierarchical reliability structure such as the BN in Fig. 5.1.
composed of directly related system’s elements (nodes) in a level i+1. In this manner,
a node at the level 1 always indicates the system itself and nodes at the lowest level
represent the most basic components. Nodes in the intermediate levels correspond to
wise configuration.
stated in Section 5.2.1, a node can take any value on [0, 1] and this value quantitatively
considered to be failed (fully performing) when the corresponding node has 0 (1) and
values between 0 and 1 represent intermediate FP states. Arcs and rank correlations
on a NPBN show direct influences between FPs and their strengths. In building
systems are considered. That is, never do FPs negatively impact each other. To
99
reflect this restriction and model the joint distributions of FPs, Clayton copulas, which
possess the lower tail dependence property, are used in (Lee and Pan, 2018). The
lower tail dependence property helps effectively model the general behavior between
related FPs that a system’s part likely has more impacts to its associated parts as
its condition is worse. However, the choice of copula can be different based on data
or opinions from domain-experts. The authors also introduced and developed the
rank correlation estimation methods in system design stages. This is out of scope for
this research and we assume that a fully parameterized NPBN model is given for this
study.
For instance of the NPBN approach, suppose we are analyzing the system reli-
ability structure given in Fig. 5.1. The network structure shows the impacts that
the components’ FPs, X2 and X3 , have on the system’s FP, X1 , as r12 and r13|2 ,
given the components’ FPs (opposite direction is also possible) at the mission time.
x1 = Fr−1
12 ;f p2
(Fr−1
13|2 ;Fr (f p3 ) (u1 )) = Fr−1
12 ;f p2
(Fr−1
13|2 ;f p3
(u1 )) (5.4)
23 ;f p2
the output distribution is given in Fig. 5.2. We can expect that the system will
moderately or poorly perform. To quantify the simulation result, the system reliability
is defined as a value of sample at 50th quantile. In this example, the system reliability
is 0.423. One may refer to (Lee and Pan, 2018) for the details of NPBN construction
100
Figure 5.4: Skewed Distributions
tional quantiles of a response variable. QRs have been widely utilized for statistical
analysis in various areas because of its advantages over ordinary least squares (OLS)
First, it is well known that the conditional mean, which is provided by an OLS
of response variable is asymmetric. Fig. 5.4 shows two skewed distributions where
both distributions have the mean = 0. If these are the conditional distributions of
the response variable given different values of the same predictor, the OLS model
fails to convey useful information. On the other hand, the median values, which are
measured by a 50th QR model, of the distributions are −0.12 (the distribution in blue)
and 0.139 (the distribution in red), respectively, and these values show the central
101
cannot provide the best fit to data. Moreover, the OLS method provides only one
always desired. Fig. 5.5 and 5.6 compare and show how QR models overcome the
shortcomings of an OLS model on the same dataset. Obviously, the scatter plot in
Fig. 5.5 does not satisfy the homoscedasticity assumption and this violation should
result in unreliable confidence intervals for the OLS model. There are five QR models
with different quantiles, 10th , 30th , 50th , 70th , and 90th , in Fig. 5.6 on the same scatter
plot. The use of many models with different quantiles can overcome the non-constant
yi = xTi β q + i (5.5)
where yi is the ith response variable, xTi is a transposed vector of ith predictors, β q
is a coefficient vector for the specified quantile (0 < q < 1), and i is an error term.
Note that the error term does not have any distributional assumption. For n samples,
i = yi − xTi β
ui = max(ei , 0)
vi = max(−ei , 0)
ei = ui − vi
102
Figure 5.5: An Example of LR Figure 5.6: Examples of LR and QRs
with the constraints ui , vi > 0 and yi = xTi β + ui − vi . Thus, the coefficients can be
i=1
ui , vi > 0, β ∈ Rp
103
Eq. 5.8 where the location parameter is set as xTi β. For BQR models, any prior
used. Otherwise, one can use the Markov Chain Monte Carlo (MCMC) approach.
Plese refer to (Benoit et al., 2017) for the technical details on MCMC.
regression model that emulates the NPBN model is expected to significantly facilitate
solving an RBDO problem because the main limitation of NPBN model for solving
probabilistic inference on a node at the level 1 based on FPs of its components, which
are indicated by nodes at the last level. In other words, an updated distribution
(evidence). Lee and Pan (2018) defines system reliability as a k th quantile value in
sets of evidence help us discover the relationship between the specific quantile values
numerical value in [0, 1], for a system consisting of n components, the input space
is [0, 1]n and output space is [0, 1]. Note that a QR model may not be appropriate
104
to represent the complex relationship between components’ FPs and system’s FP
because the relationship, [0, 1]n → [0, 1], may show a strong non-linear behavior.
To overcome this challenge, we will locally fit a QR model by narrowing the input
at the mission time to enhance the overall system’s performance, the targeted FP of
this component should lie in the interval [pi , 1]. Thus, once we obtain the optimal
Therefore, the input space of this component’s FP can be [pi , 1] instead of [0, 1]. By
applying this idea to all components, we can reduce the input space into i=1 [pi , 1]
Qn
with the assumption that the relationship becomes more linear or at least weakly
This paper employes the NPBN modeling method proposed in (Lee and Pan,
2018), but the definition of system reliability is changed. We set the system reliability
Min: C(θ)
θ
(5.9)
Subject to: P rob[Gi (θ) ≥ 0] ≥ Ri , i = 1, . . . , m.
The system performance function, G(θ), in the RBDO formulation 5.9 is defined
as follows:
where f (θ) provides a predicted system performance given components’ FPs, θ, and
105
(failure). The desired probability of success is R, and it is a target reliability. This
than h. In other words, a (1−R)-quantile value from below has to be equal to or larger
than h. For instance, assume we have the simulation results given in Fig. 5.7 and 5.8
based on different sets of solutions (Design 1 and Design 2). Because the (1 − R)-
quantile value is less than h in Fig. 5.7, a fraction of samples greater than h is smaller
than R. On the other hand, Fig. 5.8 shows the (1 − R)-quantile value that is greater
than h. Thus, a fraction of samples above h is larger than R and the probabilistic
constraint is satisfied. Note that we can evaluate the probabilistic constraint based
106
Suppose we have a system given in Fig. 5.1 and solve the RBDO problem below:
Min: C(θ)
θ
θL ≤ θi ≤ θU , i = 1, 2.
The objective is to minimize the system development cost where the decision variables
θ are a set of the expected components’ FPs at the mission time. As the system has
two components, θ is a vector of two elements. The optimal solution of this RBDO
problem, θ opt is the optimal assignment of the components’ expected FPs at the
Having the corresponding NPBN model, we can evaluate the probabilistic con-
Min: C(θ)
θ
θL ≤ θi ≤ θU , i = 1, 2.
As it is costly to repeatedly evaluate the system performance function, G(θ) =
sion in Bayesian manner is proposed by taking some samples (a sample from NPBN
sample of NPBN is called a design sample to the rest of this paper) from the NPBN
model. The QR model is updated by adding more design samples until it provides
a space-filling sampling approach can be used. Because there are two components,
the initial BQR model with 5-quantiles will have the form:
107
The above model is iteratively updated based on design samples from the NPBN
model. The sampling and updating are terminated if the following criteria is met:
Max.Coeff.Diff = |Max(β0(c)
5 5
− β0(p) 5
, ..., βn(c) 5
− βn(p) )| ≤ ω (5.14)
where βi(c)
5
and βi(p)
5
represent the estimated coefficients at the current and previous
iterations and ω is a small number. Thus, once coefficient values show negligible
changes over iterations, the surrogate model is not updated anymore. The One can
set a stricter stopping criteria by requiring the negligible coefficient changes over a
few successive iterations. Then, by substituting the probabilistic constraint for the
Min: C(θ)
θ
θL ≤ θi ≤ θU , i = 1, 2.
Note that the equation β05 + β15 θ1 + β25 θ2 provides the value of 5th quantile sample
β05 + β15 θ1 + β25 θ2 is larger than h = 0.85 given θ, it indicates that the portion of
samples larger than h = 0.85 is higher than R = 0.95. Then, the system reliability
Any optimization technique can be applied to solve the above deterministic opti-
mization problem. The proposed method is illustrated step by step in the following:
4. Update the BQR surrogate model by taking samples from NPBN model.
108
6. If the stopping criteria is satisfied, put the BQR model in the RBDO formulation
radar system is investigated, which is also used in (Zhou et al., 2006) for the reliability
assessment. The radar system is functioning to detect targets and reporting signals.
Its application areas are extremely diverse such as transportation systems, air-defense
systems, antimissile system, etc. As the radar system is embedded in various sophis-
ticated systems for purposes of safety and security, any failures of the radar system
can cause catastrophic disasters. Therefore, a very high level of reliability is required.
The general reliability structure of radar system is given in Fig. 5.9 and it is
assumed to be a coherent system in this study. For each arc, a value of rank correlation
and a copula assigned are specified (C: Clayton Copula, G: Gumbel Copula). The
node 1 represents the FP of the radar system and the nodes circled by dotted lines
are corresponding to the components. Each node represents the FP of the following
components:
• Node 5: Antennas
109
• Node 9: Basic component 1
The nodes 2,3,4,6,7, and 8 are the non-physically existing subsystems consisting of
associated components, which are directly connected from below in the network.
Given the network structure, assigned copulas, and a set of components’ FPs,
we take 2500 samples to get an output distribution because a specific quantile value
shows little variations (less than 0.1e − 2) after taking 2500 samples. The simulation
is performed on the processor Intel(R) Core(TM) i5-3337U and the memory (RAM)
Suppose we are solving an RBDO problem for the system, and the threshold, h,
and target system reliability, R, are set as 0.85 and 0.95, respectively. All components
are currently expected to show the FPs, 0.7, at the mission time where the corre-
sponding system reliability is 0.6688 (5th -quantile value is 0.6181). Fig. 5.10 shows
the distribution of predicted system performance with the current design. Since not
more than 95% samples are larger than h = 0.85, the current design does not meet
110
the target reliability. We want to improve the system design by finding the optimal
Ri −Ri.current
Ci (Ri ) = e Ri.max −Ri
(5.16)
where Ci is the cost incurred to develop the reliability of ith component from the
that can be achieved by the current technology. In this paper, Ri.max is assumed as
0.7 ≤ θi ≤ 1, i = 1, . . . , 11.
Figure 5.10: The Distribution of Predicted System Performance Based on the Cur-
rent Design
111
Note that the input space is defined as [0.7, 1]11 . The system performance function,
G(θ) = f (θ) − 0.85, can be evaluated through simulations on NPBN and the proba-
Thus, a 5th -quantile regression surrogate model will be constructed based on prior
knowledge and some design samples from the NPBN model. For the Bayesian ap-
coefficient. Based on the analysis of historical data of parent systems, the initial QR
where y and θ represent a 5th quantile value and components’ FPs. The information
We fit the BQR by using an R package, bayesQR, and the Latin Hypercube
Design (LHD) is used for sampling strategy. The surrogate model is updated in
every iteration by taking 20 additional design samples. 300 samples are taken from
each design sample using the NPBN. The posterior distributions of coefficients are
approximated by the MCMC method. During the MCMC procedures of finding the
posterior distributions, 5000 samples are drawn in every iteration and 4500 samples
are retained (500 burnin samples). The stopping criteria is that the Max.Coeff.Diff
value of Eq. 5.14 is less than 0.01 for two consecutive iterations. The stopping criteria
met at 10th iteration and the finial model is (Appendix C provides all the intermediate
112
No. Component Prior Dist
QR models):
The model summary is given in Table. 5.2. The first and second columns indicate
the coefficients and corresponding estimated values. The third and fourth columns
provide the 95% credible intervals of the coefficients. The model summary shows that
113
Bayes Est. 2.5 Quantile 97.5 Quantile
Given the final model, we can formulate the RBDO problem as follow:
11
Min:
X
C(θ)
θ
i=1
(5.20)
Subject to: BQR.model ≥ 0.85
0.7 ≤ θi ≤ 1, i = 1, . . . , 11.
114
Table 5.3: RBDO Result
COBYLA is a direct search method that can handle both non-linear equality and
Table. 5.3. Note that currently all the components’ FPs are expected to be 0.7 and
the predicted system reliability is 0.6688. It is hard to intuitively find the optimal
solution because of the complex system reliability and exponential behaviors of cost
functions. The θ values in Table. 5.3 represent the components’ FPs in order to meet
the target reliability, 0.95, at the minimum development cost, 103.815. The solution
is verified by generating 2500 samples on the NPBN and 2384 samples are larger
than the threshold, h = 0.85. Thus, the fraction of samples larger than the threshold
is 95.36% and the system reliability is 0.9536. In addition, the 5th quantile value,
115
Figure 5.11: The Verification of the Solution from BQR Model
which is 0.8554, also shows that the predicted system reliability meets the target
of the verification is given in Fig. 5.11. Although the RBDO solution with the BQR
model provides the approximate solution, further investigations may be needed for
more accurate result. However, the surrogate model approach certainly offers useful
5.6 Discussion
It is verified that the optimal solution found by the surrogate model provides the
system reliability, which is little higher than the target reliability. Thus, further inves-
tigations on the system may find better solutions with the system development cost
lower than 103.815. Although the surrogate model does not guarantee the optimal
solution, it greatly helps our decision making on the system design change.
Note that the current design is expected to show the system reliability, 0.6688,
116
with all components’ FPs, 0.7. Because the expected system reliability does not meet
the target reliability, we could improve the system reliability using the approaches
the most efficient component, and in Section 4, genetic algorithm (GA) was carried
out to consider all components together. However, the GA introduces another com-
putational burdensome and the optimal solution is not ensured too. On the other
hand, the surrogate model approach allows us to comprehensively examine all the
model, computational time to solve an RBDO problem becomes negligible (0.16 sec-
sec. If we solely rely on the NPBN model, computational time will be 192.73 sec × a
number of different solution sets until obtaining the optimal solution. Therefore, the
surrogate model provides the advantages on the fast problem-solving time and the
5.7 Conclusion
a system with a complex reliability structure is being developed. Due to the lack
modeled and analyzed through a NPBN model, and it is expressed as the implicit
by running the simulation on the NPBN model will directly evaluate the probabilistic
constraint, the accompanied computational cost can be very expensive. To ease the
Replacing the performance function with a BQR model transforms the RBDO
117
problem into a deterministic optimization problem. Thus, the proposed methodology
RBDO solving process and the optimal solution assists in improving the current
system design. As shown in the Section. 5.5, however, the optimal solutions may
need more detailed investigations because the evaluated system reliability with the
solution isn’t perfectly meet the target reliability. The errors should come from the
there exist much uncertainties in the sampling procedure of NPBN model, we need to
develop more robust and flexible surrogate models with not increasing the problem
solving-time.
118
Chapter 6
6.1 Conclusion
and proposed various reliability analysis approaches using two graphical models: dis-
crete Bayesian network (discrete BN) and nonparametric Bayesian network (NPBN).
The dissertation particularly focused on the production stage in chapter 2 and the
component. The case study shows that Semi-Markov chain model most generalizes
the Markovian property among the proposed methodologies. In other words, Semi-
Markov chain model is more data-driven than the other models (higher-order Markov
chain and discrete time Markov chain models) while more computational demand is
119
accompanied.
property. Although Semi-Markov chain and higher order Markov chain models solve
this limitation to some extent, these approaches could not fundamentally remove the
more realistic constraints. For example, minimizing the maintenance cost can be a
primary interest under some constraints such as a probability of external shocks, the
limited number of components being repaired, and minimum required reliability for
data every day. Thus, thorough data pre-processing and application of various ma-
system reliability modeling frameworks using NPBN and Bayesian inference. The
use of NPBN effectively overcome the limitation of discrete BN, thus, the continuous
In contrast to the research on the production stage, one of main challenges in the
design stage is the lack of field failure data. To handle this difficulty, we developed
the information integration technique that combines domain-expert opinion and his-
continuous time.
120
The last research presented in this dissertation overcomes the limitation of NPBN,
build a surrogate model with quantile regression (QR) model as conditional quantiles
of distributions can be calculated without Monte Carlo simulation. The case study
in chapter 5 shows that the use of QR surrogate model extremely facilitates solving
The research on embodiment design stage in this dissertation can be further devel-
oped to the earlier design stage, conceptual design stage. As conceptual design stage
comes before embodiment design stage, any decision made in the conceptual design
stage will have a stronger impact than decisions made in the embodiment design
121
REFERENCES
122
N. Caballé, I. Castro, C. Pérez, and J. M. Lanza-Gutiérrez. A condition-based mainte-
nance of a dependent degradation-threshold-shock model in a system with multiple
degradation processes. Reliability Engineering & System Safety, 134:98–109, 2015.
2.2
B. Cai, Y. Liu, Y. Ma, Z. Liu, Y. Zhou, and J. Sun. Real-time reliability evaluation
methodology based on dynamic bayesian networks: A case study of a subsea pipe
ram bop system. ISA transactions, 58:595–604, 2015. 4.2
B. Cai, H. Liu, and M. Xie. A real-time fault diagnosis methodology of complex
systems using object-oriented bayesian networks. Mechanical Systems and Signal
Processing, 80:31–44, 2016. 3.1, 4.2
B. Cai, Y. Liu, and M. Xie. A dynamic-bayesian-network-based fault diagnosis
methodology considering transient and intermittent faults. IEEE Transactions on
Automation Science and Engineering, 14(1):276–285, 2017. 4.2
F. Campolongo, J. Cariboni, and A. Saltelli. An effective screening design for sen-
sitivity analysis of large models. Environmental modelling & software, 22(10):
1509–1518, 2007. 3.6.2
I. Castro, N. Caballé, and C. Pérez. A condition-based maintenance for a system sub-
ject to multiple degradation processes and external shocks. International Journal
of Systems Science, 46(9):1692–1704, 2015. 2.2
A. Chen and G. Wu. Real-time health prognosis and dynamic preventive maintenance
policy for equipment under aging markovian deterioration. International Journal
of Production Research, 45(15):3351–3379, 2007. 2.2
Y. Cheng and X. Du. System reliability analysis with dependent component failures
during early design stage. In ASME 2015 International Mechanical Engineering
Congress and Exposition, pages V011T14A008–V011T14A008. American Society
of Mechanical Engineers, 2015. 3.1, 3.2
U. Cherubini, E. Luciano, and W. Vecchiato. Copula methods in finance. John Wiley
& Sons, 2004. 4.3
K. Choi, Y. Noh, and I. Lee. Reliability-based design optimization with confidence
level for problems with correlated input distributions. In Sixth China-Japan-Korea
Joint Symposium on Optimization of Structural and Mechanical Systems, 2010. 5.1
O. Chryssaphinou, M. Karaliopoulou, and N. Limnios. On discrete time semi-
markov chains and applications in words occurrences. Communications in Statis-
tics—Theory and Methods, 37(8):1306–1322, 2008. 2.3.3
G. Clark and R. Paasch. Diagnostic iviodeling and diagnosability evaluation of ivie-
chanical systems. Journal of Mechanical Design, 118:425, 1996. 3.1
123
R. M. Cooke, T. A. Zang, D. N. Mavris, and J. Tai. Sculpting: A fast, interactive
method for probabilistic design space exploration and margin allocation. In 16th
AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, page 3440,
2015. 3.4, 5.1
J. Cózar and J. M. P. J. A. Gámez. An application of dynamic bayesian networks
to condition monitoring and fault prediction in a sensored system: a case study.
International Journal of Computational Intelligence Systems, 10:176–195, 2017. 4.2
T. Daemi, A. Ebrahimi, and M. Fotuhi-Firuzabad. Constructing the bayesian net-
work for components reliability importance ranking in composite power systems.
International Journal of Electrical Power & Energy Systems, 43(1):474–480, 2012.
3.2
S. Distefano and A. Puliafito. Dynamic reliability block diagrams vs dynamic fault
trees. In Reliability and Maintainability Symposium, 2007. RAMS’07. Annual,
pages 71–76. IEEE, 2007. 4.2
P. Do, A. Voisin, E. Levrat, and B. Iung. A proactive condition-based maintenance
strategy with both perfect and imperfect maintenance actions. Reliability Engi-
neering & System Safety, 133:22–32, 2015. 2.2
X. Du and W. Chen. Sequential optimization and reliability assessment method for
efficient probabilistic design. Transactions-American Society of Mechanical Engi-
neers Journal of Mechanical Design, pages 225–233, 2004. 5.1
V. Dubourg, B. Sudret, and J.-M. Bourinet. Reliability-based design optimization
using kriging surrogates and subset simulation. Structural and Multidisciplinary
Optimization, 44(5):673–690, 2011. 5.1
A. C. Elegbede, C. Chu, K. H. Adjallah, and F. Yalaoui. Reliability allocation through
cost minimization. IEEE Transactions on reliability, 52(1):106–111, 2003. 4.1, 4.2
N. Friedman and D. Koller. Being bayesian about network structure. a bayesian
approach to structure discovery in bayesian networks. Machine learning, 50(1-2):
95–125, 2003. 2.3.2
H. Garg and S. Sharma. Multi-objective reliability-redundancy allocation problem
using particle swarm optimization. Computers & Industrial Engineering, 64(1):
247–255, 2013. 4.2
N. Gebraeel. Sensory-updated residual life distributions for components with expo-
nential degradation patterns. Automation Science and Engineering, IEEE Trans-
actions on, 3(4):382–393, 2006. 2.2
N. Z. Gebraeel, M. A. Lawley, R. Li, and J. K. Ryan. Residual-life distributions
from component degradation signals: A bayesian approach. IiE Transactions, 37
(6):543–557, 2005. 2.2
124
S. Gunawan and P. Y. Papalambros. A bayesian approach to reliability-based op-
timization with incomplete information. Journal of Mechanical Design, 128(4):
909–918, 2006. 5.1
H. Guo, M. Jiang, and W. Wang. A method for reliability allocation with confidence
level. In Reliability and Maintainability Symposium (RAMS), 2014 Annual, pages
1–1. IEEE, 2014. 4.2
A. Hanea and D. Kurowicka. Mixed non-parametric continuous and discrete bayesian
belief nets. Advances in Mathematical Modeling for Reliability, (1):9–16, 2008. 3.1
A. Hanea, O. M. Napoles, and D. Ababei. Non-parametric bayesian networks: Im-
proving theory and reviewing applications. Reliability Engineering & System Safety,
144:265–284, 2015. 5.2.1
A. M. Hanea. Algorithms for non-parametric Bayesian belief nets. PhD thesis, TU
Delft, Delft University of Technology, 2008. 3.1, 3.4.1
A. Heng, S. Zhang, A. C. Tan, and J. Mathew. Rotating machinery prognostics: State
of the art, challenges and opportunities. Mechanical systems and signal processing,
23(3):724–739, 2009. 2.2
A. K. Jardine, D. Lin, and D. Banjevic. A review on machinery diagnostics and
prognostics implementing condition-based maintenance. Mechanical systems and
signal processing, 20(7):1483–1510, 2006. 2.2
H. Joe. Families of m-variate distributions with given margins and m (m-1)/2 bivariate
dependence parameters. Lecture Notes-Monograph Series, pages 120–141, 1996.
3.4.1
K. A. Kaiser and N. Z. Gebraeel. Predictive maintenance management using sensor-
based degradation models. Systems, Man and Cybernetics, Part A: Systems and
Humans, IEEE Transactions on, 39(4):840–849, 2009. 2.2
R. U. Kay. Fundamentals of the dempster-shafer theory and its applications to system
safety and reliability modelling. Reliab. Theory Appl, 3:173–185, 2007. 3.4.2
N. Khakzad, F. Khan, and P. Amyotte. Dynamic safety analysis of process systems
by mapping bow-tie into bayesian network. Process Safety and Environmental
Protection, 91(1):46–53, 2013. 4.6
K. O. Kim, Y. Yang, and M. J. Zuo. A new reliability allocation weight for reducing
the occurrence of severe failure effects. Reliability Engineering & System Safety,
117:81–88, 2013. 4.2
H. Komijani, M. Shahin, and A. Jabbarzadeh. Condition-based maintenance con-
sidering shock and degradation processes. Decision Science Letters, 6(2):151–164,
2017. 2.2
125
J. Koochaki, J. A. Bokhorst, H. Wortmann, and W. Klingenberg. Condition based
maintenance in the context of opportunistic maintenance. International Journal of
Production Research, 50(23):6918–6929, 2012. 2.2
W. Kuo and R. Wan. Recent advances in optimal reliability allocation. IEEE Trans-
actions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 37(2):
143–156, 2007. 4.2
D. Kurowicka. Distribution-free continuous bayesian belief. Modern statistical and
mathematical methods in reliability, 10:309, 2005. 3.1, 3.4.1, 3.4.1, 4.1.1, 5.1, 5.2.1
D. Kurowicka. Dependence modeling: vine copula handbook. World Scientific, 2011.
3.4.1
L. Lamberson and K. Kapur. Reliability in engineering design, 1977. 4.1
H. Langseth and L. Portinale. Bayesian networks in reliability. Reliability Engineering
& System Safety, 92(1):92–108, 2007. 2.2, 2.3.2, 3.1, 3.2, 4.2
D. Lee and R. Pan. Predictive maintenance of complex system with multi-level reli-
ability structure. International Journal of Production Research, pages 1–17, 2017.
4.2
D. Lee and R. Pan. A nonparametric bayesian network approach to assessing system
reliability at early design stages. Reliability Engineering & System Safety, 2018.
4.2, 4.4, 5.1, 5.2, 5.2.2, 5.2.2, 5.4, 1
J. Li, D. W. Coit, and E. A. Elsayed. Reliability modeling of a series system with
correlated or dependent component degradation processes. In Quality, Reliabil-
ity, Risk, Maintenance, and Safety Engineering (ICQR2MSE), 2011 International
Conference on, pages 388–393. IEEE, 2011. 2.2
M. Li, J. Liu, J. Li, and B. U. Kim. Bayesian modeling of multi-state hierarchical
systems with multi-level information aggregation. Reliability Engineering & System
Safety, 124:158–164, 2014. 2.2, 3.2
C. Liang and S. Mahadevan. Pareto surface construction for multi-objective opti-
mization under uncertainty. Structural and Multidisciplinary Optimization, 55(5):
1865–1882, 2017. 3.4.2, 5.1
J. Liang, Z. P. Mourelatos, and E. Nikolaidis. A single-loop approach for system
reliability-based design optimization. Journal of Mechanical Design, 129(12):1215–
1224, 2007. 5.1
J. Liu, J. Li, and B. U. Kim. Bayesian reliability modeling of multi-level system with
interdependent subsystems and components. In Intelligence and Security Infor-
matics (ISI), 2011 IEEE International Conference on, pages 252–257. IEEE, 2011.
2.1, 2.2
C. J. Lu and W. O. Meeker. Using degradation measures to estimate a time-to-failure
distribution. Technometrics, 35(2):161–174, 1993. 2.4.1
126
J. Luque and D. Straub. Reliability analysis and updating of deteriorating systems
with dynamic bayesian networks. Structural Safety, 62:34–46, 2016. 4.1, 4.2
S. Mahadevan, R. Zhang, and N. Smith. Bayesian networks for system reliability
reassessment. Structural Safety, 23(3):231–251, 2001. 2.2, 2.3.2, 3.2
K. R. McNaught and A. Zagorecki. Using dynamic bayesian networks for prognostic
modelling to inform maintenance decision making. In Industrial Engineering and
Engineering Management, 2009. IEEM 2009. IEEE International Conference on,
pages 1155–1159. IEEE, 2009. 2.2, 4.2
W. Q. Meeker and L. A. Escobar. Statistical methods for reliability data. John Wiley
& Sons, 2014. 2.4.1
K. Menberg, Y. Heo, and R. Choudhary. Sensitivity analysis methods for building en-
ergy models: Comparing computational costs and extractable information. Energy
and Buildings, 133:433–445, 2016. 3.6.2
A. Mettas. Reliability allocation and optimization for complex systems. In Reliability
and Maintainability Symposium, 2000. Proceedings. Annual, pages 216–221. IEEE,
2000. 4.2, 4.6, 5.5
J. Molina, J. Bromley, J. Garcı́a-Aróstegui, C. Sullivan, and J. Benavente. Integrated
water resources management of overexploited hydrogeological systems using object-
oriented bayesian networks. Environmental Modelling & Software, 25(4):383–397,
2010. 3.1
S. Montani, L. Portinale, A. Bobbio, and D. Codetta-Raiteri. Radyban: A tool for
reliability analysis of dynamic fault trees through conversion into dynamic bayesian
networks. Reliability Engineering & System Safety, 93(7):922–932, 2008. 4.2
O. Morales, D. Kurowicka, and A. Roelen. Eliciting conditional and unconditional
rank correlations from conditional probabilities. Reliability Engineering & System
Safety, 93(5):699–710, 2008. 3.4.2, 3.4.2, 4.3.1, 4.4
O. Morales Nápoles. Bayesian belief nets and vines in aviation safety and other
applications. doctoral thesis, 2010. 3.1, 3.4.2
S. Nannapaneni, C. Liang, and S. Mahadevan. Bayesian network approach to
multidisciplinary, multi-objective design optimization under uncertainty. In 18th
AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, page 3825,
2017. 3.4
M. Neil, N. Fenton, S. Forey, and R. Harris. Using bayesian belief networks to predict
the reliability of military vehicles. Computing & Control Engineering Journal, 12
(1):11–20, 2001. 3.2
S. W. Ormon, C. R. Cassady, and A. G. Greenwood. Reliability prediction models to
support conceptual design. IEEE Transactions on Reliability, 51(2):151–157, 2002.
3.2
127
D. Ozgür-Unlüakın and T. Bilgiç. Predictive maintenance using dynamic probabilistic
networks. In proceedings of the 3rd European Workshop on Probabilistic Graphical
Models PGM’06, pages 141–148, 2006. 2.2, 4.2
M. J. Powell. A direct search optimization method that models the objective and con-
straint functions by linear interpolation. In Advances in optimization and numerical
analysis, pages 51–67. Springer, 1994. 5.5
K. W. Przytula and A. Choi. An implementation of prognosis with dynamic bayesian
networks. In Aerospace Conference, 2008 IEEE, pages 1–8. IEEE, 2008. 2.2
W. Qian, L. Xie, D. Huang, and X. Yin. Systems reliability analysis and fault diag-
nosis based on bayesian networks. In Intelligent Systems and Applications, 2009.
ISA 2009. International Workshop on, pages 1–4. IEEE, 2009. 2.2
N. Rasmekomen and A. K. Parlikad. Optimising maintenance of multi-component
systems with degradation interactions. In In World Congress, pages 7098-7103,
2014. 2.2
A. B. Salem, A. Muller, and P. Weber. Dynamic bayesian networks in system relia-
bility analysis. IFAC Proceedings Volumes, 39(13):444–449, 2006. 4.2
L. M. Sanchez and R. Pan. An enhanced parenting process: Predicting reliability in
product’s design phase. Quality Engineering, 23(4):378–387, 2011. 3.2
S. Shan and G. G. Wang. Reliable design space and complete single-loop reliability-
based design optimization. Reliability Engineering & System Safety, 93(8):1218–
1230, 2008. 5.1
J. Sigurdsson, L. Walls, and J. Quigley. Bayesian belief nets for managing expert
judgement and modelling reliability. Quality and Reliability Engineering Interna-
tional, 17(3):181–190, 2001. 3.2
C. Simon, P. Weber, and A. Evsukoff. Bayesian networks inference algorithm to
implement dempster shafer theory in reliability analysis. Reliability Engineering &
System Safety, 93(7):950–963, 2008. 3.4.2
C. Su, N. Lin, and Y. Fu. Multi-state reliability assessment for hydraulic lifting
system based on the theory of dynamic bayesian networks. Proceedings of the
Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, page
1748006X16660488, 2016. 4.2
J. G. Torres-Toledano and L. E. Sucar. Bayesian networks for reliability analysis of
complex systems. In Ibero-American Conference on Artificial Intelligence, pages
195–206. Springer, 1998. 3.2
I. Y. Tumer and R. B. Stone. Mapping function to failure mode during component
development. Research in Engineering Design, 14(1):25–33, 2003. 3.2
128
P. Wang and D. W. Coit. Reliability prediction based on degradation modeling for
systems with multiple degradation measures. In Reliability and Maintainability,
2004 Annual Symposium-RAMS, pages 302–307. IEEE, 2004. 2.2
Y. Wang and H. Pham. Modeling the dependent competing risks with multiple degra-
dation processes and random shock using time-varying copulas. IEEE Transactions
on Reliability, 61(1):13–22, 2012. 4.2
C. S. Wasson. System engineering analysis, design, and development: Concepts,
principles, and practices. John Wiley & Sons, 2015. 3.1
P. Weber and L. Jouffe. Reliability modelling with dynamic bayesian networks. In
In 5th IFAC Symposium on Fault Detection, Supervision and Safety of Technical
Processes (SAFEPROCESS’03), Washington, DC, USA, pages 57–62. IFAC, 2003.
4.2
P. Weber and L. Jouffe. Complex system reliability modelling with dynamic object
oriented bayesian networks (doobn). Reliability Engineering & System Safety, 91
(2):149–162, 2006. 3.1, 4.2
P. Weber, G. Medina-Oliva, C. Simon, and B. Iung. Overview on bayesian networks
applications for dependability, risk analysis and maintenance areas. Engineering
Applications of Artificial Intelligence, 25(4):671–682, 2012. 3.2, 4.2, 5.2.1
G. Weidl, A. L. Madsen, and S. Israelson. Applications of object-oriented bayesian
networks for condition monitoring, root cause analysis and decision support on
operation of complex continuous processes. Computers & chemical engineering, 29
(9):1996–2009, 2005. 3.1
A. G. Wilson and A. V. Huzurbazar. Bayesian networks for multilevel system relia-
bility. Reliability Engineering & System Safety, 92(10):1413–1420, 2007. 2.2
T. Xia, L. Xi, X. Zhou, and J. Lee. Condition-based maintenance for intelligent
monitored series system with independent machine failure modes. International
Journal of Production Research, 51(15):4585–4596, 2013. 2.2
A. Yalaoui, C. Chu, and E. Châtelet. Reliability allocation problem in a series–parallel
system. Reliability engineering & system safety, 90(1):55–61, 2005. 4.2
J.-E. Yang, M.-J. Hwang, T.-Y. Sung, and Y. Jin. Application of genetic algorithm
for reliability allocation in nuclear power plants. Reliability Engineering & System
Safety, 65(3):229–238, 1999. 4.2
P. Yi, Z. Zhu, and J. Gong. An approximate sequential optimization and reliabil-
ity assessment method for reliability-based design optimization. Structural and
Multidisciplinary Optimization, 54(6):1367–1378, 2016. 5.1
P. Yontay and R. Pan. A computational bayesian approach to dependency assessment
in system reliability. Reliability Engineering & System Safety, 152:104–114, 2016.
2.3.2, 2.4, 3.2, 4.2
129
P. Yontay, L. M. Sanchez, and R. Pan. Bayesian network for reliability prediction
in functional design stage. In Reliability and Maintainability Symposium (RAMS),
2015 Annual, pages 1–6. IEEE, 2015. 2.3.2, 3.2, 4.2
D. C. Yu, T. C. Nguyen, and P. Haddawy. Bayesian network model for reliability
assessment of power systems. IEEE Transactions on Power Systems, 14(2):426–
432, 1999. 3.2
K. Yu and R. A. Moyeed. Bayesian quantile regression. Statistics & Probability
Letters, 54(4):437–447, 2001. 5.1, 5.3.2
W. Y. Yun, Y. M. Song, and H.-G. Kim. Multiple multi-level redundancy allocation
in series systems. Reliability Engineering & System Safety, 92(3):308–313, 2007.
4.2
S. Zhai and S. Z. Lin. Bayesian networks application in multi-state system reliability
analysis. In Applied Mechanics and Materials, volume 347, pages 2590–2595. Trans
Tech Publ, 2013. 2.2
C.-x. Zhang, J.-j. Sun, and S.-z. Lin. Reliability analysis and improvement for li/mno
2 cell production line based on fault tree analysis. In Industrial Engineering and
Engineering Management (IE&EM), 2011 IEEE 18Th International Conference
on, pages 1132–1135. IEEE, 2011. 3.6, 3.6.1
X. Zhang and A. Wilson. System reliability and component importance under de-
pendence: A copula approach. Technometrics, 59(2):215–224, 2017. 4.2
P. Zhao, P. S. Chan, L. Li, and H. K. T. Ng. On allocation of redundancies in
two-component series systems. Operations Research Letters, 41(6):690–693, 2013.
4.2
X. Zhong, M. Ichchou, and A. Saidi. Reliability assessment of complex mechatronic
systems using a modified nonparametric belief propagation algorithm. Reliability
engineering & system safety, 95(11):1174–1185, 2010. 2.4
Z. Zhou, G. Jin, D. Dong, and J. Zhou. Reliability analysis of multistate systems
based on bayesian networks. In Engineering of Computer Based Systems, 2006.
ECBS 2006. 13th Annual IEEE International Symposium and Workshop on, pages
6–pp. IEEE, 2006. 5.5
X. Zhuang and R. Pan. Epistemic uncertainty in reliability-based design optimization.
In Reliability and Maintainability Symposium (RAMS), 2012 Proceedings-Annual,
pages 1–6. IEEE, 2012a. 5.1
X. Zhuang and R. Pan. A sequential sampling strategy to improve reliability-based
design optimization with implicit constraint functions. Journal of Mechanical De-
sign, 134(2):021002, 2012b. 5.1
130
APPENDIX A
131
Conditional Probability Tables (CPTs) of the BN used in the simulation study
are given below.
132
Table A.2: The CPT for Subsystem 2
Subsystem 2
C3 C4 0 1 2 3 4
0 0 0.998 0.002 0 0 0
0 1 0.9 0.1 0 0 0
0 2 0.6 0.25 0.15 0 0
0 3 0.3 0.3 0.3 0.1 0
0 4 0 0.15 0.5 0.35 0
1 0 0.9 0.1 0 0 0
1 1 0.6 0.25 0.15 0 0
1 2 0.3 0.3 0.3 0.1 0
1 3 0 0.15 0.5 0.35 0
1 4 0 0.15 0.25 0.4 0.2
2 0 0.6 0.25 0.15 0 0
2 1 0.3 0.3 0.3 0.1 0
2 2 0 0.15 0.5 0.35 0
2 3 0 0.15 0.25 0.4 0.2
2 4 0 0 0.25 0.5 0.25
3 0 0.3 0.3 0.3 0.1 0
3 1 0 0.15 0.5 0.35 0
3 2 0 0.15 0.25 0.4 0.2
3 3 0 0 0.25 0.5 0.25
3 4 0 0 0 0.2 0.8
4 0 0 0.25 0.4 0.35 0
4 1 0 0.15 0.25 0.4 0.2
4 2 0 0 0.25 0.5 0.25
4 3 0 0 0 0.2 0.8
4 4 0 0 0 0.001 0.999
133
Table A.3: The CPT for System
System
Sub1 Sub2 0 1
0 0 0.999 0.001
0 1 0.99 0.01
0 2 0.9 0.1
0 3 0.8 0.2
0 4 0.7 0.3
1 0 0.99 0.01
1 1 0.9 0.1
1 2 0.8 0.2
1 3 0.7 0.3
1 4 0.6 0.4
2 0 0.9 0.1
2 1 0.8 0.2
2 2 0.7 0.3
2 3 0.6 0.4
2 4 0.5 0.5
3 0 0.8 0.2
3 1 0.7 0.3
3 2 0.6 0.4
3 3 0.5 0.5
3 4 0.2 0.8
4 0 0.7 0.3
4 1 0.6 0.4
4 2 0.5 0.5
4 3 0.2 0.8
4 4 0.001 0.999
134
APPENDIX B
135
Sampling algorithms for some nodes in Figures 3.3, 3.7, and 3.9
The second equality holds because the network structure shows that X1 and X2
are independent.
• Sampling algorithm for X4 in Fig. 3.9
x4 = Fr−1
41 ;x1
(Fr−1
43|1 ;Fr
(Fr−1 (Fr32 ;x3 (x2 )) (u4 )))
13 ;x1 (x3 ) 42|31 ;Fr
12|3 ;Fr13 ;x3 (x1 )
= Fr−1
41 ;x1
(Fr−1
43|1 ;Fr
(u4 )))
13 ;x1 (x3 )
Again, the second equality holds because the network structure implies X4 and
X2 are independent once values of X1 and X3 are given.
136
APPENDIX C
137
BQR models for each iteration and Max coefficient differences.
Initial Model
138
Iteration 9 (Max coefficient difference from the Iteration5 Model: 0.00893)
139