Programmable Controller With Flexible Redundancy For Safety Functions in A Nuclear Power Plant
Programmable Controller With Flexible Redundancy For Safety Functions in A Nuclear Power Plant
Abstract: This paper presents the redundancy architecture of the Programmable Logic
Controller called the Safety PLC(SPLC) for the safety functions such as reactor protection in
a nuclear power plant. The architecture of the SPLC is designed to switch flexibly redundancy
model between the Dual Modular Redundancy(DMR) and Triple Modular Redundancy(TMR).
Using this flexible redundancy architecture, the controller can be optimally configured to the
application area, and the reliability and availability of the overall system can be increased
because redundancy model varies as failures occur. The operating system of the SPLC is also
specially designed to guarantee the strict real-time operation using the non-preemptive state-
based scheduler and the supervisory task that manages timing violation of each task. The
data communication of the SPLC uses the deterministic state-based protocol based on the
Guaranteed Time Slot(GTS) protocol. The reliability analysis results show that MTTF of SPLC
is 41,630 hours, which is about 15% and 50% more reliable than the TMR or DMR architecture,
respectively.
On the contrary, the Tricon control system from Invensys Deterministic operation is another key feature for the
Inc. is based on the triple redundancy architecture as controllers to meet. From the IEEE standard, tasks of
shown in Fig. 1(Invensys Systems Inc. (2007)). There are safety-related functions should perform their own func-
three active processing modules in operation that calculate tions without any interruption to the pre-defined schedule.
control output simultaneously and determine output val- Following design criteria were selected for the SPLC with
ues using the majority voting algorithm. To help voting, consideration for the above mentioned requirements.
high speed serial bus, TriBus, is provided between three • The redundancy of the input/output module and
processing modules for high speed data exchange among processor module used in the SPLC should be flexible.
the processors. To provide the redundancy in I/O modules, • 2 out of 3 voting and hot-standby policy should
each I/O module has triple I/O legs through which input be used in the TMR(Triple Modular Redundancy)
values are distributed to three different processors and and DMR(Dual Modular Redundancy) configuration,
output values are again voted in the output module. The respectively.
triple redundancy in I/O module provides a very high level • The fault detection and fail-safe function should be
of reliability and lessens the probability of common cause performed by the data receiving module.
failure. • The Operating System(OS) of the SPLC should have
Since both dual and triple redundancy have their own the non-interruptable and deterministic task schedul-
advantage over each other, this paper introduces a new ing.
programmable controller for the safety-related functions in • The scan time of safety critical applications should
a nuclear power plant, called SPLC(PLC for safety func- be less than 25 msec.
tion), based on the flexible architecture that can switch • The data communication of the SPLC should be
dual or triple redundancy upon specific circumstances. deterministic and satisfy the independence among the
separated channels.
• The safety data communication should support upto
2. REQUIREMENT FOR NUCLEAR POWER PLANT 64 nodes with at least 20 Mbps of network bandwidth.
CONTROLLER
3. STRUCTURE OF SPLC
Since the controller for a safety-related function should be
strictly designed to meet industry standards and regula- To the design constraints described in the previous section,
tions including IEEE standard and US Regulatory Guide, SPLC is designed to have the dual and triple redundant
the requirement to design them should be clearly identi- architecture as shown in Fig. 2.
fied. Since IEEE standard recommends eliminating the risk
of Common Cause Failure(CCF) at the module and system
3.1 Flexible Redundancy
levels(IEEE (1994)), redundancy should be implemented
at the processing modules, I/O modules, and communica-
tion networks. The redundancy concept of SPLC is using the triple redun-
dancy in active module such as the processor module and
The other key feature is the independence that means each I/O module that needs independent decision and active
module or sub-system should independently operate from control functions and dual redundancy in passive compo-
each other and a failure in a part should not propagate nents including back-plane, communication network, and
to the other. For this, communication between safety- power supply that requires minimum redundancy to avoid
related controllers should be uni-directional and have a Common Cause Failure(CCF).
696
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
697
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
network but the other serial buses can be used in the of overall controller should be analyzed with considering
implementation phase. this redundant mode changes. With reflecting this failure
mode change of the flexible TMR model, the Fault Tree
3.4 Operating System Analysis(FTA) chart of SPLC is shown in Fig. 5. In the
chart, system state are classified into four categories: Total
Even though there are a few commercial operating systems Success(TS), Minimum Anticipated Failure(MAF), Maxi-
that support hard real-time operations, most of the com- mum Tolerable Failure(MTF), and Complete Failure(CF).
mercial operating systems hardly guarantee very strict de- For example, since the processor module is configured as
terministic operation that complied to the regulations and the TMR, if one processor module fails, it goes to the
standards of the nuclear system. Hence, to meet the strict MAF state. If two processor modules fail, system state
determinism defined in regulations, a new real-time kernel falls into the MTF state because further failure causes the
for SPLC, Safety Real-time Kernel(SRK), was designed system failure(CF). The state transition in the FTA-tree
to ensure the deterministic operation of safety function is caused by a failure occurred in each module(processor,
without any interruption by other functions, including the I/O card, bus, etc). Assuming these failures are indepen-
system diagnostics and communications. Fig. 4 shows the dent, this state transition can be modeled as a Markov
internal blocks of SRK. process. Fig. 6 shows the state transition of SPLC to the
failure occurs and the probability of each state transition,
To guarantee the execution time of the safety function, the PM AF , PM T F 1 , PM T F 2 , and PCF , depends on the redun-
scheduler of SRK operates using a non-preemptive state- dancy structure and model. Using these probability values,
based scheduling policy. In addition to the non-preemptive the reliability of overall system is calculated as in (1).
scheduler, a special supervisory task module that takes
care of time management, security management, and sys- R(t) = (1 − PM AF − PM T F 1 )RT S (t)
tem diagnostics was used to guarantee the hard-real-time +(PM AF − PM T F 2 )RM AF (t)
operation. The maximum resolution of the context switch- (1)
ing time and the scan time of the application task are 5 +(PM T F 1 + PM T F 2 − PCF )RM T F (t)
msec and 25 msec, respectively. +PCF RCF (t)
Another key feature of SRK is to handle the redundancy Based on the actual failure rate of each component module,
structure according to the failure occurrence. Since SPLC such as the processor module, communication module, I/O
is based on the flexible TMR redundancy, when a compo- modules, and etc, the probability of state transition is
nent fails, it automatically reconfigures the redundancy calculated as in (2). The earlier author’s work showed the
structure and, accordingly, the voting policy of the in- details of the failure rate and the FTA analysis results(Noh
put/output and application tasks should be changed as et al. (2013)).
shown in Fig. 3.
R(t) = 0.999953 · RT S (t) − {3.92 · RM AF (t)
(2)
4. RELIABILITY OF SPLC +3.92 · RM T F (t) + 4.70 · RCF (t)} × 10−5
As described in the previous section, since the redun- By integrating (2), the Mean Time To Failure(MTTF) of
dancy of SPLC changes as a fault occurs, the reliability SPLC is calculated as 41,630 hours that is about 97%
698
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
699
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
700