Stochastic_Maintenance_Schedules_of_Active_Distribution_Networks_Based_on_Monte-Carlo_Tree_Search
Stochastic_Maintenance_Schedules_of_Active_Distribution_Networks_Based_on_Monte-Carlo_Tree_Search
5, SEPTEMBER 2020
Abstract—The integration of volatile distributed energy re- ΩP Set of scenarios for stochastic variables
sources (DERs) brings new challenges for the active distribu-
tion network maintenance scheduling (DN-MS). Conventionally, Constants and Parameters
the DN-MS is formulated as a deterministic optimization model T Maintenance planning horizon
without considering the uncertainties of DERs. In this paper, bfj Constant of maintenance budget for equip-
the DN-MS is formulated as a multistage stochastic optimization
problem, which is cast as a stochastic mixed-integer nonlinear ment j in feeder f
programming model. It aims to reduce the total maintenance cost PMfj Preventive maintenance budget of equip-
constrained by the reliability indices. To capture the operational ment j in feeder f
characteristics of active distribution networks, the uncertainties CMfj Corrective maintenance budget of equip-
of DERs and post-outage operation strategies of switching devices ment j in feeder f
are incorporated into the model. In general, this type of model is
intractable and mainly solved by heuristic search methods with low MBt Stage-t maintenance budget
P,Mat C,Mat
efficiency. Recently, Monte-Carlo tree search (MCTS) is emerging Cj , Cj Cost of materials required for the repair of
as a scalable and promising reinforcement learning approach. We the equipment j in preventive or corrective
propose a stochastic MCTS solution to this problem. In the tree maintenance
search procedure, a sample average approximation technique is
developed to estimate multistage maintenance costs considering CjP,Wor , CjC,Wor Cost per working hour necessary for the
uncertainties. To speed up the MCTS, the complicated constraints repair of equipment j in preventive or cor-
of the original problem are transformed to penalty or heuris- rective maintenance
tics functions. This approach can asymptotically approximate the HjP,Wor , HjC,Wor Number of working hours for the repair of
optimum with promising computation efficiency. Numerical test equipment j in preventive or corrective main-
results demonstrate the superiority of the proposed method over
benchmark methods. tenance
Lf , TRf Length of line segments or number of trans-
Index Terms—Distribution network, distributed generation,
electric vehicles, maintenance, Monte-Carlo tree search. formers in feeder f
NC Number of customers
NOMENCLATURE
ρP U
i , ρi Prices of planned or unplanned unsupplied
Sets energy at load point i (¥/kWh)
Ωfl , Ωftr Set of line sections or distribution transform- λbase
j Base failure probability of equipment j
ers in feeder f λincre
j Incremental failure probability of equip-
Ωfnn Set of load points in feeder f ment j
P,Mai C,Mai
ΩSWI Set of switching devices ri,j , ri,j Restoration time experienced by load point i
ΩF Set of feeders due to preventive or corrective maintenance
of equipment j
Manuscript received August 28, 2019; revised January 2, 2020; accepted δSAIDI , δSAIFI Thresholds of SAIDI or SAIFI
February 9, 2020. Date of publication February 13, 2020; date of current version tfAS , tfM S Operation time of the automatic or manual
August 24, 2020. This work was supported in part by the National Science
Foundation of China under Grant 51725703. (Corresponding author: Wenchuan switching devices in feeder f
Wu.) Variables
Yuwei Shang and Wenchuan Wu are with Tsinghua University, Beijing
100084, China (e-mail: [email protected]; [email protected]). LTi,j Binary variable that is 1 if load point i can
Jianbo Guo, Jian Su, and Wei Liu are with China Electric Power Research be transferred to other feeders by operating
Institute, Beijing 100192, China (e-mail: [email protected]; sujian@
epri.sgcc.com.cn; [email protected]). switching devices due to the failure of equip-
Jiawei Liao and Yu Huang are with Peking University, Beijing 100871, China ment j, and 0 otherwise
(e-mail: [email protected]; [email protected]). N CjP,f , N CjU,f Number of customers experiencing planned
Color versions of one or more of the figures in this article are available online
at http://ieeexplore.ieee.org. or unplanned outages due to maintenance of
Digital Object Identifier 10.1109/TPWRS.2020.2973761 equipment j
0885-8950 © 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3941
Pi,j Unsupplied power for node i during the scenario is adopted for evaluation. This over-simplified
maintenance of equipment j model may not be applicable to active DNs.
SAIDI System Average Interruption Duration Index To describe DER uncertainties and their influence, the DN-
SAIFI System Average Interruption Frequency MS is formulated as a multistage stochastic programming model
Index for active DNs. The solutions involve the maintenance budgeting
λfj Failure probability of equipment j in feeder f and time allocation that minimize the expected costs while
CPM , CCM Labor and material cost of preventive or satisfying the reliability constraints of the distribution network.
corrective maintenance If only one operational scenario (i.e., one realization of DER
CUE Cost of unsupplied energy uncertainties) is considered, the solution space to this problem
CP,UE , CU,UE Cost of unsupplied energy in planned or contains approximately k Ne ·T candidate solutions, where k is
unplanned outage the number of maintenance decisions for each piece of equip-
afj Binary decision variable that is 1 if preven- ment, Ne is the number of pieces of equipment, and T is the
tive maintenance is allocated for equipment maintenance planning horizon. The problem is computation-
j in feeder f, otherwise 0 ally challenging. The Monte-Carlo tree search (MCTS) method
In the Markov decision process and Monte-Carlo tree search may provide a viable solution to the problem, as an artificial
intelligence approach that has achieved superior performance in
A, S Set of all actions or states several computationally intensive computer games. We develop
a, s Action or state MCTS-based solutions to solve the DN-MS.
v, v Node or chance node in the search tree
Q(s, a) Reward function for taking action a in state s
γ Discount rate B. Related Work
Many works have focused on maintenance scheduling in DNs.
I. INTRODUCTION Reference [2] established a quantitative model to analyze the
impact of different PM strategies on system reliability and costs.
A. Background
References [3] and [4] presented a practical framework for
ISTRIBUTION networks are one of the most implementing a reliability-centered maintenance procedure in
D maintenance-intensive parts of power systems.
Distribution network companies devote considerable efforts to
DNs. Reference [5] used the network reliability and maintenance
cost as evaluation criteria, and the fuzzy analytical hierarchical
allocate their limited resources (e.g., labors and materials) to process (AHP) was adopted to assign their weights. Reference
maintain the power assets (e.g., lines and transformers) [1]. In [6] formulated a mixed-integer linear programming (MILP)
general, maintenance strategies can be classified into corrective method and employed an efficient decision-tree solution for
maintenance (CM) and preventive maintenance (PM). The PM. They used a linearized cost model to describe the EENS
CM is performed after equipment failure occurs, while the and maintenance costs, etc. Reference [7] presented a MILP
PM is performed before equipment failure occurs. The goal formulation in conjunction with time-dependent deterioration
of distribution network maintenance scheduling (DN-MS) is failure probabilities of equipment. The aim was to identify a
to enhance system reliability while reducing total costs over cost-effective PM strategy while satisfying system reliability
a multistage planning horizon. The integration of large-scale requirements. Reference [8] presented a two-step approach to
distributed energy resources (DERs), such as distributed optimize the maintenance strategy for feeders equipped with
generators (DGs) and electric vehicles (EVs), distinguishes normally open switches. If the budget is limited and does not
DN-MS from existing models in terms of the cost formulation allow performance of the required maintenance, the loading of
and reliability estimation: the worst-performing equipment is reduced. In [9] a decision
1) In general, expenditure due to PM and CM mainly involves tree was used to express state transitions and dynamic program-
the costs of labor and materials, and the cost of unsupplied ming (DP) was employed to optimize the maintenance strategy.
energy (also known as the expected energy not supplied, However, DP for DN-MS may suffer from dimensionality [6].
EENS). Most existing works formulate the EENS as an This issue can be addressed by the approximate DP, as evidenced
expected loss function of energy consumption. However, by references [10] and [11] for solving maintenance scheduling
in active DNs, the generation losses of DGs should be problems in related fields. Some recent works, i.e., [12] and [13],
considered in EENS, as well as their uncertainties. incorporated the operation strategies of automatic switching
2) During DN-MS formulation, the reliability indices are devices, including sectionalizers and tie switches, into the DN-
incorporated as constraints; reliability estimation is in- MS. They formulated mixed-integer non-linear programming
dispensable. For reliability estimation, post-outage op- (MINP) models that were solved by a heuristic search algorithm.
eration of switching devices, such as sectionalizers and To date, no study has used the stochastic optimization
tie switches, is necessary for transferring the outage load model in the context of DN-MS. However, some recently pub-
to adjacent feeders, which is constrained by the feeders’ lished works, i.e., [14], [15] and [16], incorporated different
available capacity, in turn determined by the power flow. types of uncertainties into the maintenance scheduling prob-
The DER uncertainties are always ignored and the load de- lem for power plants and transmission networks, and demon-
mand is assumed to be constant, i.e., only one deterministic strated remarkable performance improvements compared with
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
3942 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 35, NO. 5, SEPTEMBER 2020
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3943
alternative power supply for all load points in F1. If there is uncertainties of DERs. Here superscript · is used to distinguish
insufficient capacity and only LD can be transferred to F2, then stochastic variables from deterministic variables,
T1 and S3 will be closed while S1 and S2 will be opened. UE P,UE,f U,UE,f
Ct = C i,j,t + C i,j,t (6)
A. Objective Function f ∈ΩF j∈{Ωf ∪Ωftr } i∈Ωfnn
l
maintenance costs through optimizing the maintenance budget- where C i,j,t and C i,j,t are the planned and unplanned un-
ing and time allocation for the correlated equipment. Let T be supplied energy costs for load point i due to stage-t maintenance
the length of the maintenance scheduling horizon, and t the time of equipment j.
P,UE,f
index. Let C(at ) be the stage-t costs, i.e., the cost incurred in C i,j,t is given by
the period [t, t + 1),
P,UE,f
where CtPM and CtCM are the labor and material costs of PM and ∀j ∈ {Ωfl ∪ Ωftr }, ∀f ∈ ΩF (7)
CM, respectively. CtUE is the cost of the EENS in both planned
and unplanned outages. at is the maintenance decisions for all where P i,j,t is the stochastic power and ri,j,t is the restoration
equipment at stage t, i.e., at = {afj,t |j ∈ Ωl ∪ Ωtr , f ∈ ΩF }. time of load point i during stage-t maintenance of equipment j,
CtPM can be obtained based on the scheduled PM budget for where,
the distribution equipment by the distribution utility, ri,j,t
P
= LTi,j,t · (pAS tfAS + (1 − pAS )tfM S )
CtPM = afj,t · PMfj,t (2) P,Mai
+ (1 − LTi,j,t )ri,j , ∀i ∈ ΩfLD , ∀j ∈ {Ωfl ∪ Ωftr }
f ∈ΩF j∈{ Ωl ∪Ωtr }
(8)
CtCM is obtained according to the number of working hours LTi,j,t is the binary variable denoting whether the load can
and the amount of materials needed for CM activities, be transferred to alternative feeders. If load point i can be
CtCM = CM,f
Cj,t (3) transferred to a backup feeder, LTi,j,t equals 1; otherwise, it
f ∈ΩF j∈{Ωl ∪Ωtr }
is 0. As a result, if load point i cannot be transferred, it will
experience a restoration time equal to the maintenance duration
CM,f
Cj,t = λfj,t · (Cjwor · Hjwor + Cjmat ), ∀j ∈ {Ωfl ∪ Ωftr }, of equipment j, which usually lasts from several hours to several
days. If load point i is transferable, we further consider the
∀f ∈ ΩF (4) possibility of automatic switch malfunctions in practice. Let pAS
where λfj,t
is the failure probability of equipment j. be the probability of correct operation of automatic switches; if
In general, equipment failures can be classified as random or switches always operate correctly in automatic mode, pAS = 1
deteriorating failures (i.e., ageing). Random failures are com- and load point i has a restoration time that is equal to the
monly modelled by exponential probability distributions, while automatic switching time. In contrast, if the switches do not
deteriorating failures are commonly modeled by the Weibull or operate correctly in automatic mode, pAS = 0, which implies
normal probability distributions, whose failure probability may load point i has the manual switching time [13].
P,UE,f U,UE,f
increase over time. PM can reduce the probability of deterio- Similar to C i,j,t , C i,j,t is calculated by
rating failures [9], and the failure probability of the equipment U,LD,f
could be decreased in proportion to their allocated maintenance C i,j,t = λfj,t · P i,j,t · ri,j,t
U
· ρU f
i,j,t , ∀i ∈ Ωnn ,
budget [12]. Hence the failure probability in a time interval is
given as, ∀j ∈ {Ωfl ∪ Ωftr }, ∀f ∈ ΩF (9)
f f where,
λfj,t+1 = afj,t · λfj,t · e−(bj ·PMj,t ) + (1 − afj,t ) · λfj,t + λf,incre
j,t ,
ri,j,t
U
= LTi,j,t · (pAS tfAS + (1 − pAS )tfM S )
∀j ∈ {Ωfl ∪ Ωftr }, ∀f ∈ ΩF (5)
C,Mai
+ (1 − LTi,j,t )ri,j , ∀i ∈ ΩfLD , ∀j ∈ {Ωfl ∪ Ωftr } (10)
where bfj and λf,incre
j,t can be determined based on the historical
maintenance data [12]. Note that (5) is a general model to Let Pr denote the probability distribution of stochastic power
describe the deteriorating failure probability. In short-term main- P on a common and complete probability space P , and let
tenance scheduling problems (e.g., monthly, weekly), λf,incre
j,t vector ω t ≡ {P i,t : i ∈ Ωnn } ∈ ΩP t denote stage-t stochastic
can be simply set to 0. Besides, as more sensors and smart power for all load points. Assume the initial stage is stage 0,
meters are deployed, condition based information can be used we refer to a scenario ωtT ∈ ΩT,P t ≡ ΩP P
t × · · · × ΩT −1 as a
to replace (5) for capturing the time-varying failure probability realization (or sampling trajectory) of the stochastic process
more accurately [22]. T
{ω t : 0 ≤ t ≤ T − 1} (see [17] for details of the sampling
CtUE is estimated based on the interrupted energy during out-
UE
procedure). For the DN-MS problem over the multistage plan-
age. We use a stochastic variable denoted as C t to incorporate ning horizon, its objective can be finally formulated as a nested
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
3944 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 35, NO. 5, SEPTEMBER 2020
multistage expectation minimization function: The decision variables of (17) are denoted by {a0 , …,aT-1 }.
Take a0 as an example, it is a set of binary variables repre-
z = min C(a0 ) + γE ω1
min C(a1 ) + γE
ω 2| ω 1
min C(a2 ) senting the maintenance decisions of all equipment at stage
a0 a1 a2
0, i.e., a0 = {afj,0 |j ∈ Ωl ∪ Ωtr , f ∈ ΩF }. In Section III we
describe the MCTS-based approach for obtaining high-quality
+ · · · + γE
ω T −1 | ω T −2
[min C(aT −1 )] . . . (11)
aT −1 approximate solutions to this problem.
where E
ω t+1 | ω t
is the expectation following the conditional
III. PROPOSED METHOD
probability Pr(ω t+1 |ω t ). C(·) denotes the cost formulated in
(1). γ is the discount factor for future maintenance costs. For A. Preliminaries of the Computation Framework
short-term maintenance scheduling, γ can be set to 1. The problems solved by the MCTS are commonly formalized
by the Markov decision process (MDP), in which S and A denote
B. Constraints the state space and action space, respectively, F : S × A → S
The maintenance decision is subject to constraints (12)–(16). denotes the transition function from a state-action pair to the
1) The allocated stage-t maintenance budget should be less next state, Q(s) denotes the reward of a state s ∈ S, and Af (s)
than the maximum maintenance budget, denotes the set of available actions in state s. In the DN-MS
problem, we use Q as the negative of the maintenance cost C
CtPM + CtCM ≤ M Bt (12) formulated in (1), the transition function F is defined by (5).
2) The reliability indices System Average Interruption Fre- The state and action are
quency Index (SAIFI) and System Average Interruption
Duration Index (SAIDI) are used to represent the degree s = {λfj |j ∈ Ωl ∪ Ωtr , f ∈ ΩF } (18)
of customer satisfaction. They should be less than the a = {afj |j ∈ Ωl ∪ Ωtr , f ∈ ΩF } (19)
allowable thresholds [6],
where s and a are vectors comprising failure probabilities and
f
U,f
N Cj,t f
P,f
N Cj,t
λj,t · + aj,t · ≤ δSAIFI maintenance decisions for all equipment, respectively. Note
F l tr
NC NC thatλfj ∈ [0, 1] is a real-valued variable, and afj ∈ {0, 1} is a
f ∈Ω j∈{Ω ∪Ω }
(13) binary variable that, when equal to 1, denotes the scheduling PM
for j, and vice versa when it is 0. We use t = 0 to represent the
i∈Ωfnn ri,j,t
U
i∈Ωfnn ri,j,t
P
initial stage and λfj,0 to represent the stage-0 failure probability
λfj,t + afj,t
NC NC of equipment j; the corresponding state vector is s0 = {λfj,0 |j ∈
f ∈ΩF j∈{Ωl ∪Ωtr }
Ωl ∪ Ωtr , f ∈ ΩF }. If the stage-0 maintenance strategy is a0 =
≤ δSAIDI (14)
{afj,0 |j ∈ Ωl ∪ Ωtr , f ∈ ΩF }, we can derive the corresponding
where the reliability indices are estimated by incorporating state vector s1 by (5). Similarly, the state vector st can be derived
the post-outage events and network reconfiguration through based on {st−1 , at−1 }, where t ∈ {1, . . . , T }.
switches [23]. A Monte-Carlo search tree consists of nodes and edges. Let
3) The failure probability of the equipment should not be v be a tree node corresponding to a state s. The directed tree
lower than that of the minimum failure probability in its edge connecting a parent node to its child node represents an
useful life period [10], action a leading to state transitions. Each node v contains the
following statistics: state s(v), selected action a(v), expected
λfj,t ≥ λbase f f
j,t , ∀j ∈ {Ωl ∪ Ωtr }, ∀f ∈ Ω
F
(15) reward Q(v) and visit count N(v). For example, in our problem,
where λbase is the minimum failure probability of equipment j v0 is the root node of the tree with statistics s(v0 ) denoting the
j
during its service life. For simplicity, λbase can be set to 0. state vector s0 defined in (18), and a(v0 ) denoting a0 defined
j
4) The thermal capacity of the equipment restricts the maxi- in (19). v1 is one of the child nodes of v0 , whose statistics
mum current that can be carried through, s(v1 ) and a(v1 ) represent stage-1 state and action vectors s1
and a1 , respectively. If the total number of pieces of equipment
f f,max
, ∀j ∈ {Ωfl ∪ Ωftr }, ∀f ∈ ΩF (16)
Ij,t (ω t ) ≤ Ij,t to be repaired is Ne , then the number of child nodes for v0 is
2N e . The Q(v) and N(v) of each node v are updated during the
which determines the operation strategies of switching de-
implementation of the MCTS. When the iterative computation
vices for supply restoration or network reconfiguration accord-
budget of the MCTS is reached (e.g., constraint of time, iteration
ing to the power flow distribution.
or memory), we can identify a complete path of the search tree
The DN-MS problem is formulated as a multi-stage SP:
denoted {v0 , v1 , . . . , vT }, with the tree edges representing the
min (11) set of optimal maintenance strategies {a0 , a1 , . . . , aT −1 } and
the resulting failure probability vectors {s1 , s2 , . . . , sT }.
s.t. (12)–(16) (17)
The above process offers the general computation framework
which is different from the existing two-stage SP models, which of the MCTS based methods for solving the DN-MS. In subsec-
provide only a single recourse opportunity under uncertainty. tion B, we briefly introduce the computation procedure of the
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3945
basic MCTS and its gaps for solving (17). To bridge the gaps, in
subsection C we propose the S-MCTS, which is customized
from the B-MCTS by introducing two novel computation
strategies.
ω t ∈ΩP
(Q(wt )) and approximated by M
t
C. Stochastic MCTS Method T
realizations of the random process ω t . A detailed example is
In this subsection, we provide the customizations that we provided in Appendix B to show how the multistage uncertainty
made to extend B-MCTS into S-MCTS. The implementation is incorporated for node reward estimation.
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
3946 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 35, NO. 5, SEPTEMBER 2020
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3947
TABLE IV
PERFORMANCE COMPARISON OF INCORPORATING NETWORK
TABLE II RECONFIGURATION STRATEGY AND DER UNCERTAINTIES
PREVENTIVE MAINTENANCE SCHEDULES
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
3948 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 35, NO. 5, SEPTEMBER 2020
TABLE V TABLE VI
COMPUTATION TIME OF DIFFERENT ALGORITHMS COMPARISON OF ALGORITHMS IN THE FULL-SCALE TEST SYSTEM
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3949
schedule involves only 1 equipment over two stages (i.e., stage DEFAULTPOLICY starts from nodev1,1 , action a1 is set to 0
0 and stage 1). randomly, which yields the terminal nodev2,2 and concludes
Figure 5 depicts two trees, i.e., the S-MCTS and a scenario this trajectory. Then, the expected rewards of expanded nodes
tree. The S-MCTS originates from the root node v0 and (in this case, node v1,1 ) are estimated based on the SAA in
ends at a terminal node (e.g., v2,2 ). v0 corresponds to the a backward direction. So, for v1,1 its reward is approximated
equipment failure probability λ0 . The tree branches represent as minus the average costs taking actiona = {1, 0}under 6 sce-
different PM decisions. Take a0 as an example, a0 = 1 leads to narios, i.e., 16 6n=1 Q(ω n ; a). Note that according to (27), the
the state transition from v0 to v1,1 , while a0 = 0 leads to ln N (vp )
term Cp N (vc ) is added into the reward value for address-
v1,2 . Totally, there are four possible maintenance schedules
ing the exploration-exploitation dilemma. If some constraints
that yield four state transition trajectories. The scenario tree
are violated the estimated value is further biased by the term
provides possible realizations of uncertainties. We assume 6
− 3l=1 max(0, penaltyl ). After that the newly estimated re-
scenarios over the two stages, i.e.,
ward value is compared to the stored value in the previous
{(ω1,1 , ω2,1 ), (ω1,1 , ω2,2 ), (ω1,2 , ω2,3 ), (ω1,2 , ω2,4 ), (ω1,3 , iteration, the bigger will be saved.
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
SHANG et al.: STOCHASTIC MAINTENANCE SCHEDULES OF ACTIVE DN BASED ON MONTE-CARLO TREE SEARCH 3951
TABLE VIII [20] D. Silver et al., “Mastering the game of go with deep neural networks and
BUDGETS AND COSTS PER CM/PM ACTIVITY tree search,” Nature, vol. 529, no. 7587, pp. 484–489, 2016.
[21] D. Whitehouse, E. Powley, and P. Cowling, “Determinization and informa-
tion set Monte Carlo tree search for the card game Dou Di Zhu,” in Proc.
IEEE Conf. Comput. Intell. Games, Seoul, South Korea, 2011, pp. 87–94.
[22] M. Yildirim, X. A. Sun, and N. Z. Gebraeel, “Sensor-driven condition
based generator maintenance scheduling—Part I: Maintenance problem,”
IEEE Trans. Power Syst., vol. 31, no. 6, pp. 4253–4262, Nov. 2016.
[23] C. Chen, W. Wu, B. Zhang, and C. Singh, “An analytical adequacy
evaluation method for distribution networks considering protection strate-
gies and distributed generators,” IEEE Trans. Power Del., vol. 30, no. 3,
pp. 1392–1400, Jun. 2015.
REFERENCES [24] A. Shapiro, “Analysis of stochastic dual dynamic programming method,”
Eur. J. Oper. Res., vol. 209, pp. 63–72, 2011.
[1] J. Zhong, W. Li, C. Wang, J. Yu, and R. Xu, “Determining optimal
[25] E. Arriagada, E. López, C. Roa, M. López, and J. Vannier, “A stochastic
inspection intervals in maintenance considering equipment aging failures,”
economic dispatch model with renewable energies considering demand
IEEE Trans. Power Syst., vol. 32, no. 2, pp. 1474–1482, Mar. 2017.
and generation uncertainties,” in Proc. IEEE Grenoble Conf., Grenoble,
[2] L. Bertling, R. Allan, and R. Eriksson, “A reliability-centered asset
2013, pp. 1–6.
maintenance method for assessing the impact of maintenance in power
[26] N. Abdel-Karim, E. J. Nethercutt, J. N. Moura, T. Burgess, and T. C.
distribution systems,” IEEE Trans. Power Syst., vol. 20, no. 1, pp. 75–82,
Ly, “Effect of load forecasting uncertainties on the reliability of North
Feb. 2005.
American bulk power system,” in Proc. IEEE PES General Meeting |
[3] P. Dehghanian, M. Firuzabad, F. Aminifar, and R. Billinton, “A compre-
Conference & Exposition, National Harbor, MD, 2014, pp. 1–5.
hensive scheme for reliability centered maintenance in power distribution
[27] V. K. Koumousis and C. P. Katsaras, “A Saw-tooth genetic algorithm
systems—Part I: Methodology,” IEEE Trans. Power Del., vol. 28, no. 2,
combining the effects of variable population size and reinitialization to
pp. 761–770, Apr. 2013.
enhance performance,” IEEE Trans. Evol. Comput., vol. 10, no. 1, pp. 19–
[4] P. Dehghanian, M. Firuzabad, F. Aminifar, and R. Billinton, “A Compre-
28, Feb. 2006.
hensive scheme for reliability-centered maintenance in power distribution
[28] J. C. Stacchini, M. B. Filho, and M. L. R. Roberto, “A genetic-based
systems—Part II: Numerical Analysis,” IEEE Trans. Power Del., vol. 28,
methodology for evaluating requested outages of power network ele-
no. 2, pp. 771–778, Apr. 2013.
ments,” IEEE Trans. Power Syst., vol. 26, no. 4, pp. 2442–2449, Nov. 2011.
[5] M. Khodaei Tehrani, A. Fereidunian, and H. Lesani, “Financial planning
[29] E. K. Burke and A. J. Smith, “Hybrid evolutionary techniques for the
for the preventive maintenance of power distribution systems via fuzzy
maintenance scheduling problem,” IEEE Trans. Power Syst., vol. 15, no. 1,
AHP,” Complexity, vol. 21, pp. 36–46, Oct. 2014.
pp. 122–128, Feb. 2000.
[6] H. M. Shourkaei, A. A. Jahromi, and M. F. Firuzabad, “Incorporating
[30] J. Zhu, P. Xuan, P. Xie, C. Hong, and W. Yan, “Generation and transmission
service quality regulation in distribution system maintenance strategy,”
equipment maintenance scheduling with load transfer,” in Proc. IEEE
IEEE Trans. Power Del., vol. 26, no. 4, pp. 2495–2504, Oct. 2011.
Power Energy Soc. General Meeting, Chicago, IL, 2017, pp. 1–5.
[7] A. Abiri-Jahromi, M. Fotuhi-Firuzabad, and E. Abbasi, “An efficient
[31] A. M. Eldurssi and R. M. O’Connell, “A fast nondominated sorting guided
mixed-integer linear formulation for long-term overhead lines mainte-
genetic algorithm for multi-objective power distribution system reconfig-
nance scheduling in power distribution systems,” IEEE Trans. Power Del.,
uration problem,” IEEE Trans. Power Syst., vol. 30, no. 2, pp. 593–601,
vol. 24, no. 4, pp. 2043–2053, Oct. 2009.
Mar. 2015.
[8] V. Aravinthan and W. Jewell, “Optimized maintenance scheduling for
[32] J. D. Foster, A. M. Berry, N. Boland, and H. Waterer, “Comparison of
budget-constrained distribution utility,” IEEE Trans. Smart Grid, vol. 4,
mixed-integer programming and genetic algorithm methods for distributed
no. 4, pp. 2328–2338, Dec. 2013.
generation planning,” IEEE Trans. Power Syst., vol. 29, no. 2, pp. 833–843,
[9] A. D. Janjic and D. S. Popovic, “Selective maintenance schedule of
Mar. 2014.
distribution networks based on risk management approach,” IEEE Trans.
Power Syst., vol. 22, no. 2, pp. 597–604, May 2007.
[10] K. Ahadi and K. M. Sullivan, “Approximate dynamic programming for
selective maintenance in series–Parallel systems,” IEEE Trans. Rel., to be
published, 2019, doi: 10.1109/TR.2019.2916898.
[11] S. K. Abeygunawardane, P. Jirutitijaroen, and H. Xu, “Power system
maintenance planning using value function approximation,” in Proc. Int.
Conf. Probabilistic Methods Appl. Power Syst., 2014, pp. 1–7. Yuwei Shang received the B.E. degree in electri-
[12] H. Mirsaeedi, A. Fereidunian, S. M. Hosseininejad, and H. Lesani, “Elec- cal engineering from Zhejiang University, Zhejiang,
tricity distribution system maintenance budgeting: A reliability-centered China, in 2013 and the M.E. degree in electri-
approach,” IEEE Trans. Power Del., vol. 33, no. 4, pp. 1599–1610, cal engineering in 2015 from Tsinghua University,
Aug. 2018. Beijing, China, where he is currently working toward
[13] H. Mirsaeedi, A. Fereidunian, S. M. Hosseininejad, P. Dehghanian, and H. the Ph.D. degree. His research interests include active
Lesani, “Long-term maintenance scheduling and budgeting in electricity distribution system analysis & control, power distri-
distribution systems equipped with automatic switches,” IEEE Trans. Ind. bution asset management, machine learning and its
Informat., vol. 14, no. 5, pp. 1909–1919, May 2018. application in energy system.
[14] H. G. Williams and E. Patelli, “Maintenance strategy optimization for
complex power systems susceptible to maintenance delays and operational
dynamics,” IEEE Trans. Rel., vol. 66, no. 4, pp. 1309–1330, Dec. 2017.
[15] B. Basciftci, S. Ahmed, N. Z. Gebraeel, and M. Yildirim, “Stochastic opti-
mization of maintenance and operations schedules under unexpected fail-
ures,” IEEE Trans. Power Syst., vol. 33, no. 6, pp. 6755–6765, Nov. 2018.
[16] R. Rocchetta et al., “A reinforcement learning framework for optimal Wenchuan Wu (Senior Member, IEEE) received the
operation and maintenance of power grids,” Appl. Energy, vol. 241, B.S., M.S., and Ph.D. degrees from Tsinghua Univer-
pp. 291–301, 2019. sity, Beijing, China. He is currently a Professor and
[17] A. Bhattacharya, J. P. Kharoufeh, and B. Zeng, “Managing energy storage the Director of Electric Power System Research Insti-
in microgrids: A multistage stochastic programming approach,” IEEE tute, Department of Electrical Engineering, Tsinghua
Trans. Smart Grid, vol. 9, no. 1, pp. 483–496, Jan. 2018. University. His research interests include energy man-
[18] R. Coulom, “Efficient selectivity and backup operators in Monte-Carlo agement system, active distribution system operation
tree search,” in Proc. 5th Int. Conf. Comput. Games, Turin, Italy, 2006, and control, machine learning and its application in
pp. 72–83. energy system. He is an IET Fellow, and an Associate
[19] C. Browne et al., “A survey of Monte Carlo tree search methods,” IEEE Editor for IET Generation, Transmission & Distribu-
Trans. Comput. Intell. AI Games, vol. 4, no. 1, pp. 1–49, Mar. 2012. tion and IET Energy Systems Integration.
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap
3952 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 35, NO. 5, SEPTEMBER 2020
Jiawei Liao received the B.S. degree in Internet Jian Su received the M.S. degree in power electric
of Things from Northeastern University, Shenyang, system and automation from China Agricultural Uni-
China, in 2013. He is currently working toward versity, Beijing, China, in 1996. He is currently a
the master’s degree in software engineering with Senior Engineer with China Electric Power Research
Peking University, Beijing, China. His research inter- Institute, Beijing, China. He is the Convener of IEC
ests include object detection and deep reinforcement SC 8B WG4, and the Vice Chairman of CIGRE C6
learning. National Committee of China. His research interests
include planning and automation of power distri-
bution system, integration of distributed renewable
generations.
horized licensed use limited to: NUST School of Electrical Engineering and Computer Science (SEECS). Downloaded on November 18,2024 at 09:11:56 UTC from IEEE Xplore. Restrictions ap