Full Text
Full Text
Abstract—In this paper, a novel dynamic multi-objective opti- behaviors, the DMOAs are required to timely update the
mization algorithm (DMOA) is proposed based on a designed obtained Pareto solutions to ensure the convergence in each
hierarchical response system (HRS). Named as HRS-DMOA, environment. In order to track the time-varying Pareto front,
the proposed algorithm mainly aims at integrating merits from
the mainstream ideas of dynamic behavior handling (i.e., the it is of vital significance for the DMOAs to make effective
diversity-, memory-, and prediction-based methods) so as to make responses to the environmental changes, which is quite a
flexible responses to environmental changes. In particular, by two challenging issue.
pre-defined thresholds, the environmental changes are quantified To address above basic and important issue in handling
as three levels. In case of a slight environmental change, the DMOPs, the existing DMOAs can be generally divided into the
previous Pareto set-based refinement strategy is recommended,
while the diversity-based re-initialization method is applied in memory- [45], [59], prediction- [42], [48]–[50], [57], [70], and
case of a dramatic environmental change. For changes occur- diversity-based methods [12], [35], [36]. Additionally, some
ring in a medium level, the transfer-learning-based response is hybrid algorithms have also been proposed [8], [21], [31], [53],
adopted to make full use of the historical searching experiences. [69], and in particular, a novel trend of developing DMOAs
The proposed HRS-DMOA is comprehensively evaluated on a has emerged recently, which combines the memory mechanism
series of benchmark functions, and the results show an improved
comprehensive performance as compared with four popular and the prediction method, where the transfer learning (TL)
baseline DMOAs in terms of both convergence and diversity, technique has been adopted to make full use of history knowl-
which also outperforms other two state-of-the-art DMOAs in edge to accelerate convergence in a new environment, see [25],
10 out of 14 testing cases, exhibiting the competitiveness and [26], [33], [39], [55], [60] for some successful applications.
superiority of the algorithm. Finally, extensive ablation studies For a clear inspection of above mainstream algorithms, their
are carried out, and from the results, it is found that as compared
with randomly selecting the response methods, the proposed HRS main ideas are summarized in Table I (see Section II-B for
enables more reasonable and efficient responses in most cases. more discussions of their advantages and disadvantages).
In addition, the generalization ability of the proposed HRS as
a flexible plug-and-play module to handle dynamic behaviors is TABLE I: Overview of the mainstream DMOAs
proven as well.
Mainstream algorithms Main idea
Index terms— Dynamic multiobjective optimization algo- Enrich or maintain diversity of
rithm (DMOA); transfer learning (TL); hierarchical response Diversity-based
the population for sufficient search
system; evolutionary algorithm Recall the useful historical Pareto
Memory-based
solutions in the new environment
I. I NTRODUCTION Predict the varying Pareto set
Prediction-based
in advance to accelerate convergence
Dynamic multiobjective optimization problems (DMOPs) Apply several different strategies
composed of conflicting objective functions are inevitably Hybrid methods
to collaboratively handle changes
encountered in many real-world scenes [9], [14], [62], [66],
[67], where both the objectives and constraints may change It should be highlighted that all of the existing mainstream
with time [5], [12], and this has attracted wide research DMOAs have already been proven effective in many cases,
attention to the design of effective dynamic multiobjective which are popular and reliable in coping with the environ-
optimization algorithms (DMOAs) [6], [15], [20], [32], [34], mental changes. In [35], a novel coevolutionary multi-swarm
[44]. The population-based evolutionary algorithms have been particle swarm optimizer has been proposed to solve DMOPs,
proven to be effective under various optimization scenarios where once a change is detected, 20% of the swarm is gen-
in searching for the optimal solutions [2], [4], [7], [38], [40], erated randomly to enhance the population diversity, and the
[54], [56]. Particularly, owing to the wide existence of dynamic experimental results have shown that the proposed algorithm
This work was supported in part by the Natural Science Foundation of performs well in the rapidly changing environment. In [45], an
China under Grant 62073271, the National Science and Technology Major explicit memory has been adopted to store non-dominated so-
Project of China under Grant J2019-I-0013-0013, the Independent Innovation lutions, where a novel minimum distance search-based updat-
Foundation of AECC of China under Grant ZZCX-2018-017, and the Korea
Foundation for Advanced Studies. (Corresponding author: Nianyin Zeng) ing technique is used. When there is an environmental change,
H. Li, C. Lan, P. Wu and N. Zeng are with the Department of Instrumental the stored solutions are reused in later stages for population
and Electrical Engineering, Xiamen University, Fujian 361005, China. Email: initialization, and from the results, the proposed algorithm has
[email protected]
Z. Wang is with the Department of Computer Science, Brunel University shown its competitiveness in tracking the true Pareto front.
London, Uxbridge UB8 3PH, U.K. Email: [email protected] An individual-based self-learning prediction method has been
Copyright © 2023 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be
obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,
creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works by
sending a request to [email protected]. For more information, see https://www.ieee.org/publications/rights/rights-policies.html
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 2
proposed in [42], whose major innovation is the employment to some other algorithms.
of adjustable reference points, which can effectively alleviate A noticeable issue is that the most existing methods have
the situation of inaccurate prediction caused by the non- paid a great many of attention to designing effective response
uniform Pareto front. From the experiments, it is found that strategies, while the in-depth analysis is relatively limited on
the proposed algorithm can well balance the convergence the dynamic behaviors (e.g., the changing severity), whose
and diversity. Similarly, an adaptive reference vector-based precise characterization is conducive to tracking the varying
adjustment strategy has been introduced in [70] along with a Pareto front [18], [46]. Even in those emerging hybrid al-
linear prediction strategy, whose effectiveness is demonstrated gorithms that have applied different novel response methods,
on twelve functions with diverse dynamic characteristics. few attention has been paid to the quantification of the envi-
Aiming at recovering diversity in a short time, the authors ronmental changes, which motivates us to cover this research
in [21] have devised a subspace-based diversity maintenance gap. Additionally, in case of dynamic behaviors, it would be
strategy, which can identify gaps between population distri- tough to figure out the useful historical experiences in the
bution to maintain the diversity regardless of the environ- absence of further analysis about response-making; in cases
mental changes. Moreover, another layered prediction strategy of some slight changes, introducing the new individuals seems
has been proposed in [21], which benefits making prompt unnecessary and moreover, the prediction-based responses
responses and improving the accuracy of predicted evolution- might lead the evolution to an inappropriate direction when
ary direction. Similarly, a hybrid DMOA of prediction- and dramatic changes exist in the environment. In this regard, the
diversity-based strategy has been developed in [69], where it is DMOAs should be sensitive to the environmental changes to
regarded that too much reliance on the prediction method may enable both timely and appropriate responses. Accounting for
reduce the diversity. Therefore, the authors have employed the respective merits of the previously mentioned methods,
several mechanisms to generate different seed sets, which a promising way to endow the DMOAs with comprehensive
obtains satisfactory results on series of benchmark evalua- performances is to make efficient and rational integration of
tions, and the proposed algorithm is also promising to handle diverse response mechanisms.
the unpredictable DMOPs. Additionally, the memory- and Motivated by above discussions, in this study, we aim to
prediction-based strategy have been combined in [31], where design a novel DMOA capable of quantitatively measuring
the similarity among the environmental changes is compared. the environmental changes so as to take appropriate response
If the current detected change is dissimilar to any historical strategies accordingly. To be specific, in case of a slight en-
records, then individuals in the new environment are predicted vironmental change, the previous Pareto set-based refinement
based on the differential population center in the previous two strategy is recommended to save unnecessary computations
environments. Otherwise, memory mechanism is adopted to and track the almost unchanged Pareto front. On the contrary,
deal with the similar changes, and the experimental results in case of a dramatic change, an intuitive idea (of introducing
have shown the robustness of the proposed hybrid strategy. individuals in the new environment) is adopted due to the fact
Recently, more and more advanced DMOAs have been that searching for Pareto solutions in a totally different envi-
proposed. To cope with the irregular patterns in stochastic ronment can be directly regarded as solving a new problem. In
changes, a Mahalanobis distance-based approach has been addition to above two extreme cases, the TL-based response
developed in [22] to estimate the correlation between the strategy is applied to accelerate the convergence, whose main
current environment and the previous ones, which performs idea is to transfer useful history knowledge to provide a
significantly better than some latest algorithms when handling high-quality initial population in the new environment. It is
the stochastic DMOPs. In [61], a knowledge-guided Bayesian noticeable that different from those hybrid methods where
(KGB) classification method has been proposed to make several types of advanced strategies have been designed, the
robust prediction, which follows the same idea of TL-based main innovation of this study is to rationally adopt appropriate
DOMAs to sufficiently exploit the history information, and responses based on the quantification of the changes, which is
the effectiveness of the proposed KGB is demonstrated on supposed to encourage and promote more in-depth investiga-
different test suites. tions of the dynamic behaviors in DMOPs.
In [60], the authors have developed a clustering difference- In this paper, a novel dynamic multi-objective optimization
based TL method to solve DMOPs, where the k-means al- algorithm is proposed based on a hierarchical response system
gorithm is applied to divide the population into five clusters, (HRS), where the environmental changes are quantified and
whose centroid is then adopted to construct the target domain divided into three levels. The proposed algorithm (named
by the first-order difference. By increasing the similarity be- as the HRS-DMOA) is, in essence, a two-stage algorithm
tween source and target domains, the phenomenon of negative where 1) half of the non-dominated solutions in the previous
transfer is expected to be alleviated. For the same purpose, environment are firstly selected as the sensors to quantify the
a knowledge reconstruction method has been proposed in dynamic behaviors; and accordingly, based on two predefined
[17], where the fuzzy neural network is applied to extract thresholds, 2) three response strategies can be later adopted
domain knowledge from two successive Pareto sets, which to generate the initial population in the new environment to
is evaluated and screened via a pre-designed mechanism. accelerate the convergence. In particular, the diversity-based
Consequently, the suitable knowledge can be selected to guide strategy is adopted in case of severe changes, and when the
the evolution, and the proposed algorithm has presented both changes are in a medium level, the novel frontier of TL-based
better convergence and diversity performance in comparison response method is considered, which combines the popular
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 3
responses with the stored solutions in the memory, which mo- A. Environmental Change Quantification
tivates the development of the prediction-based methods due As previously mentioned, one of the most important issues
to correlations between consecutive environments may benefit in handling DMOPs is to make effective responses to the
further exploration on the stored solutions. For example, in dynamic behaviors therein, of which the premise is to detect
[72] and [68], the knee- and center-points of the population the environmental changes. The popular change detection
have been employed to predict the trace of PS, respectively. manner contains the population- and sensor-based methods [1],
In the predictive strategy proposed in [28], different types [44], and in this study, we hold the belief that it is important to
of special points have been sufficiently utilized with three not only detect but also quantify the environmental changes,
mechanisms. An ensemble learning-based prediction strategy which is conducive to adopting appropriate response strategies.
has been proposed in [52], which aims at overcoming the In this regard, half of the non-dominated solutions in the
shortcomings of inaccurate and unstable prediction. Similarly, previous environment are selected to form the sensor set S,
several sub-prediction models have been integrated in [16], which is used to estimate the change degree (denoted by CD)
and the proposed ensemble method is applied to estimate of objective function as [46]:
the fitness of individuals in the new environment, which has
X fi,j (t) − fi,j (t − 1)
presented better robustness than the single prediction model. CDi = , i = 1, 2, ..., m (5)
Above methods have concentrated on developing various novel fi,j (t − 1) + µ
j∈S
prediction strategies, which can accelerate the convergence in
the new environment and accordingly save some searching where m is the number of objective functions, fi,j (t) denotes
efforts, while as pointed out in some work, the prediction the i-th fitness value of sensor j in environment t, and
model may also suffer from the inaccurate tracking of PS and µ = 0.001 is a smoothing value that avoids the denominator
the extra computational costs [16], [52], especially when the equaling to zero. Then, the overall change degree of the
change is either too severe or slight. environment is defined as:
Considering that those historical experiences may still have CD = λ max {CDk } (6)
1≤k≤m
some reference values to search in the new environment,
the TL technique has paved a feasible path in solving the where λ is an amplification factor, which is set to m − 1 as
DMOPs by combining the memory- and prediction-based the same in [46].
strategy, whose idea is to transfer the useful knowledge to
assist learning in similar tasks. Under the assumption that there
B. Hierarchical Response System
may be inherent associations among individuals in successive
environments, developing TL-based DMOAs is an emerging In the proposed HRS-DMOA, the essence of response is to
and promising direction [51], where an important concern is re-initialize the population to start searching in a new environ-
to alleviate the negative transfer phenomenon, which may lead ment, where the major concern is how to improve reliability
the evolution towards wrong directions. of the initialized population. Hence, a hierarchical response
Based on above discussions, it is clear that each kind mode is taken to realize tailored and appropriate reactions
of the mainstream strategy in solving DMOPs has the own to environmental changes in different extents. To be specific,
advantages and disadvantages without a certain one always based on the change degree CD of environment in Eq. (6), two
being the best choice. More importantly, it seems feasible predefined thresholds LT and HT are used to divide CD into
and promising to realize the complementary performance by three levels, and accordingly, the response modes of refinement
quantifying the environmental changes. As previously men- (if CD < LT ), TL (if LT ≤ CD ≤ HT ), and re-initialization
tioned, if the environment is dramatically changed, it might be (if CD > HT ) are adopted.
better to re-initialize the population to restart searching than When CD is fewer than the lower-threshold LT , it is
to figure out the useful history solutions to accomplish the deemed that the change is negligible, that is, the new envi-
initialization. On the contrary, in case of a very slight change, ronment (denoted as t) is similar to the previous one (denoted
it might be unnecessary to apply any advanced prediction as t − 1). Therefore, P St−1 is encouraged to keep improving
models. Consequently, in this paper, we aim to realize an the convergence and searching for the Pareto solutions, where
efficient and rational integration of different response strategies mutation operators are adopted to further supplement the
based on the quantification of environmental changes, which diversity. If CD is larger than the higher-threshold HT , then
provides a novel idea to develop a competitive DMOA with the change is regarded so dramatic that most of the previous
comprehensive performance. searching experience is no longer useful. As a result, such
situation is treated as a new optimization problem and the
evolution is re-started, where only a few solutions in P St−1
are reserved in the re-initialized population. In particular, if
III. M ETHODOLOGY CD falls between LT and HT , then the TL-based response is
applied, where knowledge acquired in previous environments
In this section, the proposed HRS-DMOA is elaborated, is used to train a prediction model, and more details are
whose main idea is to grade the environmental changes via presented in the next subsection.
some quantification procedure so that hierarchical responses It is noticeable that, in essence, LT and HT determine the
can be adopted accordingly. probability of selecting above three response strategies, and
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 5
in this paper, (LT, HT ) is set to (0.03, 0.78). In addition, Algorithm 2 Construction of the target domain
the framework of the proposed hierarchical response system Require:
is summarized in Algorithm. 1. Two populations P1 = {x1i }N 2 N
i=1 and P2 = {xi }i=1 in the
new environment, mutation probability pm
Algorithm 1 Hierarchical response system Ensure:
Require: Target domain Dt
Predefined lower-threshold LT and higher-threshold HT 1: Initialize Dt = ∅
Ensure: 2: For n from 1 to N
Response strategy 3: Crossover x1n and x2n to generate x̃1n and x̃2n
1: Calculate change degree (CD) according to 4: Define competition group Gc = {x1n , x2n , x̃1n , x̃2n }
Eqs. (5) and (6) 5: Calculate Q(x), x ∈ Gc according to Eq. (7)
2: If CD < LT 6: Dt ← arg max Q(x)
x∈Gc
3: Refinement 7: EndFor
4: else if CD > HT 8: For each individual x̃ ∈ Dt
5: Re-initialization 9: Generate a random number rand
6: else 10: If rand < pm
7: TL 11: x̃′ = mutation(x̃)
8: EndIf 12: If Q(x̃′ ) ≥ Q(x̃)
13: Substitute x̃ with x̃′
14: EndIf
C. TL-based Population Initialization 15: EndIf
16: EndFor
The essence of transfer learning is to apply the learned
17: Return Dt
knowledge from the source domain Ds to assist solving related
but not the same tasks in the target domain Dt . In the pro-
posed HRS-DMOA, the main idea of the TL-based response
the implementation details of above TL-based initialization
is to make full use of the historical searching experiences
strategy are presented.
to initialize population in the new environment, which is
required to have high quality to accelerate the convergence
[26]. Consequently, PS in the previous environment with some D. Overall Framework of HRS-DMOA
mutations is selected as the source domain Ds . In addition, to The overall flowchart of the proposed HRS-DMOA is shown
guarantee knowledge is transferred towards a correct direction, in Fig. 1, where the main contributions (i.e., the proposed
another group of transfer reference points (TRPs) is screened HRS) are illustrated in the red dashed box. In addition, the
in the new environment to form the target domain Dt . To support vector machine (SVM) [10] is adopted as the base
be specific, a local search strategy is applied to assign each learner in the TL-based response strategy, and the MOEA/D
individual with a quality factor Q(·) as [3]: [64] is employed as the static optimizer, which is a rep-
resentative multi-objective problem solver that decomposes
Q(x) = min max fj (z, t)−fj (x, t) , j ∈ {1, 2, · · · , m} the problem into several scalar sub-problems. Since that the
z∈P \{x}
(7) MOEA/D has been successfully applied in various situations,
where P stands for the population, and the individuals with it is deemed that MOEA/D is competent in searching for the
larger Q(·) value are deemed to have better quality. Then, Pareto solutions in each individual environment when solving
based on Eq. (7), tournament is performed between two a DMOP.
populations in the new environment to obtain a set of TRPs to Explanations of the proposed hierarchical response modes
form the Dt . The details are displayed in Algorithm. 2, where are further summarized as follows.
simulated binary crossover and polynomial mutation operators (1) Refinement. At time t, polynomial mutation is performed
are applied to further enhance the diversity. on P St−1 to enhance the diversity, and Pini is obtained
Let T = {X, Y } denote the training set, where X = by selection from the augmented P St−1 .
Ds ∪ Dt and Y = {0, 1} is the ground-truth label determined (2) TL. At time t, the TL-based initialization is applied to
by the domination relationship at the new time t (Y = 1 generate Pini , where the augmented P St−1 is adopted
for non-dominated individuals). In the proposed HRS-DMOA, as the source domain, and a group of TRPs is screed to
the transfer learning process is realized via the sample-based form the target domain (see Section III-C for details).
TrAdaboost technique [11], where several base learners are (3) Re-initialization. At time t, only a few of the individuals
trained by approximating the mapping from X to Y , and in P St−1 are reserved in Pini , whereas the rest of Pini
the weight of training samples is updated according to the is directly generated by random initialization.
weighted errors on Dt of the trained classifier. Then, the It is noticeable that the TL technique is only adopted
ensemble learning method is adopted to form a strong classifier when the environmental changes are regarded in a medium
H(·) which is later used to generate an initial population level. The merits of this setting contain mainly two aspects.
with high quality in the new environment. In Algorithm. 3, On one hand, if the environment slightly changes, it seems
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 6
Fig. 1: Flowchart of HRS-DMOA, where the proposed HRS (in the red dashed box) is responsible for generating the initial
population (Pini ) to the static optimizer, which accomplishes the searching in each environment and can be any
multi-objective problem solver (the MOEA/D is employed in this study). By comparing the calculated change degree (CD)
with the two thresholds, the responses based on TL, refinement, and re-initialization are adopted when LT ≤ CD ≤ HT ,
CD < LT , and CD > HT , respectively. Additionally, the “TRP Acquisition” refers to the construction of target domain Dt .
unnecessary to introduce the prediction model, which will no 5) In the TL-based initialization, SVM is employed as the
doubts burden the computation. On the other hand, dramatic base learner, whose computational complexity is O(N 2 mn)
environmental changes may lead to few correlations between [10].
the successive environments, and in this case, the negative 6) In the re-initialization response mode, the computational
transfer phenomenon easily occurs. As a result, the advantages complexity is O(N mn).
of different strategies are effectively integrated in the proposed 7) In the refinement response mode, only polynomial mu-
hierarchical response system, which is, in fact, a rational tation operation and a selection process are involved. Hence,
combination of them. In Algorithm. 4, a brief pseudocode of the computational complexity is O(N n) + O(N ).
the proposed HRS-DMOA is presented for a clear view. In general, N is far larger than m and n. Consequently,
Remark 1: In the first environment t = 1, the initial the computational complexity of the proposed HRS-DMOA is
population Pini is directly generated by the static optimizer O(N 2 mn).
Opt(·). In other cases, Pini is provided to Opt(·) by the
adopted response strategy. IV. E XPERIMENTAL R ESULTS AND D ISCUSSIONS
In this section, benchmark evaluations are performed to
validate the effectiveness of the proposed HRS-DMOA in
E. Complexity Analysis solving DMOPs, and sufficient ablation studies are carried out
Given that N is the population size, m is the number to verify the superiority of the developed HRS.
of objectives and n is the dimension of decision vector.
The following cases are mainly considered to estimate the A. Experimental Environment
computational complexity of the proposed HRS-DMOA. The proposed HRS-DMOA is comprehensively evaluated
1) MOEA/D is used to search for Pareto solutions in each on 14 DMOPs in the CEC2018 test suites [27], where the
environment, whose computational complexity is O(N mT ) provided benchmark DMOPs can well characterize the prop-
[64], where T is the size of applied neighborhood. erties of dynamic problems in various real-world optimization
2) The computational complexity is O(n) for both polyno- scenarios, such as time-varying PS and disconnected PF, etc.
mial mutation and simulated binary crossover. The 14 benchmark functions are named as DF1 to DF14,
3) The computational complexity is O(N ) when calculating and one can find more information of the applied test suites
the fitness value of the sensors in a new environment. in [27]. For the evaluation metrics, the inverted generational
4) The computational complexity is O(N m) when calculat- distance (IGD) and the hypervolume (HV) are adopted. To
ing the quality factor Q(·) to select transfer reference points be specific, IGD reflects the convergence of algorithms by
to form Dt . measuring distance between the obtained PF and the real one.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 7
and severity of change, respectively. On each benchmark, three worse than the KT-DMOEA, which shows the competitiveness
groups of evaluation are performed based on different dynamic of our method.
parameters, including (nt = 5, τt = 10), (nt = 10, τt = 5) 2) Diversity: According to Table IV, HRS-DMOA achieves
and (nt = 10, τt = 10). For a fair comparison, all algorithms half of the best results in terms of MHV, and the sub-optimal
are evaluated with the same parametric conditions on each algorithm Tr-DMOEA wins in 14 cases, which implies the
benchmark, where the dimension of test problems is set to advantage of the proposed algorithm regarding to the compre-
10, the population size for bi- and tri-objective optimization hensive performance. Furthermore, as displayed in Table V,
problem is set to 100 and 150, respectively. 10 epochs of it is found that the proposed HRS-DMOA obtains 22 best
search in each individual environment is fixed for all DMOAs, MMS out of the 42 results, and in totally 106 cases, our
that is, the searching times for Pareto solutions by the applied method performs significantly better than other comparison
static optimizer in all algorithms are the same and, by doing so, models. Combining the results reported in both Table IV and
the effectiveness of the responses to the environmental changes Table V, it can be concluded that the proposed algorithm yields
can be reflected to some extents. Moreover, considering that considerable diversity as well. While on the problem DF6, the
while applying the TL technique to make responses, it is MHV of our method is not so satisfactory as compared with
inevitable to consume some evaluations in advance at the other algorithms, which may due to that the time-varying PF
new environment to establish the target domain, for those of DF6 has the geometric property of long-tails and knee-
algorithms that do not involve extra evaluations, six more regions. By observing the response manner of our method
epochs of search for the static optimizer are added to promote when solving DF6, it is found that in almost 65.3% of the total
the fairness of comparison. 50 environment, the initial population is randomly generated
All algorithms have run 20 times individually on each within the re-initialization mode. Simultaneously, it can be
benchmark to alleviate the influence of randomness, and inferred that the negative transfer phenomenon may occur in
results in average level are reported along with the Wilcoxon the left 34.7% cases where the TL-based response is adopted,
rank sum test at the significance level of 0.05 [13]. Comparison as in the unstable environment it is hard to accurately identify
results in terms of MIGD, MHV, and MMS are presented in the useful knowledge. Consequently, the random initialization
Table III, Table IV, and Table V, respectively, where “+/-” is performed in most cases, and recalling the historical experi-
indicates that the proposed HRS-DMOA is significantly bet- ences makes few contributions, which collaboratively lead to
ter/worse than the corresponding algorithm, and “=” denotes a poor performance.
there is no significant difference between the two algorithms It is also noticeable that the PF of problem DF2 keeps
in a statistical sense. unchanged in a period of time (while the PS changes), which
requires the algorithms to not only monitor environmental
B. Benchmark Evaluations Results changes, but also maintain the population diversity to obtain
1) Convergence: As reported in Table III, HRS-DMOA considerable result. From the reported results, with different
achieves 26 out of 42 best results in terms of MIGD, which dynamic parameter settings, the proposed HRS-DMOA ob-
performs significantly better than KT-DMOEA, Tr-DMOEA, tains almost all the best MHV and MMS on DF2, exhibiting
CR-DMOEA and DNSGA-II-B in 32, 35, 29 and 39 cases, the superiority of HRS in handling dynamic behaviors with
respectively, reflecting the reliability of the proposed method multiple concerns.
in adapting to the varying environments in DMOPs. According To sum up, the proposed HRS-DMOA ranks first in most
to the Wilcoxon rank sum test, there are 135 cases where HRS- testing cases, which is competent in handling complex dy-
DMOA performs significantly better than other algorithms, namic behaviors in DMOPs with satisfactory comprehensive
which shows that the proposed method is a competitive DMOA performance. Moreover, according to the above results, it is
with outstanding convergence. Notice that on the problems found that the DNSGA-II-B has achieved few best results in
DF7, the DNSGA-II-B performs slightly better than the pro- terms of MIGD although a competitive diversity is presented,
posed HRS-DMOA in two situations, which may mainly which indicates that the diversity-based response strategy does
due to the time-varying PF ranges of DF7 are in dissimilar have some limitations in practice. Meanwhile, in various dy-
scales, and simultaneously the corresponding PS center is namic situations, it is also hard to guarantee the performances
fixed, thereby making it extremely hard to realize efficient of DMOAs with only TL-based strategy. Hence, it can be
knowledge transfer towards a proper direction. concluded that the designed HRS is effective and practical,
In addition, for the tri-objective problem DF11, not only the which successfully integrates the merits of different response
centroid of its PF oscillates by expanding and shrinking with strategies.
multiple scales, but also the density of solutions changes with
time, which brings great challenges to the algorithms. On one C. Comparisons with Other Hybrid DMOA
hand, the population is required to rapidly converge with a In this subsection, two additional hybrid methods SVM-
considerable diversity so that the obtained PF can well cover DMOA [24] and PPS-RM-MEDA [71] are employed for
the true one. On the other hand, the population should also comparisons. In Table VI, the results on DF series testing
realize timely escape from the local optimum. Such a high problems are presented, where the number in parenthesis
requirement on maintaining diversity in the objective-space denotes the rank (from 1st to 3rd) of corresponding algorithm,
makes DF11 quite hard to handle, whereas the proposed HRS- and the data of above two comparison models are cited from
DMOA still ranks second on DF11 and performs only slightly [37] and [58], respectively.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 9
As can be seen from Table VI, the proposed HRS-DMOA results, and according to our algorithm configuration, it is
ranks first and second in 13 and 14 cases, respectively. In difficult for the static optimizer to always obtain sufficient
particular, on the complex tri-objective benchmark problems high-quality Pareto solutions within only 10 epochs of search
DF13 and DF14, our method yields considerable convergence in each environment, which accounts for that our algorithm
performance, which owes to that the changing severity is may present performance declination as compared with the
taken into account when making response to the environmental optimal one in some situations.
changes, and it enables the generated initial population well In addition, the proposed HRS-DMOA is further compared
adapt to the new environment. It should be pointed out that with the MoE [43], which is a state-of-the-art DMOA that
the static optimizer can also make great contribution to the has employed multiple prediction mechanism. To make a fair
comparison, the basic experimental settings are made the same
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 10
as mentioned in [21], and the comparison results on JY series to realize rational and efficient integration of the mainstream
problems [23] are displayed in Table VII, where data of the strategies, thus only several basic response methods are applied
algorithm MoE are cited from [21]. in the proposed HRS-DMOA, which may limit the algorithm
performance as compared with the meticulously designed
According to Table VII, the proposed HRS-DMOA wins
advanced strategies in MoE.
the state-of-the-art MoE in 15 cases, and on problems of JY1
and JY5, the MoE yields better performance than our method. It is also noticeable that on the rest five JY problems,
Particularly, on these two problems, the results obtained by the proposed HRS-DMOA obtains the best results under all
the proposed HRS-DMOA merely reach the level of 10−1 , dynamic parameter settings, which is a promising and inspiring
whose major reason is that the main focus of this study is result as it implies that even the basic methods could realize
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 11
TABLE VI: Comparisons with other hybrid algorithms on two state-of-the-art methods, namely KGB [61] and TCD [60],
DF series problems in terms of MIGD respectively, which are integrated to the NSGA-II algorithm
Algorithms as well to form the DMOAs.
Problems nt , τt
SVM-DMOA [24] PPS-RM-MEDA [71] HRS-DMOA According to the results reported in Table VIII, it is found
10, 10 0.4310±4.85e-02(3) 0.0365±7.34e-03(1) 0.0884±1.80e-02(2)
DF1 that our HRS-NSGA-II yields 10 best MIGD out of the 14
10, 5 1.3278±8.55e-02(3) 0.0949±2.11e-02(2) 0.0859±1.94e-02(1)
10, 10 0.2962±2.26e-02(3) 0.0408±8.58e-03(1) 0.0482±1.37e-02(2) benchmarks, which is also an inspiring result. On one hand,
DF2
10, 5 0.8960±6.07e-02(3) 0.0917±7.86e-03(2) 0.0419±1.31e-02(1) it exhibits the superiority of the proposed HRS in comparison
DF3
10, 10 0.5159±5.47e-02(2) 0.2012±6.02e-03(1) 0.5916±1.57e-01(3) to other advanced response approaches to the environmental
10, 5 1.2482±2.07e-01(3) 0.2809±3.57e-02(1) 0.5680±1.44e-01(2)
changes; on the other hand, the proposed HRS is demonstrated
10, 10 0.1219±3.83e-03(1) 1.0118±1.78e-02(3) 0.4310±7.83e-02(2)
DF4
10, 5 0.1990±4.03e-02(1) 1.0861±1.48e-02(3) 0.4296±6.97e-02(2) a flexible plug-and-play module with considerable general-
DF5
10, 10 0.1482±4.32e-02(2) 1.1942±1.30e-02(3) 0.0366±5.65e-03(1) ization ability, which can be integrated with any other static
10, 5 1.2815±2.70e-01(2) 1.3787±6.25e-02(3) 0.0367±9.04e-03(1) optimizer to effectively solve the DMOPs.
10, 10 3.9768±5.77e-01(3) 3.0892±3.07e-01(2) 1.2066±6.06e-01(1)
DF6
10, 5 9.5840±1.09e+00(3) 6.5176±5.38e-01(2) 1.7784±6.44e-01(1)
10, 10 0.5220±5.70e-02(1) 2.9774±2.83e-01(3) 1.1298±1.53e-01(2) E. Sensitivity Analysis of Thresholds LT and HT
DF7
10, 5 0.6959±1.22e-01(1) 5.7018±5.14e-01(3) 1.2286±1.74e-01(2) In the proposed HRS-DMOA, the thresholds LT and HT
10, 10 0.0752±1.12e-02(1) 0.8673±7.93e-03(3) 0.1108±3.15e-02(2)
DF8
10, 5 0.1940±1.42e-02(2) 0.7860±2.10e-02(3) 0.1232±3.16e-02(1)
play important roles in grading the change degree to adopt dif-
10, 10 0.4151±6.63e-02(2) 1.5801±1.13e-02(3) 0.1965±2.64e-02(1) ferent response strategies. Hence, in this subsection, influences
DF9
10, 5 1.0983±1.63e-01(2) 1.6691±3.88e-02(3) 0.1993±3.56e-02(1) of the two thresholds are investigated.
10, 10 0.6037±3.84e-02(3) 0.1389±1.84e-03(1) 0.3230±2.76e-02(2)
DF10
10, 5 0.6188±3.34e-02(3) 0.1874±2.68e-03(1) 0.3283±2.28e-02(2)
10, 10 0.6717±2.15e-03(3) 0.1565±1.09e-02(1) 0.3666±2.20e-03(2)
DF11
10, 5 0.6933±3.06e-03(3) 0.1783±7.75e-03(1) 0.3677±2.78e-03(2)
10, 10 0.3829±3.86e-03(1) 1.1756±5.61e-03(3) 0.6236±1.69e-01(2)
DF12
10, 5 0.4262±6.10e-03(1) 1.1760±5.61e-03(3) 0.4643±1.81e-01(2)
10, 10 0.5733±5.07e-02(2) 1.3815±2.58e-02(3) 0.2527±1.26e-02(1)
DF13
10, 5 1.4883±2.57e-01(2) 1.6325±3.88e-02(3) 0.2556±1.20e-02(1)
10, 10 0.2556±5.73e-02(2) 0.8579±7.83e-03(3) 0.1231±3.97e-03(1)
DF14
10, 5 0.8887±2.44e-01(2) 1.0334±4.22e-02(3) 0.1216±3.71e-03(1)
TABLE VIII: Comparisons against other DMOAs with NSGA-II serving as the static optimizer in terms of MIGD
Algorithms
Problems
KGB-NSGA-II TCD-NSGA-II DNSGA-II-B HRS-NSGA-II
DF1 0.3113±1.39e-03 0.1887±4.93e-02 0.7182±1.54e-01 0.2257±4.71e-02
DF2 0.2198±1.64e-03 0.1146±1.74e-02 0.4638±1.12e-01 0.1380±1.91e-02
DF3 0.9593±1.19e-02 0.5371±6.67e-02 1.0556±2.24e-01 0.4983±2.02e-01
DF4 3.1503±2.84e-01 0.1487±4.20e-02 2.0526±2.28e-01 0.1812±1.76e-02
DF5 0.4004±3.05e-03 0.3427±1.46e-01 0.6813±6.67e-02 0.0232±4.40e-03
DF6 19.8500±4.97e+00 6.6381±6.96e-01 6.0010±1.07e+00 3.3446±9.41e-01
DF7 0.4575±5.85e-03 8.0032±3.77e+00 1.1040±1.23e-01 0.5021±5.62e-02
DF8 0.2130±2.51e-04 0.3057±2.54e-02 0.4431±1.08e-01 0.0441±8.75e-03
DF9 0.5932±6.34e-03 0.5954±1.42e-01 2.2331±5.79e-01 0.2155±3.39e-02
DF10 0.1892±4.71e-04 0.1814±1.70e-02 1.3771±3.14e-01 0.1317±8.44e-03
DF11 10.3910±3.06e-02 0.2670±4.09e-02 1.0163±2.24e-01 0.2108±1.73e-02
DF12 0.5650±3.03e-03 4.4655±5.09e+00 0.7770±6.64e-02 0.2290±2.53e-02
DF13 0.5854±2.49e-03 0.7456±2.24e-01 0.6401±1.10e-01 0.2603±1.30e-02
DF14 0.2267±5.72e-04 0.8972±5.40e-01 0.5049±1.23e-01 0.1907±3.05e-03
investigated, including 10% : 80% : 10%, 15% : 70% : 15% wise by:
and 20% : 60% : 20%, where the corresponding (LT, HT ) fi,j (t) − fi,j (t − 1)
takes value from {(0.01, 0.97), (0.03, 0.78), (0.05, 0.68)}, Scd = max , i = 1, 2, ..., m (12)
fi,j (t − 1) + µ
respectively. Using IGD as the evaluation metric, algorithm
performances with different settings of (LT, HT ) are shown where j stands for the sensor individual.
in Fig. 3, where the dynamic parameters of all 14 benchmark
functions are fixed at τt = 10, nt = 10.
Accordingly, if it satisfies that Scd ≥ Ecd , it can be regarded
As is shown, when LT and HT are set to 0.03 and 0.78, the
that the corresponding individual is greatly influenced by
algorithm presents a more stable performance. To be specific,
the fluctuation in environment, which is not quite a reliable
poor convergence is found on problems DF3 and DF6 when
solution in the new environment. By calculating the proportion
LT = 0.01, HT = 0.97. It may because that the two thresh-
of those unreliable sensors to the whole sensor set, severity
olds are too marginalized to enable the hierarchical response
of the changes can be reflected. Then, considering the three
system timely adapt to the varying environments such as the
optional response modes, the interval (0, 1) is uniformly
concave/convex change on a certain objective function. On the
divided into three sub-intervals, thereby realizing the adaptive
contrary, if LT and HT are set to 0.05 and 0.68, the designed
selection of different response modes. (1) If more than two
response system becomes highly sensitive to the environmental
thirds of sensors are unreliable, then the change is deemed
changes, which can be reflected from the sharp variation
severe and the re-initialization strategy is applied; (2) in
tendency of the IGD value on problems DF12 and DF13.
case of fewer than one third of the sensors are unreliable,
Under this circumstance, once a disconnected time-varying PF
the refinement mode is adopted; (3) otherwise, the TL-based
occurs, the algorithm is likely to treat it as a dramatic change
response is taken.
and accordingly take the re-initialization strategy, which may
cause adverse effects on the convergence. Therefore, it is also
inappropriate to set too centralized thresholds, which cannot Benchmark valuation results obtained by the HRS with
take full advantages of the hierarchical response system. As fixed thresholds (LT, HT ) = (0.03, 0.78) and the HRS
a result, LT = 0.03 and HT = 0.78 are recommended and with adaptive thresholds (denoted as HRS*) are displayed in
they are adopted in other reported experiments in this study. Table IX, where it is found that the adaptive thresholds also
In addition, we also make another attempt to adaptively enable the proposed HRS to effectively handle the dynamic
select the response modes (rather than based on the fixed behaviors with satisfactory convergence. Moreover, according
thresholds), which benefits further strengthening the con- to the rank sum test results, HRS* presents equivalent per-
nections between the response manner and the severity of formance as compared to HRS in six cases, and the former
environmental changes. To be specific, by taking the average even outperforms the latter on DF3, which is a promising
value of CD (in Eq. (5)) regarding to the cardinality of finding that motivates us to explore other adaptive threshold
sensor set S, an environmental changing degree is obtained setting manners and, by doing so, the subjectivity in deter-
1
as Ecd = |S| max{CDi } (i = 1, 2, ..., m), which is deemed mining response modes can be effectively eliminated so as to
as the objective-wise influences of the current change to all establish a highly generalized model that can adapt to diverse
sensors in average level. Meanwhile, one can also obtain the changing situations by connecting the change severity with the
maximum changing degree for each sensor (Scd ) in objective- responses.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 14
0.35
0.12 1
0.3
IGD
IGD
IGD
0.2 0.08 0.6
0.15
0.06 0.4
0.1
0.04 0.2
0.05
0 0.02 0
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
Environment Environment Environment
0.8 0.8 8
IGD
IGD
IGD
0.6 0.6 6
0.4 0.4 4
0.2 0.2 2
0 0 0
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
Environment Environment Environment
3 0.16
0.2
2.5 0.14
IGD
IGD
IGD
0.12
2
0.1 0.15
1.5
0.08
1
0.06 0.1
0.5 0.04
0 0.02
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
Environment Environment Environment
0.7 0.2
0.8
0.6 0.18
IGD
IGD
IGD
0.4 0.14
0.4
0.3 0.12
0.2 0.1
0.2
0.1 0.08
0 0.06 0
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
Environment Environment Environment
DF13 DF14
0.5 0.3
(LT,HT)=(0.01,0.97) (LT,HT)=(0.01,0.97)
(LT,HT)=(0.03,0.78) (LT,HT)=(0.03,0.78)
0.45 (LT,HT)=(0.05,0.68) (LT,HT)=(0.05,0.68)
0.25
0.4
0.2
0.35
IGD
IGD
0.15
0.3
0.1
0.25
0.05
0.2
0.15 0
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
Environment Environment
TABLE IX: Fixed thresholds vs. adaptive thresholds in terms DF1, DF2, DF3 and DF12, the proposed HRS-DMOA spends
of MIGD where (nt , τt , τ ) = (10, 10, 200) less time and achieves better performance than the variant SR-
Problems HRS(fixed thresholds) HRS*(adaptive thresholds) DMOA, whose reason lies in that if the environment only has
DF1 0.0778±4.95e-02 0.0889±4.85e-02(=) slight changes, taking TL-based response will increase the
DF2 0.0562±1.85e-02 0.0586±2.80e-02(=) computational burden with little performance enhancement.
DF3 0.5146±1.88e-01 0.2561±7.48e-02(-) On the contrary, if the TL method is used in case of dramatic
DF4 0.3584±8.47e-02 0.6212±2.35e-01(+) changes, the negative transfer phenomenon may occur that can
DF5 0.0417±8.90e-03 0.0438±2.09e-02(=) lead to poor convergence. Consequently, it can be concluded
DF6 1.3116±7.17e-01 2.2021±1.63e+00(=) that the designed HRS can enable flexible responses to the
DF7 0.4297±5.99e-02 0.5761±2.01e-01(+) varying environments, which effectively combines the merits
DF8 0.1268±6.17e-02 0.1423±1.14e-01(=) of different mainstream strategies to pursue comprehensive
DF9 0.2216±4.22e-02 0.2687±5.79e-02(+) performance improvement.
DF10 0.2659±5.77e-02 0.3966±8.04e-02(+)
DF11 0.1586±1.44e-02 0.5913±4.04e-02(+)
DF12 0.3370±1.22e-01 0.2974±4.21e-03(=) G. Ablation Study on HRS
DF13 0.2506±2.24e-02 0.2905±1.55e-02(+) In this subsection, extensive ablation studies are carried
DF14 0.1043±9.09e-03 0.2141±4.88e-03(+) out to validate the effectiveness of the proposed HRS. In
+/-/= \ 7/1/6 particular, five bi- (DF1-DF5) and tri-objective (DF10-DF14)
problems are selected for performance evaluation in this group
of experiments, where the dynamic parameters are set as
F. Investigation on Single TL-based Response Mode (nt , τt ) = (5, 10), and each algorithm is run 20 times to report
Recently, developing the TL-based DMOAs has become the MIGD results in average level.
the novel frontier of solving DMOPs [51], which can realize 1) Influences of the Quantification-based Hierarchy:
the seamless combination of the memory- and prediction- Firstly, in the proposed HRS, selection criterion of the different
based response strategy. In this regard, the proposed HRS response modes is based on the quantification of environ-
further integrates the merits of diversity enhancement when mental changes. To validate whether the quantification-based
making responses. To validate whether such designing is response-making is effective, another variant of the proposed
useful, an algorithm variant with single response namely SR- algorithm is designed, which is named RRS-DMOA that takes
DMOA is investigated, where only the TL-based response is the same three response modes with random probability.
adopted regardless of the change degree of environment. The
comparison results in terms of both the MIGD indicator and TABLE XI: Effectiveness of the quantification-based
the average runtime are reported in Table X, where dynamic hierarchical response modes in terms of MIGD
parameters are set to τt = 10, nt = 10.
Algorithms
Problems
TABLE X: Performance comparison against single TL-based HRS-DMOA RRS-DMOA
response in terms of MIGD and average runtime DF1 0.0657±8.28e-03 0.8467±4.68e-01
(τt = 10, nt = 10) DF2 0.0474±1.55e-02 0.5827±1.72e-01
DF3 0.2795±1.32e-01 0.6866±2.29e-01
MIGD Average runtime (seconds) DF4 0.6621±3.52e-02 1.2031±1.48e-01
Problems
HRS-DMOA SR-DMOA HRS-DMOA SR-DMOA DF5 0.0639±1.78e-02 1.8872±4.68e-01
DF1 0.1336 0.1653 2.4201 4.2164 DF10 0.1820±9.42e-03 0.2170±2.25e-02
DF2 0.0597 0.1148 1.2067 3.025 DF11 0.1486±2.84e-03 0.1899±1.10e-02
DF3 0.1871 0.1888 1.5432 2.4524 DF12 0.4715±1.15e-01 0.5050±3.26e-02
DF4 0.6758 0.9439 2.192 1.9164 DF13 0.2710±1.16e-02 2.0003±3.20e-01
DF5 0.3324 0.2464 2.4921 3.3526 DF14 0.1342±3.63e-03 1.2288±2.74e-01
DF6 1.6638 0.9891 3.352 3.482
DF7 31.4148 28.246 2.4963 2.728 According to the results displayed in Table XI, it is found
DF8 0.8603 1.3746 3.9365 3.6527 that the proposed HRS-DMOA outperforms the variant RRS-
DF9 1.521 1.0455 2.708 3.064 DMOA on all of the adopted test problems with overwhelming
DF10 0.2062 0.2168 6.2245 5.4015 advantages. In particular, only on the four problems of DF3
DF11 0.1722 0.2034 6.0723 5.6432 and DF10-DF12 can random response modes obtain similar
DF12 0.8925 0.9125 2.4647 2.7542 results as the hierarchical ones. In other cases, the proposed
DF13 0.2813 0.2832 7.5568 5.7128 HRS presents significant convergence improvement, which
DF14 0.1367 0.1613 7.2126 5.7605 may due to that although the RRS can also exploit merits of
different response strategies, the randomness-based response-
According to Table X, SR-DMOA only achieves 4 best making lacks analysis on the specific environment. Hence,
MIGD results out of 14 problems, and in none of the cases that it can lead to the inappropriate responses, like to transfer
SR-DMOA can outperform the original HRS-DMOA in terms previous knowledge in the extremely fluctuating situations.
of both MIGD and runtime. It is noticeable that on problems Particularly, on problem DF5, an improvement on MIGD by
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 16
28.53% is achieved by the quantification-based hierarchical DF4 are illustrated in Fig. 5, and the PF obtained by the four
responses, which indicates that it is necessary to analyze the algorithms is visualized in Fig. 6.
environment so as to make flexible and efficient responses.
In addition, from the scatter plot illustrated in Fig. 4, it can
be observed that the proposed HRS-DMOA can track the
time-varying PF better than RRS-DMOA, which demonstrates
the superiority of the quantification-based response-making
manner.
1.4
True PF(t=21)
1.2 HRS-DMOA(t=21)
RRS-DMOA(t=21)
(a) DF1 (b) DF2
True PF(t=37)
1
HRS-DMOA(t=37)
RRS-DMOA(t=37)
0.8
0.6
0.4
0.2
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
(c) DF3 (d) DF4
Fig. 4: Scatter plots on DF1 of the two algorithms Fig. 5: Box plots on four benchmark problems of the four
(HRS-DMOA vs. RRS-DMOA) at the 21- and 37-th algorithms, where the red circles refer to some outliers, the
environments. red solid and blue dashed lines represent the median and
average value, respectively.
2) Effectiveness of the Response Modes: Secondly, in the
proposed HRS, three response modes are adopted, and to According to the above presented results, we have following
verify the effectiveness of them, three variants of the proposed findings:
algorithm are designed by removing each mode from the 1) On most benchmark problems, the original HRS yields
HRS respectively, which are termed as HRS v1 (without the best convergence performance, which validates that the
refinement), HRS v2 (without TL), and HRS v3 (without proposed HRS can flexibly handle different dynamic behav-
iors, and some variances in 10−3 level also present a stable
! %
re-initialization). The comparison results are reported in Ta-
ble XII, where “ ” and “ ” denote whether a certain performance.
response mode is contained in corresponding algorithm or not. 2) When the TL-based response is removed, the HRS v2
Moreover, the box plots regarding to MIGD of problems DF1- presents the most severe performance declination in many
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 17
V. C ONCLUSIONS
cases such as on problem DF1, where the MIGD is as nearly
five times as that obtained by HRS, which indicates the
In this paper, a novel DMOA has been proposed based on
importance of the TL-based response mode.
a hierarchical response system, whose main idea is to realize
3) On tri-objective problems DF10 and DF11, the four
rational and efficient integration of the mainstream methods so
algorithms obtain similar results, while slight convergence
as to enhance comprehensive performance. The environmental
improvement can be achieved by the original HRS, which
changes have been quantified as three levels and, by doing
implies that the complementary performance is realized in the
so, different response modes can be adopted accordingly to
proposed algorithm.
flexibly handle various dynamic behaviors in DMOPs.
4) The response made by the proposed HRS may not
always be the best choice, as can be seen that on DF3 and Benchmark evaluations have been carried out on 14
DF4, the variants HRS v1 and HRS v2 obtain the best result, DMOPs, and the results show that the proposed HRS-DMOA
respectively. The reason may be that the thresholds of HRS outperforms other four popular baseline DMOAs in terms
are manually set. of both convergence and diversity. In addition, extensive
ablation studies have been carried out to validate the su-
5) Without the re-initialization mode, none of the best
periority of the proposed HRS. On one hand, as compared
MIGD is obtained by the HRS v3, while in some cases like
with randomly taking different responses, the quantification-
DF2 and DF14, the HRS v3 can slightly outperform the
based hierarchical response-making has shown overwhelming
HRS v1 and HRS v2. Hence, it can be concluded that the
advantages, which proves the necessity of in-depth analysis
diversity-based response can still be helpful.
on the dynamic behaviors in DMOPs. On the other hand,
6) From the results on DF4, it can be inferred that the
performance declination to different extents has been observed
TL-based response may even cause adverse effects due to the
when some certain response mode is removed from the HRS,
negative transfer phenomenon, which simultaneously implies
which demonstrates the effectiveness of integrating different
the superiority of the rational integration of diverse strategies
mainstream methods.
as none of the single response can always be the best.
Based on above discussions, it can be concluded that the Although some satisfactory results have been obtained,
proposed HRS does realize the systematical integration of several important issues of the proposed HRS-DMOA deserve
different mainstream response strategies, which is effective in further attention, which mainly include (1) insufficient con-
handling various dynamic behaviors in DMOPs. Moreover, cerns on integrating novel state-of-the-art response strategies;
the quantification of environmental changes is also proven (2) manual intervention on the threshold parameters of HRS.
necessary to some extents, which motivates us to develop In future, we tend to propose some novel response methods
other novel schemes to comprehensively quantify the dynamic with other advanced techniques, and to develop a learnable
behaviors in our future work. optimizer that can avoid the manual parameterization is also a
feasible and innovative direction. Then, how to comprehen-
sively quantify the dynamic behaviors from the aspects of
H. Outlook for Future Work both changing severity and frequency can provide us with a
In addition to above satisfactory results, it is worth pointing more thorough understanding on the essence of the DMOPs.
out that there are still spaces for improvements on the proposed In addition, we are prone to integrate the proposed HRS
HRS-DMOA, where the most important issue is that some to other swarm-intelligence-based static optimizer, and it is
state-of-the-art novel strategies are not well considered in the also promising to apply the HRS-DMOA to more real-world
proposed HRS. optimization scenes to validate its practicality.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 18
R EFERENCES [22] Y. Hu, J. Zheng, S. Jiang, J. Zou and R. Wang, A Mahalanobis distance-
based approach for dynamic multi-objective optimization with stochastic
[1] L. Altin and H. Topcuoglu, Impact of sensor-based change detection changes, IEEE Transactions on Evolutionary Computation, 2023. https:
schemes on the performance of evolutionary dynamic optimization //doi.org/10.1109/TEVC.2023.3253850
techniques, Soft Computing, vol. 22, no. 14, pp. 4741-4762, 2018. [23] S. Jiang and S. Yang, Evolutionary dynamic multiobjective optimization:
[2] A. Anand and A. Singh, Hybrid nature-inspired optimization and benchmarks and algorithm comparisons, IEEE Transactions on Cyber-
encryption-based watermarking for E-healthcare, IEEE Transactions netics, vol. 47, no. 1, pp. 198-211, 2017.
on Computational Social Systems, 2022. https://doi.org/10.1109/TCSS. [24] M. Jiang, W. Hu, W. Qiu, M. Li, H. Ming and K. Tan, Solving
2022.3140862 dynamic multi-objective optimization problems via support vector ma-
[3] R. Azzouz, S. Bechikh and L. Said, A multiple reference point-based chine, Proceedings of the 10th International Conference on Advanced
evolutionary algorithm for dynamic multi-objective optimization with Computational Intelligence, pp. 819-824, 2018.
undetectable changes, 2014 IEEE Congress on Evolutionary Computa- [25] M. Jiang, Z. Huang, L. Qiu, W. Huang and G. Yen, Transfer learning-
tion, pp. 3168-3175, 2014. based dynamic multiobjective optimization algorithms, IEEE Transac-
[4] X. Cai, H. Hu, M. Li, Y. Xiao, H. Ishibuchi and Z. Fan, A grid-based tions on Evolutionary Computation, vol. 22, no. 4, pp. 501-514, 2017.
inverted generational distance for multi/many-objective optimization, [26] M. Jiang, Z. Wang, H. Hong and G. Yen, Knee point-based imbalanced
IEEE Transactions on Evolutionary Computation, vol. 25, no. 1, pp. 21- transfer learning for dynamic multiobjective optimization, IEEE Trans-
34, 2021. actions on Evolutionary Computation, vol. 25, no. 1, pp. 117-129, 2020.
[5] B. Chen, Y. Chen, G. Zeng and Q. She, Fractional-order convolutional [27] S. Jiang, S. Yang, X. Yao, K. Tan, M. Kaiser and N. Krasnogor, Bench-
neural networks with population extremal optimization, Neurocomput- mark problems for CEC2018 competition on dynamic multiobjective
ing, vol. 477, pp. 36-45, 2022. optimisation, 2018 CEC Competition, pp. 1-18, 2018.
[6] D. Chen, F. Zou, R. Lu and X. Wang, A hybrid fuzzy inference [28] Q. Li, S. Yang, J. Zheng and G. Ruan, A predictive strategy based on
prediction strategy for dynamic multi-objective optimization, Swarm and special points for evolutionary dynamic multi-objective optimization,
Evolutionary Computation, vol. 43, pp. 147-165, 2018. Soft Computing, vol. 23, pp. 3723-3739, 2019.
[7] J. Chen, Y. Wu, X. Xu, H. Zheng, Z. Ruan and Q. Xuan, PSO-ANE: [29] H. Li, J. Li, P. Wu, Y. You and N. Zeng, A ranking-system-based
Adaptive network embedding with particle swarm optimization, IEEE switching particle swarm optimizer with dynamic learning strategies,
Transactions on Computational Social Systems, vol. 6, no. 4, pp. 649- Neurocomputing, vol. 494, pp. 356-367, 2022.
659, 2019. [30] H. Li, P. Wu, N. Zeng, Y. Liu and F. E. Alsaadi, A survey on
[8] M. Chen, Y. Guo, Y. Jin, S. Yang, D. Gong and Z. Yu, An environment- parameter identification, state estimation and data analytics for lateral
driven hybrid evolutionary algorithm for dynamic multi-objective op- flow immunoassay: from systems science perspective, International
timization problems, Complex and Intelligent Systems, vol. 9, no. 1, Journal of Systems Science, vol. 53, no. 16, pp. 3556-3576, 2022.
pp. 659-675, 2023. [31] Z. Liang, S. Zheng, Z. Zhu and S. Yang, Hybrid of memory and pre-
[9] H. Cheng, Z. Wang, Z. Wei, L. Ma and X. Liu, On adaptive learning diction strategies for dynamic multiobjective optimization, Information
framework for deep weighted sparse autoencoder: A multiobjective Sciences, vol. 485, pp. 200-218, 2019.
evolutionary algorithm, IEEE Transactions on Cybernetics, vol. 52,
[32] Z. Liang, T. Wu, X. Ma, Z. Zhu and S. Yang, A dynamic multiobjective
no. 5, pp. 3221-3231, 2022.
evolutionary algorithm based on decision variable classification, IEEE
[10] C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, Transactions on Cybernetics, vol. 52, no. 3, pp. 1602-1615, 2022.
vol. 20, no. 3, pp. 273-297, 1995.
[33] Z. Liang, X. Xu, L. Liu, Y. Tu and Z. Zhu, Evolutionary many-task op-
[11] W. Dai, Q. Yang, R. Xue and Y. Yu, Boosting for transfer learning,
timization based on multisource knowledge transfer, IEEE Transactions
International Conference on Machine Learning, 2007.
on Evolutionary Computation, vol. 26, no. 2, pp. 319-333, 2022.
[12] K. Deb, N. Rao and S. Karthik, Dynamic multi-objective optimization
[34] J. Lin, H. Liu, K. Tan and F. Gu, An effective knowledge transfer ap-
and decision-making using modified NSGA-II: a case study on hydro-
proach for multiobjective multitasking optimization, IEEE Transactions
thermal power scheduling, International Conference on Evolutionary
on Cybernetics, vol. 51, no. 6, pp. 3238-3248, 2021.
Multi-Criterion Optimization, pp. 803-817, 2007.
[13] J. Derrac, S. Garcia, D. Molina and F. Herrera, A practical tutorial [35] R. Liu, J. Li, C. Mu and L. Jiao, A coevolutionary technique based on
on the use of nonparametric statistical tests as a methodology for multi-swarm particle swarm optimization for dynamic multi-objective
comparing evolutionary and swarm intelligence algorithms, Swarm and optimization, European Journal of Operational Research, vol. 261, no. 3,
Evolutionary Computation, vol. 1, no. 1, pp. 3-18, 2011. pp. 1028-1051, 2017.
[14] J. Eaton, S. Yang and M. Gongora, Ant colony optimization for [36] R. Liu, P. Yang and J. Liu, A dynamic multi-objective optimization
simulated dynamic multi-objective railway junction rescheduling, IEEE evolutionary algorithm for complex environmental changes, Knowledge-
Transactions on Intelligent Transportation Systems, vol. 18, no. 11, Based Systems, vol. 216, article no. 106612, 2021.
pp. 2980-2992, 2017. [37] X. Liu, X. Xu, Z. Zhan, Y. Fang and J. Zhang, Interaction-based
[15] L. Feng, W. Zhou, W. Liu, Y. Ong and K. Tan, Solving dynamic prediction for dynamic multiobjective optimization, IEEE Transactions
multiobjective problem via autoencoding evolutionary search, IEEE on Evolutionary Computation, 2023. doi:10.1109/TEVC.2023.3234113
Transactions on Cybernetics, vol. 52, no. 5, pp. 2649-2662, 2022. [38] X. Liu, Y. Du, M. Jiang and X. Zeng, Multiobjective particle swarm
[16] Y. Guo, H. Yang, M. Chen, J. Cheng and D. Gong, Ensemble prediction- optimization based on network embedding for complex network com-
based dynamic robust multi-objective optimization methods, Swarm and munity detection, IEEE Transactions on Computational Social Systems,
Evolutionary Computation, vol. 48, pp. 156-171, 2019. vol. 7, no. 2, pp. 437-449, 2020.
[17] H. Han, Y. Liu, L. Zhang, H. Liu, H. Yang and J. Qiao, Knowledge [39] X. Liu, Z. Zhan, T. Gu, S. Kwong, Z. Lu, H. Duh and J. Zhang,
reconstruction for dynamic multi-objective particle swarm optimization Neural network-based information transfer for dynamic optimization,
using fuzzy neural network, International Journal of Fuzzy Systems, IEEE Transactions on Neural Networks and Learning Systems, vol. 31,
2023. https://doi.org/10.1007/s40815-023-01477-2 no. 5, pp. 1557-1570, 2020.
[18] M. Helbig and A. Engelbrecht, Performance measures for dynamic [40] Y. Liu, N. Zhu and M. Li, Solving many-objective optimization problems
multi-objective optimisation algorithms, Information Sciences, vol. 250, by a Pareto-based evolutionary algorithm with preprocessing and a
pp. 61-81, 2013. penalty mechanism, IEEE Transactions on Cybernetics, vol. 51, no. 11,
[19] J. Hu, C. Jia, H. Liu, X. Yi and Y. Liu, A survey on state estimation of pp. 5585-5594, 2021.
complex dynamical networks, International Journal of Systems Science, [41] L. Ma, Z. Shao, X. Li, Q. Lin, J. Li, V. Leung and A. Nandi, Influence
vol. 52, no. 16, pp. 3351-3367, 2021. maximization in complex networks by using evolutionary deep rein-
[20] L. Hu, W. Naeem, E. Rajabally, G. Watson, T. Mills, Z. Bhuiyan, forcement learning, IEEE Transactions on Emerging Topics in Computa-
C. Raeburn, I. Salter and C. Pekcan, A multiobjective optimization tional Intelligence, 2021. https://doi.org/10.1109/TETCI.2021.3136643
approach for COLREGs-compliant path planning of autonomous surface [42] J. Ou, M. Li, L. Xing, J. Lv, Y. Hu, N. Dong and G. Zhang, Individual-
vehicles verified on networked bridge simulators, IEEE Transactions on based self-learning prediction method for dynamic multi-objective opti-
Intelligent Transportation Systems, vol. 21, no. 3, pp. 1167-1179, 2020. mization, Information Sciences, vol. 613, pp. 401-418, 2022.
[21] Y. Hu, J. Zheng, S. Jiang, S. Yang and J. Zou, Handling dynamic [43] R. Rambabu, P. Vadakkepat, K. Tan and M. Jiang, A mixture-of-
multiobjective optimization environments via layered prediction and experts prediction framework for evolutionary dynamic multiobjective
subspace-based diversity maintenance, IEEE Transactions on Cybernet- optimization, IEEE Transactions on Cybernetics, vol. 50, pp. 5099-5112,
ics, vol. 53, pp. 2572-2585, 2021. 2019.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final
publication. Citation information: DOI10.1109/TCSS.2023.3293331, IEEE Transactions on Computational Social Systems
FINAL VERSION 19
[44] S. Sahmoud and H. Topcuoglu, Sensor-based change detection schemes [67] H. Zhang, D. Yue, X. Xie, S. Hu and S. Weng, Multi-elite guide hybrid
for dynamic multi-objective optimization problems, 2016 IEEE Sympo- differential evolution with simulated annealing technique for dynamic
sium Series on Computational Intelligence, pp. 1-8, 2016. economic emission dispatch, Applied Soft Computing, vol. 34, pp. 312-
[45] S. Sahmoud and H. Topcuoglu, A memory-based NSGA-II algorithm for 323, 2015.
dynamic multi-objective optimization problems, European Conference [68] J. Zheng, Z. Peng, J. Zou and R. Shen, A prediction strategy based
on the Applications of Evolutionary Computation, pp. 296-310, 2016. on guide-individual for dynamic multi-objective optimization, Acta
[46] S. Sahmoud and H. Topcuoglu, Exploiting characterization of dynamism Electronica Sinica, vol. 43, no. 9, pp. 1816-1825, 2015.
for enhancing dynamic multi-objective evolutionary algorithms, Applied [69] J. Zheng, F. Zhou, J. Zou, S. Yang and Y. Hu, A dynamic multi-
Soft Computing, vol. 85, article no. 105783, 2019. objective optimization based on a hybrid of pivot points prediction
[47] B. Song, H. Miao and L. Xu, Path planning for coal mine robot via and diversity strategies, Swarm and Evolutionary Computation, vol. 78,
improved ant colony optimization algorithm, Systems Science & Control article no. 101284, 2023.
Engineering, vol. 9, no. 1, pp. 283-289, 2021. [70] J. Zheng, Q. Wu, J. Zou, S. Yang and Y. Hu, A dynamic multi-
[48] H. Sun, A. Cao, Z. Hu, X. Li and Z. Zhao, A novel quantile-guided objective evolutionary algorithm using adaptive reference vector and
dual prediction strategies for dynamic multi-objective optimization, linear prediction, Swarm and Evolutionary Computation, vol. 78, article
Information Sciences, vol. 579, pp. 751-775, 2022. no. 101281, 2023.
[71] A. Zhou, Y. Jin and Q. Zhang, A population prediction strategy for
[49] H. Sun, X. Ma, Z. Hu, J. Yang and H. Cui, A two stages prediction
evolutionary dynamic multi-objective optimization, IEEE Transaction
strategy for evolutionary dynamic multi-objective optimization, Applied
on Cybernetics, vol. 33, no. 1, pp. 40-53, 2012.
Intelligence, vol. 53, pp. 1115-1131, 2023.
[72] J. Zou, Q. Li, H. Bai and J. Zheng, A prediction strategy based on
[50] J. Sun, X. Gan, D. Gong, X. Tang, H. Dai and Z. Zhong, A self-
center points and knee points for evolutionary dynamic multi-objective
evolving fuzzy system online prediction-based dynamic multi-objective
optimization, Applied Soft Computing, vol. 61, pp. 806-818, 2017.
evolutionary algorithm, Information Sciences, vol. 612, pp. 638-654,
2022.
[51] K. Tan, L. Feng and M. Jiang, Evolutionary transfer optimization-a
new frontier in evolutionary computation research, IEEE Computational
Intelligence Magazine, vol. 16, no. 1, pp. 22-33, 2021.
[52] F. Wang, Y. Li, F. Liao and H. Yan, An ensemble learning based
prediction strategy for dynamic multi-objective optimization, Applied
Soft Computing, vol. 96, article no. 106592, 2020.
[53] Y. Wang, K. Li and G. Wang, Combining key-points-based transfer
learning and hybrid prediction strategies for dynamic multi-objective
optimization, Mathematics, vol. 10, no. 12, article no. 2117, 2022.
[54] X. Wang, K. Zhang, J. Wang and Y. Jin, An enhanced competitive swarm
optimizer with strongly convex sparse operator for large-scale multiob-
jective optimization, IEEE Transactions on Evolutionary Computation,
vol. 26, no. 5, pp. 859-871, 2021.
[55] X. Wang, Y. Jin, S. Schmitt and M. Olhofer, Transfer learning based co-
surrogate assisted evolutionary bi-objective optimization for objectives
with non-uniform evaluation times, Evolutionary Computation, vol. 30,
no. 2, pp. 221-251, 2022.
[56] Z. Wang, Z. Zhan, W. Yu, Y. Lin, J. Zhang, T. Gu and J. Zhang, Dynamic
group learning distributed particle swarm optimization for large-scale
optimization and its application in cloud workflow scheduling, IEEE
Transactions on Cybernetics, vol. 50, no. 6, pp. 2715-2729, 2020.
[57] F. Wu, J. Chen and W. Wang, A dynamic multi-objective evolutionary
algorithm based on prediction, Journal of Computational Design and
Engineering, vol. 10, no. 1, pp. 1-15, 2023.
[58] D. Xu, M. Jiang, W. Hu, S. Li, R. Pan and G. Yen, An online prediction
approach based on incremental support vector machine for dynamic
multiobjective optimization, IEEE Transactions on Evolutionary Com-
putation, vol. 26, no. 4, pp. 690-703, 2021.
[59] X. Xu, Y. Tan, W. Zheng and S. Li, Memory-enhanced dynamic multi-
objective evolutionary algorithm based on Lp decomposition, Applied
Sciences, vol. 8, no. 9, pp. 1673-1695, 2018.
[60] F. Yao and G. Wang, Transfer learning based on clustering difference for
dynamic multi-objective optimization, Applied Science, vol. 13, no. 8,
article no. 4795, 2023.
[61] Y. Ye, L. Li, Q. Lin, K. Wong, J. Li and Z. Ming, Knowledge
guided Bayesian classification for dynamic multi-objective optimization,
Knowledge-Based Systems, vol. 250, article no. 109173, 2022.
[62] N. Yu, R. Yang and M. Huang, Deep common spatial pattern based
motor imagery classification with improved objective function, Inter-
national Journal of Network Dynamics and Intelligence, vol. 1, no. 1,
pp. 73-84, 2022.
[63] N. Zeng, Z. Wang, W. Liu, H. Zhang, K. Hone and X. Liu, A dynamic
neighborhood-based switching particle swarm optimization algorithm,
IEEE Transactions on Cybernetics, vol. 52, no. 9, pp. 9290-9301, 2022.
[64] Q. Zhang and H. Li, MOEA/D: A multiobjective evolutionary algorithm
based on decomposition, IEEE Transactions on Evolutionary Computa-
tion, vol. 11, no. 6, pp. 712-731, 2007.
[65] W. Zhang, L. Sheng and M. Gao, Finite-horizon H∞ state estimation for
time-varying complex networks based on the outputs of partial nodes,
Systems Science & Control Engineering, vol. 9, no. s2, pp. 48-59, 2021.
[66] G. Zhao, Y. Li and Q. Xu, From emotion AI to cognitive AI, Inter-
national Journal of Network Dynamics and Intelligence, vol. 1, no. 1,
pp. 65-72, 2022.