1 s2.0 S1474667016430557 Main
1 s2.0 S1474667016430557 Main
Abstract: In this paper, the minimization of an unknown but measurable cost function using
extremum-seeking control is considered. The main contribution of the paper is to formulate
the extremum-seeking problem as a time-varying estimation problem. The concepts of invariant
manifold and adaptive parameter update law are used for adaptive estimation of the time-
varying gradient of the unknown cost function. The proposed approach is shown to avoid the
need for averaging analysis which minimizes the sensitivity of the closed-loop performance to
the choice of dither signal.
The main contribution of this work is to remove the need If one has access to the gradient θ(t), then it follows
for averaging of the quasi steady-state system, a crucial that the gradient descent u̇ = −k1 θ(t) with k1 > 0, will
element of conventional ESC approach. It also avoids the converge the optimum u∗ .
need to use the frequency of the dither signal as a singu- Lemma 1. Consider the cost function subject to the all
lar perturbation parameter. The proposed ESC algorithm assumptions. The gradient descent update u̇ = −k1 θ(t) is
can improve the transient performance of the closed-loop such that the objective function decreases monotonically
system. and reaches to its minimum at u∗ .
The paper is organized as follows. The problem description
is given in Section 2. In Section 3, the proposed ESC Proof. By convexity of the cost, it is shown in the
controller is presented for the case of process described following that the cost will decrease until a value of u is
by a static map. The extension of the algorithm to an reached such that the gradient of the cost be zero. Let
unknown dynamical system is presented in Section 4. This ũ = u − u∗ and consider the following Lyapunov function
is followed by a simulation example in Section 5 and a brief candidate for the input dynamics:
conclusion in Section 6. Vũ = 12 ũT ũ
2. PROBLEM DESCRIPTION By considering Assumption 1, and differentiation with
respect to t, the following inequality is obtained
Consider a nonlinear system 2
V̇ũ = −k1 ũT ∂`(u)
∂u ≤ −k1 α2 kũ(t)k
ẋ = f (x, u) (1)
y = h(x) (2) As a result, the system converges to the unknown mini-
where x ∈ Rn is the state, u is the control input taking mizer u∗ .
values in U ⊂ Rm and y ∈ R is the unknown and
The design of the extremum seeking scheme is based on
measurable cost function to be minimized. The functions
the unknown dynamics (4). The first step consists in
f (x, u) and h(x) are assumed to be C ∞ in all of their
the estimation of the time-varying parameter θ(t). In the
arguments.
second step, we define a suitable adaptive controller that
The objective of ESC is to bring the closed-loop control accomplishes the extremum seeking task.
system to the unknown equilibrium x∗ and u∗ that min-
imizes the cost function y. The equilibrium (or steady- 3.1 Parameter Estimation
state) map is the n dimensional vector-valued function
π(u) which is such that f (π(u), u) = 0. The steady-state In this section, an alternative technique based on the idea
cost function is given by y = h(π(u)) = `(u). Thus, of invariant manifolds is proposed for adaptive estimation
at steady-state, the problem is reduced to finding the of the unknown time-varying parameters. The geometric
minimizer u∗ of the y = `(u). The following assumptions concept of invariance has been widely used in nonlinear
are required. control theory (see Tian and Yu [2000], Astolfi and Ortega
Assumption 1. The equilibrium cost is such that [2003]). This algorithm utilizes a number of high gain
∂`(u∗ ) ∂ 2 `(u) estimators and filters. An almost invariant manifold is
= 0, > α1 I ∀u ∈ U generated from the filters, which allows to exploit implicit
∂u ∂u∂uT functions relating implicitly the known variables and the
∂`(u) 2
(u − u∗ )T ≥ α2 ku − u∗ k , ∀u ∈ U unknown variables. A parameter update law is assigned
∂u using the almost invariant manifold.
where matrix I is an identity matrix with suitable dimen-
sion, and α1 , α2 are strictly positive constants. Invariant Manifold Design: The basic idea is to find a
Assumption 2. The steady-state map is such that mapping from known variables to the unknown variables,
which has an almost invariance property for sufficiently
∂`(u) ∂ 2 `(u)
kyk ≤ Y, ∂u ≤ L1 , ∂u∂uT
≤ L2 large value of an assignable design gain. This implicit
mapping is used to facilitate the estimation the unknown
∀u ∈ U with positive constants Y , L1 and L2 . parameters.
3. STATIC MAP The estimator model for (4) is defined as
ŷ˙ = −k 2 (ŷ − y), k > 0 (5)
Since the minimization of y is performed in real-time, the
input u is taken as a time-varying signal. That is, and the filter is described along with the structure of the
system by
y(t) = `(u(t)) (3)
φ̇ = −k 2 (φ − u̇) (6)
Assumption 3. The input signal u(t) is such that u(t) ∈ U
and ku̇(t)k ≤ α3 , ∀t ≥ t0 ≥ 0. Assumption 4. The vector-valued function φ(t) is bounded
with a known bound as kφk ≤ λ, ∀t ≥ t0 ≥ 0.
The differentiation of (3) with respect to time results in Proposition 2. Consider the estimator (5) and filter (6),
ẏ = ( ∂`(u) T
∂u ) u̇. Following the time-varying approach in the following implicit manifold
Guay et al. [2013], we define the unknown gradient and lim [k 2 (ŷ − y) + φT θ(t)] = 0 (7)
the cost dynamic as k→∞
∂`(u) is such that the manifold is invariant and internally ex-
θ(t) = , ẏ = θ(t)T u̇(t) (4) ponentially stable. Furthermore, the desired manifold is
∂u
9130
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
bounded and almost invariant for bounded and sufficiently An adaptive estimator is considered as
large values of k. Σ̇(t) = k[bI − φφT (Σ(t))], Σ(t0 ) = bI (15)
Proof. First, it is shown that the manifold (7) has an in- Based on (14) and (15), the proposed parameter update
variance feature. The off-the-manifold coordinate variable law is given by
z(t) is defined as ˙
θ̂(t) = k 2 Σ(t)[δ̇ + kδ] (16)
z(t) = k 2 (ŷ − y) + φT θ(t). (8) where b > 1 and δ = p − q.
The derivative of z(t) along the trajectories of the system Assumption 5. There exists constants α > 0 and T > 0
is given by such that
Z t+T
ż(t) = k 2 (ŷ˙ − ẏ) + φ̇T θ(t) + φT θ̇(t). (9)
φ(τ )φT (τ )dτ ≥ αI, ∀t > t0 . (17)
It follows from (4)-(6) that t
T
φ θ̇(t) The condition of the Assumption 5 is equivalent to the
ż(t) = −k 2 z(t) + φT θ̇(t) = −k 2 (z(t) − ). (10)
k2 standard persistent excitation (PE) condition required for
∂ 2 `(u) parameter convergence.
By the fact that θ̇ = ∂u∂u T u̇, and in view of Assumptions 2
φT θ̇(t)
and 3, it follows that θ̇ ≤ L2 α3 = Lθ , so k2 → 0 as 3.2 Controller Design
k → ∞. Suppose there exists a time tc such that z(tc ) = 0,
The proposed extremum seeking control is given by
then
u̇ = −k1 θ̂(t) + d(t) (18)
z(tc ) = 0 ⇒ ż(t) ≡ 0 ⇒ z(t) ≡ 0, ∀t ≥ tc
where d(t) is a bounded dither signal with kd(t)k ≤ D. The
which shows that z(t) = 0 is an invariant manifold for dither signal d(t) can be chosen arbitrary, and it only need
nonlinear dynamics (4)-(6) as k → ∞, and ż(t) = −k 2 z(t) to satisfy the PE condition of the Assumption 5. A one
is globally exponentially stable. proper choice of d(t) signals are sinusoidal waves, because
For the case where k is bounded, a quadratic Lyapunov of their orthogonality feature (Tan et al. [2008]). Although,
function can be defined as in our algorithm convergence to the optimal value is not
1 sensitive to the choice of amplitude and frequency of
Vz(t) = z 2 (t) (11)
2 the sinusoidal signal. Note that, since θ̂(t) and d(t) are
The first time derivative of Vz(t) along its trajectories, assumed to be bounded then the controller (18) is such
result in that ku̇k ≤ k1 L1 + D ≤ α3 .
V̇z(t) = −k 2 z 2 (t) + z(t)φT θ̇(t) Theorem 3. Let Assumptions 1 to 5 hold. Then the pa-
rameter update law (16) and the control law (18) are
As a result of Assumption 4, and applying Young’s In- such that the closed-loop extremum seeking control system
equality, one obtains the following inequality: converges to a neighborhood of the minimizer u∗ of the
λk 2 λ T static nonlinear optimization problem (4). The size of this
V̇z(t) ≤ −k 2 z 2 (t) + z (t) + θ̇ (t)θ̇(t) neighborhood is adjustable by the gains k and k1 .
2 2k (12)
λk λ 2
≤ − k2 − Vz(t) + Lθ Proof. By construction of parameter estimation error as
2 2k
θ̃(t) = θ(t) − θ̂(t), the parameter update law (16) can be
where k λ2 . Hence, the manifold normal coordinate approximated implicitly in the form
variables z(t) enter a small neighborhood of the origin. ˙
The size of this neighborhood depends on the choice of θ̂(t) = k 2 Σ(t)φφT θ̃(t). (19)
the high gain k. Therefore, an almost invariant manifold The quadratic Lyapunov function is defined as
is obtained by considering sufficiently large value for k. 1
V(θ̃,ũ) = θ̃T (t)θ̃(t) + Vũ (20)
Remark 1. It will be shown that the parameter estima- 2
tion can be achieved by considering the proposed almost By differentiating of (20) along (19) and (18), we have
invariant manifold. Thus, unlike the sliding mode-based
techniques, satisfactory performance can be attained with- V̇(θ̃,ũ) = −k 2 θ̃T (t)(Σ(t)φφT )θ̃(t) + θ̃T (t)θ̇(t)
out requiring that the system states reach the reference −k1 ũT θ(t) + k1 ũT θ̃(t) + ũT d(t)
manifold.
By Assumption 1, one can write the following inequality
Adaptive Estimation: The invariant manifold (7) can be
V̇(θ̃,ũ) ≤ −k 2 θ̃T (t)(Σ(t)φφT )θ̃(t) + θ̃T (t)θ̇(t)
re-written as
k 2 (ŷ − y) = −φT θ(t). (13) −k1 α2 ũT ũ + k1 ũT θ̃(t) + ũT d(t)
This equation provides an implicit relationship from Applying Young’s inequality to all indefinite terms of the
known variables (φ, ŷ, y) to the unknown variable θ(t). last inequality, there exists a positive constant k2 such that
This mapping provides direct information about the pa- k
rameter estimation error without requiring a priori knowl- V̇(θ̃,ũ) ≤ −k 2 θ̃T (t)(Σ(t)φφT )θ̃(t) + θ̃T (t)θ̃(t)
edge of the time-varying parameters. This is achieved by 2
1 kk1 T
defining the auxiliary variables p and q with the dynamics + θ̇T (t)θ̇(t) − k1 α2 ũT ũ + θ̃ (t)θ̃(t)
2k 2
ṗ = −kp − φφT θ̂(t) k1 k2 1 T
(14) + ũT ũ + ũT ũ + d (t)d(t)
q̇ = −kq + φ(k 2 (ŷ − y)) 2k 2 2k2
9131
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
With collecting the similar terms, one can rewrite the depends on the choice of gains k, k1 and the magnitude of
above inequality as follows the dither signal. This completes the proof of Theorem 3.
kk1 + k Remark 2. By construction, one can decrease the contri-
V̇(θ̃,ũ) ≤ − k 2 Σ(t)φφT − θ̃T (t)θ̃(t) bution from d(t) and θ̇(t) by increasing the optimization
2
gain k1 and the estimation gain k, respectively. Since,
k1 + kk2 1 T 1 T
− k1 α2 − ũT ũ + θ̇ (t)θ̇(t) + d (t)d(t) the rate of change of the parameter
2k 2k 2k2 is proportional
to
Next we claim the boundedness of the matrix Σ(t) as the optimization gain, θ̇(t) ≤ L2 k1 θ̂(t) + D , the
follows. By integration, one gets convergence to the small neighborhood is achieved by
Z t ensuring that k k1 .
T
Σ(t) = exp −kφ(τ )φ (τ )dτ Σ(t0 )
Z t Zt0 t 4. OPTIMIZATION IN DYNAMICAL SYSTEMS
+kb exp −kφ(ξ)φT (ξ)dξ dτ >
In this section, we consider the initial extremum control
Z t t0 Z t τ (21) system which consists in steering the unknown dynamical
kb exp −kφ(ξ)φT (ξ)dξ dτ ≥ system (1) to the equilibrium that minimizes the measure
tZ0 t τ
cost function (2). The closed-loop extremum seeking con-
b(1 − e−kβ(t−t0 ) )
−kβ(t−τ ) trol system is given by (Guay et al. [2013]):
kb e dτ I = I
t0 β
1 ẋ = f (x, u)
where the last inequality is achieved from the boundedness u̇ = −k1 θ̂(t) + d(t)
of matrix φφT . As a result of Assumption 5, we obtain ˙
Z t Z t θ̂(t) = k 2 Σ(t)[δ̇ + kδ]
(26)
Σ(t) ≤ Σ(t0 ) + kb exp −kφ(ξ)φT (ξ)dξ dτ ≤ ŷ˙ = −k 2 (ŷ − y)
Z t t0 τ φ̇ = −k 2 (φ − u̇)
bα + b(1 − e−kα(t−t0 ) )
b + kb e −kα(t−τ )
dτ I ≤ I Σ̇(t) = k[bI − φφT (Σ(t))]
t0 α
As in other works on extremum-seeking control, the closed-
(22) loop dynamics of the system are written in error form in
It follows from (21) and (22) that a two time-scale system where t is the slow time-scale and
b(1 − e−kβ(t−t0 ) ) T the system’s dynamics are assumed to evolve over a fast
φφ ≤ Σ(t)φφT time-scale τ = t1 . The parameter 1 > 0 is a small strictly
β (23)
bα + b(1 − e−kα(t−t0 ) ) T positive parameter to be assigned.
≤ φφ
α Let us define the deviation variables x̃ = x − π(u) and ũ =
and u − u∗ where u∗ is the local minimizer of the steady-state
b(1 − e−kβ(t−t0 ) ) ≤ Σ(t)φφT map y = `(u). The auxiliary variable is defined as above.
bα + b(1 − e−kα(t−t0 ) ) (24) However, one must take into account the measurement of
≤ β. the cost function over the fast and the slow time-scale, as
α
Consider the function V̇(θ̃,ũ) and inequality (24), the gain h(x̃ + π(ũ + u∗ )). Therefore, new dynamics are defined for
the off-the-manifold coordinate variable z(t), as
should be chosen such that
k1 + 1 2 ˙ 1 ∂h(x) ∂h(x) ∂π(u)
(1 − e−kβ(t−t0 ) )k ≥ ż = k ŷ − f (x, u) + u̇
2 1 ∂x ∂x ∂u (27)
k1 + kk2 T T
+φ̇ θ(t) + φ θ̇(t)
k1 α2 − >0
2k Substituting for ŷ˙ and φ̇, one obtains
or equivalently
2 2 ∂h(π(u)) ∂π(u)
N ż(t) = −k z(t) + k u̇
X k1 + 1 ∂x ∂u
kβ(t − t0 ) = kβ (ti − ti−1 ) ≥ ln( ) (28)
2k 1 ∂h(x) ∂h(x) ∂π(u)
i=1 −k 2 f (x, u) + u̇ + φT θ̇(t)
kk2 1 ∂x ∂x ∂u
k1 >
2kα2 − 1 As a result, the z(t) dynamic is affected by the fast and
Based on the last inequality, kβN T 0 ≥ ln(k1 + 1/2k), slow dynamics. One can write the system (26) and implicit
where T 0 is the sampling time and N is an integer. If the manifold (28) in deviation form as follows
estimation gain is chosen large enough as k ≥ ln(k1βT +1/2k)
, ∂π(u) ˙
0 1 x̃˙ = f (x̃ + π(ũ + u∗ ), ũ + u∗ ) − 1 ũ
then (1 − e −kβ(t−t0 ) k1 +1
)k ≥ 2 , ∀t > t0 ≥ 0. For the ∂u
∂h(x)
given k, there exist strictly positive constants ka , kb and 1 ż(t) = −k 2 f (x, u) − k 2 1 z(t) + 1 φT θ̇(t)
k 0 = min {ka , kb } such that ∂x
2 ∂h(π(u)) ∂π(u) ∂h(x) ∂π(u) (29)
1 2 1 2 +k 1 u̇ − u̇
V̇(θ̃,ũ) ≤ −ka θ̃T (t)θ̃(t) − kb ũT ũ + L + D ∂x ∂u ∂x ∂u
2k θ 2k2 (25) ũ˙ = −k1 θ̂(t) + d(t)
1 1 ˙
≤ −k 0 V(θ̃,ũ) + L2θ + D2 θ̃(t) = −k 2 Σ(t)[δ̇ + kδ] + θ̇(t)
2k 2k2
It is assumed that the gain k is such that
It follows that θ̃ and ũ converge exponentially to a small
1 2
neighborhood of the origin. The size of this neighborhood k 2 = ks2 + 1 kf
9132
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
u1
operates following the slower time-scale with gain ks . 2.2
˙ 2
θ̃(t) = −k Σ(t)[δ̇ + kδ] + θ̇(t)
u2
4
The boundary layer system is given by
dx̃ 3.5
= f (x̃ + π(ũ + u∗ ), ũ + u∗ ) 0 0.5 1 1.5 2 2.5
Time (sec)
3 3.5 4 4.5 5
dτ
dz ∂h(x̃ + π(ũ + u∗ ))
= −kf2 z − k 2 Fig. 1. Trajectories of the control inputs
dτ ∂x (31)
∂h(π(ũ + u∗ ))
f (x̃ + π(ũ + u∗ ), ũ + u∗ ) + kf2 ( optimization problem. The size of the neighborhood de-
∂x pends on the choice of the gains k, k1 and the magnitude
∂π(ũ + u∗ ) ˙ ∂h(x̃ + π(ũ + u∗ )) ∂π(ũ + u∗ ) ˙
ũ − ũ) of the dither signal d(t).
∂u ∂x ∂u
Assumption 6. The origin of the nonlinear system (1) is The proof is given in Appendix A.
locally exponentially stable ∀u ∈ U. Let χr = {x̃ ∈
Rn | kx̃k ≤ r} for r > 0, a positive constant. Similarly, 5. SIMULATION EXAMPLE
let Er = {z ∈ R | |z| ≤ r}.
Assumption 7. The vector field f (x̃ + π(ũ + u∗ ), ũ + u∗ ) is The unknown static input-output map is given by Ghaffari
such that et al. [2012] as
kf (x̃ + π(ũ + u∗ ), ũ + u∗ )k ≤ Lf kx̃k
T
1 2 100 30 2
y = 100 + u− u−
∀x̃ ∈ χ and ∀u ∈ U where Lf > 0 is a positive constant. 2 4 30 20 4
Assumption 8. The cost function h(x) is such that The objective is to minimize the output y with respect to
∂h(x̃ + π(ũ + u∗ )) input u. We apply the proposed extremum-seeking
√ control
≤ Lh algorithm with b = 1.1, k = 10 10, k1 = 0.3, and
∂x arbitrary dither signal
∂h(x̃ + π(u)) ∂h(ỹ + π(u))
− ≤ Lh kx̃ − ỹk sin(10t)
∂x ∂x d(t) = 0.002 .
sin(20t)
∀x̃, ỹ ∈ χ and ∀u ∈ U where Lh > 0 is a positive constant.
The initial conditions are chosen as u(0) = [2.5, 5]T and
Finally, we make the following assumption concerning the θ̂(0) = [10, 10]T , where θ̂(t) is estimation of the unknown
steady-state map, π(u). gradient.
Assumption 9. The steady-state map π(u) is such that
The simulation results are shown in Figs. 1-3. The
∂π(u) extremum-seeking control input trajectories re shown in
≤ Lπ
∂u Fig. 1. The changes of the unknown cost function y
∀u ∈ U where Lπ > 0 is a positive constant. is depicted in Fig. 2. The results demonstrate that the
proposed extremum-seeking control algorithm provides a
By Assumptions 7, 8 and 9, it follows that there exists rapid progression to the unknown minimizer of the opti-
a kf such that the origin of the boundary layer (31) is mization problem. In addition, the control system provides
locally exponentially stable. It then follows that there satisfactory transient behaviour for both the inputs and
exists a Lyapunov function and positive constants, βi for the objective function.
i = 1, ..., 6 such that
2 2 2 2 The normal manifold coordinate variable z(t) is shown in
β1 (kx̃k + |z| ) ≤ V (x̃, z) ≤ β2 (kx̃k + |z| ) Fig. 3. As confirmed in this figure, the implicit manifold
dV dV dx̃ dV dz 2 2 z(t) converges to the small neighbourhood of zero. It is
= + ≤ −β3 kx̃k − β4 |z| important to point out that the value of this variable is
dτ dx̃ dτ dz dτ
not known.
dV dV 2 2
, ≤ β5 kx̃k + β6 |z|
dx̃ dz 6. CONCLUSION
∀u ∈ U, x̃ ∈ χr .
Theorem 4. Consider the nonlinear system (1) and the In this paper, an alternative time-varying ESC technique
cost function (2). Let Assumptions 1 to 9 be fulfilled then was proposed. The technique is based on the time-varying
the time-varying parameter estimation scheme and the estimation of the unknown gradient which relies on defi-
extremum-seeking controller (18) is such that for every nition of an invariant manifold principle. The ESC algo-
1 ∈ (0, ∗ ), the closed-loop system converges to a neigh- rithm is shown to provide local exponential convergence
borhood of the unknown local minimizer of the nonlinear of the closed-loop system to the unknown optimum. The
9133
19th IFAC World Congress
Cape Town, South Africa. August 24-29, 2014
9134