14 - Int.J.Control 2014
14 - Int.J.Control 2014
Haibo Gao, Xingguo Song, Liang Ding, Kerui Xia, Nan Li & Zongquan Deng
To cite this article: Haibo Gao, Xingguo Song, Liang Ding, Kerui Xia, Nan Li & Zongquan Deng
(2014) Adaptive motion control of wheeled mobile robot with unknown slippage, International
Journal of Control, 87:8, 1513-1522, DOI: 10.1080/00207179.2013.878038
As a major representative nonholonomic system, wheeled mobile robot (WMR) is often used to travel across off-road
environments that could be unstructured environments. Slippage often occurs when WMR moves in slopes or uneven terrain,
and the slippage generates large accumulated position errors in the vehicle, compared with conventional wheeled mobile
robots. An estimation of the wheel slip ratio is essential to improve the accuracy of locomotion control. In this paper, we
propose an improved adaptive controller to allow WMR to track the desired trajectory under unknown longitudinal slip,
where the stabilisation of the closed-loop tracking system is guaranteed by the Lyapunov theory. All system states use
neural network online weight tuning algorithms, which ensure small tracking errors and no loss of stability in robot motion
with bounded input signals. We demonstrate superior tracking results using the proposed control method in various Matlab
simulations.
Keywords: nonholonomic systems; wheeled mobile robot (WMR); radial basis function (RBF); neural networks (NN); slip
ratio; Lyapunov theory
∗
Corresponding author. Email: [email protected]
C 2014 Taylor & Francis
1514 H. Gao et al.
According to (2) and (3), it is possible to find an auxiliary Substituting (9) into (6), we can rewrite the kinematic
vector time function v(t) ∈ R 2 such that for all t equation by
⎡ ⎤ ⎡ ⎤
q̇ = S(q)v(t). (4) ẋc (1 − s1 )cθ (1 − s2 )cθ
⎢ ⎥ 1⎢ ⎥ vR
q̇ = ⎣ ẏc ⎦ = ⎣ (1 − s1 )sθ (1 − s2 )sθ ⎦ ,
2 vL
It follows that S(q) can be given by: θ̇ 1 − s1 /L 1 − s2 /L
(11)
⎡ ⎤ where cθ = cos θ , sθ = sin θ ,
cos θ 0
⎢ ⎥ ⎡ ⎤
S(q) = ⎣ sin θ 0⎦. (5) (1 − s1 )cθ (1 − s2 )cθ
1⎣
0 1 S̄ = (1 − s1 )sθ (1 − s2 )sθ ⎦ . (12)
2 1 − s /L 1 − s2 /L
1
vR (1 − s1 ) + vL (1 − s2 )
v= , (7) 2.2 Structural properties of a mobile robot
2
vR (1 − s1 ) − vL (1 − s2 ) The dynamics of system (1) is now transformed into a
ω= , (8) more appropriate representation for control purposes. Using
2L
⎡ ⎤ Equation (12), the dynamics Equation (1) can be rewritten
v 1 − s1 1 − s2 v by (Fierro & Lewis, 1998):
1⎣
1 − s1 1 − s2 ⎦
R
= , (9)
ω 2 vL
L −L q̇ = S̄(q)υ(t)
M̄(q)υ̇ + V̄ (q, q̇)υ + F̄(υ) + Ḡ(υ) + τ̄ d = τ̄ ,
where vector υ = [vR vL ]T , vR and vL are the driving veloc- (14)
ities of the right and left wheels, respectively, which can be
measured by encoders. 2L is the right- and left-wheel tread,
where M̄ = S̄ M S̄, V̄ = S̄ (M S̄˙ + V S̄), F̄ = S̄ F, Ḡ =
T T T
and s1 and s2 denote the slip ratios defined as follows:
T T T
S̄ G, τ̄ d = S̄ τ d , τ̄ = S̄ Bτ and τ̄ d ≤ bd with bd as an
upper bound of system disturbances.
vR − vR vL − vL
s1 = , s2 = , (10)
vR vL Property 1: M̄ is a symmetric positive definite, and the
˙ − 2 V̄ is skew-symmetric (Fierro & Lewis, 1998):
matrix M̄
where vR and vL are the factual/current velocity of the i.e.,
right and left wheels, respectively, while s1 and s2 are the
slip ratios of the right and left wheels, respectively. If no
slippage occurs, the current velocity is equal to the driving
˙ − 2 V̄ )x = 0, ∀x = 0.
x T ( M̄
velocity (vR = vR , vL = vL ), and slip ratios s1 = s2 = 0.
1516 H. Gao et al.
where wi are the adjustable weights that link the output An auxiliary velocity control law input that achieves track-
nodes with the appropriate hidden neurons. These weights ing for (3) is given by (Kanayama et al., 1990):
in the output layer can be learnt using the least-squares
method.
vr cos θe + k1 xe
In this study, the RBF neural network is employed for vc = (20)
ωr + k2 vr ye + k3 vr sin θe
nonlinear function approximation. We assume a special case
of the linear regression model as follows:
where (k1 , k2 , k2 ) > 0 are the feedback gains of xe , ye , and
θ e , respectively.
g(θ) = W φ(θ ) + ε(θ ), (16) However, we need to compensate for the loss of velocity
caused by wheel slippage while maintaining the desired
W = [w0 , w1 , . . . , wL ] , velocity of the WMR when we design the controller.
Therefore, according to (7), (8) and (20), based on
φ(θ) = [1, φ1 (θ), . . . , φL (θ)]T , (17) the estimation of slip ratios, we can recover an auxiliary
International Journal of Control 1517
velocity control law input υ c = [v1 , v2 ]T by where w ∈ R (L+1)×2 is the vector of the ideal threshold and
v +ω L their weights. The bounds for w and ε(θ ) are expressed as
c c
v 1−ŝ1
υc = 1 = vc −ωc L
, (21) w ≤ bw and |ε(θ )| ≤ bε ∀θ . (28)
v2
1−ŝ2
where v1 and v2 are the auxiliary velocity control law input Then, an estimate of g(θ ) can be given by
of the right and left wheels, respectively; ŝ1 and ŝ2 are the
estimated slip ratios of the right and left wheels, which can ĝ(θ) = ŵ T φ(θ ) (29)
be detected by equality (13). From literature (Ding et al.,
2010), we know slip ratio is defined by s ∈ [−1, 1], when where ŵ ∈ R (L+1)×2 is the vector of the estimated threshold
wheels are slipping on original location and not able to and weights.
move forward/backward, in this case, slip ratio s = 1 or −1. The control law is given by
Therefore, for tracking object, we should assume ŝ = ±1.
After slip-compensation, the velocity tracking errors of τ̄ = ĝ + K ec + lsgn(ec ), (30)
the wheels are defined as follows:
where ĝ is regarded as the output torque of NN controller;
e Kec is the torque controller, and, K = diag[k4 , k5 ] is a
ec = 1 = υ c − υ. (22) positive definite matrix; lsgn(ec ) is a robust term to suppress
e2
the effect of disturbance and approximate errors.
Differentiating (22), using (14), the mobile robot dynamics Let the NN weights be further adjusted to minimise the
may be written in terms of the tracking errors as velocity tracking error. The adaptive law of ŵ is designated
as:
M̄(q)ėc = − V̄ (q, q̇)ec − τ̄ + g(x) + τ̄ d , (23)
ŵ˙ i = φeTc − κ ec ŵi , i = 1, 2, . . . , L, (31)
where the important nonlinear mobile robot function is de-
fined as where ŵ = [ŵ1 . . . ŵL ], L is the number of NN hidden
layer, and learning law κ is a positive constant. The simul-
g(x) = M̄ υ̇ c + V̄ (q, q̇)υ c + F̄(υ) + Ḡ(υ). (24) taneous updates of the adaptive law may be suitable for
non-stationary conditions or online settings.
Here, the vector x required to compute g(x) can be defined The control law (30) can be rewritten as:
as
τ̄ = ŵ T φ(θ) + K ec + lsgn(ec ) (32)
T T
x≡ υ Tc υ̇ Tc υ . (25)
and the parameter l is defined as
Function g(x) contains all the mobile robot parameters, such
as mass, moments of inertia, and friction coefficients. How- 1
ever, nonlinear function g(x) is often imperfectly known in l ≥ bε + bd + κbw2 +
, (33)
4
applications and it is difficult to determine. In this paper,
RBFNN is used to approximate it. which is related to the bounds described by (28), the pa-
rameter κ in (31), and a strictly positive constant ε.
The neural network (15) for (24) is redefined as g(θ ): R6 → where g̃ = g − ĝ is the function estimation error. This es-
R2 , which is rewritten as: timation error is expressed according to (27) and (29) as
where w̃ is the vector of the threshold and weight estimation ≤ −K ec 2 − l ec + ec (ε + τ̄ d )
errors, defined as − κ ec w̃2 + κ ec w̃ bw
= −K ec 2 − l ec + ec (bε + bd )
w̃ = w − ŵ. − κ ec (w̃2 − w̃ bw )
1 1
Therefore, (33) can be written as follows: − κbw2 ec + κbw2 ec
4 4
= −K ec 2 − l ec + ec (bε + bd )
M̄ėc = − (K + V̄ )ec +w̃ T φ(θ )
(36) 1 1
+ε(θ )+τ̄ d − lsgn(ec ). − κ ec (w̃2 − w̃ bw + bw2 ) + κbw2 ec
4 4
It is necessary to develop the adaptive NN control (NNC)
= −K ec 2 − ec l − bε − bd
given in Equation (32) by considering the system stability,
so that both the position and the velocity tracking errors 2
converge to zero. Consider the following Lyapunov func- 1 1 2
+ κ w̃ − bw − bw
tional candidate for the proposed controller: 2 4
1
V = V1 (ec , t) + V2 (em , t) ≤ −K ec 2 − ec
+κ w̃ − bw )2 , (38)
2
and ˙ − 2 V̄ )x = 0,
where inequality (33) and property 2, x T ( M̄
1 T 1 ∀x = 0 are used, and tr[] denotes the trace of matrix (Horn
V1 (ec , t) = ec M̄ ec + [(w − ŵ)T (w − ŵ)], (37) & Johnson, 1985).
2 2
With (38), V̇1 ≤ 0 is guaranteed to be negative. This
where matrix M̄ is a symmetric positive definite, so that shows V1 →0 and implies ec (t) → 0 and ėc (t) → 0 as
V1 ≥ 0. Differentiating Equation (37) with respect to time t → ∞. The tracking velocity convergence is guaranteed by
as the control law (32), which implies υ → υ c and υ̇ → υ̇ c
as t → ∞.
V̇1 (ec , t) We consider the other positive definite Lyapunov func-
1 ˙ tional candidate:
= eTc M̄ ėc + eTc M̄ ˙
ec +tr[(w − ŵ)T ŵ]
2
1 2
= eTc (− V̄ ec + w̃T φ − K ec − lsgn(ec ) + ε + τ̄ d ) V2 (em , t) = (x + ye2 ) + (1 − cos θe )/k2 . (39)
2 e
1 ˙
+ eTc M̄ ec − tr w̃T φeTc − κ ec ŵ
2
1 T ˙
= ec ( M̄ − 2 V̄ )ec + eTc w̃T φ − eTc K ec
2
− leTc sgn(ec ) + eTc (ε + τ̄ d )
− tr(w̃T φeTc − κ w̃T ec ŵ)
= eTc w̃T φ − K ec 2 − l ec + eTc (ε + τ̄ d )
− tr(eTc w̃T φ) + tr[κ w̃ ec (w − w̃)] Figure 4. Reference circle trajectory.
International Journal of Control 1519
Differentiation yields (Kanayama et al., 1990): 1991), both the system stability and tracking convergence
are guaranteed by the control law (32) driving the system
V̇2 (em , t) = −k1 xe2 − k3 sin2 θe /k2 ≤ 0. (40) (14), which closely tracks the desired motion trajectories.
Figure 5. Artificial NNC without compensation (left). Adaptive NNC with compensation (right).
1520 H. Gao et al.
Figure 5. (Continued)
wheel velocities are influenced by the slippage, necessi- NN controller with slip-compensation under slippage. We
tating the analysis using an estimated slip ratio for move- adopt vehicle parameters (Figure 1) as follows: m = 5 kg,
ment control. According to the definition of the slip ratio I = 4 kgm2 , L = 0.2 m, r = 0.15 m, under the time varying
(Ding et al., 2010), we know that the range of slip ratios is external disturbance τ d = (sin t, cos t, 1)T N, vr = 1 m/s,
from −1 to 1, the change of slip ratios can resemble a sine ωr = 1/3rad/s. The objective is to track the trajectory such
curve, and the slip ratios will increase with increasing slope. that the errors in the position and velocity tend to zero. The
Therefore, we can assume as follows. controller gains were selected so that the closed-loop sys-
Let the desired trajectory be a space circular ring, xr = tem exhibits a critical damping behaviour: k1 = 2, k2 =9,
3 cos(t/3), yr = 3 sin(t/3), zr = −0.4 cos(2t/3), and t ∈ k3 = 6, k4 = 20, k5 = 10. In the neural network, we selected
[0, 30]s and where the initial position is Q = (2, −1, −0.4) a radial basis function with Nh = 6 hidden-layer neurons
(Figure 4). and κ = 0.35.
The slip ratio could be considered as the time variable
function ‘0.2 sin (2t/3)’, which increases or decreases with
the slope angle of the reference trajectory. 5.1 Artificial NN control under slippage
We now demonstrate the adaptive NN control shown The response of artificial NN control (NNC) is described
in Figure 3. Here, we compare its performance with slip- under the tangential slip of the wheels, as shown (a1-g1)
compensation and without slip-compensation when slip in Figure 5. If we do not compensate the slippage or even
exits between the wheels and the ground. Two control neglect the effect of slippage in velocity control, these re-
performances were implemented and tested using Mat- sults of the tracking trajectory present large position and
lab Simulink models: A. Artificial NN controller (Jolly, velocity error changes. Obviously, these results are not the
Kumar, & Vijayakumar, 2009) under slippage; B. Adaptive desired precision for tracking control.
International Journal of Control 1521
Figure 6. Adaptive NNC with compensation. (i1) Real slip ratio of left wheel. (i2) Real slip ratio of right wheel.
Li, Z.J., Li, J.X., & Kang, Y. (2010). Adaptive robust coordinated Ray, L.R., Brande, D.C., & Lever, J.H. (2009). Estimation of net
control of multiple mobile manipulators interacting with rigid traction for differential-steered wheeled robots. Journal of
environments. Automatica, 46, 2028–2034. Terramechanics, 46, 75–87.
Li, Z.J., & Xu, C.Q. (2009). Adaptive fuzzy logic control of dy- Slotine, J-J.E., & Li, W. (1991). Applied nonlinear control. Engle-
namic balance and motion for wheeled inverted pendulums. wood Cliffs, NJ: Prentice-Hall.
Fuzzy Sets and Systems, 160, 1787–1803. Sun, T., Pei, H.L., Pan, Y.P., & Zhang, C.H. (2013). Ro-
Li, Z.J., & Yang, C.G. (2012). Neural-adaptive output feedback bust adaptive neural network control for environmen-
control of a class of transportation vehicles based on wheeled tal boundary tracking by mobile robots. International
inverted pendulum models. IEEE Transactions on Control Journal of Robust and Nonlinear Control, 23, 123–
Systems Technology, 20, 1583–1591. 136.
Liaw, H.C., Farrell, J.A., & Schaal, S. (2009). Robust neural Theodoridis, D., Boutalis, Y., & Christodoulou, M. (2011). A
network motion tracking control of piezoelectric actuation new adaptive Neuro-Fuzzy controller for trajectory tracking
systems for Micro/Nanomanipulation. IEEE Transactions on of robot manipulators. International Journal of Robotics and
Neural Networks, 20, 356–367. Automation, 26, 1–12.
Moosavian, S.A.A., & Kalantari, A. (2008). Experimental slip Wang, Z.P., Ge, S.S. Lee, T.H., & Lai, X.C. (2006). Adaptive smart
estimation for exact kinematics modeling and control of a neural network tracking control of wheeled mobile robots.
tracked mobile robot. Proc. IEEE/RSJ International Confer- ICARCV.
ence on Intelligent Robots and Systems, Nice, France, Sept Wong, J.Y. (2001). Theory of ground vehicles. Hoboken, NJ: John
22-26, 95–100. Wiley & Sons.