0% found this document useful (0 votes)

55 views

Consensus in Multi Agent Systems

Uploaded by

nancy_007

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views

Consensus in Multi Agent Systems

Uploaded by

nancy_007

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.

php

Chapter 13

Consensus in
Multi-agent Systems

13.1 Introduction
This chapter brings together game theory and consensus in multi-agent systems. A multi-
agent system involves n dynamic agents; these can be vehicles, employees, or computers,
each one described by a differential or difference equation. The interaction is modeled
through a communication graph. In a consensus problem the agents implement a dis-
tributed consensus protocol, i.e., distributed control policies based on local information.
The goal of a consensus problem is to make the agents’ reach consensus, that is, to converge
to a same value, called a consensus value.
The core message in this chapter is that the consensus problem can be turned into a
noncooperative differential game, where the dynamic agents are the players. To do this, we
formulate a mechanism design problem where a supervisor “designs” the objective func-
tions such that if the agents are rational and use their best-response strategies, then they
converge to a consensus value. We illustrate the results by simulating the vertical align-
ment maneuver of a team of unmanned aerial vehicles (UAVs).
Unfortunately, solving the mechanism design problem is a difficult task, unless the
problem can be modeled as an affine quadratic game. Given such a game, the main idea
is then to translate it into a sequence of more tractable receding horizon problems. At
each discrete time tk , each agent optimizes over an infinite planning horizon T → ∞
and executes the controls over a one-step action horizon δ = tk+1 − tk . The neighbors’
states are kept constant over the planning horizon. At time tk+1 each agent reoptimizes its
controls based on the new information on neighbors’ states which have become available.
We then take the limit for δ → 0.
This chapter is organized as follows. Section 13.2 formulates the consensus problem
(Problem 13.1) and the mechanism design problem (Problem 13.2). Section 13.3 provides
a solution to the consensus problem. Section 13.4 addresses the mechanism design prob-
lem. Section 13.5 illustrates the results on a study case involving a team of UAVs perform-
ing a vertical alignment maneuver. Finally, Section 13.6 provides notes and references on
the topic.

13.2 Consensus via mechanism design

Let a set Γ = {1, . . . , n} of dynamic agents be given. Let G = (Γ , E) be a time-invariant
undirected connected network, where Γ is the vertexset and E ⊆ Γ ×Γ is the edgeset. Such

143
144 Chapter 13. Consensus in Multi-agent Systems

a network describes the interactions between pairs of agents. By undirected we mean that
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

if (i, j ) ∈ E, then ( j , i) ∈ E. By connected we mean that for any vertex i ∈ Γ there exists
a path in E that connects i with any other vertex j ∈ Γ . Recall that a path from i to j is
a sequence of edges (i, k1 )(k1 , k2 ) . . . (k r , j ) in E. Note that in general, the network G is
not complete; that is to say that each vertex i has a direct link only to a subset of other
vertices, denoted by Ni = { j : (i, j ) ∈ E}. This subset is referred to as neighborhood of i.
The interpretation of an edge (i, j ) in the edgeset E is that the state of vertex j is
available to vertex i. As the network is undirected, then communication is bidirectional;
namely, the state of agent i is available to agent j .
Let xi be the state of agent i. The evolution of xi is determined by the following
first-order differential equation driven by a distributed and stationary control policy:

ẋi = ui (xi , x (i ) ) ∀i ∈ Γ , (13.1)

where x (i ) represents the vector collecting the states of the only neighbors of i. In other
words, for the j the component of x (i ) we have

(i ) x j if j ∈ Ni ,
xj =
0 otherwise.

The control policy is distributed, as the control ui depends only on xi and x (i ) . The
control policy is stationary, as there is no explicit dependence of ui on time t . Occasion-
ally, we also call such a control policy time invariant. Let the state of the collective system
be defined by the vector x(t ) = {xi (t ), i ∈ Γ }, and let the initial state be x(0). Similarly,
denote by u(x) = {ui (xi , x (i ) ) : i ∈ Γ } the collective control vector, which we occasionally
call simply protocol. Fig. 13.1 depicts a possible network of dynamic agents. In the graph,
for some of the vertices, we indicate the corresponding dynamics.

ẋ1 (t ) = u1 (x1 (t ), x (1) (t ))

ẋi (t ) = ui (xi (t ), x (i ) (t ))

ẋ2 (t ) = u2 (x2 (t ), x (2) (t ))

Figure 13.1. Network of dynamic agents.

The consensus problem consists in determining how to make the players reach agree-
ment on a so-called consensus value. To give a precise definition of such a value, consider
a function χ̂ : n → . This function is a generic continuous and differentiable function
of n variables x1 , . . . , xn which is permutation invariant. In other words, for any permu-
tation σ(.) from the set Γ to the set Γ , the function satisfies

χ̂ (x1 , x2 , . . . , xn ) = χ̂ (xσ(1) , xσ(2) , . . . , xσ(n) ).

Sporadically, we refer to χ̂ as agreement function.

From [194, 206, 252], a protocol u(.) makes the agents reach asymptotically consensus
on a consensus value χ̂ (x(0)) if

xi − χ̂ (x(0)) −→ 0 for t −→ ∞.

13.2. Consensus via mechanism design 145

The above means that the collective system converges to χ̂ (x(0))1, where 1 denotes
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

the vector (1, 1, . . . , 1)T .

In the rest of this chapter we focus on agreement functions satisfying
min{yi } ≤ χ̂ (y) ≤ max{yi } ∀ y ∈ n . (13.2)
i ∈Γ i ∈Γ

In other words, the consensus value is a point in the interval from the minimum to the
maximum values of the agents’ initial states.
In preparation for the formulation of the consensus problem as a game, let us also
introduce a cost functional for agent i as the one displayed below:
!T: ;
Ji (xi , x (i ) , ui ) = lim F (xi , x (i ) ) + ρui2 d t , (13.3)
T −→∞ 0

where ρ > 0 and F : × → is a nonnegative penalty function. This penalty ac-

counts for the deviation of player i from his neighbors. With the above cost functional
in mind, a protocol is said to be optimal if each control ui minimizes the corresponding
cost functional Ji . Fig. 13.2 depicts a network of dynamic agents and the cost functionals
corresponding to different agents.

J1 (x1 (t ), x (1) (t ), u1 (t ))
Ji (xi (t ), x (i ) (t ), ui (t ))

J2 (x2 (t ), x (2) (t ), u2 (t ))

Figure 13.2. Network of dynamic agents with the cost functionals assigned to the players.

After this preamble, the problem under study can be stated as follows.

Problem 13.1 (Consensus problem). Let a network of dynamic agents G = (Γ , E) be

given. Assume that the agents evolve according to the first-order differential equations (13.1).
For any agreement function χ̂ verifying (13.2), design a distributed and stationary protocol
as in (13.1) that makes the agents reach asymptotically consensus on χ̂ (x(0)) for any initial
state x(0).

We say that a protocol is a consensus protocol if it is solution of the above consensus

problem. Furthermore, we say that a consensus protocol is optimal if the controls ui (.)
minimize (13.3). We are in a position to give a precise definition of the mechanism design
problem.

Problem 13.2 (Mechanism design problem). Let a network of dynamic agents G = (Γ , E)

be given. Assume that the agents evolve according to the first-order differential equations
(13.1). For any agreement function χ̂ (.) design a penalty function F (.) such that there exists
an optimal consensus protocol u(.) with respect to χ̂ (x(0)) for any initial state x(0).

Note that a pair (F (.), u(.)) which is solution to Problem 13.2 must guarantee that
all cost functionals in (13.3) converge to a finite value. For this to be true, the integrand
in (13.3) must be null in χ 1.
146 Chapter 13. Consensus in Multi-agent Systems

Table 13.1. Means and corresponding functions f and g .

Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

Mean χ̂ (x) f (y) g (z)

1 1
Arithmetic
> i ∈Γ n xi y z
n1
Geometric n
i ∈Γ xi e ny log z
1 n 1
Harmonic n
y z
? i∈Γ x
i ?
1 p q 1
zp
p
Mean of order p i ∈Γ n xi n
y

13.3 A solution to the Consensus Problem

This section deals with the solution of Problem 13.1, namely the Consensus Problem. To
this purpose, let us start by considering the following family of agreement function χ̂ (x).

Assumption 13.1 (Structure of χ̂ (.)). Assume that the agreement function χ̂ (.) verifies
d g (x )
(13.2) and it is such that χ̂ (x) = f ( i ∈Γ g (xi )) for some f , g : → with d x i = 0 for all
i
xi .

It is worth noting that the class of agreement functions contemplated in the above
assumption involves any value in the range between the minimum and the maximum of
the initial states. This is clear if we look at Table 13.1 and note that to span the whole
interval we simply consider the mean of order p and let p vary between −∞ and ∞.

Theorem 13.1 (Solution to the Consensus Problem). The following protocol is solution
to the consensus problem:

1
ui (xi , x (i ) ) = α dg
φ̂(ϑ(x j ) − ϑ(xi )) ∀i ∈ Γ , (13.4)
d xi j ∈Ni

where

• the parameter α > 0, and the function φ̂ : → is continuous, locally Lipschitz, odd,
and strictly increasing;
d ϑ(xi )
• the function ϑ : → is differentiable with d xi
locally Lipschitz and strictly posi-
tive;
d g (y)
• the function g (.) is strictly increasing, that is, dy
> 0 for all y ∈ .

Proof. Let us start by observing that from the restrictions imposed on α, φ̂ : → ,

and ϑ : → , the equilibria are given by λ1. We can also infer that if a trajectory x(t )
converges to λ0 1, then it holds that λ0 = χ̂ (x(0)) for any initial state x(0).
Let us turn to the restrictions on g (.). It is useful to introduce the new variable
η = {ηi , i ∈ Γ }, where ηi = g (xi ) − g (χ̂ (x(0))). Actually, after doing this, consensus
implies asymptotic stability of η. Note that ηi is strictly increasing. Furthermore, η = 0
corresponds to x = χ̂ (x(0))1. Having introduced η, we next prove that the equilibrium
point η = 0 is asymptotically stable in the quotient space n /span{1}. To do this, we
1
consider the following candidate Lyapunov function: V (η) = 2 i ∈Γ η2i . Note that we
13.4. A solution to the Mechanism Design Problem 147

have V (η) = 0 if and only if η = 0. In addition, V (η) > 0 for all η = 0. Our goal is to
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

show that V̇ (η) < 0 for all η = 0. To this purpose, let us first rewrite V̇ (η) as follows:
d g (xi )
V̇ (η) = ηi η̇i = ηi ẋ . (13.5)
i ∈Γ i ∈Γ d xi i
Now, from (13.4) we can rewrite (13.5) as
d g (xi )
V̇ (η) = ηi ui
i ∈Γ
d xi
d g (xi ) 1
= ηi α dg φ̂(ϑ(x j ) − ϑ(xi )) (13.6)
i ∈Γ d xi
d xi j ∈Ni

=α ηi φ̂(ϑ(x j ) − ϑ(xi )).
i ∈Γ j ∈Ni

Now, by noting that j ∈ Ni if and only if i ∈ N j for each i, j ∈ Γ , from (13.6) we can
rewrite

V̇ (η) = −α ( g (x j ) − g (xi ))φ̂(ϑ(x j ) − ϑ(xi )). (13.7)
(i , j )∈E

From (13.7) we conclude that V̇ (η) ≤ 0 for all η and, more specifically, V̇ (η) = 0 only
for η = 0. To see this, observe that for any (i, j ) ∈ E, x j > xi implies g (x j ) − g (xi ) > 0,
ϑ(x j ) − ϑ(xi ) > 0, and φ̂(ϑ(x j ) − ϑ(xi )) > 0. This is true, as α > 0 and g (.), φ̂(.), and
ϑ(.) are strictly increasing. Therefore we have α( g (x j ) − g (xi ))φ̂(ϑ(x j ) − ϑ(xi )) > 0 if
x j > xi . A similar argument can be used if x j < xi .

13.4 A solution to the Mechanism Design Problem

This section deals with Problem 13.2, namely the Mechanism Design Problem. We show
that the cost functionals can be designed so that the consensus protocol (13.4) is the unique
best-response strategy. In other words, consensus is reached when all the agents imple-
ment their best-response strategies. This result is significant, as it shows the true nature of
consensus as Nash equilibrium and of a consensus protocol as a collection of best-response
policies.
However, Problem 13.2 presents some difficulties in that the agents must predict the
evolution of their neighbors’ states over the horizon. We propose a method that turns
Problem 13.2 into a sequence of tractable problems (Problem 13.3). Consider an infinite
planning horizon, namely T → ∞, and assume that at each discrete-time tk the agents
compute their best-response strategies over this horizon. Remarkably, in doing this, the
neighbors’ states do not change over the planning horizon. Given the sequence of optimal
controls, the agents use only their first controls. In the parlance of receding horizon and
Model Predictive Control, this corresponds to saying that the agents operate on a one-step
action horizon δ = tk+1 − tk . When new information on the neighbors’ states becomes
available at time tk+1 , the agents use such information to perform a new iteration of the
infinite horizon optimization problem. This section concludes by showing that the solu-
tion to Problem 13.3 coincides with the solution to Problem 13.2 asymptotically, namely
for δ → 0.
148 Chapter 13. Consensus in Multi-agent Systems

Let the following update times be given: tk = t0 +δk, where k = 0, 1, . . . . Let x̂i (τ, tk )
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

and x̂ (i ) (τ, tk ), τ ≥ tk be the predicted state of agent i and of his neighbors, respectively.
The problem we wish to solve is the following one.

Problem 13.3 (Receding horizon). For all agents i ∈ Γ and times tk , k = 0, 1, . . . , given
the initial state xi (tk ) and x (i ) (tk ), find

ûi (τ, tk ) = arg min #i (xi (tk ), x (i ) (tk ), ûi (τ, tk )),

where
! T : ;
(i )
#i (xi (tk ), x (tk ), ûi (τ, tk )) = lim " (x̂i (τ, tk ), x̂ (i ) (τ, tk )) + ρ ûi2 (τ, tk ) d τ
T −→∞ tk
(13.8)
subject to the following constraints:

x̂˙i (τ, tk ) = ûi (τ, tk ), (13.9)

x̂˙ j (τ, tk ) = û j (τ, tk ) := 0 ∀ j ∈ Ni , (13.10)
x̂i (tk , tk ) = xi (tk ), (13.11)
x̂ j (tk , tk ) = x j (tk ) ∀ j ∈ Ni . (13.12)

The above set of constraints involves the predicted state dynamics of agent i and of
his neighbors; see (13.9) and (13.10), respectively.
The constraints also involve the boundary conditions at the initial time tk ; see condi-
tions (13.11) and (13.12). Note that, by setting x̂ (i ) (τ, tk ) = x (i ) (tk ) for all τ > tk , agent i
restrains the states of his neighbors to be constant over the planning horizon.
At tk+1 new information on x (i ) (tk+1 ) becomes available. Then the agents update
their best-response strategies, which we refer to as receding horizon control policies. Con-
sequently, for all i ∈ Γ , we obtain the closed-loop system

ẋi = uiRH (τ), τ ≥ t0 ,

where the receding horizon control law uiRH (τ) satisfies

uiRH (τ) = ûi (τ, tk ), τ ∈ [tk , tk+1 ).

The complexity reduction introduced by the method derives from turning Prob-
lem 13.3 into n one-dimensional problems. This is a consequence of constraint (13.10),
which forces x̂ (i ) to be constant in (13.8). Further evidence of this derives from rewrit-
ing " (.), thus highlighting its dependence on the state x̂i (τ, tk ). By doing this the cost
functional (13.8) takes the form
!T
@ A
Ji = lim " (x̂i (τ, tk )) + ρ ûi2 (τ, tk ) d τ. (13.13)
T −→∞ tk

Consequently, the problem simplifies, as it involves the computation of the optimal con-
trol ûi (τ, tk ) that minimizes (13.13).
Fig. 13.3 illustrates the receding horizon formulation. Given a dynamics for x j (t ), for
all j ∈ Ni (solid line), agent i takes for it the value measured at time tk (small circles) and
maintains it constant from tk on (thin horizontal lines).
13.4. A solution to the Mechanism Design Problem 149
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

x j (t ), x̂ j (t ), j ∈ Ni

Figure 13.3. Receding horizon formulation for agent i: at each sampling time (circles) the
estimated state of neighbor j , x̂ j (.) is maintained constant over the horizon (thin solid); the actual state
x j (.) changes with time (thick solid).

Let us now use the Pontryagin Minimum Principle to prove that the control ûi (τ, tk )
is a best-response strategy. Before doing this, let the Hamiltonian function be given by

H (x̂i , ûi , pi ) = (" (x̂i ) + ρ ûi2 ) + pi ûi , (13.14)

where pi is the co-state. In the above we have dropped dependence on τ and tk . After
doing this, the Pontryagin necessary conditions yield the following set of equalities:

∂ H (x̂i , ûi , pi )
Optimality condition: = 0 ⇒ pi = −2ρ ûi . (13.15)
∂ ûi
∂ H (x̂i , ûi , pi )
Multiplier condition: ṗi = − . (13.16)
∂ x̂i
∂ H (x̂i , ûi , pi )
State equation: x̂˙i = ⇒ x̂˙i = ûi . (13.17)
∂ pi
B
∂ 2 H (x̂i , ûi , pi ) BB
Minimality condition: B ≥ 0 ⇒ ρ ≥ 0. (13.18)
∂ û 2 B ∗ ∗ ∗
i x̂i =x̂i , ûi = ûi , pi = pi

Boundary condition: H (x̂i∗ , ûi∗ , pi∗ ) = 0. (13.19)

The boundary condition (13.19) restrains the Hamiltonian to be null along any opti-
mal path {x̂i∗ (t )∀t ≥ 0} (see, e.g., [52, Sect. 3.4.3]).
Recall from Section 9.2.1 that the Pontryagin Minimum Principle yields conditions
that are, in general, necessary but not sufficient (see also [52]). However, sufficiency is
guaranteed under the following additional assumption:

Uniqueness condition: " (xi ) is convex. (13.20)

If we impose further restraints on the structure of " (xi ), we obtain sufficient con-
ditions that yield a unique optimal control policy ûi (.). This is established in the next
result.

Theorem 13.2. Let agent i evolve according to the first-order differential equation (13.1).
At times tk = 0, 1, . . . , let the agents be assigned the cost functional (13.8), where the penalty
150 Chapter 13. Consensus in Multi-agent Systems

is given by
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

⎛ ⎞2
1
" (x̂i (τ, tk )) = ρ ⎝ d g (ϑ(x j (tk )) − ϑ(x̂i (τ, tk )))⎠ , (13.21)
d xi j ∈Ni

1
and where g (.) is increasing, ϑ(.) is concave, and d g (y) is convex.
dy
Then the control policy

1
ûi (τ, tk ) = ui (xi (τ)) = α dg
(ϑ(x j (tk )) − ϑ(xi (τ))), α = 1, (13.22)
d xi (τ) j ∈Ni

is the unique optimal solution to Problem 13.3.

ϑ(x j (tk ))

Proof. First, well-posedness of the problem is guaranteed, as for xi∗ = ϑ −1
j ∈N i
|Ni |
the control policy is null and the cost functional (13.13) converges. This is obtained
∗
the condition that∗ the penalty (13.21) is null in a state x̂i , for which it holds that
from
j ∈Ni (ϑ(x j (tk )) − ϑ(xi )) = 0.
Let us now prove optimality of the control policy (13.22) with α = 1. To this purpose,
note that it satisfies conditions (13.15)–(13.20). Actually, it is straightforward to see that
conditions (13.17) and (13.18) are satisfied. Now, let us compute ṗi from (13.15) and let
us substitute the expression we obtain in (13.16). Thus we get

∂ H (x̂i , ûi , pi )
2ρ û˙i = . (13.23)
∂ x̂i

∂ ûi ∂ ûi ∂ û
Also, from (13.17), we have û˙i = ∂ x̂i
x̂˙i = ∂ x̂i
ûi . Then, (13.23) yields 2ρ ∂ x̂i ûi =
i
∂ H (x̂i , ûi , pi )
∂ x̂i
. After integration and from (13.19) we have that the solution of (13.23) must
satisfy
ρ ûi2 = " (x̂i ). (13.24)
1
Then, it suffices to note that ûi (τ, tk ) = dg j ∈Ni (ϑ(x j (tk )) − ϑ(x̂i (τ, tk ))) verifies the
d x̂i

above condition.
To prove uniqueness, let us prove that " (x̂i ) is convex. To this purpose, we can
: ∂ g ;−1
write " = "3 (F1 (x̂i ), "2 (x̂i )), where function "1 (x̂i ) = ∂ x , function "2 (x̂i ) =
i

j ∈Ni (ϑ(x j (tk )) − ϑ(x̂i )), and "3 = ("1 (x̂i ) · "2 (x̂i )) . As "3 (.) is nondecreasing in
2

each argument, function "3 (.) is convex if both functions "1 (.) and "2 (.) are also convex
: d g ;−1
[64]. Function "1 (.) is convex, as d x̂ is convex by hypothesis. Analogously, "2 (.) is
i
convex, as ϑ(.) is concave, and this concludes the proof.

dg
The above theorem holds also for α = −1 if d x < 0 for all xi (0).
i
From the above theorem, we can derive the following corollary.

Corollary 13.3. Let a network of dynamic agents G = (Γ , E) be given. Assume that the
agents evolve according to the first-order differential equation (13.1). At times tk = 0, 1, . . . ,
13.5. Numerical example: Team of UAVs 151

let the agents be assigned the cost functional (13.8), where the penalty is given by
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

⎛ ⎞2
1
" (x̂i (τ, tk )) = ρ ⎝ d g (ϑ(x j (tk )) − ϑ(x̂i (τ, tk )))⎠ , (13.25)
d xi j ∈Ni

1
and where g (.) is increasing, ϑ(.) is concave, and d g (y) is convex. If we take δ −→ 0, then we
dy
have
(i) the penalty function
⎛ ⎞2
1
" (xi (τ, tk )) −→ F (xi , x (i ) ) = ρ ⎝ d g (ϑ(x j ) − ϑ(xi ))⎠ (13.26)
d xi j ∈Ni

and
(ii) the applied receding horizon control law
1
uiRH (τ) −→ ui (xi , x (i ) ) = dg
(ϑ(x j ) − ϑ(xi )). (13.27)
d xi j ∈Ni

The above corollary provides a solution to the mechanism design problem (Prob-
lem 13.2). To see this, imagine that a game designerwishes the agents to asymptotically
reach consensus on the consensus value χ̂ (x) = f ( i ∈Γ g (xi )). He can accomplish this
by assigning the agents the cost functional (13.3), where the penalty is as in (13.26) and
1
where g (.) is increasing, d g (y) is convex, and δ is “sufficiently” small.
dy

13.5 Numerical example: Team of UAVs

v4 v3

v1 v2

Figure 13.4. The information flow in a network of four agents.

Let us now illustrate the results on a team of four UAVs. The UAVs are initially at
different heights, and they are performing a vertical alignment maneuver in longitudinal
flight. Each vehicle controls the vertical rate on the basis of the neighbors’ heights. The
UAVs interact as described by the communication network depicted in Fig. 13.4. The
goal of the mission is to make the UAVs reach consensus on the formation center. We
analyze four different vertical alignment maneuvers where the formation center is the (i)
arithmetic mean, (ii) geometric mean, (iii) harmonic mean, and (iv) mean of order 2 of the
initial heights of all UAVs. Set the initial heights as x(0) = (5, 5, 10, 20)T .
Simulations are performed using the following algorithm.
152 Chapter 13. Consensus in Multi-agent Systems

ALGORITHM 13.1. Simulation algorithm for a team of UAVs.

Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

Input: Communication network G = (V , E) and UAVs’ initial heights.

Output: UAVs’ heights x(t )
1 : Initialize. Set the initial states equal to the UAVs’ initial heights
2 : for time i t e r = 0, 1, . . . , T − 1 do
3 : for player i = 1, . . . , n do
4 : Set t = i t e r · d t and compute protocol ui (.) using current x (i ) (t )
5: compute new state x(t + d t ) from (13.1)
6 : end for
7 : end for
8 : STOP

In the first simulation, the UAVs are assigned the cost functional (13.3), where the
: ;2
penalty F (xi , x (i ) ) =j ∈Ni (x j − xi ) . The UAVs use their best responses

u(xi , x (i ) ) = (x j − xi ), (13.28)
j ∈Ni

and as a result, they reach asymptotically consensus on the arithmetic mean of x(0). We
illustrate this in Fig. 13.5(a).
In the second simulation, the UAVs are assigned a cost functional where the penalty
: ;2
F (xi , x (i ) ) = xi j ∈Ni (x j − xi ) . By using their best responses

u(xi , x (i ) ) = xi (x j − xi ), (13.29)
j ∈Ni

they reach asymptotically consensus on the geometric mean of x(0). A graphical illustra-
tion of this is available in Fig. 13.5(b).
In the third simulation scenario, the UAVs are assigned a cost functional where for
: ;2
the penalty we have F (xi , x (i ) ) = xi2 j ∈Ni (x j − xi ) . The implementation of their best
responses
u(xi , x (i ) ) = −xi2 (x j − xi ) (13.30)
j ∈Ni

leads them to reach asymptotically consensus on the harmonic mean of x(0). A sketch of
the resulting dynamics is given in Fig. 13.5(c).
In the fourth simulation scenario, the UAVs are assigned cost functionals where the
: 1 ;2
penalty F (xi , x (i ) ) = 2x j ∈Ni (x j − xi ) . The UAVs’ best responses
i

1
u(xi , x (i ) ) = (x − xi ) (13.31)
2xi j ∈N j
i

lead them to reach asymptotically consensus on the mean of order 2 of x(0). This is
sketched in Fig. 13.5(d).
Finally, Fig. 13.6 depicts a vertical alignment maneuver when the UAVs use protocol
maxi ∈Γ {xi (0)}
u(xi , x (i ) ) = (x j − xi ). (13.32)
2xi j ∈N i

The above protocol is obtained by scaling the protocol (13.31) by twice an upper bound
of maxi ∈Γ {xi (0)}.
13.5. Numerical example: Team of UAVs 153

20 20
Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

height 15 15

height
10 10

5 5
0 1 2 3 4 5 0 0.2 0.4 0.6 0.8
time time

(a) (b)
20 20

15 15
height

height
10 10

5 5
0 0.02 0.04 0.06 0.08 0.1 0.12 0 1 2 3 4 5
time time

Figure 13.5. Longitudinal flight dynamics converging to (a) the arithmetic mean under
protocol (13.28); (b) the geometric mean under protocol (13.29); (c) the harmonic mean under proto-
col (13.30); (d) the mean of order 2 under protocol (13.31). Reprinted with permission from Elsevier
[30].

20 20
height

height

10 10

0 0
0 2 4 0 2 4
east east

20 20
height

height

10 10

0 0
0 2 4 0 2 4
east east
Figure 13.6. Vertical alignment to the mean of order 2 on the vertical plane. Reprinted with
permission from Elsevier [30].
154 Chapter 13. Consensus in Multi-agent Systems

13.6 Notes and references

Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

This chapter shows how to turn a consensus problem into a noncooperative differential
game. Consensus is the result of a mechanism design where a game designer imposes
individual objective functions. Then, the agents reach asymptotically consensus as a side
effect of the optimization of their own individual objectives. The results of this chapter
are important, as they shed light on the game-theoretic nature of a consensus problem.
We refer the reader to a few classical references on consensus [128, 194, 193, 205, 206, 252].
Consensus arises in several application domains, such as autonomous formation flight
[94, 102], cooperative search of UAVs [46], swarms of autonomous vehicles or robots
[100, 161], and joint replenishment in multi-retailer inventory control [31, 32]. More
details on mechanism design or inverse game theory can be found in [196, Chap. 10]. For
more details on receding horizon we refer the reader to [89] and [158].
Part of the material contained in this chapter is borrowed from [30]. We refer the
reader to the original work for further details on invariance properties of the consensus
value. In this chapter, the presentation of the topic has been tailored to emphasize the
game theory perspective on the problem. Additional explanatory material and figures
have been added to help the reader gain a better insight and physical interpretation of the
different concepts.

Mechanics of The Cell (2nd Ed-1.) (David H. Boal, 2012)
86% (7)
Mechanics of The Cell (2nd Ed-1.) (David H. Boal, 2012)
624 pages
Even When You Lie To Me by Jessica Alcott
25% (4)
Even When You Lie To Me by Jessica Alcott
40 pages
ZHANG-Distributed Optimal Control
No ratings yet
ZHANG-Distributed Optimal Control
8 pages
CCC_2015
No ratings yet
CCC_2015
6 pages
Decision Brochure
No ratings yet
Decision Brochure
4 pages
Icmas98 Heterogeneity
No ratings yet
Icmas98 Heterogeneity
8 pages
CALM
No ratings yet
CALM
9 pages
Decision-Oriented Dialogue For Human-AI Collaboration
No ratings yet
Decision-Oriented Dialogue For Human-AI Collaboration
34 pages
Paper1 - Systems and Control Letters
No ratings yet
Paper1 - Systems and Control Letters
10 pages
Robust Bipartite Consensus and Tracking Control of High-Order Multiagent Systems With Matching Uncertainties and Antagonistic Interactions
No ratings yet
Robust Bipartite Consensus and Tracking Control of High-Order Multiagent Systems With Matching Uncertainties and Antagonistic Interactions
10 pages
MonA01-1
No ratings yet
MonA01-1
6 pages
Leader-Follower Bipartite Consensus of Linear Multiagent Systems Over A Signed Directed Graph
No ratings yet
Leader-Follower Bipartite Consensus of Linear Multiagent Systems Over A Signed Directed Graph
5 pages
Agents to mas wp
No ratings yet
Agents to mas wp
15 pages
Nash Equilibria in Finite-Horizon Multiagent Concurrent Games
No ratings yet
Nash Equilibria in Finite-Horizon Multiagent Concurrent Games
12 pages
Gordon
No ratings yet
Gordon
3 pages
Capstone Project VFinal
No ratings yet
Capstone Project VFinal
8 pages
Measurement Scheduling For Control Invariance in Networked Control Systems
No ratings yet
Measurement Scheduling For Control Invariance in Networked Control Systems
6 pages
Distributed Operating System
No ratings yet
Distributed Operating System
5 pages
Coevolution of Role-Based Cooperation in Multiagent Systems-3S0
No ratings yet
Coevolution of Role-Based Cooperation in Multiagent Systems-3S0
17 pages
Economics Konzepte
No ratings yet
Economics Konzepte
7 pages
2504.02088v1
No ratings yet
2504.02088v1
8 pages
Comparative Analysis TP and BD
No ratings yet
Comparative Analysis TP and BD
2 pages
Fully Distributed Model Predictive Control of Connected Automated Vehicles in Intersections Theory and Vehicle Experiments
No ratings yet
Fully Distributed Model Predictive Control of Connected Automated Vehicles in Intersections Theory and Vehicle Experiments
13 pages
4 box data
No ratings yet
4 box data
1 page
auto
No ratings yet
auto
10 pages
Concurrent Object-Oriented: Ccyy"Nlcitlcyicct"Eaccy/September 199O/Vol.33, No.9
No ratings yet
Concurrent Object-Oriented: Ccyy"Nlcitlcyicct"Eaccy/September 199O/Vol.33, No.9
17 pages
Correlated Anarchy
No ratings yet
Correlated Anarchy
10 pages
Introduction To Multi-Agent Systems: Michal Jakob, Milan Rollo
No ratings yet
Introduction To Multi-Agent Systems: Michal Jakob, Milan Rollo
46 pages
Distributed Consensus Protocol For Multi-Agent Differential Graphical Games
No ratings yet
Distributed Consensus Protocol For Multi-Agent Differential Graphical Games
5 pages
Of The Art in Adaptive Control of Robotic Systems: State
No ratings yet
Of The Art in Adaptive Control of Robotic Systems: State
10 pages
Game Theoretical Approach To Conflict Resolution in Transboundary Water Resources Management
No ratings yet
Game Theoretical Approach To Conflict Resolution in Transboundary Water Resources Management
8 pages
8283_Large_Language_Models_Can
No ratings yet
8283_Large_Language_Models_Can
12 pages
Its All in The Network
100% (1)
Its All in The Network
3 pages
LLM Multi-Agent Systems Challenges and Open Problems
No ratings yet
LLM Multi-Agent Systems Challenges and Open Problems
8 pages
Project for linear
No ratings yet
Project for linear
10 pages
mathrm (C) /mathrm (I) /mathrm (O) /mathrm (M) /mathrm (P) /mathrm (U) /mathrm (T)
No ratings yet
mathrm (C) /mathrm (I) /mathrm (O) /mathrm (M) /mathrm (P) /mathrm (U) /mathrm (T)
25 pages
Latif, Widyarto - 2004 - The Crowd Simulation For Interactive Virtual Environments
No ratings yet
Latif, Widyarto - 2004 - The Crowd Simulation For Interactive Virtual Environments
4 pages
ArchitecturesofIVDC KISSAI
No ratings yet
ArchitecturesofIVDC KISSAI
6 pages
Ramasubramanian ET al (2021) [CPT Dynamic Program]
No ratings yet
Ramasubramanian ET al (2021) [CPT Dynamic Program]
8 pages
Consensus Algorithms Are Input-to-State Stable: Derek B. Kingston Wei Ren Randal W. Beard
No ratings yet
Consensus Algorithms Are Input-to-State Stable: Derek B. Kingston Wei Ren Randal W. Beard
5 pages
Conict Resolution For Air Trac Management: A Case Study in Multi-Agent Hybrid Systems
No ratings yet
Conict Resolution For Air Trac Management: A Case Study in Multi-Agent Hybrid Systems
33 pages
DC Unit IV
No ratings yet
DC Unit IV
37 pages
HCI-130-148 Summarized
No ratings yet
HCI-130-148 Summarized
11 pages
15972-Article Text-19465-1-2-20210517
No ratings yet
15972-Article Text-19465-1-2-20210517
9 pages
mirzaei2021
No ratings yet
mirzaei2021
16 pages
Minsky (1974)_Frame System
No ratings yet
Minsky (1974)_Frame System
13 pages
Ripple Consensus Whitepaper
No ratings yet
Ripple Consensus Whitepaper
10 pages
Distributed: Intelligent Control For Mine Refrigeration System
No ratings yet
Distributed: Intelligent Control For Mine Refrigeration System
8 pages
By Geoffrey Bubbers and John Christian: ABSTRACT: This Paper Suggests A New Approach To The Problem of Providing
No ratings yet
By Geoffrey Bubbers and John Christian: ABSTRACT: This Paper Suggests A New Approach To The Problem of Providing
15 pages
0156 2021 DCMiniProject NEELA
No ratings yet
0156 2021 DCMiniProject NEELA
16 pages
Towards Consistent and Explainable Motion Prediction Using Heterogeneous Graph Attention
No ratings yet
Towards Consistent and Explainable Motion Prediction Using Heterogeneous Graph Attention
8 pages
Asynchrony&delays PDF
No ratings yet
Asynchrony&delays PDF
15 pages
2503.20772v1
No ratings yet
2503.20772v1
9 pages
Optimization and Non-Cooperative Game of Anonymity Updating in Vehicular Networks
No ratings yet
Optimization and Non-Cooperative Game of Anonymity Updating in Vehicular Networks
17 pages
An Intelligent Framework For Agriculture System Based o 1998 IFAC Proceeding
No ratings yet
An Intelligent Framework For Agriculture System Based o 1998 IFAC Proceeding
6 pages
Distrituted Cooperative Control - Olfati-Saber
No ratings yet
Distrituted Cooperative Control - Olfati-Saber
6 pages
High-Dimensional Prediction For Sequential Decision Making
No ratings yet
High-Dimensional Prediction For Sequential Decision Making
74 pages
Translation Engine in Support of Context-Level Interoperability
No ratings yet
Translation Engine in Support of Context-Level Interoperability
9 pages
2
No ratings yet
2
17 pages
Maximum-Entropy Multi-Agent Dynamic Games
No ratings yet
Maximum-Entropy Multi-Agent Dynamic Games
15 pages
Concepts_ Concurrency
No ratings yet
Concepts_ Concurrency
18 pages
Multi Agent System: Fundamentals and Applications
From Everand
Multi Agent System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Number System
No ratings yet
Number System
96 pages
Driverless Vehicles Conf
No ratings yet
Driverless Vehicles Conf
5 pages
Purdue AI and ML Dual Master Program SlimUp
No ratings yet
Purdue AI and ML Dual Master Program SlimUp
27 pages
Switch MCQ Ans
No ratings yet
Switch MCQ Ans
8 pages
Real World Activity Summary For Senior Home Monitoring: Multimedia Tools and Applications July 2011
No ratings yet
Real World Activity Summary For Senior Home Monitoring: Multimedia Tools and Applications July 2011
5 pages
While Loop MCQ Ans
No ratings yet
While Loop MCQ Ans
5 pages
Break and Continue MCQ Ans
No ratings yet
Break and Continue MCQ Ans
11 pages
For Loop MCQ Ans
No ratings yet
For Loop MCQ Ans
8 pages
If Else MCQ Ans
No ratings yet
If Else MCQ Ans
8 pages
Engineering Classification of Soils PDF
No ratings yet
Engineering Classification of Soils PDF
50 pages
Full download Toward a Comparative Institutional Analysis 1st Edition Masahiko Aoki pdf docx
100% (11)
Full download Toward a Comparative Institutional Analysis 1st Edition Masahiko Aoki pdf docx
37 pages
Đề Thi IGCSE Math 2024 October Paper 4 Variant 2
No ratings yet
Đề Thi IGCSE Math 2024 October Paper 4 Variant 2
16 pages
REV AND EDITED CHEMISTRY SCHEMES Input CSN WBOOK
No ratings yet
REV AND EDITED CHEMISTRY SCHEMES Input CSN WBOOK
124 pages
The Force of Friction - Student Booklet
No ratings yet
The Force of Friction - Student Booklet
8 pages
The New Arabic Type Classification System
No ratings yet
The New Arabic Type Classification System
23 pages
THERMODYNAMICSANDKINETICS
No ratings yet
THERMODYNAMICSANDKINETICS
8 pages
Sample Paper 11 TH NM
No ratings yet
Sample Paper 11 TH NM
8 pages
GSB 622 Decision Support Systems Linear Programming & Excel Solver
No ratings yet
GSB 622 Decision Support Systems Linear Programming & Excel Solver
4 pages
Assignment 3 - Answers
No ratings yet
Assignment 3 - Answers
9 pages
Mechanics - Basic Kinematics Review2
No ratings yet
Mechanics - Basic Kinematics Review2
7 pages
Alumni Feedback Form
No ratings yet
Alumni Feedback Form
4 pages
Summative Test 1 Gen Math
No ratings yet
Summative Test 1 Gen Math
2 pages
LEXAN™ Healthcare Resin_HP1_Americas_Technical_Data_Sheet
No ratings yet
LEXAN™ Healthcare Resin_HP1_Americas_Technical_Data_Sheet
3 pages
Air Pollution - Plume Rise
No ratings yet
Air Pollution - Plume Rise
15 pages
The Most Valuable Land Buying Checklist FOR Florida
No ratings yet
The Most Valuable Land Buying Checklist FOR Florida
13 pages
(Short Term) (Explain The Nursing Diagnosis)
No ratings yet
(Short Term) (Explain The Nursing Diagnosis)
1 page
Syscal R1plus SW
No ratings yet
Syscal R1plus SW
2 pages
Automatic Control 1 (Introduction)
No ratings yet
Automatic Control 1 (Introduction)
40 pages
Allen, Geoffrey - Comprehensive Polymer Science and Supplements - (Elsevier) (1996)
100% (1)
Allen, Geoffrey - Comprehensive Polymer Science and Supplements - (Elsevier) (1996)
1,410 pages
Content Paper First Mass
100% (1)
Content Paper First Mass
9 pages
Preserving The Fading Traditional Art Forms in Malaysia
No ratings yet
Preserving The Fading Traditional Art Forms in Malaysia
20 pages
Full download Inner Game of Work The Wei Zhi pdf docx
100% (1)
Full download Inner Game of Work The Wei Zhi pdf docx
71 pages
Applications of The Duality Method To Generalizations of The Jordan Canonical Form - Olga Holtz
No ratings yet
Applications of The Duality Method To Generalizations of The Jordan Canonical Form - Olga Holtz
7 pages
Templete CV 1
No ratings yet
Templete CV 1
1 page
Programmatic EPRMP Outline
No ratings yet
Programmatic EPRMP Outline
2 pages
10_social_science_sp_03
No ratings yet
10_social_science_sp_03
8 pages
In Your Own Idea, Describe What Is A Language?
No ratings yet
In Your Own Idea, Describe What Is A Language?
4 pages

Consensus in Multi Agent Systems

Uploaded by

Consensus in Multi Agent Systems

Uploaded by

Downloaded 08/19/16 to 131.156.224.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.

13.2 Consensus via mechanism design

ẋi = ui (xi , x (i ) ) ∀i ∈ Γ , (13.1)

ẋ1 (t ) = u1 (x1 (t ), x (1) (t ))

ẋ2 (t ) = u2 (x2 (t ), x (2) (t ))

Figure 13.1. Network of dynamic agents.

χ̂ (x1 , x2 , . . . , xn ) = χ̂ (xσ(1) , xσ(2) , . . . , xσ(n) ).

Sporadically, we refer to χ̂ as agreement function.

xi − χ̂ (x(0)) −→ 0 for t −→ ∞.

the vector (1, 1, . . . , 1)T .

where ρ > 0 and F : × → is a nonnegative penalty function. This penalty ac-

Problem 13.1 (Consensus problem). Let a network of dynamic agents G = (Γ , E) be

We say that a protocol is a consensus protocol if it is solution of the above consensus

Problem 13.2 (Mechanism design problem). Let a network of dynamic agents G = (Γ , E)

Table 13.1. Means and corresponding functions f and g .

Mean χ̂ (x) f (y) g (z)

13.3 A solution to the Consensus Problem

Proof. Let us start by observing that from the restrictions imposed on α, φ̂ : → ,

13.4 A solution to the Mechanism Design Problem

x̂˙i (τ, tk ) = ûi (τ, tk ), (13.9)

ẋi = uiRH (τ), τ ≥ t0 ,

where the receding horizon control law uiRH (τ) satisfies

uiRH (τ) = ûi (τ, tk ), τ ∈ [tk , tk+1 ).

H (x̂i , ûi , pi ) = (" (x̂i ) + ρ ûi2 ) + pi ûi , (13.14)

Boundary condition: H (x̂i∗ , ûi∗ , pi∗ ) = 0. (13.19)

Uniqueness condition: " (xi ) is convex. (13.20)

is the unique optimal solution to Problem 13.3.

13.5 Numerical example: Team of UAVs

Figure 13.4. The information flow in a network of four agents.

ALGORITHM 13.1. Simulation algorithm for a team of UAVs.

Input: Communication network G = (V , E) and UAVs’ initial heights.

13.6 Notes and references

You might also like

xi − χ̂ (x(0)) −→ 0 for t −→ ∞.

uiRH (τ) = ûi (τ, tk ), τ ∈ [tk , tk+1 ).