Distributed Algorithms For Stochastic Source Seeking With Mobile Robot Networks
Distributed Algorithms For Stochastic Source Seeking With Mobile Robot Networks
Atanasov1
Distributed Algorithms
Department of Electrical and
Systems Engineering,
University of Pennsylvania,
for Stochastic Source Seeking
Philadelphia, PA 19104
e-mail: [email protected]
With Mobile Robot Networks
Jerome Le Ny Autonomous robot networks are an effective tool for monitoring large-scale environmen-
Department of Electrical Engineering and GERAD, tal fields. This paper proposes distributed control strategies for localizing the source of a
Ecole Polytechnique de Montreal, noisy signal, which could represent a physical quantity of interest such as magnetic force,
Montreal, QC H3T-1J4, Canada heat, radio signal, or chemical concentration. We develop algorithms specific to two sce-
e-mail: [email protected] narios: one in which the sensors have a precise model of the signal formation process
and one in which a signal model is not available. In the model-free scenario, a team of
George J. Pappas sensors is used to follow a stochastic gradient of the signal field. Our approach is distrib-
Department of Electrical and uted, robust to deformations in the group geometry, does not necessitate global localiza-
Systems Engineering, tion, and is guaranteed to lead the sensors to a neighborhood of a local maximum of the
University of Pennsylvania, field. In the model-based scenario, the sensors follow a stochastic gradient of the mutual
Philadelphia, PA 19104 information (MI) between their expected measurements and the expected source location
e-mail: [email protected] in a distributed manner. The performance is demonstrated in simulation using a robot
sensor network to localize the source of a wireless radio signal.
[DOI: 10.1115/1.4027892]
Journal of Dynamic Systems, Measurement, and Control MARCH 2015, Vol. 137 / 031004-1
C 2015 by ASME
Copyright V
Journal of Dynamic Systems, Measurement, and Control MARCH 2015, Vol. 137 / 031004-3
Journal of Dynamic Systems, Measurement, and Control MARCH 2015, Vol. 137 / 031004-5
1 X k
xi;tþ1 ¼ xi;t þ ct pt ðzfig[V i ;t ; xfig[V i ;t Þ (22)
xit ðkÞÞÞ
g^i;t ðkÞ :¼ coli ðWð^ zi;t ðsÞ (20)
k þ 1 s¼0
This update is still not completely distributed as it requires knowl-
In order to obtain an approximation to g(mt, y) as in Eq. (19) in a edge of xV i ;t and the pdf pt.8 We propose to distribute the computa-
distributed manner, we use a high-pass dynamic consensus filter tion of pt via the distributed particle filter (Algorithm 1). Then,
[37] to have the sensors agree on the value of the sum each sensor maintains its own estimate of the source pdf, pi,t, rep-
! resented by a particle set fwm m
i;t ; yi;t g. Given a new measurement,
1X n
zi,tþ1, sensor i averages its prior, pi,t, with the priors of its neigh-
g^t ðkÞ :¼ n g^i;t ðkÞ
n i¼1 bors and updates it using Bayes rule. Finally, to obtain xV i ;t we use
a flooding algorithm (Algorithm 2). The convergence analysis of
Each node maintains a state qi,k, receives an input lik, and pro- the gradient ascent scheme in the distributed case (22) remains the
vides an output rik with the following dynamics: same as in Sec. 4 because each sensor i computes the complete
X X MI gradient. This is possible because due to Theorem 5 the states
qi;kþ1 ¼ qi;k þ b ðqj;k qi;k Þ þ b ðlj;k li;k Þ and measurements of distant sensors are not needed, while Algo-
j2Ni j2Ni (21) rithm 2 provides the information from the nearby sensors.
ri;k ¼ qi;k þ li;k
Algorithm 2: State exchange algorithm at sensor i
where b > 0 is a step-size. For a connected network, P Ref. [37, 1: Input: Communication radius rc, sensing radius rs, state xi
Theorem 1] guarantees that ri,k converges to 1=n i li;k as k 2: Output: Array ai with ai[j] ¼ xj if j 2 V i [ fig and
! 1. The following result can be shown by letting li;k :¼ g^i;t ðkÞ ai[j] ¼ empty else
and is proved in the Appendix.
THEOREM 4. Suppose that the communication graph G is 3: ai ½i xi ; ai ½j empty; j 6¼ i ä Holds the required sen-
strongly connected. If the sensor nodes estimate their states x* from sor states
the relative measurements (18) using algorithm (16), compute the 4: b minfceilð2rs =rc Þ; ng ä Number of rounds needed
FD weights (7) using the state estimates, and run the dynamic con- 5: for k ¼ 1…b do
sensus filter (21) with input li;k :¼ g^i;t ðkÞ, which was defined in 6: Send ai to neighbors Ni , receive {aj} from j 2 Ni
Eq. (20), then the output ri,k of the consensus filter satisfies 7: for j 2 Ni do
8: for l ¼ 1…n do
n lim E½ri;k ¼ gðm ; yÞ þ b; 8i 2 f1; …; ng 9: if ðai ½l¼ emptyÞ&&ðaj ½l6¼ emptyÞ then ai ½l aj ½l
k!1
P 6 Applications
where g(m*, y) is the true signal gradient at m :¼ ni¼1 xi =n and
b is the error in the FD approximation (5). The performance of the source-seeking algorithms is demon-
After this procedure, the agents agree on a centroid for the for- strated in simulation using a team of ten sensors to localize the
mation and a gradient estimate, which can be used to compute the source of a wireless radio signal. A radio signal is suitable for
next formation centroid according to Eq. (9). Since the FD comparing the two algorithms because it is very noisy and diffi-
weights are recomputed at every t, the formation need not be cult to model and yet most approaches for wireless source seeking
maintained accurately. This allows the sensors to avoid obstacles are model-based. We begin by modeling the received signal
and takes care of the motion uncertainty. strength (RSS), which is needed for the model-based algorithm.
5.3 Distributed Model-Based Algorithm. In this section, we 6.1 RSS Model. Let the positions of a wireless source and re-
aim to distribute the model-based source-seeking algorithm (13). ceiver in 2D be y and x, respectively. The RSS (dBm) at x is mod-
We assume that sensors which are sufficiently far from each other eled as
receive independent information. This is justified because when
the sensing footprints of two sensors do not overlap, their sensed Prx ðx; yÞ ¼ Ptx þ Gtx Ltx þ Grx Lrx
signals (if any) will not be coming from the same source. As a
result, computing the MI gradient in Eq. (12) with respect to xi is Lfs ðx; yÞ Lm ðx; yÞ Rðx; yÞ
decoupled from the states of the distant sensors.
THEOREM 5. Let V i denote the set of sensors (excluding i) whose where Ptx is the transmitter output power (18 dBm in our experi-
sensing footprints overlap with that of sensor i. Let V i denote the ments), Gtx is the transmitter antenna gain (1.5 dBi), Ltx is the
rest of the sensors. Suppose that sensor i’s measurements, zi, are transmitter loss (0 dB), Grx is the receiver antenna gain (1.5 dBi),
independent (not conditionally on y, as before) of the measure- Lrx is the receiver loss (0 dB), Lfs is the free space loss (dB), Lm is
ments, zVi , obtained by the sensors Vi , i.e., pt ðzi ; zVi jxi ; xVi Þ the multipath loss (dB), and R is the noise. The free space loss is
¼ pt ðzi jxi Þpt ðzVi jxVi Þ. Then, modeled as
@ @ Lfs ðx; yÞ ¼ 27:55 þ 20 log10 ðÞ þ 20 log10 ðkx yk2 Þ
Iðy; zi ; zV i ; zV i jxi ; xV i ; xV i Þ ¼ Iðy; zi ; zV i jxi ; xV i Þ
@xi @xi
where is the frequency (2400 MHz). The model from Ref. [38]
Proof. By the chain rule of MI and then the independence of zi
is used for the multipath loss
and zV i
(
Iðy; zi ; zV i ; zV i jxi ; xV i ; xV i Þ a þ bkðx; yÞ; if kðx; yÞ > 0
Lm ðx; yÞ ¼
0; else
¼ Iðy; zi ; zV i jxi ; xV i Þ þ Iðy; zV i jzi ; zV i ; xi ; xV i ; xV i Þ
Fig. 3 The paths followed by the sensors after 30 iterations of the model-based source-seeking algorithm in an environment
without obstacles (left) and with obstacles (right). The white circles indicate sensor 1’s estimates of the source position over
time. The plots show the average error of the source position estimates and its standard deviation averaged over 50 inde-
pendent repetitions. The evolution of sensor 1’s distributed particle filter is shown in each scenario (bottom row).
Journal of Dynamic Systems, Measurement, and Control MARCH 2015, Vol. 137 / 031004-7
dius was 10 m, while the sensing radius was infinite. The sensors 2d2 KiU
maintained distributed particle filters with 4000 particles and used j¼1
Journal of Dynamic Systems, Measurement, and Control MARCH 2015, Vol. 137 / 031004-9