A Two-Level Scheme For Efficient Data Gathering in Mobile-Sink Wireless Sensor
A Two-Level Scheme For Efficient Data Gathering in Mobile-Sink Wireless Sensor
Research Article
Abstract: This study addresses the problems of energy and delay in wireless sensor networks equipped with mobile sinks. The
authors jointly consider the compressive sensing (CS) theory, cluster-based routing, and sink mobility to propose a data
collection method named ‘weighted data aggregation trees with optimal mobile sink(s) (WDAT-OMS)’. The proposed scheme
relies on a two-level architecture in which sensors are clustered at the first level. WDAT-OMS uses the CS theory along with
load-balanced data aggregation trees to route packets from sensors to the corresponding cluster heads (CHs). In this regard,
they present an efficient metric named ‘energy-and distance-aware CH selection’ to fairly distribute the energy consumption
among different sensors. At the second level, one or more sinks traverse the network to collect the aggregated data of CHs. As
an advantage, WDAT-OMS not only balances the energy consumption among different sensors but also increases the network
scalability. Numerical results demonstrate that the proposed algorithm reduces energy consumption in comparison with
‘centralised clustering algorithm’, ‘energy-aware CS-based data aggregation’, and ‘energy-balanced high-level data aggregation
tree ‘by 66%, 62%, and 63% for an average number of clusters, respectively. It also decreases the sink delay in comparison with
the ‘single-hop data-gathering problem’ by 10%.
Nomenclature concentration in the air [1, 2]. Energy consumption is one of the
main challenges in WSNs, which affects the network's lifetime.
B number of polling points Owing to the redundancy in WSNs, data of different sensors should
b a constant parameter be compressed and aggregated [1, 3]. In this regard, the
C number of clusters compressive sensing (CS) theory is an efficient solution used to
CH set of cluster heads (CHs) reconstruct the sparse signals at a rate less than the Nyquist rate
CM( j) members of jth cluster [4]. It combines the sampling and compression processes in one
d(i, j) Euclidean distance between sensors i and j step to minimise the number of samples of signals. This
E0 initial energy of sensors significantly reduces the amount of data to send, and subsequently,
ERx‐elec electrical energy required to run the circuitry in decreases energy consumption.
receiver On the other hand, traffic distribution in WSNs is naturally
ETx‐elec electrical energy required to run the circuitry in unbalanced. The sensors near the sink act as relay nodes and
transmitter receive/transmit more data than the other sensors. As a result, these
Er(i) residual energy of sensor i sensors quickly lose their energy. This is identified as ‘energy hole
ERx(L) energy consumption for receiving an L-bit packet problem’, which reduces the performance of the network.
ETx(L, d) energy consumption for transmitting an L-bit packet Therefore, an efficient routing protocol should be considered to
over distance d improve the load-balancing in the network. In this regard, cluster-
L packet length based routing protocols divide the network into several clusters to
M number of compressive sensing measurements locally balance the traffics among different sensors [5]. Over the
N number of sensors in the network last decade, several methods have been proposed for cluster head
NB(CH j) set of neighbour polling points of the jth CH (CH) selection, which generally use the residual energy and
NP(i) nearest neighbour polling point located at the distance factors [6–10]. However, unlike the traditional methods,
transmission range of sensor i we propose a new CH selection strategy, which not only uses both
P set of polling points distance and residual energy metrics but also considers the sink
Pr(i, r) punishment factor of sensor i at round r mobility and a punishment factor to more efficiently control the
R transmission range of sensors energy consumption in the network.
RSP reduced set of polling points In traditional WSNs, a multi-hop routing is used to deliver the
S number of mobile sinks data of sensors to the sink node. This increases the number of
V speed of mobile sinks transmissions and consequently, intensifies the energy consumption
ρ compressive ratio in the network. On the other hand, two-hop routing protocols, e.g.
ϵamp energy consumption of the transmission amplifier low-energy adaptive clustering hierarchy (LEACH), have also been
proven to be inefficient in comparison with the modern schemes. In
1 Introduction recent years, a number of research studies have been conducted to
more efficiently collect data in WSNs, including the use of
Wireless sensor networks (WSNs) consist of small sensors with multiple constant sinks and mobile sinks [11, 12]. In particular, the
sensing, limited computing, and wireless communication abilities. use of a mobile sink for data collection introduces a trade-off
These nodes sense the environment and send the collected data to between energy consumption and delay. Although the mobile sink
the sink. WSNs have many applications in environmental is not suitable for some real-time applications in terms of delay, it
monitoring including the information collection of soil humidity, is an efficient technique to reduce the energy consumption in non-
monitoring the coastline changes, and checking the gas real time applications, which generally collect data of an
N 1/ p
where ∥ X ∥ p = ∑i = 1 xi p represents the lp-norm of vector
X . The existence of RIP for the random matrices, e.g. Gaussian
matrix with uniformly and independently distributed elements and
Bernoulli matrix with ±1 elements, has been proved [20]. A k-
sparse signal can be reconstructed from only M measurements by
utilising the following l1 optimisation expression [43]:
Fig. 1 Typical example for the applied two-level architecture consisted of ^
the first-level data aggregation trees and sink tour Θ = arg min ∥ Θ ∥ 1 subject to Y = Φ Ψ Θ, (4)
N ^
assumed that the duration of a round is such that the mobile sink where ∥ Θ ∥ 1 = ∑i = 1 ∣ θi ∣ and X = Ψ Θ.
has the opportunity to traverse the network and collects the It should be noted that the practical performance of the CS
buffered data of CHs. However, we aim to minimise this time. theory depends on the sparseness of signals and reconstruction
In general, the energy consumed by sensors is influenced by algorithms. Certainly, increasing the number of CS measurements
three factors, including data sensing, data processing, and wireless leads to a better quality of data recovery.
communication. However, the first two factors are negligible
compared to the energy consumption for wireless communication
[3, 20]. We use the first-order radio model, in which the transmitter
4 Weighted data aggregation trees with optimal
dissipates the energy to run the radio electronics and the power mobile sink(s) (WDAT-OMS)
amplifier, and the receiver consumes a different amount of energy In this section, we first present the problem description and then,
to run the radio electronics. According to this model, the energy introduce the WDAT-OMS algorithm in detail.
consumption for transmitting an L-bit packet over distance d is
modelled as follows [5, 20]: 4.1 Problem description
ETx(L, d) = ETx‐elec(L) + ETx‐amp(L, d) Given the explained model in Section 3, the goal is to reduce the
(1) energy consumption of sensors and tour delay of the mobile sink in
= L × ETx‐elec + L × ϵamp × d 2,
WSNs. For this purpose, we use a two-level hybrid architecture in
which the first level organises sensors into C clusters. Each sensor
where ETx‐elec is the electrical energy required to run the circuitry in
transmits its data to the corresponding CH via a weighted data
the transmitter and ϵamp shows the energy consumption of the aggregation tree. At the second level, one (or multiple) sink(s)
transmission amplifier. The electrical energy factor in the traverses the network to collect the aggregated data from CHs. The
transmitter, i.e. ETx‐elec, depends on different factors such as digital main issues to be discussed include
coding, modulation, and spreading of the signal, whereas the term
of amplifier energy, i.e. ϵampd 2, uses the square of the distance • Employing an efficient method for clustering and CH selection to
between transmitter and receiver to consider the acceptable bit fairly distribute the energy consumption among different sensors.
error rate. • Developing a load-balanced data aggregation routing tree in each
In addition, the energy for receiving an L-bit packet can be cluster that connects the sensors to the corresponding CH.
computed as follows [5, 20]: • Using the hybrid CS theory along with the first-level routing
process.
ERx(L) = L × ERx‐elec, (2) • Determining an appropriate path for the sink tour to jointly
reduce both energy consumption of sensors and sink delay in the
where ERx‐elec is the electrical energy required to run the circuitry in network.
the receiver. Following the other references [5, 20, 41], we ignore
the energy consumption of baseband signal processing blocks such 4.2 Algorithm description
as source coding and pulse shaping. In fact, these factors are quite
small compared to the energy consumption of the radio frequency In this subsection, the WDAT-OMS algorithm is proposed to
circuitry. jointly address the problems of CS theory, load-balanced cluster-
Nomenclature summarises the notations that will be used in this based routing, and sink mobility in WSNs. Unlike sensors that
paper. have a simple structure, the mobile sink can be a mobile robot or a
vehicle equipped with a rechargeable battery, powerful central
processing unit (CPU), large memory, and global positioning
3.2 CS theory
system (GPS) device. Thus, in this work, the decision-making
CS theory presents a novel technique to efficiently recover a sparse centre of the network is the sink node. In fact, the basic
signal from its CS measurements at a rate less than the Nyquist rate calculations, including the CH selection, optimal tour etc., are
[17]. Consider a data vector collected by the sensors, i.e. made in the sink node.
X = [x1, x2, …, xN ]T ∈ RN . [ ⋅ ]T indicates the transpose operator. Similar to the other protocols, some preliminary signalling is
This vector has a representation as X = Ψ Θ in the proper basis of required at the initial deployment of sensors and before monitoring
the environment. For example, in the beginning, the sink node gets
Ψ = [ψ i j] ∈ RN × N , in which Θ = [θ1, θ2, …, θN ]T is the coefficient the location of sensors by one of the conventional localisation
vector of X in the basis of Ψ. X is a k-sparse vector, if it has at most services [44]. Given the grid layout of polling points, it is also well
k non-zero elements (k ≪ N) in Θ. According to the CS theory, a k- aware of their position. Owing to the static nature of sensors and
where CH( j) represents the selected CH for cluster j, Er(i) denotes Certainly, by reducing the energy of sensors at different rounds,
the residual energy of sensor i, and NP(i) is the nearest polling different routing trees are created in each cluster. At each round,
point located at the transmission range of sensor i. Also, d i, NP(i) considering the new CHs and current residual energy of sensors,
shows the Euclidean distance between sensor i and NP(i). In fact, WDAT-OMS provides the routing trees in such a way that not only
among the members of cluster j, i.e. CM( j), a sensor with more balances the energy among different sensors but also decreases the
EDCS is selected as the next CH. energy consumption in the network.
In EDCS, the residual energy factor causes the high-energy In this way, each sensor transmits its data to the corresponding
sensors to have more chance to be selected as CH. This leads the parent sensor. Intermediate sensors aggregate the received data by
sensors to lose their energy fairly, and subsequently, the network their own sensed data and hierarchically send the result towards the
lifetime will be increased. Unlike the works, which use the inter- sink node. For this purpose, WDAT-OMS uses the hybrid CS
Fig. 3 A typical example for sink tour considering 20 CHs and 36 polling points (6 × 6) formed by
(a) WDAT-OMS algorithm, (b) SHDGP algorithm [16]
Table 1 List of selected polling points and corresponding sensors. Once the mobile sink reaches a selected polling point, it
CHs in Fig. 4b assuming 20 CHs, 36 polling points, and two stops and sends a beep to wake up the transceiver of the
mobile sinks neighbouring CHs. It collects the buffered data and control
Selected polling Corresponding Corresponding information from CHs, and at the same point calculates the new
point sink CHs configuration of the corresponding clusters for the next round.
Subsequently, the mobile sink locally returns the result to the
11 1 8, 18
corresponding CHs. Each CH notifies its sensors of the new
5 1 1, 16 configuration through the existing first-level routing tree. Finally,
10 1 2, 11 the mobile sink moves to the next points to complete its trajectory.
8 1 4, 13 At the beginning of the next round, it determines its new optimal
16 1 7 trajectory based on the presented scheme and starts to aggregate
18 1 3, 19 data of new CHs.
23 2 15
21 2 12 5 Simulation results
19 2 10 In this section, several scenarios are presented to compare the
26 2 5 performance of the proposed algorithm with ‘centralised clustering
34 2 14 algorithm (CCA)’ [14], ‘ECDA with constant sink in centre’ [13],
29 2 6, 9, 17, 20 ‘EHDT with constant sink in (0,0)’ [13], and SHDGP [16]. With
the aim of a fair comparison, the CCA algorithm is modified to be
run round by round. Furthermore, to evaluate the described strategy
for providing the first-level data aggregation trees, we use the hop-
distance factor is required for determining CHs. This is because all
count metric once again for constructing these trees. In the
sensors have approximately the same initial energy and their
following, this is referred to as ‘data aggregation trees with optimal
punishment factor is also equal to one. At the next rounds, CHs are
mobile sink(s) (DAT-OMS)’ to be distinct from WDAT-OMS. In
selected based on the aggregated control information at the
fact, in DAT-OMS, each sensor transmits its data to the
previous round. In fact, at round r, CHs are selected based on the
corresponding CH through the pure SPT and by utilising the hybrid
aggregated control information at round r − 1. For this purpose,
CS method. It should be noted that the first and third phases of
during the data gathering process at round r − 1, the information of
DAT-OMS are completely similar to the WDAT-OMS algorithm.
all sensors is also collected locally. This information includes the
The performance of the algorithms is evaluated in terms of
residual energy of each sensor and the round numbers in which that
energy consumption, energy consumption variance, and tour delay.
sensor is selected as CH. Certainly, the mobile sink is well aware
Both energy consumption and energy consumption variance are
of the location of sensors and polling points by the described
calculated after the 10th round. From the perspective of statistics
preliminary signalling at the initial deployment of sensors. Each
theory, the variance is a suitable metric that shows how far a set of
sensor knows the length of packets it must send to its parent. It is
numbers are spread out from their average value. In fact, a
also well aware of its distance to its parent. Thus, each sensor can
easily calculate how much energy it will have at the end of the reduction in the energy consumption variance (σE2 ) demonstrates a
current round. The number of times a sensor is selected as CH is better load-balancing in the network. This metric is defined as
also used to calculate its punishment factor. Accordingly, each follows:
sensor transmits its information along with the sensed data to the
N
corresponding CH via the first-level routing tree at round r − 1. 1
N i∑
2
σE2 = E0 − Er(i) − Ec , (18)
This information is very small compared to the sensed data. =1
However, we consider a fractional of each packet for this control
information. The current CH of each cluster buffers this where E0 and Ec are the initial energy and average energy
information and sends it along with the aggregated data to the
consumption of sensors, respectively. Also, since the speed of sink
mobile sink.
(V ) is assumed to be 1 m/s, tour delay is calculated as the sum of
As mentioned before, the clusters are constant during the
network activity and only the role of CH is rotated between the
Fig. 13 Delay of sink as a function of parameter α for 400 sensors Fig. 15 Time complexity as a function of the number of clusters for 100
(N = 400) and ten clusters (C = 10) in scenario 4 sensors (N = 100) and 300 sensors (N = 300) in scenario 6
first-order radio model, energy consumption is reduced (Fig. 12). Scenario 6: In the last scenario, we study the time complexity
This is associated with an increase in the sink delay (Fig. 13). For of DAT-OMS and WDAT-OMS for a different number of clusters.
α = 0.5, the most energy consumption and the least delay is The results are presented in Fig. 15 for two cases of N = 100 and
observed. N = 300. All measurements are made on a computer equipped with
On the other hand, by increasing the number of mobile sinks, an Intel core i7 2.40 GHz processor and 6 GB of memory. As the
the number of partitions in the network is increased. This divides a number of clusters increases, the time complexity of both
single tour into multiple parallel sub-tours. Since all sinks algorithms intensifies. Increasing the number of clusters leads to an
simultaneously traverse the corresponding area, the total delay is increase in the number of CHs. This aggrandises the dimensions of
equal to the maximum delay of sub-tours. Thus, the sink delay of the adopted ILP optimisation model in the third phase of the
‘DAT-OMS with S = 4’ and ‘WDAT-OMS with S = 4’ are less algorithms.
than the other cases. It is worth noting that both DAT-OMS and Unlike DAT-OMS, which uses the hop-count metric for
WDAT-OMS have the same method for CH selection and sink constructing the first-level data aggregation trees, WDAT-OMS
trajectory formation. This causes both algorithms to have the same adopts the metric introduced in (8) to efficiently weight different
sink delay (Fig. 13). links of the network. Although a weighted data aggregation tree
Scenario 5: In this scenario, we compare the tour delay in the provides better load-balancing in the network, it increases the time
proposed algorithms and SHDGP [16] for a different number of complexity. Thus, the WDAT-OMS algorithm has more time
clusters and a different number of sinks. With the aim of a fair complexity than the DAT-OMS algorithm. Furthermore, as the
comparison, SHDGP is used at the second level of the network to number of sensors increases, the time required for calculating the
directly collect the aggregated data of CHs. The results are shown weighted graph and constructing the first-level routing trees is also
in Fig. 14. In this figure, due to the similar delay of both DAT- increased. Accordingly, as shown in Fig. 15, the time complexity
OMS and WDAT-OMS algorithms, we show them as ‘DAT/ of the algorithms for N = 300 is more than N = 100.
WDAT-OMS’. The proposed algorithms use an improved objective
function, which not only aims to reduce the length of the sink tour,
but also minimises the number of selected polling points. Thus, in
6 Conclusion
the same conditions, i.e. using a single sink, DAT/WDAT-OMS has In this study, we proposed an efficient algorithm named WDAT-
less delay than the SHDGP algorithm. Furthermore, as the number OMS to jointly address the problems of CS theory, cluster-based
of clusters increases, the number of CHs also increases. This leads routing, and sink mobility in WSNs. At the first level, WDAT-
to a longer trajectory for the sink. As a result, its delay increases. OMS organises the sensors into several clusters. In each cluster, a
On the other hand, as justified before, increasing the number of weighted routing tree is used which results in fair energy
sinks significantly reduces the delay for a different number of consumption by different sensors. Thus, the energy consumption
clusters. variance is reduced. At the second level, one or more mobile sinks
traverse the network to collect the aggregated data of CHs through