Data Analytics Platform For The Optimization of Waste Management Procedures
Data Analytics Platform For The Optimization of Waste Management Procedures
net/publication/333627447
CITATIONS READS
14 3,946
6 authors, including:
All content following this page was uploaded by Thanasis Vafeiadis on 06 March 2020.
Abstract—Today, the use of IoT devices, the interconnectivity support and optimization of planning to the waste management
of machines and systems, and the development of novel analytic company. The scalable design of the platform enables its
algorithms which are main characteristics of Industry 4.0 are incorporation by further waste managements companies. This
also transferred to other domains connected to the industrial
one. The advances on industrial domain have impact on waste paper describes this platform's functionalities by emphasizing
management one as these domains are strictly connected. The on data analytic tools and algorithms.
connection with sensors on industrial partners premises, the big The remainder of this paper is structured as follows. In
data availability and the significant advances on data analytics, Section II, related work regarding waste management is given
enable the waste management companies to smarten their domain while in Section III a brief description of the waste man-
and automate many of their solutions and processes. In this work,
we present a data analytics platform for the optimization of agement platform and the embedded tools (fill level analysis,
waste management procedures. By using this platform, a waste tonnage forecasting, price forecasting, state prediction engine,
management company is able to monitor and analyze sensors optimal routes and statistical analysis tools) is provided. Fi-
data from industrial partners bins and optimize its planning nally, in Section IV the conclusions are drawn.
based on different analytic tools and historical data. Besides
the platform description, special focus is given on data analytics II. L ITERATURE R EVIEW
algorithms and methods.
Index Terms—data analytics, Industry 4.0 time series, smart A growing body of literature has been examined towards
waste management, deep learning, IoT platform smart waste management and leading-edge solutions. As a first
consideration, waste management monitoring is the starting
I. I NTRODUCTION point of waste management initiation and optimization. A
Smart waste management is a very important procedure in monitoring system provides, among others, transparency of
Industry 4.0 and it can be a profitable asset of both waste waste spend, access to historical data and the capability of
producers and waste management providers. Based on reports evaluating and improving the waste management through an-
of World Bank1 the total volume of generated wastes is going alytics. In the work of [1] a smart solid waste monitoring and
to get doubled by 2025. Moreover, Smart Waste Management collection system is presented, able to monitor the solid waste
Market was estimated over 1 billion USD in 2016 and it is collection and control the transportation system of disposal.
projected to reach 4 billion USD by 2025 based on the same Moreover, the advent of IoT technology is definitely im-
resources. These numbers indicate the importance of the smart proving waste management operations. Sensor-based bins and
waste management procedures as they can reduce the overall dedicated waste management applications are forcefully re-
transport and collection cost by 50%. placing obsolete methods. A monitoring system deploying
Today the smart waste management procedures are based IoT technology is presented in [2]. It is a smart garbage
on IoT sensor devices and data analytics tools. Variant factors management system which also makes the garbage collection
such as the continuously decreasing cost of IoT sensors, more efficient by monitoring the dust bin level. Likewise,
the growth of fast machine-to-machine connections and the [3] is a study for the development of a recycle management
constantly increasing performance of data analytic tools have information platform based on Internet of Things technology.
enable the adoption of smart waste management solutions. A more thorough solution is the one proposed in [4]. With
This work presents a data analytics platform regarding op- the use Smart-M3 platform, real-time monitoring of the level
timization problems of a waste management company, which of bins fullness through sensors is achieved and the incoming
participates as a pilot partner of the COMPOSITION2 EU data are provided to decision support algorithms. Last but
project. The platform aims to provide smart solutions that foremost, if we direct the attention to waste management, the
will enable monitoring processes based on sensors, decision IBM Intelligent Waste Management Platform [5] cannot go
unmentioned. This platform is addressed to both municipality
1 http://www.worldbank.org and waste management companies and provides them with
2 https://www.composition-project.eu analytics to improve operational processes.
III. DATA A NALYTICS P LATFORM • MQTT protocol for data transmission.
A. Platform Overview The use of the aforementioned devices provides services for
different type of bins located on industrial plants through
The proposed data analytics platform offers a variety of
the platforms interfaces monitoring. The solution is based on
analytic services to end-users related to waste management
activities in order to enhance the decision-making and optimize
planning.
The platform was designed as a completely web-based
tool. It is developed in AngularJS3 framework, which takes
care of all the basic and content management features of
the application. Moreover, the MongoDB4 is used for the
tools storing requirements. The tool offers a wide variety
of advanced visualizations and it can be considered as a
Fig. 1. Bins Fill Level Monitoring.
visual analytics tool as well. The graphical representations are
enabled by the use of the Chart.js5 and D3.js6 visualization low power hardware offers over the air firmware update and
libraries. These libraries are ideal for producing dynamic, enable remote and real time monitoring. The monitoring is
interactive data visualizations in web browsers. In contrast a valuable asset for waste management companies as they
to many other, they allow great control over the final visual are continuously informed of their customers bins' fill level
results by using the widely implemented SVG7 , HTML58 and and they are able to act quickly, re-allocate resources(such as
CSS9 standard. From communication perspective both MQTT drivers and trucks) and re-organize their planning.
and HTTP communication protocols are supported. By using
these protocols the platform is able to connect to IoT devices C. Fill Level Analysis
and different analytic services. Both authentication and the Fill Level Analysis is a tool that utilizes a methodology
authorization services are supported by the platform for the called Slope Statistic Profile, denoted hereafter as SSP [6].
secure data exchange. Users log-in services are supported as SSP is a methodology that aims to estimate the change point
well. T from the profile of a linear trend test statistic, computed
The introduced platform supports different technologies as on consecutive overlapping time windows along the time
well as different types of features to the end-users. The main series. The test statistic for linear trend estimation is selected
platform features are: based on its high power compared to other test statistics for
• Monitoring of bins fill level based on IoT sensors. both correlated and white noise residuals. Modifications and
• Analysis of the bins fill level. extensions of SSP methodology has been widely used to deal
• Forecasting about the tonnage of wastes that is going to with anomaly detection problems in industrial sector. The real
be transported by a waste management company. time approach of SSP (Real Time - Slope Statistic Profile)
• Calculator for optimal pair of routes and tonnage. utilizes the information of two interdependent time series, real
• Price forecasting for various waste types/materials. time classification of their linear trend profiles signals and a
• Statistical analysis and visualization for better data ex- self-adaptive sliding window [7], [8], [9].
ploration and understanding of data.
All these main features are analyzed in the following sub-
section especially from data analytics and algorithms perspec-
tive.
D. Tonnage Forecasting
This algorithm helps us to forecast new observations based Fig. 3. Time Series Forecasting tool that utilizes moving average models.
on a time series. This algorithm uses smoothing methods. The
moving average, denoted hereafter as MA, algorithm is used
only on time series that does not have a trend. This method is actually very simple to find out. Given a transition matrix
is by far the easiest and it consists of making the arithmetic P , this can be determined by calculating the value of entry
mean of the last n observations contained by the time series to (i, j) of the matrix obtained by raising P to the power of
forecast the next observation. We use the following formula: M . For the time series of median tonnage per month for all
Pt customers of the waste management factory, we define two
xi
M At + 1 = i=t−n possible states: “Up” and “Down”, which means that a value
n of the time series at its current time can be followed by a
where t is the period of time (or the size of the time series) larger (“Up”) or a smaller (“Down”) value. We could also
and xi the actual time series. In some cases, we need to find the define the a third state when the current and the future value
optimal number n of observations to be used in the forecast. are the same (Steady), but the form of the time series does not
We can find it by checking the square error mean of multiple provide any clues for the existence of this state. Thus, based
n observations. Basically, the range for forecasting elements on the assumptions described above, the transition matrix P
is to start at two observations up to half of the data set size for the median tonnage per month time series is calculated
plus one. The moving average model has been applied on the and given below:
time series of tonnage weights per month for n = 2, 3, 4, 5,
0.222 0.778
which means that the predictions based on moving average P =
0.857 0.143
model will take place from 2 to 5 months ahead (see Fig. 3).
From the transition matrix P one can see that the probability
E. State Prediction Engine of being at state “Up” and the next state to be the same is 0.222
Markov chains are a fairly and simple common way to and the probability of being at state “Up” and the next state
statistically model random processes. Markov Chains are con- to be “Down” is 0.778. Moreover, the probability of being
ceptually quite intuitive, and are very accessible in that they at state “Down” and the next state to be “Up” is 0.857 and
can be implemented without the use of any advanced statistical the probability of being at state “Down” and the next state to
or mathematical concepts. Formally, a Markov chain model be “Down” is 0.143. Based on transition matrix P it is easy
is a probabilistic automaton. The probability distribution of to calculate the probabilities of futures states over M steps.
state transitions is typically represented as the Markov chains For example, the probability of being at state “Up” and after
transition matrix P . If the Markov chain has N possible one month to be at state “Down” the probability is 0.777 (see
states, the matrix will be an N xN matrix, such that entry Fig. 4).
(i, j) is the probability of transitioning from state i to state j.
Additionally, the transition matrix must be a stochastic matrix, F. Price Forecasting
a matrix whose entries in each row must add up to exactly one. A Deep Learning algorithm is implemented at this point,
This makes complete sense, since each row represents its own able to derive the latest prediction on the price per ton
probability distribution. at which users are likely to accept to buy/sell scrap metal
Markov chain models can be used to determine the proba- within a fixed timeframe in the future. The Price Forecasting
bility of moving either from state i to state j for a single step is a prediction model based on historical price data. The
or from state i to state j over M steps. As it turns out, this initial dataset consists of aggregated sales statistics spanning
prices pattern with the help of continuous learning. Hence,
we come to the conclusion that if the circumstances in which
the network has been trained are the same with the ones used
to predict or to apply continuous learning, the network will
converge. The Price Forecasting of the presented platform is
shown in Fig. 5. A price forecasting analysis is presented for
the future including the prediction values and the coefficient
of determination [10] expressed as accuracy rate in the same
diagram.
Fig. 4. State Prediction Engine tool that utilizes Markov Chain approach.