100% found this document useful (1 vote)
619 views

IOT Unit 4 Data and Analytics For IoT by Dr.M.K.Jayanthi Kannan

This document provides an overview of unit 4 on data and analytics for IoT. It discusses the vast amounts of data generated by IoT systems and how traditional data management is unprepared for big data. A new approach to IoT data analytics is needed involving machine learning, big data tools, edge streaming analytics, and network analytics. It also covers structured vs unstructured data, semi-structured data, and data in motion vs data at rest in IoT networks.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
619 views

IOT Unit 4 Data and Analytics For IoT by Dr.M.K.Jayanthi Kannan

This document provides an overview of unit 4 on data and analytics for IoT. It discusses the vast amounts of data generated by IoT systems and how traditional data management is unprepared for big data. A new approach to IoT data analytics is needed involving machine learning, big data tools, edge streaming analytics, and network analytics. It also covers structured vs unstructured data, semi-structured data, and data in motion vs data at rest in IoT networks.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 41

Internet of Things ( Elective -2 )

Sub Code: 17CSDE663

Unit 4 : Data and Analytics


for IoT
Course Specification

By
Staff Room: 324- 8. Dr. M.K. Jayanthi Kannan, M.E.,MS.,MBA., M.Phil., Ph.D.,
Office Hours : 8.30 AM -4 PM Professor,
Department of Computer Science Faculty of Engineering & Technology,
and Engineering, JAIN Deemed To-Be University,
FET Block, Bengaluru.

Internet of Things ( Elective 2) by Dr.M.K.Jayanthi Kannan 2


Internet of Things
Unit 4 : Data and Analytics for IoT

 4.1 Data and Analytics for IoT,


 4.2 An Introduction to Data Analytics for IoT,
 4.3 Machine Learning,
 4.4 Big Data Analytics Tools and Technology,
 4.5 Edge Streaming Analytics,
 4.6 Network Analytics,
 4.7 Securing IoT,

Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 3


IoT data analytics

• IoT systems are producing vast volumes of data—far more


than has ever been available in the past.
• The business value of IoT is not just in the ability to connect
devices but comes from understanding the data these devices
create.
• A new form of data management has therefore emerged: IoT
data analytics.
• Traditionally data management was performed by relational
databases, which cared for well-structured data in tables
where the relationships between tables and data structures
were well understood and could be easily accessed through.

Internet of Things(Elective 2) Prepared by


4
Dr.M.K.Jayanthi Kannan
4.1 Data and Analytics for IoT
• IOT has more devices and many more devices are added to
IoT networks, the data generated by these systems
becomes overwhelming.
• Not only does this data begin to consume precious network
bandwidth but server resources are increasingly taxed in
their attempt to process, sort, and analyze the data.
• Traditional data management systems are simply
unprepared for the demands of what has come to be
known as “big data.”
• The real value of IoT is not just in connecting things but
rather in the data produced by those things, the new
services you can enable via those connected things, and the
business insights that the data can reveal.
Internet of Things(Elective 2) Prepared by
5
Dr.M.K.Jayanthi Kannan
IOT Data Analytics
• To use the data from IOT, it should be be handled in a
way that is organized and controlled.
• Thus, a new approach to data analytics is needed for the
Internet of Things.
• This Unit provides an overview of the field of data
analytics from an IoT perspective, including the following
sections:
• An Introduction to Data Analytics for IoT: It discusses
the analytics for IoT and discusses the differences
between structured and unstructured data. It also
discusses how analytics relates to IoT data.

Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 6


IOT Data Analytics
• Machine Learning: Once you have the data, what do you
do with it, and how can you gain business insights from
it? This section delves into the major types of machine
learning that are used to gain business insights from IoT
data.
• Big Data Analytics Tools and Technology: Big data is one
of the most commonly used terms in the world of IoT.
This section examines some of the most common
technologies used in big data today, including Hadoop,
NoSQL, MapReduce, and MPP.

Internet of Things(Elective 2) Prepared by


7
Dr.M.K.Jayanthi Kannan
• Edge Streaming Analytics: IoT requires that data be
processed and analyzed as close to the endpoint as
possible, in real-time. This section explores how
streaming analytics can be used for such processing and
analysis.
• Network Analytics: The final section of this chapter
investigates the concept of network flow analytics using
Flexible NetFlow in IoT systems. NetFlow can help you
better understand the function of the overall system and
heighten security in an IoT network.

Internet of Things(Elective 2) Prepared by


8
Dr.M.K.Jayanthi Kannan
An Introduction to Data Analytics for IoT
• In the world of IoT, the creation of massive amounts of
data from sensors is common and one of the biggest
challenges—not only from a transport perspective but
also from a data management standpoint.
• A great example of the deluge of data that can be
generated by IoT is found in the commercial aviation
industry and the sensors that are deployed throughout
an aircraft.
• Aircraft today have thousands of other sensors
connected to the airframe and other systems. In fact, a
single wing of a modern jumbo jet is equipped with
10,000 sensors.
Internet of Things(Elective 2) Prepared by
9
Dr.M.K.Jayanthi Kannan
An Introduction to Data Analytics for IoT

• Modern jet engines are fitted with thousands of sensors


that generate a whopping 10GB of data per second.
• For example, modern jet engines, similar to the one
shown in Figure 7-1, may be equipped with around 5000
sensors.
• Therefore, a twin engine commercial aircraft with these
engines operating on average 8 hours a day will generate
over 500 TB of data daily, and this is just the data from
the engines.

Internet of Things(Elective 2) Prepared by


10
Dr.M.K.Jayanthi Kannan
Internet of Things(Elective 2) Prepared by
11
Dr.M.K.Jayanthi Kannan
Structured Versus Unstructured Data
• Structured data and unstructured data are important
classifications as they typically require different toolsets
from a data analytics perspective.
• Structured data means that the data follows a model or
schema that defines how the data is represented or
organized, meaning it fits well with a traditional
relational database management system (RDBMS).
• Example:
 spreadsheet where data occupies a specific cell and can be
explicitly defined and referenced.
 Banking transaction and invoices to computer log files and
router configurations.
 IoT sensor data often uses structured values, such as
temperature, pressure,
Internet ofhumidity,
Things(Elective 2) Prep
12
Dr.M.K.Jayanthi Kannan
• Because of the highly organizational format of structured data, a
wide array of data analytics tools are readily available for
processing this type of data.
• From custom scripts to commercial software like Microsoft Excel
and Tableau, most people are familiar and comfortable with
working with structured data
• Unstructured data lacks a logical schema for understanding and
decoding the data through traditional programming means.
• Examples of this data type include text, not fit neatly into a
predefined data model is classified as unstructured data.
• Around 80% of a business’s data is unstructured.
• Because of this fact, data analytics methods that can be
applied to unstructured data, such as cognitive computing and
machine learning, are deservedly garnering a lot of attention.

13
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan
Internet of Things(Elective 2) Prepared by
14
Dr.M.K.Jayanthi Kannan
Semi-Structured Data
• A third data classification, semi-structured data, is sometimes
included along with structured and unstructured data.
• semi-structured data is a hybrid of structured and
unstructured data and shares characteristics of both.
• While not relational, semi-structured data contains a certain
schema and consistency.
• Email is a good example of semi-structured data as the fields
are well defined but the content contained in the body field
and attachments is unstructured.
• JavaScript Object Notation (JSON) and Extensible Markup
Language (XML), which are common data interchange formats
used on the web and in some IoT data exchanges.
Data in Motion Versus Data at Rest
• As in most networks, data in IoT networks is either in transit
(“data in motion”) or being held or stored (“data at rest”).
• Examples of data in motion include traditional client/server
exchanges, such as web browsing and file transfers, and email.
• Data saved to a hard drive, storage array, or USB drive is data
at rest.
• From an IoT perspective, the data from smart objects is
considered data in motion as it passes through the network
en route to its final destination.
• This is often processed at the edge, using fog computing.
When data is processed at the edge, it may be filtered and
deleted or forwarded on for further processing and possible
storage at a fog node or in the data center.
Internet of Things(Elective 2) Prepared by
16
Dr.M.K.Jayanthi Kannan
Data at rest

• Data at rest in IoT networks can be typically found in IoT


brokers or in some sort of storage array at the data center.
• Myriad tools, especially tools for structured data in relational
databases, are available from a data analytics perspective.
• The best known of these tools is Hadoop. Hadoop not only
helps with data processing but also data storage.

Internet of Things(Elective 2) Prepared by


17
Dr.M.K.Jayanthi Kannan
IoT Data Analytics Overview
• The true importance of IoT data from smart objects is realized
only when the Analysis of the data leads to actionable
business intelligence and insights.
• Data analysis is typically broken down by the types of results
that are produced.
• As shown in Figure 7-3, there are four types of data analysis
results:
1. Descriptive: Descriptive data analysis tells you what is
happening, either now or in the past. For example, a
thermometer in a truck engine reports temperature values
every second.

Internet of Things(Elective 2) Prepared by


18
Dr.M.K.Jayanthi Kannan
2. Diagnostic: When you are interested in the “why,” diagnostic
data analysis can provide the answer. Continuing with the
example of the temperature sensor in the truck engine, you
might wonder why the truck engine failed.
3. Predictive: Predictive analysis aims to foretell problems or
issues before they occur. For example, with historical values of
temperatures for the truck engine, predictive analysis could
provide an estimate on the remaining life of certain
components in the engine.
4. Prescriptive: Prescriptive analysis goes a step beyond
predictive and recommends solutions for upcoming
problems. A prescriptive analysis of the temperature data
from a truck engine might calculate various alternatives to
cost-effectively maintain our truck.

Internet of Things(Elective 2) Prepared by


19
Dr.M.K.Jayanthi Kannan
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 20
Figure 7-4 Application of Value and Complexity Factors to the Types of
Data Analysis
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 21
IoT Data Analytics Challenges
That traditional data analytics solutions were not always adequate to
process IOT
For example, traditional data analytics typically employs a standard RDBMS
and corresponding tools, but the world of IoT is much more demanding.
Scaling problems: Due to the large number of smart objects in most IoT
networks that continually send data, relational databases can grow
incredibly large very quickly. This can result in performance issues that
can be costly to resolve, often requiring more hardware and
architecture changes.
• Volatility of data: With relational databases, it is critical that the
schema be designed correctly from the beginning. Changing it later can
slow or stop the database from operating. IoT data, however, is volatile
in the sense that the data model is likely to change and evolve over
time. A dynamic schema is often required so that data model. changes
can be made daily or even hourly.

Internet of Things(Elective 2) Prepared by


22
Dr.M.K.Jayanthi Kannan
Machine Learning
• One of the core subjects in IoT is how to makes sense of the
data that is generated.
• Because muchspecialized tools and algorithms of this data can
appear incomprehensible to the naked eye, are needed to find
the data relationships that will lead to new business insights.
• This brings us to the subject of machine learning (ML).
• Machine learning, deep learning, neural networks, and
convolutional networks are words you have probably heard in
relation to big data and IoT.
• ML is indeed central to IoT.
• Data collected by smart objects needs to be analyzed, and
intelligent actions need to be taken based on these analyses.

Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 23


Machine Learning Overview
• Machine learning is, in fact, part of a larger set of
technologies commonly grouped under the term artificial
intelligence (AI).
• AI includes any technology that allows a computing system to
mimic human intelligence using any technique, from very
advanced logic to basic “if-then else” decision loops.
• Any computer that uses rules to make decisions belongs to
this realm.
• A simple example is an app that can help you find your
parked car. When you park and disconnect from the when the
disconnection happens. This is where your car is parked.
• A GPS reading of your position at regular intervals calculates
your speed.
Internet of Things(Elective 2) Prepared by
24
Dr.M.K.Jayanthi Kannan
Machine Learning Types
• ML is a vast field but can be simply divided in two main
categories: supervised and unsupervised learning.
• The program is configured to recognize the audio pattern of
each word in a dictionary, but it does not know your voice’s
specifics—your accent, tone, speed, and so on.
• You need to record a set of predetermined sentences to help
the tool match well-known words to the sounds you make
when you say the words.
• This process is called machine learning. ML is concerned with
any process where the computer needs to receive a set of
data that is processed to help perform a task with more
efficiency.
Internet of Things(Elective 2) Prepared by
25
Dr.M.K.Jayanthi Kannan
Supervised Learning
• In supervised learning, the machine is trained with input for
which there is a known correct answer.
• For example you are training a system to recognize when
there is a human in a mine tunnel.
• A sensor equipped with a basic camera can capture shapes
and return them to a system that is responsible for
determining whether the shape is a human or something else
such as a vehicle, a pile of ore, a rock, a piece of wood, and so
on.
• With supervised learning techniques, hundreds or thousands
of images are fed into the machine, and each image is labeled
• This is called the training set.

Internet of Things(Elective 2) Prepared by


26
Dr.M.K.Jayanthi Kannan
• An algorithm is used to determine common parameters and
common differences between the images.
• The comparison is usually done at the scale of the entire image, or
pixel by pixel.
• Images are resized to have the same characteristics resolution,
color depth, position of the central figure, and so on), and each
point is analyzed.
• Human images have certain types of shapes and pixels in certain
locations which correspond to the position of the face, legs, mouth,
and so on).
• Each new image is compared to the set of known “good images,”
and a deviation is calculated to determine how different the new
image is from the average human image and, therefore, the
probability that what is shown is a human figure.
• This process is called classification.
• After training, the machine should be able to recognize human
shapes. Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 27
Unsupervised Learning
• In some cases, supervised learning is not the best method for
a machine to help with a human decision.
• Suppose that you are processing IoT data from a factory
manufacturing small engines.
• You know that about 0.1% of the produced engines on
average need adjustments to prevent later defects, and your
task is to identify them before they get mounted into
machines and shipped away from the factory.
• With hundreds of parts, it may be very difficult to detect the
potential defects, and it is almost impossible to train a
machine to recognize issues that may not be visible.
• However, you can test each engine and record multiple
parameters, such as sound, pressure, temperature of key
parts, and so on.
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 28
• Once data is recorded, you can graph these elements in relation to
one another (for example, temperature as a function of pressure,
sound versus rotating speed over time).
• You can then input this data into a computer and use mathematical
functions to find groups.
• For example, you may decide to group the engines by the sound
they make at a given temperature.
• A standard function to operate this grouping, K-means clustering,
finds the mean values for a group of engines ,for example, mean
value for temperature, mean frequency for sound.
• Grouping the engines this way can quickly reveal several types of
engines that all belong to the same category for example, small
engine of chainsaw type, medium engine of lawnmower type.
• All engines of the same type produce sounds and emperatures in
the same range as the other members of the same group.
Internet of Things(Elective 2) Prepared by
29
Dr.M.K.Jayanthi Kannan
Unsupervised Learning
• The computing process associated with this determination is
called unsupervised learning.
• This type of learning is unsupervised because there is not a
“good” or “bad” answer known in advance.
• It is the variation from a group behavior that allows the
computer to learn that something is different.
• The example of engines is, of course, very simple.
• In most cases, parameters are multidimensional. In other
words, hundreds or thousands of parameters are computed,
and small cumulated deviations in multiple dimensions are
used to identify the exception.
• Figure 7-5 shows an example of such grouping and deviation
identification logic.
Internet of Things(Elective 2) Prepared by
30
Dr.M.K.Jayanthi Kannan
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 31
Neural Networks
• Processing multiple dimensions requires a lot of computing power.
• It is also difficult to determine what parameters to input and what
combined variations should raise red flags.
• Similarly, supervised learning is efficient only with a large training set;
larger training sets usually lead to higher accuracy in the prediction.
• This requirement is partly what made ML fade away somewhat in the
1980s and 1990s and complicated.
• The computer can recognize that humans have distinct shapes and
that vehicles do not.
• Distinguishing a human from another mammal is much more difficult.
• The same goes for telling the difference between a pickup truck and a
van. This is where neural networks come into the picture.

Internet of Things(Elective 2) Prepared by


32
Dr.M.K.Jayanthi Kannan
• Neural networks are ML methods that mimic the way the human
brain works.
• When you look at a human figure, multiple zones of your brain are
activated to recognize colors, movements, facial expressions, and so
on.
• Your brain combines these elements to conclude that the shape you
are seeing is human.
• Neural networks mimic the same logic.
• The information goes through different algorithms each of which is
in charge of processing an aspect of the information.
• The resulting value of one unit computation can be used directly or
fed into another unit for further processing to occur.
• In this case, the neural network is said to have several layers.

Internet of Things(Elective 2) Prepared by


33
Dr.M.K.Jayanthi Kannan
• For example, a neural network processing
human image recognition may have two units
in a first layer that determines whether the
image has straight lines and sharp angles
because vehicles commonly have straight lines
and sharp angles, and human figures do not.

Internet of Things(Elective 2) Prepared by


34
Dr.M.K.Jayanthi Kannan
Figure 7-6 Neural Network Example
Internet of Things(Elective 2) Prepared by
35
Dr.M.K.Jayanthi Kannan
Figure 7-6 Neural Network Example
Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 36
Figure 7-6 Neural
Network Example

Internet of Things(Elective 2) Prepared by


37
Dr.M.K.Jayanthi Kannan
Neural Network Example
• Neural networks rely on the idea that information is divided
into key components, and each component is assigned a
weight.
• The weights compared together decide the classification of
this information (no straight lines + face + smile = human).
• When the result of a layer is fed into another layer, the
process is called deep learning (“deep” because the learning
process has more than a single layer).
• One advantage of deep learning is that having more layers
allows for richer intermediate processing and representation
of the data.
• At each layer, the data can be formatted to be better utilized
by the next layer. This process increases the efficiency of the
overall result.
Internet of Things(Elective 2) Prepared by
38
Dr.M.K.Jayanthi Kannan
Unit 4: Summary

This chapter discused


 Data and Analytics for IoT,
 An Introduction to Data Analytics for IoT,
 Data in Motion Versus Data at Rest
 Structured Versus Unstructured Data
 Machine Learning.
 Two main ML categories like supervised and
unsupervised learning.
 The Need for Neural Network and its functionality.
Internet of Things(Elective 2) Prepared by
39
Dr.M.K.Jayanthi Kannan 39
Text Books
1. Fundamentals: Networking Technologies,
Protocols, and Use Cases for the Internet of
Things”, David Hanes, Gonzalo Salgueiro, Patrick
Grossetete, Robert Barton, Jerome Henry,
1stEdition, Pearson Education (Cisco Press Indian
Reprint, ISBN: 978- 9386873743.

2. “Internet of Things”, Srinivasa K G, 2017,


CENGAGE Leaning, India.

Internet of Things(Elective 2) Prepared by


40
Dr.M.K.Jayanthi Kannan
Reference Books

1. “Internet of Things (A Hands-on-


Approach)”,Vijay Madisetti and ArshdeepBahga,
1st Edition, VPT, 2014. ISBN: 978-8173719547.

2. “Internet of Things: Architecture and Design


Principles”, Raj Kamal, 1st Edition, McGraw Hill
Education, 2017, ISBN: 978-9352605224.

Internet of Things(Elective 2) Prepared by Dr.M.K.Jayanthi Kannan 41


41

You might also like