Big Data (Analytics) in Power Systems
Big Data (Analytics) in Power Systems
1|P age
Abstract:
Power system has faced different challenges among the years, technologies and innovations
within last few years helped with growing the whole system and improving the methodology of
each concept. Starting from producing large amount of data which called concept of Big Data to
how to manage storing those data’s and the availability in real-time needed for further
operations within electrical power system.
2|P age
Table of Contents
Abstract:................................................................................................................................................................ 2
I. Introduction: ................................................................................................................................................. 4
Characteristics: ............................................................................................................................................. 5
V. Conclusion:.................................................................................................................................................. 19
References .......................................................................................................................................................... 20
3|P age
I. Introduction:
“Big data is not a fad. We are just at the beginning of a revolution that will touch every business
and every life on this planet.” [1]
Due to the rapid growth of the world and daily new innovations, keeping up is mandatory to
ensure reserving the data for generations to come, this is how the concept of big data or -
information explosion as Oxford dictionary- came.
With all the details around us, unseen data transferred between smart phones, cars, even
houses, and human interactions itself. Requiring complicated analysis and massive storage,
where the collected data are on exponential growth due to the world development. All the new
methods required for this amount and volume of data is a new challenge to face it. But a new
opportunity as well to this century.
The great progress for IT team with collecting data, provides new vision and future for
engineers to ensure the ability of growing the capacity of data receiving.
4|P age
II. Big data:
What is big data:
To describe what is it exactly, think of one single smartphone user how many data can be
generate of (texting, phone calls, emails, photos, videos, ...etc.) approximately 50 exabytes.
Now imagine this number is multiplied by 5 billion smartphone users, that is a lot to process
only by smartphones, think of machines and mother nature that process data as well. This
amount of data is quite large to process by one normal computer. so, this giant amount of data
we refer to it by big data.
It is a technology evolution, to ensure that each user got the right information at the right
needed time from all the available data that keep growing from a long time till now.
The challenge is not how to deal with big massive data, but how to manage and control the
diverse data with diverse information in addition to real time access.
Characteristics:
As big data refer to massive amount representation of data set (Volume), the speed of
generating this information (Velocity), and more branched out to include structure, semi
structure, and unstructured (Variety).
- Volume:
The amount of data generated and represented
each day, and it increased exponentially by the
information analytics.
- Velocity:
Represent where data is captured, shared and
generated in the real time.
- Variety:
Diversity of the type and source of data
managed by system and this leads to sort the
data in structured manner and links the
relationship of each.
Figure 2 - The 3 V's of big data
5|P age
Due to importance of the concept how crucial has been added to 3 fundamental V’s, and it
might increase more depend on the need to each company, group, …etc.
- Validity or Veracity:
Is to assure of data quality, and authenticity. As working with huge amount of data
results kind of less quality.
- Value:
Adding value to each user. Establish each data platform from company it might be
without real value.
Most popular and established one is Apache Hadoop, which is an open source framework for
saving and processing / analyzing data.
Another one is Apache Spark, which can store a big part of data processing in a memory and
disk. Hence it is much faster. Other advantage it can run on one single local machine.
6|P age
One more is Apache Kafka, that allows to publish and subscribe to real time data. This help with
bringing reliability to messaging system.
There are three types of Big Data, Structure, Semi Structure, and Un Structure:
- Structured data:
Its fixed format and handled by machines. Consists of information already managed by
the organization in database.
- Un-structured data:
Is unorganized information, no specific format. Can be gathered from anywhere such as
data from social media sources.
- Semi-structured data:
Contain both the forms of data. As an example sensor’s data entered by the developer
and web server logs.
7|P age
Big Data Analytics:
Big Data Analytic refers to collecting, organizing, and analyzing of different information to
achieve the purpose. Mainly focusing on solving new problems or old problems in a new better
way. Here are types of Big Data Analytics:
- Descriptive Analytics
(What is happening?), first stage of data analytics that creates a history for a data. Help
to uncover pattens that offer perception and provides prospects and trends.
- Diagnostic Analytics:
(Why did it happen?), it looks and search for the root cause of the problem. This uses to
identify and understand the cause of events and behaviors.
- Predictive Analytics:
(What is likely to happen?), uses many techniques like artificial intelligence and data
mining to investigate and make scenarios of what will happen.
- Prescriptive Analytics:
(What should be done?), provide historical data and predictive analytics to find the right
action and the best solution to take.
Big data can help the organization to come up and create a whole new growth. Each
organization uses data in its own way, the more efficiently uses data the possibility to achieve
and growth is high.
The ability to operate big data in an efficient way brings many benefits for different sectors
such as, health, education, industry, and much more. How exactly it is important?
8|P age
In business large amount of data to be stored, some kind of big data tools like (cloud-
based analytics, and Hadoop) can help in identifying more efficient way of doing
business and bring coast advantages as well.
- Time reduction:
Due to high speed of big data tools can easily check and find new sources of data and
this led to immediately analyzing data and making new quick discussions based on past
learning.
- Customer care:
In a business wise customer behavior is very important to trigger loyalty, where any
business asset is customers. It allows to observe the various of customer related
patterns and trends.
And much more reasons why big data is important to our life. Here is an example in figure
bellow, a study by Business Application Research Center (BARC) how some of biggest company
of the world utilizing big data analytics.
9|P age
The future:
Nowadays everything is heading to next level of development and future to come, controlling
everything with one device such as IoT concept (Internet of Things). Once everything starting to
use IoT the possibility of using big data will be giant. Not only the amount of data that will
increase the analytics techniques will variety as well. Here is a report of global big data market
forecast 2019-2027 -bellow in the figure-.
10 | P a g e
III. Power System
Electrical Power System is a network consist of three phases (Generation, Transmission,
Distribution). Uses one kind of energy to convert it into Electrical Energy. Most of electricity
generated in UAE using natural gas. In addition, UAE is developing to achieve strategic
objectives of the Dubai Integrated Energy Strategy 2030 to diversity energy resources and
improve efficiency of electricity and water usage. [4]
11 | P a g e
IV. Big Data (Analytics) in power system:
Big data technologies for smart grid:
- Data sources:
Varity of data is based on how the values are extracted, as we have Operational Data
related to electrical data of the grid, which represent real and reactive power flows, voltage,
…etc. Non-Operational Data not related to grid power, but it refers to main data, which is
on power quality and reliability, …etc. Meter Usage Data other type of data related to
consumer on power usage and demand values as average usage, peak and time of the day,
…etc. Event Message Data which is related to smart grid devices as fault detection, voltage
loss, …etc. finally Metadata which is related to explain and design all other types of data
from several sources as example sensors, devices, mobile data, substations, …etc.
- Data integration
To ensure data integration, several technologies and approach used in latest
communication technology and advanced operations methods are to improve smart grid
reliability, efficiency and performance. Such as:
12 | P a g e
▪ Common Information Models (CIM) is critical specially in failure or success of data
management in energy management systems in term of time, coast, and data
integration. Which helps to exchange data with technical grid infrastructure.
▪ Enterprise Service Bus (EBS) reduces coast and time in terms of monitoring and
management. Which is achieve great approach to manage communication between
different kind of systems as GIS, CIS, OMS, …etc.
▪ Messaging which responsible on communication systems based on exchanging
messages include data and some information.
▪ Service Oriented Architecture (SOA) makes data integration flexible and easier by
using single approach software communicate together. Which solve the problem of
how to maintain such amount of systems provided to the user.
- Data storage
Data storage works as critical role in smart grid, as collecting data from many sources and
delivering it to analytics tools. Storage system need to be developed to meet big data
requirements.
13 | P a g e
Voldermort, 2. Column-oriented solutions as Cassandra and HBase, 3. Documents
databases solutions as MongoDB and CouchDB.
- Data analytics
Smart grid does collect data from many sources and stored it in huge quantity of dataset
that should be easier for analyzing. It is essential role to make the grid more efficient and
intelligent:
Descriptive model is used in describe customers behaviors. Diagnostic model understands their
behavior and analyze it. Predictive model is to predict customers decision in the future. Finally,
prescriptive model high level of analytics in smart grid to affect marketing and decision making.
14 | P a g e
Two ways to process big data, first is batch processing which process data without high
requirements on time. Second, is stream processing which is used in real-time applications.
- Data visualization
Based on different high dimension visualization, 2D and 3D is used by the system. But due
to massive amount of data required data presenting such as 3D power map, scatter
diagram, …etc.
- Data transmission
Due to importance of data transmission’s role, maintaining is required for high bandwidth
capacity, speed, data security and privacy, …etc.
15 | P a g e
▪ Electric device state monitoring:
A single failure in power transformers may cause huge outage in power system.
Therefore, management of life cycle of power transformer is extremely important.
Where the existing methods focused on limited state parameters, while the potential
risk problem and health condition can be predicted with the help of Proportional Hazard
Model (PHM) which developed to process and classify lifecycle data.
16 | P a g e
get the forecasting results data of wind energy. Later by using vector regression method
to predict the wind speed and timeline.
▪ Cyber-Security
Is a main issue facing this generation to deal with security issues as availability, privacy,
integrity, auditability, authentication, authorization, confidentiality, nondeducibility,
…etc.
17 | P a g e
▪ Real-time big data intelligence
Massive amount of available data in operation makes the process for such data is
challenging as well while the real time operations/responses of monitoring and
analyzing real time big data energy demand are required.
▪ Data quality
Due to massive data resources available, databases include data with all the
characteristics as incompleteness, inconsistency, and inaccuracy.
V. Case study:
17 years ago, 2003, Aug 14th a huge blackout hit Northeast America, which caused more
than 40 million people in 8 state and 10 million people in Canada to lose their electricity up
to days.
After investigation found that software bug in alarm’s system of Ohio, which collapsed and
failed to redirect power from an overloaded power line. Then the wire got heated and
dropped down into a tree near to Cleveland, which tripped a circuit and caused power to
redirect to other line. Hence it got overloaded and set off a line of failures that resulted a
huge blackout in history.
Later more than decade, electrical grid still having some kind of failures, but due to new
data monitoring system it has the potential to transform the grid by providing real-time
data and solving any issues caused by the weather.
Which is the concept of smart grid, a system connected to a bunch of sensors to secure
two-way communications and analytics.
Like a human body, smart grid can be thought of self-healing. It has the huge ability to
identify and solve problems.
As GTM Research global utility data analytics market of 20$ billion between 2013-2020. The
investment includes sensors, hence US has installed over 1000 sensors all over the country,
funded by the Recovery Act Smart Grid Investments.
18 | P a g e
In 2012 during Hurricane, having PMUs (Phasor Measurement Units) installed all over, did
reduced the storm’s impact. Where the physical damage was done, but the sensor did stop
it before spreading to other near places, to prevent repeating the history of 2003.
Smart grid does not only monitor the grid in real-time it also reducing theft energy and
knowing when to rely on renewable energy, which accounted for 13% of US electricity in
2014. It also will be able to integrate with smart building and smart home technologies.
In 2009, US utilities had 194 Petabytes of stored data, to make it clear if we will compare it
to entire digital collection library of Congress its just 3 Petabytes.
The question is how much of all stored data you really used? and with all that massive
amount of data security is essential. However, if the stored data is cleaned it will remain
secure. [5]
VI. Conclusion:
The future approach for big data is increasing wisely, starting from monitoring to analyzing then
will start to act smartly and make decisions.
It can be applied by combining all knowledges from Artificial intelligence to IoT and of-course
concept of big data we can head to next step. Letting the machines apply self-learning concept
and start making decisions for further steps. In addition, predicting to predicting the future will
prevent faults and increase efficiency. This is how the future journey begins from yesterday.
19 | P a g e
References
[9] Yuanjun Gueo, Kang Li, Wenxiong Mo, "IEEE Explore," 02 SEP 2016. [Online]. Available:
https://ieeexplore-ieee-org.ezproxy.rit.edu/stamp/stamp.jsp?tp=&arnumber=7737581.
20 | P a g e