Adaptive Segmentation and Machine Learning Based Potential DR Capacity Analysis
Adaptive Segmentation and Machine Learning Based Potential DR Capacity Analysis
Abstract—By shedding their electricity consumption or getting introducing models of DR aggregator, more studies are shifted
ready to be shed, the Demand Response (DR) program to analyze the consumption behavior among low voltage users.
compensates the shortage of generation or acts as spinning Based on the data collected by low voltage advanced metering
reserve. Depending on how the end-users join in the program. the infrastructure (LVAMI), consuming behavior clustering [5],
potential capacity of DR is, therefore, a key issue no matter to consumption prediction [6], direct load control, and price-based
system operator or DR aggregator. The proposed method load control [7]-[11] are proposed. A. Asadinejad et al. [11]
employs adaptive k-means approach to evaluate the potential consider the probability of changing the consumption behavior
consumer as candidate participants of DR. The consumption to schedule the appliance usage. All these mentioned studies
prediction models of controllable appliances are constructed by
assume the appliances controllable, perfectly obeying the
Gaussian Processes for Machine Learning (GPML). Through
combining the candidates’ data and prediction models, the
schedule, and their historical consumptions well known.
potential capacity is then achieved. Case studies evaluate the However, except some demo sides, to most of consumers with
accuracy and efficacy of the proposed method with practical low just AMI installed, the consumption data of each appliance
voltage advanced metering infrastructure (LVAMI) data cannot be obtained. The capacity of individual consumer’s DR
achieved from Taiwan. The results show good efficiency and potential is, therefore, difficult to be investigated.
practicability of the proposed method. There are some studies deliberating the classification of
AMI data for even further analysis. K-means, hierarchical
Index Terms-- Demand Response, LVAMI, Adaptive K-means,
Machine Learning
clustering, and finite mixture modeling are traditional
clustering algorithms. References [12] and [13] compare these
I. INTRODUCTION algorithms and divide them as direct clustering and indirect
clustering categories. A suitable clustering method groups
Owing to growth of energy demand and environmental similar data within same bunch, and also has the ability to
protection issues, distributed renewable energy sources (RESs) distinguish dissimilar data. Some indices can be used to
have been introduced to worldwide distribution systems. The evaluate the clustering quality [12]. K-means is often used for
penetration of DGs, such as cogeneration, photovoltaic, gas AMI data clustering [6], [14]. Reference [6] uses the clustering
turbines, wind power, and fuel cells, is meanwhile fast scheme to find similarities between customers’ behaviors to
increasing. Adequate controllable and fast-reacting resources improve the accuracy of load forecast. Y. Zhang et al. propose
are required to maintain system’s security and reliability [1]. a method to calculate the baseline for DR based on the
energy storage system (ESS) and demand response (DR) are clustering result.
thus attracting public’s attention as the compensation for
traditional generation units. However, the number of groups k must be determined in
advance, and how to find a suitable k is an issue. To solve this
With the popularity of advanced metering infrastructure problem, adaptive k-means and Bayesian information criterion
(AMI), the real-time power consumption is fully aware by users are employed by References [15] and [16]. These two methods
and enable them to participate in active bidding programs and determine the number k according to error-tolerant rate and
managements [2]. Many countries have been pushing DR threshold [15] or Bayes Information Criterion [16]. The
program, while some of them can provide 5%-10% of total strategy employed by Reference [15] evaluates the potential a
consumption as DR capability. The advantages of DR include consumer has to participate in DR or energy efficiency (EE)
peak shaving, RES efficiency enhancement, short action time, program based on adaptive k-means algorithm.
and low cost.
This paper proposes a method to deal with large quantities
From previous researches, the analysis of DR is mainly of AMI data for the total consumption. The potential capacity
focused on medium or high-voltage level consumers with high to join DR is obtained with potential segmentation, prediction
quantity and stable usage of consumption [3],[4]. By
Authorized licensed use limited to: IEEE Xplore. Downloaded on July 28,2020 at 03:43:50 UTC from IEEE Xplore. Restrictions apply.
Consumer with DR program
The potential duration (peak
LVAMI Data Adaptive K-means stable consumption
users’ data time) data
behavior
Final Prediction
Segmentation&Screening Daytype Result
Controllable
Potential Weather Data
Appliances’ Data
Capacity
Sensitivity Analysis
models construction and capacity estimation. The details are dividing the maximum consumption L of the day. Let s (t ) be
described as follows. the normalized load profile and i(s) a tentative representative
II. PROPOSED STRATEGY load pattern. Euclidean distance E(s, i(s)) is computed in (2) to
assign the element to each cluster. E(s, i(s)) is minimized to
The research proposes a method to evaluate the users with determine k representative patterns i*(s), which is the
DR participating potential and the related capacity for DR. The representative load profile determined from (3) for minimal
capacity is calculated based on some kinds of appliances. The square error (MSE) and called centroid to signify the s sample
categories of appliances are determined by 1) the fixed and curves of data [5].
unadjustable load, 2) controllable load, and 3) adjustable load.
The second and third kinds of loads may contribute to DR By defining a threshold θ in (4), the number of i*(s) is
program. Limited by diverse facts, such as environment, updated. θ is the threshold which is normally limited within
location, weather, season, family structure and so on, prediction 0-2 [2],[15]. A near-to-2 value of θ denotes a loose
of some appliance consumption is complex and difficult classification environment with relatively less dictionaries. On
especially due to lack of historical consumption data. the contrary, a smaller θ leads to a stricter segmentation.
In the view of data mining, to evaluate the potential of DR Compared with the traditional k-means algorithm, the adaptive
capacity, the consumers with DR-participating potential are one generates optimal ko within a given boundary (kmin to kmax)
firstly screened. In other words, the consumer with stable and determine it referring to the MSE within the limitation by a
consumption behavior is preferred as a candidate to be further certain proportion θ of centroid value’s summation.
evaluated. The load data evaluation of candidate’s second and
The algorithm proceeds with the steps described in Figure
third categories of appliances (as defined previously) is then
2. It follows a simple principle that if a clustering process results
made. The quantities of energy that can be shed is computed
in violation of condition in (4), an additional 2-means algorithm
through the prediction models of individual appliances. The
is used to partition the violating segmentation and two new
overall flowchart of the proposed strategy is described as in
centroids are obtained. The updated result is then checked again
Figure 1 with the details given in the sections that follow.
with the re-defined boundary. By introducing equation (4), the
A. Potential Segmentation classification error tolerance becomes stricter, while the
K-means algorithm is traditionally used for classification. segmentation number is adaptable dynamically.
The value k is also be treated as the dictionary of s data and L = max .(l (1),..., l (t ),..., l (24)),
must be determined prior to the start of the algorithm. While
l (t ) (1)
dealing with large quantity of data, how to determine a suitable s(t ) =
k is a debatable issue. The target of the research is to find the L
consumer with stable consumption behavior through 24
classifying different patterns of power-consumption behavior. E ( s, i) = ∑t =1 ( s(t ) − i( s))2 (2)
That is, the value k should be different as for users with different
consumption data and behavior.
i* (s) = arg min E (s, i) (3)
Adaptive k-means is employed to solve the problems i
mentioned above. It starts with an initial kmin and forms the
seeds of centroid of clusters. Normalization of the data is very 24
important to avoid being dominated by one or few properties. E (s, i* (s)) ≤ θ ∑t =1 i * (s) 2 (4)
Equation (1) is used to normalize the daily profile l(t) by
Authorized licensed use limited to: IEEE Xplore. Downloaded on July 28,2020 at 03:43:50 UTC from IEEE Xplore. Restrictions apply.
Start
HVAC Consumption
Set k= kmin
Traditional k-means
algorithm
Temperature Dew Point Total Consumption
Training
Yes Data
Nv=0?
Temperat
No ure
Temperat
Historical Data
Well-Known
Dew
predicted
Point Prediction
GPML Model of Dew Point
Yes Total HVAC
k>kmax? Consump
Total
tion
No Consump
tion
For the clusters violating the Consump HVAC
tion of consumpt
threshold limitation do 2-means HVAC ion
clustering
Figure 4. GPML prediction model for HVAC construction
Update the cluster and
check the violation
Authorized licensed use limited to: IEEE Xplore. Downloaded on July 28,2020 at 03:43:50 UTC from IEEE Xplore. Restrictions apply.
Gaussian Distribution (GD). Meanwhile, any combination B. Potencial Capacity Reuslt
ofthe input vectors also follows GD. GP turns to Gaussian In the case study, the load data are segmented by adaptive
random field, when the vector is multi-dimensional. Similar to k-means method. The parameter of threshold θ is determined
GD, GP is also described as a distribution function.
as 2 according to [15], while kmin and kmax are set to be 5 and 105,
respectively. Sample consumers with stable consumption
k ( xm , xn ) = φ ( xm )T φ ( xn ) (6) behavior and with nonstable behavior are shown by load profile
number and entropy in Table I. Obviously, the stable ones have
Where xm is the prediction input, while xn is the exiting less segmentations of daily profiles. However, the number of
training data , such like total consumption,
temperature and dew segmentation is not the index that can represent the variability
point are the input of HVAC model. It is noted that all the input completely. Entropy (function (5)) is calculated according to
attributes’ categories and format should be the same. With the the probability of consumption behavior being divided to each
result obtained from Kernel function (6), the similarity between load pattern. As presented in Table I, entropy varies even with
prediction data and training data is calculated. The predicted the same load pattern number. The table lists part of the
output ym is thus computed with GP distribution function consumers from Taiwanese LVAMI dataset. Some of them
according to the relationship between ym and training data have larger load pattern number. However, the user M00249
output yn, HVAC’s historical consumption for HVAC model with 49 load patterns (meaning 49 consumption behavior types)
for instance. With GP and the features selected by previous has higher entropy than the one M00052 with 64 load patterns,
sensitivity screening, every appliance has its individual a fact which means M00249 actually has more nonstable
forecasting model. The same, HVAC model is taken as an consumption behavior than M00052. Overall, the candidates
example shown in Figure 4. The historical training data as with DR potential are selected with the entropy value lower
shown are associated with the HVAC power consumption at the than 25% of the highest entropy. 77.4% of the consumers are
same time interval. The more training data we have, the more thus concluded as the candidates.
accurate prediction model can be achieved. It should be pointed The users from two locations, M and T, are employed as
out that the input of prediction model for future time points examples. With the previous assortment, the numbers of the
must have the same categories of appliance, same units, and candidates are 213 and 496, separately, accounting for 89.5%
same time interval as the training data used to construct the and 84.9% of M and T consumer groups. Figure 6 is the
model. summation of predicted HVAC consumption of candidates
III. CASE STUDY AND DISCUSSION TABLE I. SOME PRESENTATIVE USERS’ LOAD PATTERN NUMBER AND
ENTROPY
The proposed method is applied to the LVAMI data
LOAD LOAD
provided by Industrial Technology Research Institute of CUSTOMER CUSTOMER
PATTERN ENTROPY PATTERN ENTROPY
Taiwan. The data contain 9,588 residential AMIs’ data for ID ID
NUMBER NUMBER
every 15 minutes in year 2016. For demonstration of training
M00012 50 5.6739 M00001 5 0.4712
the models, the biggest database for family energy consumption,
M00050 55 5.2924 M00002 5 0.3708
Pecan Street, is utilized [18]. The data of Pecan Street are
M00052 64 3.5292 M00003 5 0.2913
collected from communities of the cities in US. The database
M00082 53 4.3213 M00004 5 0.3189
provides the consumption data of different appliances per 15
minutes. All the data and codes are implemented in Matlab M00126 62 3.3015 M00005 5 0.2908
2017 for the model construction and potential DR capacity M00221 53 3.4485 M00006 5 0.2727
1.5
Air Consumption
1.5
same day. Compared to the actual data, most trends and 0.5
0.5
Prediction Real
5(c) and (d). Tested on the data for yearly appliance 2.5 2.5
Air Consumption
1.5
1
1
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
MAPE =
(d)
∑ (7) (c)
Authorized licensed use limited to: IEEE Xplore. Downloaded on July 28,2020 at 03:43:50 UTC from IEEE Xplore. Restrictions apply.
45 kWh
40
total consumption predicted air condition consumption
200
kWh
180
Total Consumption Predicted Air Condition Consumption
candidates and prediction models, the potential capacity can
35
30
160
140
thus be calculated. Realistic Taiwanese LVAMI data has been
120
tested to evaluate the applicability of proposed strategies. The
Air Consumption
Air Consumption
25
100
20
15
80
60
proposed scheme is believed to be a promising tool for
10
5
40 investigating the potential DR participants and their related
capacity for system operator or DR aggregators.
20
0 0
1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 91 94 1 4 7 10 13 1619 22 25 28 31 34 3740 43 46 49 52 55 5861 64 67 70 73 76 7982 85 88 91 94
-5 Time per 15 mins per 15 mins
Time
(a) (b)
Authorized licensed use limited to: IEEE Xplore. Downloaded on July 28,2020 at 03:43:50 UTC from IEEE Xplore. Restrictions apply.