FCM Yunlu 2016 Adaptive Weighted Fuzzy Clustering Algorithm For
FCM Yunlu 2016 Adaptive Weighted Fuzzy Clustering Algorithm For
Abstract—With the rapid development of the smart grid, a (FCM) [9] and Fuzzy K-Means [10]) and model-based algo-
large volume of smart meter data are collected in the form of rithms (such as Finite Mixture Modeling (FMM) [11], Self-
time series, which is called load profiles. This paper investigates Organizing Map (SOM) [12] and Support Vector Machines
the load profile clustering of smart grid customers, which is
significant for many applications. An adaptive weighted fuzzy (SVM) [13]). Partition algorithm is more widely used because
clustering algorithm is proposed to cluster load profiles, where of its less complexity and faster convergence speed compared
Principle Component Analysis (PCA) is used to reduce the to the model-based algorithm. Some combining algorithms are
data dimension, and then weighted Fuzzy C-Means (FCM) is also proposed to make up the weakness of single algorithm. A
adopted to cluster the big data. The optimal number of clusters hybrid clustering algorithms combining partitioning methods
is determined adaptively by integrating a clustering validity
function into the clustering algorithm. The simulation results and hierarchical methods is proposed in [14], which is suitable
show that the proposed algorithm could achieve considerable for large scale load data clustering.
improvement both in time complexity and clustering accuracy FCM is a soft partitioning method which is superior to the
through comparing four clustering validity indexes. traditional crisp partitioning method like K-Means, because
most objects have ambiguous attributes and may belong to
Index Terms—Load profiles, customer classification, PCA, more than one cluster [9]. However, there is a shortcoming of
weighted FCM, cluster validity
FCM that the number of clusters should be pre-defined. Many
I. I NTRODUCTION efforts have been made to determine the optimal number of
clusters, a popular way of determining the optimal number of
With the development of the smart grid, sensors and meter- clusters is to find the “knee” of the curve [9], [10], which is
ing devices such as smart meters are increasingly installed still a challenge.
into distribution networks [1]. The way to use and extract Due to the massive characteristic of the load data, many
valuable information from these big data is a serious challenge studies have attached great importance to reducing the data
faced by the electricity companies. The time-varying power dimension before clustering analysis, aiming to lower the
consumption data collected from smart meters form load computational complexity. There are three different dimension
profiles, and different load profiles of customers show their reduction methods are discussed in [14], respectively Sammon
different consumption behaviors. Conducting load profile clus- Mapping, SOM and PCA. Results show that PCA is superior
tering is an important task in grouping electricity customers to other methods because of the minimum computation time.
into classes and obtaining the typical load profiles (TLPs), In order to assess the performance of different algorithms,
which is significant for many applications, such as load fore- a measure of adequacy should be used. Research in this area
casting [2], abnormal electricity consumption detection [3], has proposed many different indices [10], [14] according to a
load control [4], designing electricity tariff offers [5], real- metric that samples within one partition should be compacted
time demand side response policy [6] and developing market and samples within different clusters should be separated, such
strategies [7]. as the clustering dispersion index (CDI), the modified Dunn
Many methods or techniques for clustering load profiles index (MDI), the scatter index (SI), the Davies-Bouldin index
have been proposed in literatures. They investigate clustering (DBI), and the sum of squared error (SSE).
analysis by using the existed classical clustering methods Existing literatures of load clustering analysis mainly focus
including partitioning methods, hierarchical methods, density- on the selection of clustering algorithm, but lack of considera-
based methods, grid-based methods and model-based methods. tion on different importance of features. In this paper, in order
Considering the large scale characteristic of load data, cluster- to cover the issues above, we propose an adaptive weighted
ing methods applied to load profile clustering mainly focus on Fuzzy C-Means clustering algorithm based on PCA (PCA-
partitioning algorithm (such as K-means [8], Fuzzy C-Means WFCM) to cluster load profiles and determine TLPs of smart
grid customers. Based on the conventional FCM algorithm,
This work is supported by the National Natural Science Foundation of
China (No.61461029), and Fundamental Research Funds for the Central we add different weights to the reduced features by variance
Universities (2014ZD03-01). contribution rate obtained from the PCA processing to improve
clustering accuracy. Moreover, the clustering validity function Step 4) Cluster validity analysis and TLPs generate. Four
Davies-Bouldin (DB) is integrated into the clustering algo- cluster validity indexes are used to evaluate the
rithm to determine the optimal number of clusters adaptively. clustering results, which are PC, PE, SSE and DB
The data sets based on the real smart meter readings collected respectively, and the TLPs are generated by the center
in Ireland are applied to evaluate the proposed PCA-WFCM profile of every cluster after clustering.
algorithm.
This paper is organized as follows. The load profile clus-
tering procedure is described in Section II. The PCA-WFCM
clustering algorithm is presented in Section III. The simulation
results are given in Section IV, and then conclusions are
presented in Section V.
features. Fig. 1. Two-stage clustering approach.
We describe clustering data as follows. Load data are
denoted as X = {x1 , x2 , · · · , xN }, where xj is a daily
load profile, and xj = {xj1 , xj2 , · · · , xjS } demonstates that III. PCA-WFCM A LGORITHM
each load profile has S features, where S ≤ 48. Clustering
analysis is to divide X into K classes, 2 ≤ K ≤ N , and In this paper, an adaptive weighted FCM clustering algorith-
V = {v1 , v2 , · · · , vK } is the cluster center vector. m based on PCA is proposed. The PCA is adopted to reduce
The classification of customers and generation of their TLPs the feature dimension of the original load data and far less
are achieved by applying clustering techniques, which include than 48 features are obtained. Then, the variance contribution
the following basic steps. of each principal components is used as the feature weight
and the weighted FCM clustering is performed. The cluster
Step 1) Data cleaning. Abnormal load data may occur due to number is adaptively selected based on the clustering validity
the network failure or meter error, and the values that function DB combined with the clustering algorithm.
are obviously wrong should be modified or deleted
before clustering analysis. For example, we remove A. PCA
those daily load profiles with 0 values or less than 48 The PCA [16] is a popular linear reduction method that
records. projects a data set embedded in an S-dimension space into an
Step 2) Data preprocessing. In this paper, we cluster the L-dimension subspace, with L S. Mathematically, the pro-
load profiles based on the shape of a load profile cedure requires computing the eigenvector of the covariance
instead of by absolute values, therefore the data matrix of the data. The zero-mean data points X = {x i } =
should be normalized, that is, scaled to a specific {xi − xi }, i = 1, 2, · · · , N , and the covariance matrix C is
range such as [0.0, 1.0]. Here we apply the min-max defined as
normalization method, which is defined as normij = 1 T
N
xij −min xij C= x ix i , (1)
1 ≤ i ≤ K, 1 ≤ j ≤ N .
j
max xij −min xij , N i=1
j j
Step 3) Load clustering. As illustrated in Fig.1, A two-stage and the eigenvector of the covariance matrix is computed as
load pattern clustering method is used in the clustering
procedure. In the first stage, clustering analysis is
S
1
K
0.6
IDB = Ri , (11)
K i=1 0.4
0.2
d(xi ) + d(xj )
Ri = max , (12)
i=j d(ci , cj ) 0
0 200 400 600 800 1000 1200 1400 1600 1800 2000
data set size
where d(xi ) is the intra-cluster distance of cluster i, and Fig. 2. Computing time for each clustering algorithm.
d(ci , cj ) is the distance between cluster center i and j. The
optimal number of clusters should minimize the value of the
index. We conduct the three clustering algorithms by varying the
We conclude a brief summary of 4 selected cluster validity data set size from 100 to 2000, and the time of each algorithm
functions in Table I. consumed is recorded. The time consumption of three different
clustering algorithms is illustrated in Fig. 2. It shows that PCA-
TABLE I: Cluster Validity Functions FCM and PCA-WFCM obviously outperform FCM with the
Validity function Functional description Optimal growth of the data size due to the PCA dimension-reduction
1
K
N
2
processing. It can also be found that the time consumed
Partition Coefficient IPC = N (uij ) Max(IPC )
i=1 j=1 by the proposed algorithm PCA-WFCM is very close to
K N
Partition Entropy 1
IPE = − N uij log(uij ) Min(IPE ) PCA-FCM. Hence, the weighted processing in PCA-WFCM
i=1 j=1
K does not consume too much time but can improve clustering
Sum of Squares Error ISSE = d2 (vi , x) Min(ISSE ) performance much more in the following figure.
i=1 x∈Xi
1
K d(xi )+d(xj ) Fig. 3 shows the performance of three different clustering
Davies-Bouldin IDB = max d(c Min(IDB )
K
i=1 i=j i ,cj )
algorithms. We use four clustering validity index to assess the
performance between the three clustering algorithm, namely
1 2.5 3.5
FCM FCM PCA−WFCM
0.8 PCA−FCM 2 PCA−FCM
PCA−WFCM PCA−WFCM
0.6 1.5
PC
PE
0.4 1
0.2 0.5 3
0 0
5 10 15 20 5 10 15 20
number of clusters number of clusters
DB
4
x 10
6 120
FCM FCM
5 PCA−FCM 100 PCA−FCM 2.5
PCA−WFCM PCA−WFCM
80
4
SSE
DB
60
3
40
2 20
1 0 2
5 10 15 20 5 10 15 20 2 4 6 8 10 12 14 16 18 20
number of clusters number of clusters number of clusters
1 2.5 that the PCA dimension reduction processing can not only
0.8 2 lower the time complexity, but also can improve clustering
0.6
1.5 performance by removing noises. The weighted processing can
PC
PE
0.4
1 improve clustering performance further but without too much
0.2
0.5
time consumption.
0
FCM PCA−FCM PCA−WFCM FCM PCA−FCM PCA−WFCM In Fig. 3, we can see the optimal number of clusters should
4
be 8, which can be chosen by the “knee” of SSE curve.
x 10
100
The optimal number of clusters determined by PCA-WFCM
5
80 algorithm adaptively is also 8, which is chosen at the minimum
4 60 value of DB index, as we can see in Fig. 5.
SSE
DB
40
3
20
The total 2046 load profiles are clustered into 8 clusters
2 0 by the proposed PCA-WFCM clustering algorithm in terms
FCM PCA−FCM PCA−WFCM FCM PCA−FCM PCA−WFCM of the optimal number of clusters. Each center of the clusters
represents the typical load profile of that cluster of customers.
Fig. 4. Validity index for each clustering algorithm on boxplot.
The final 8 typical load profiles are presented in Fig. 6. The
horizontal axis means daily time, and the vertical axis is power
consumption, whose unit is kWh. Fig. 6 shows that the 8
typical load profiles are in big difference. Some have only
PC, PE, SSE and DB, which are described in section III. one peak, and some have two or more. Some consume more
Repeated executions of the clustering algorithms have been power at night, and some use power all the day. In a word, the
performed by varying the number of clusters from 2 to 20 load profiles encode the behaviors of the customers and can be
and computing the clustering validity index for all algorithms. utilized to provide the demand-oriented services for different
The best clustering is achieved at the minimum value of types of users.
PE, SSE, DB and maximum value of PC. In Fig. 3, we
can see the proposed PCA-WFCM algorithm always exhibits