0% found this document useful (0 votes)

69 views

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimension

Uploaded by

SHASHANK DWIVEDI

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimension

Uploaded by

SHASHANK DWIVEDI

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Problems of Fuzzy c-Means Clustering and

Similar Algorithms with High Dimensional Data

Sets
Roland Winkler and Frank Klawonn and Rudolf Kruse

Abstract Fuzzy c-means clustering and its derivatives are very successful on many
clustering problems. However, fuzzy c-means clustering and similar algorithms have
problems with high dimensional data sets and a large number of prototypes. In particular, we discuss hard c-means, noise clustering, fuzzy c-means with polynomial
fuzzifier function and its noise variant. A special test data set that is optimal for clustering is used to show weaknesses of said clustering algorithms in high dimensions.
We also show that a high number of prototypes influences the clustering procedure
in a similar way as a high number of dimensions. Finally, we show that the negative
effects of high dimensional data sets can be reduced by adjusting the parameter of
the algorithms, i.e. the fuzzifier, depending on the number of dimensions.

1 Introduction
Clustering high dimensional data has many interesting applications. For example
clustering similar music files, semantic web applications, image recognition or biochemical problems. Many tools today are not designed to handle hundreds of dimensions, or in this case, it might be better to call it degrees of freedom. Many
clustering approaches work quite well in low dimensions, but especially the fuzzy
c-means algorithm (FCM), [4, 2, 8, 10] seems to fail in high dimensions. This paper
is dedicated to give some insight into this problem and the behaviour of FCM as
well as its derivatives in high dimensions.
Roland Winkler
German Aerospace Center Braunschweig e-mail: [email protected]
Frank Klawonn
Ostfalia, University of Applied Sciences e-mail: [email protected]
Rudolf Kruse
Otto-von-Guericke University Magdeburg e-mail: [email protected]

Roland Winkler and Frank Klawonn and Rudolf Kruse

The algorithms that are analysed and compared in this paper are hard c-means
(HCM), fuzzy c-means (FCM), noise FCM (NFCM), FCM with polynomial fuzzifier function (PFCM) and PFCM with a noise cluster (PNFCM) that is an extension
of PFCM in the same way like NFCM is an extension of FCM. All these algorithms are prototype based and gradient descent algorithms. Previous to this paper,
an analysis of FCM in high dimensions is presented in [12] which provides a more
extensive view on the high dimension problematic but solely analysis the behaviour
of FCM. Not included in this paper is the extension by Gustafson and Kessel [7]
because this algorithm is already unstable in low dimensions. Also not included is
the competitive agglomeration FCM (CAFCM) [6] the algorithm is not a gradient
descent algorithm in the strict sense.
A very good analysis of the influence of high dimensions is to the nearest neighbour search is done in [1]. The nearest neighbour approach can not be applied directly on clustering problems. But the basic problem is similar and thus can be used
as a starting point for the analysis of the effects of high dimensional data on FCM
as it is presented in this paper.
We approach the curse of dimensionality for the above mentioned clustering algorithms because they seem very similar but perform very differently. The main
motivation lies more in observing the effects of high dimensionality rather than producing a solution to the problem. First, we give a short introduction to the algorithms
and present a way how to test the algorithms in a high dimensional environment in
the next section. In Section 3, the effects of a high dimensional data set are presented. A way to use the parameters of the algorithms to work on high dimensions
is discussed in section 4. We close this paper with some last remarks in section 5,
followed by a list of references.

2 The algorithms and the test environment

A cluster is defined as a subset of data objects of the data set X that belong together.
The result of a (fuzzy) clustering algorithm is a (fuzzy) partitioning of X. All discussed algorithms in this paper proceed by a gradient descent strategy. The method
of gradient descent is applied to an objective function with the following form: Let
X = {x1 , . . . , xm } Rn be a finite set of data objects of the vector space Rn with
|X| = m. The clusters are represented by a set of prototypes Y = {y1 , . . . , yc } Rn
with c = |Y | be the number of clusters. Let f : [0, 1] [0, 1] be a strictly increasing function called the fuzzifier function and U Rcm be the partition matrix with
ui j [0, 1] and j : ci=1 ui j = 1. And finally, let d : Rn Rn R be the Euclidean
distance function and di j = d(yi , x j ).
The objective function J is defined as
c

J(X,U,Y ) = f (ui j )di2j .

i=1 j=1

(1)

FCM and similar algorithms in high dimensions

The minimisation of J is achieved by iteratively updating the members of U and Y

and is computed using a Lagrange extension to ensure the constraints ci=1 ui j = 1.
The iteration steps are denoted by a time variable t N with t = 0 as the initialisation
step for the prototypes. The algorithms HCM, FCM and PFCM have each a different
fuzzifier function. The variances NFCM and PNFCM use a virtual noise cluster with
a user specified, constant noise distance to all data objects: j = 1..m, d0 j = dnoise ,
0 < dnoise R. The noise cluster is represented in the objective function as additional
cluster with index 0 so that the sum of clusters is extended to i = 0 to c.
To have a first impression on the behaviour of the algorithms in question, we
apply them on a test data set T d . T d is sampled from a set of normal distributions
representing the clusters and 10% of normal distributed noise. In T d , the number of
clusters is set to c = 2n, in our examples we use values of n = 2 and n = 50. The
performance of a clustering algorithm applied on T d is measured by the number of
correctly found clusters and correctly represented number of noise data objects. A
cluster counts as found if at least one prototype is located in the convex hull of the
data objects of that cluster.
HCM [11] is not able to detect noise and it is not a fuzzy algorithm. The fuzzifier
function is the identity: fHCM (u) = u and the membership values are restricted to
ui j {0, 1}. If applied on T 50 , HCM finds around 40 out of 100 clusters.

Fig. 1 FCM, applied on T 2 (left) and T 50 (right)

The fuzzifier function for FCM [4, 2] is an exponential function with fFCM (u) =
u and 1 < R. In figure 1, the prototypes are represented as filled circles, their
tails represent the way the prototypes took from their initial- to their final location.
The devastating effect of a high dimensional data set to FCM is obvious: the prototypes run straight into the centre of gravity of the data set, independently of their
initial location and therefore, finding no clusters at all. NFCM [3] is one of the two
algorithms considered in this paper that is able to detect noise. The fuzzifier function for NFCM is identical to FCM: fNFCM = fFCM . Apart from the fact that all data
objects have the highest membership value for the noise cluster, the behaviour of the
algorithm does not change compared to FCM. PFCM [9] is a mixture of HCM and
2
2
FCM, as the definition of the fuzzifier function shows: fPFCM (u) = 1
1+ u + 1+ u.
This fuzzifier function creates an area of crisp membership values around a prototype while outside of these areas of crisp membership values, fuzzy values are

Roland Winkler and Frank Klawonn and Rudolf Kruse

assigned. The parameter controls the size of the crisp areas: the low value of
means a small crisp area of membership values.

Fig. 2 PFCM, applied on T 2 (left) and T 50 (right)

In the 2-dimensional example in Figure 2, the surrounded and slightly shaded

areas represents the convex hull of all data objects with membership value 1. On the
right hand side of the figure, it can be seen, that PFCM does not show the same ill
behaviour as FCM and NFCM, PFCM finds approximately 90 out of 100 clusters.
PNFCM (presented along with PFCM in [9]) is the second algorithm considered
in this paper that is able to detect noise. The fuzzifier function of PNFCM is again,
identical to PFCM but the objective function is again modified by the noise cluster
extension. If PNFCM is applied on T 50 , the result is quite different to PFCM. Due
to the specified noise distance, it is very likely that prototypes are initialized so
far away from the nearest cluster, that all of its data objects are assigned crisp to
the noise cluster. That is true for all prototypes which implies, that no cluster is
found and all data objects are assigned as noise. The presented algorithms are in
particular interesting because FCM produces useless results, HCM works with a
lucky initializian, but their combination PFCM can be applied quite successfully.
Since using a noise cluster has at least for PNFCM a negative effect, it is interesting
to analyse its influence further.
We want to identify structural problems with FCM (and alike) on high dimensional data sets. Considering real data sets exemplary is not enough to draw general
conclusions, therefore, we consider only one but for clustering optimal data set. Let
D = {x1 , . . . , xc } Rn be a data set that contains of c > n clusters, with one data
object per cluster. The clusters (data objects) in D are located on an n-dimensional
hypersphere surface and are arranged so that the minimal pairwise distance is maximised. The general accepted definition of clustering is: the data set partitioning
should be done in such a way, that data objects of the same cluster should be as
similar as possible while data objects of different clusters should be as different as
possible. D is a perfect data set for clustering because its clusters can be considered infinitely dense and maximally separated. There is one small limitation to that
statement: c should not be extremely larger than n (c < n!) because the hypersphere
surface might be too small for so many prototypes, but that limitation is usually
not significant. Algorithms with problems on D will have even more problems on

FCM and similar algorithms in high dimensions

other data sets because there is no easier data set than D. Especially if more, highdimensional problems occur like overlapping clusters or very unbalanced cluster
sizes.
As the example in Figure 1-right has shown, the prototypes end up in the centre of gravity for FCM and NFCM. To gain knowledge why this behaviour occurs
(and why not in the case of PFCM), the clustering algorithms are tested in a rather
artificial way. The prototypes are all initialised in the centre of gravity (COG) and
then moved iteratively towards the data objects by ignoring the update procedure
indicated by the clustering algorithms. Let control the location of the prototypes:
[0, 1], xi Rn the ith data object with cog(D) Rn the centre of gravity of
data set D implies yi : [0, 1] Rn with yi () = xi + (1 ) cog(D) and finally:
di j () = d(yi (), x j ). Since the membership values are functions of the distance
values and the objective function is a function of membership values and distance
values, it can be plotted as a function of .

3 The effect of high dimensions

In this section, the effects of the high dimensional data sets like D are presented.
There are two effects on high dimensional data sets that have strong influence on
the membership function: the number of prototypes and the number of dimensions.
HCM is not farther analysed because its objective function values do not change due
to the number of dimensions or the number of prototypes.

Fig. 3 Objective function plots for FCM (left) and NFCM (right)

However, for FCM and NFCM, these factors have a strong influence on the objective function. In Figure 3, the objective functions for these two algorithms are
plotted for a variety of dimensions, depending on . For convenience, the objective
function values are normalized to 1 at = 0. The plots show a strong local maximum between = 0.5 and = 0.9. Winkler et.al showed in [12] that the number of
dimensions effects the objective function by the height of this local maximum. The
number of prototypes however influences the location of the maximum: the higher
the number of prototypes, the further right the local maximum can be observed.

Roland Winkler and Frank Klawonn and Rudolf Kruse

Since these are gradient descent algorithms, the prototypes will run into the centre
of gravity if they are initialized left of the local maximum which is exactly what
is presented in figure 1-right. Since the volume of an n-dimensional hypersphere
increases exponentially with its radius, it is almost hopeless to initialize a prototype
near enough to a cluster so that the prototype converges to that cluster. For this example, in 50 dimensions and with 100 prototypes, the converging hypershere radius
is 0.3 times the feature space radius which means, the hypervolume is 7.2 1027
times the volume of the feature space.

Fig. 4 Objective function plots for PFCM (left) and PNFCM (right)

As presented in Figure 4-left, PFCM does not create such a strong local maximum as FCM, also the local maximum that can be observed is very far left. That is
the reason why PFCM can be successfully applied on a high dimensional data set.
The situation is quite different for PNFCM, see 4-right. The fixed noise distance is
chosen appropriate for the size of the clusters but the distance of the prototypes to
the clusters is much larger. Therefore, all data objects have membership value 0 for
the prototypes which explains the constant objection function value.

4 How to exploit the algorithm parameters to increase their

effectiveness
As the section title indicates, it is possible to exploit the parameters and dnoise
of FCM, NFCM and PNFCM to tune the algorithms so that they work on high
dimensions. The term exploit is used because the fuzzifier and the noise distance
dnoise are chosen dimension dependent and not in order to represent the properties
of the data set. Let = 1 + 2n and dnoise = 0.5 log2 (D), the parameter remained
at 0.5. Setting the fuzzifier near 1, creates an almost crisp clustering and by setting
the noise distance larger its effect is reduced. That way, FCM and NFCM become
similar to HCM and PNFCM becomes similar to PFCM. The results of the adapted
parameter are shown in Figure 5 for FCM, NFCM and PNFCM for D with 100
dimensions. The objective function plots are all very similar to PFCM which would
imply that they work just as well.

FCM and similar algorithms in high dimensions

Fig. 5 Objective function plots for FCM (left), NFCM (middle) and PNFCM (right) with dimension dependent parameters

To test that, we apply each algorithm 100 times on T 50 , the results are presented
in Table 1 as mean and sample standard deviation in braces. The found clusters column is the most important, the other two are just for measuring the performance of
recognizing noise data objects. The test clearly shows the improvement by adjusting
the parameters according to the number of dimensions.
Algorithm Found clusters
HCM
42.35 (4.65)
FCM
0
(0)
NFCM
0
(0)
PFCM
90.38 (2.38)
PNFCM
0
(0)
Adjusted Parameter
FCM AP
88.09 (3.58)
NFCM AP 88.5 (3.37)
PNFCM AP 92.7 (2.67)

Correctly clustered noise Incorrect clustered as noise

0
(0)
0
(0)
0
(0)
0
(0)
1000
(0)
10000
(0)
0
(0)
0
(0)
1000
(0)
10000
(0)
0
999.77
995.14

(0)
(0.58)
(3.12)

0
1136.0
96.0

(0)
(344.82)
(115.69)

Table 1 Performance overview of T50 with 100 data objects for each cluster, 1000 noise data
objects and each algorithm is applied performed 100 times. The mean value and (sample standard
deviation) are displayed.

5 Conclusions
The two algorithms HCM and FCM do not work on high dimensions properly. It
is very odd therefore that a combination of them in form of PFCM works quite
well. We have shown that the reason for this effect is a very small local minimum of
PFCM compared to FCM in the COG. We presented, that FCM, NFCM and PNFCM
can be tuned in such a way that their objective function shows a similar behaviour
in our test as PFCM, in which case the clustering result is similar on the test data set
T 50 . The question remains why this local minimum occurs. A possible explanation
is presented in [1, 5] as they identify the effect of distance concentration as being the
most problematic in having a meaningful nearest neighbour searches. Further work

Roland Winkler and Frank Klawonn and Rudolf Kruse

is needed here for a deeper understanding of the effect of distance concentration

in relation to clustering. It sounds logical that a clustering algorithm, that is based
on the spacial structure of the data, can not work well if all data objects from all
points of view and for all (variance limited) data distributions seem to be equally
distant. But that poses the question if clustering algorithms in general can produce
meaningful results at on arbitrary high dimensional data sets without having some
special features that reduces the complexity of the problem.
The knowledge, gained from the presented experiments is, that prototype based
clustering algorithms seem to need a crisp component. But from the failure of HCM,
it might as well be learned that only crisp assignments of data objects are not good
enough. Almost crisp clustering algorithms like the however, perform quite well,
at least on T50 . That is no guaranty that they will work on high dimensional real
data sets, but they are not as hopeless as FCM. However, this also means that the
fuzzifier in case of FCM and NFCM as well as the noise distance in case of NFCM
and PNFCM has to be used to counter the high dimensional effects. This is very
unsatisfying as it prevents a suitable modelling of the data set and a better way
would be to adapt the algorithm rather than exploiting its parameters.

References
1. Kevin Beyer, Jonathan Goldstein, Raghu Ramakrishnan, and Uri Shaft. When is nearest neighbor meaningful? In Database Theory - ICDT99, volume 1540 of Lecture Notes in Computer
Science, pages 217235. Springer Berlin / Heidelberg, 1999.
2. James C. Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum
Press, New York, 1981.
3. Rajesh N. Dave. Characterization and detection of noise in clustering. Pattern Recogn. Lett.,
12(11):657664, 1991.
4. J.C. Dunn. A fuzzy relative of the isodata process and its use in detecting compact wellseparated clusters. Cybernetics and Systems: An International Journal, 3(3):3257, 1973.
5. Robert J. Durrant and Ata Kaban. When is nearest neighbour meaningful: A converse theorem and implications. Journal of Complexity, 25(4):385 397, 2008.
6. Hichem Frigui and Raghu Krishnapuram. A robust clustering algorithm based on competitive
agglomeration and soft rejection of outliers. Computer Vision and Pattern Recognition, IEEE
Computer Society Conference on, 0:550, 1996.
7. Donald E. Gustafson and William C. Kessel. Fuzzy clustering with a fuzzy covariance matrix.
In IEEE, volume 17, pages 761766, Jan. 1978.
8. F. Hoppner, F. Klawonn, R. Kruse, and T. Runkler. Fuzzy Cluster Analysis. John Wiley &
Sons, Chichester, England, 1999.
9. Frank Klawonn and Frank Hoppner. What is fuzzy about fuzzy clustering? understanding and
improving the concept of the fuzzifier. In Cryptographic Hardware and Embedded Systems
- CHES 2003, volume 2779 of Lecture Notes in Computer Science, pages 254264. Springer
Berlin / Heidelberg, 2003.
10. Rudolf Kruse, Christian Dring, and Marie-Jeanne Lesot. Advances in Fuzzy Clustering and
its Applications, chapter Fundamentals of Fuzzy Clustering, pages 330. John Wiley & Sons,
2007. ISBN: 978-0-470-02760-8.
11. Hugo Steinhaus. Sur la division des corps materiels en parties. Bull. Acad. Pol. Sci., Cl. III,
4:801804, 1957.
12. Roland Winkler, Frank Klawonn, and Rudolf Kruse. Fuzzy c-means in high dimensional
spaces. International Journal of Fuzzy System Applications (to appear), 2011.

PyCDS 15 MachineLearning
No ratings yet
PyCDS 15 MachineLearning
38 pages
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
University of Manchester COMP37212 Computer Vision Exam 2014
No ratings yet
University of Manchester COMP37212 Computer Vision Exam 2014
5 pages
Robust Fuzzy Clustering Algorithms
No ratings yet
Robust Fuzzy Clustering Algorithms
6 pages
Performance Analysis of Various Fuzzy Clustering Algorithms: A Review
No ratings yet
Performance Analysis of Various Fuzzy Clustering Algorithms: A Review
12 pages
Fuzzy C-Means - Review
No ratings yet
Fuzzy C-Means - Review
3 pages
Fs ch10 Clustering
No ratings yet
Fs ch10 Clustering
59 pages
v5n41
No ratings yet
v5n41
9 pages
1 s2.0 S0031320305002943 Main
No ratings yet
1 s2.0 S0031320305002943 Main
17 pages
IMECS2009 pp177-182
No ratings yet
IMECS2009 pp177-182
6 pages
An Ε-Insensitive Approach To Fuzzy Clustering: Int. J. Appl. Math. Comput. Sci., 2001, Vol.11, No.4, 993-1007
No ratings yet
An Ε-Insensitive Approach To Fuzzy Clustering: Int. J. Appl. Math. Comput. Sci., 2001, Vol.11, No.4, 993-1007
15 pages
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
No ratings yet
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
6 pages
FRP PDF
No ratings yet
FRP PDF
19 pages
Unsupervised Optimal Fuzzy Clustering: I.Gath and A. B. Geva. IEEE Transactions On Pattern
No ratings yet
Unsupervised Optimal Fuzzy Clustering: I.Gath and A. B. Geva. IEEE Transactions On Pattern
34 pages
Keynote Speaker Snsi06 Unsupervised Classification by Soft Computing Techniques
No ratings yet
Keynote Speaker Snsi06 Unsupervised Classification by Soft Computing Techniques
4 pages
A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms Subhagata Chattopadhyay
No ratings yet
A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms Subhagata Chattopadhyay
20 pages
Cluster 2
No ratings yet
Cluster 2
11 pages
Fuzzypaper May No K
No ratings yet
Fuzzypaper May No K
20 pages
Improving Fuzzy C-Means Clustering Based On Feature-Weight Learning
No ratings yet
Improving Fuzzy C-Means Clustering Based On Feature-Weight Learning
10 pages
A Fuzzy Clustering Model of Data and Fuzzy C-Means: S. Nascimento, B. Mirkin and F. Moura-Pires
No ratings yet
A Fuzzy Clustering Model of Data and Fuzzy C-Means: S. Nascimento, B. Mirkin and F. Moura-Pires
6 pages
Fuzzy and Possibilistic Shell Clustering Algorithms and Their Application To Boundary Detection and Surface Approximation-Part I
No ratings yet
Fuzzy and Possibilistic Shell Clustering Algorithms and Their Application To Boundary Detection and Surface Approximation-Part I
15 pages
A Fuzzy K-Modes Algorithm For Clustering Categorical Data
No ratings yet
A Fuzzy K-Modes Algorithm For Clustering Categorical Data
8 pages
Fuzzy K-Modes Algorithm
No ratings yet
Fuzzy K-Modes Algorithm
7 pages
A New Initialization Method For The Fuzzy C-Means Algorithm Using Fuzzy Subtractive Clustering
No ratings yet
A New Initialization Method For The Fuzzy C-Means Algorithm Using Fuzzy Subtractive Clustering
7 pages
A Hybrid Algorithm Based On KFCM-HACO-FAPSO For Clustering ECG Beat
No ratings yet
A Hybrid Algorithm Based On KFCM-HACO-FAPSO For Clustering ECG Beat
6 pages
Provided by HKU Scholars Hub
No ratings yet
Provided by HKU Scholars Hub
8 pages
A Portfolio Optimization Algorithm Using Fuzzy Granularity Based Clustering
No ratings yet
A Portfolio Optimization Algorithm Using Fuzzy Granularity Based Clustering
15 pages
Chang 2016
No ratings yet
Chang 2016
12 pages
Supressed Fuzzy Cmeans
No ratings yet
Supressed Fuzzy Cmeans
12 pages
dataxplore
No ratings yet
dataxplore
34 pages
Clustering 1
No ratings yet
Clustering 1
6 pages
On The Selection of M For Fuzzy C-Means
No ratings yet
On The Selection of M For Fuzzy C-Means
7 pages
Enhancement of Qualities of Clusters by Eliminating Outlier For Data Mining Application in Education
No ratings yet
Enhancement of Qualities of Clusters by Eliminating Outlier For Data Mining Application in Education
27 pages
PFCM
No ratings yet
PFCM
14 pages
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
No ratings yet
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
8 pages
Clustering - Fuzzy C-Means
No ratings yet
Clustering - Fuzzy C-Means
5 pages
Conference Paper1
No ratings yet
Conference Paper1
5 pages
(Balasko, Dkk. 2007) Fuzzy Clustering
No ratings yet
(Balasko, Dkk. 2007) Fuzzy Clustering
77 pages
Fuzzy Clustering Toolbox
No ratings yet
Fuzzy Clustering Toolbox
77 pages
Neurocomputing: Yi Ding, Xian Fu
No ratings yet
Neurocomputing: Yi Ding, Xian Fu
3 pages
Fuzzy C-Mean Clustering Algorithm Modification and Adaptation For Applications
No ratings yet
Fuzzy C-Mean Clustering Algorithm Modification and Adaptation For Applications
4 pages
Optimizing of Fuzzy C-Means Clustering Algorithm Using GA: Mohanad Alata, Mohammad Molhim, and Abdullah Ramini
No ratings yet
Optimizing of Fuzzy C-Means Clustering Algorithm Using GA: Mohanad Alata, Mohammad Molhim, and Abdullah Ramini
6 pages
Fuzzy Systems
No ratings yet
Fuzzy Systems
50 pages
Fuzzy C-Means Clustering Using Principal Component Analysis For Image Segmentation
No ratings yet
Fuzzy C-Means Clustering Using Principal Component Analysis For Image Segmentation
4 pages
Improved Fuzzy C-Means Algorithm For MR Brain Image Segmentation
No ratings yet
Improved Fuzzy C-Means Algorithm For MR Brain Image Segmentation
3 pages
Automatic Road Extraction From Satellite Image: B.Sowmya Aashik Hameed
No ratings yet
Automatic Road Extraction From Satellite Image: B.Sowmya Aashik Hameed
6 pages
Fuzzy
No ratings yet
Fuzzy
38 pages
Agglomerative Mean-Shift Clustering
No ratings yet
Agglomerative Mean-Shift Clustering
7 pages
296 995 1 PB PDF
No ratings yet
296 995 1 PB PDF
15 pages
Image Segmentation by Fuzzy C-Means Clustering Algorithm With A Novel Penalty Term Yong Yang
No ratings yet
Image Segmentation by Fuzzy C-Means Clustering Algorithm With A Novel Penalty Term Yong Yang
15 pages
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
No ratings yet
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
11 pages
Fuzzy Image Classification Thesis
100% (3)
Fuzzy Image Classification Thesis
4 pages
WWWWW Clustering Algorithm
No ratings yet
WWWWW Clustering Algorithm
7 pages
Week3 Watermark
No ratings yet
Week3 Watermark
57 pages
Fuzzy C Means (Overlapping Clustering)
No ratings yet
Fuzzy C Means (Overlapping Clustering)
13 pages
Lecture Fuzzy Clustering Babuska PDF
No ratings yet
Lecture Fuzzy Clustering Babuska PDF
18 pages
Clustering
No ratings yet
Clustering
45 pages
Implementing The Fuzzy C-Means Algorithm: by Gagarine Yaikhom
No ratings yet
Implementing The Fuzzy C-Means Algorithm: by Gagarine Yaikhom
15 pages
C-Means With Fuzzy Local Information Journal Paper
No ratings yet
C-Means With Fuzzy Local Information Journal Paper
9 pages
Student Performance Assessment Using Clustering Techniques
No ratings yet
Student Performance Assessment Using Clustering Techniques
10 pages
9 Fuzzy Clustering
No ratings yet
9 Fuzzy Clustering
32 pages
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Algorithmic Probability: Fundamentals and Applications
From Everand
Algorithmic Probability: Fundamentals and Applications
Fouad Sabry
No ratings yet
Branch Prediction
No ratings yet
Branch Prediction
41 pages
ADVD Tutorial 6 PDF
No ratings yet
ADVD Tutorial 6 PDF
1 page
Lewis Model in Brief 2016
No ratings yet
Lewis Model in Brief 2016
13 pages
Lab 06
No ratings yet
Lab 06
9 pages
DSM 1
No ratings yet
DSM 1
6 pages
ML BIT Ans
No ratings yet
ML BIT Ans
5 pages
A Practical Time-Series Tutorial With MATLAB
No ratings yet
A Practical Time-Series Tutorial With MATLAB
95 pages
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
No ratings yet
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
14 pages
BE AIDS 2020 Syllabus
No ratings yet
BE AIDS 2020 Syllabus
126 pages
Data Warehousing Mining MCQs
No ratings yet
Data Warehousing Mining MCQs
12 pages
DM GTU Study Material Presentations Unit-5 21052021124400PM
No ratings yet
DM GTU Study Material Presentations Unit-5 21052021124400PM
63 pages
Data Mining Cluster Analysis: Basic Concepts and Algorithms
No ratings yet
Data Mining Cluster Analysis: Basic Concepts and Algorithms
26 pages
Romary 2015
No ratings yet
Romary 2015
8 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Speech Emotion Recognition1
No ratings yet
Speech Emotion Recognition1
86 pages
Instant ebooks textbook Big Data Analytics Tools and Technology for Effective Planning 1st Edition Arun K. Somani download all chapters
100% (1)
Instant ebooks textbook Big Data Analytics Tools and Technology for Effective Planning 1st Edition Arun K. Somani download all chapters
55 pages
SRI UJJAINI TECH SEM 2
No ratings yet
SRI UJJAINI TECH SEM 2
27 pages
Download full Modern Algorithms of Cluster Analysis 1st Edition Slawomir Wierzchoń ebook all chapters
100% (3)
Download full Modern Algorithms of Cluster Analysis 1st Edition Slawomir Wierzchoń ebook all chapters
55 pages
Automatic Extractive Text Summarization For Nepali Language With Bidirectional Encorder Representation Transformer and K Mean Clustering1
No ratings yet
Automatic Extractive Text Summarization For Nepali Language With Bidirectional Encorder Representation Transformer and K Mean Clustering1
16 pages
Lisan_Al_Gaib
No ratings yet
Lisan_Al_Gaib
2 pages
Short Paper SIS Palermo 2018
No ratings yet
Short Paper SIS Palermo 2018
1,668 pages
DMDW NOTES UNIT 2
0% (1)
DMDW NOTES UNIT 2
11 pages
Design and Implementation of Fake Currency Detection System
No ratings yet
Design and Implementation of Fake Currency Detection System
5 pages
Clustering Local Government For Optimizing Local Government Financial Condition Analysis - A Case Study in Indonesia
No ratings yet
Clustering Local Government For Optimizing Local Government Financial Condition Analysis - A Case Study in Indonesia
17 pages
Bmjopen 2016 December 6 12 Inline Supplementary Material 2
No ratings yet
Bmjopen 2016 December 6 12 Inline Supplementary Material 2
5 pages
Chemometric Software For Multivariate Data Analysis Based On Matlab
No ratings yet
Chemometric Software For Multivariate Data Analysis Based On Matlab
8 pages
DM UNIT IV (1)
No ratings yet
DM UNIT IV (1)
45 pages
An Improved Collaborative Movie Recommendation System Using Computational Intelligence-2
No ratings yet
An Improved Collaborative Movie Recommendation System Using Computational Intelligence-2
9 pages
ENVI Tutorial: Classification Methods
No ratings yet
ENVI Tutorial: Classification Methods
16 pages
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
No ratings yet
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
4 pages
Cailloux Taxonomy 2007
No ratings yet
Cailloux Taxonomy 2007
18 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets

Uploaded by

Winkler 2010 Problems of Fuzzy C-Means Clustering and Similar Algorithms With High Dimensional Data Sets

Uploaded by

Problems of Fuzzy c-Means Clustering and

Similar Algorithms with High Dimensional Data

Roland Winkler and Frank Klawonn and Rudolf Kruse

2 The algorithms and the test environment

J(X,U,Y ) = f (ui j )di2j .

FCM and similar algorithms in high dimensions

The minimisation of J is achieved by iteratively updating the members of U and Y

Fig. 1 FCM, applied on T 2 (left) and T 50 (right)

Roland Winkler and Frank Klawonn and Rudolf Kruse

Fig. 2 PFCM, applied on T 2 (left) and T 50 (right)

In the 2-dimensional example in Figure 2, the surrounded and slightly shaded

FCM and similar algorithms in high dimensions

3 The effect of high dimensions

Roland Winkler and Frank Klawonn and Rudolf Kruse

4 How to exploit the algorithm parameters to increase their

FCM and similar algorithms in high dimensions

Correctly clustered noise Incorrect clustered as noise

Roland Winkler and Frank Klawonn and Rudolf Kruse

is needed here for a deeper understanding of the effect of distance concentration

You might also like