Literature Survey On Genetic Algorithm Approach For Fuzzy Rule-Based System
Literature Survey On Genetic Algorithm Approach For Fuzzy Rule-Based System
=
e
=
m
p
classT Xp
Xp j
Xp j
ClassT Aj conf
1
) (
) (
) (
Hence from (3), the consequent class Cj is determined using
the maximum confidence, shown using (4).
} ,...., 1 ) ( max{ arg M T classT Aj conf Cj = =
Input partitioning of the pattern space is given, there are many
approaches to generate fuzzy classification rules from data.
Using heuristic methods candidate rules can be evaluated
easily [3]. Their basic criterion, which is a fuzzy version of the
difference between the number of true positives and false
positives, is specified as in (5).
e e
=
classCj Xp classCj Xp
Xp j XP j classCj Aj f ) ( ) ( ) (
(5)
(2)
(3)
(4)
International Journal of Engineering Research (ISSN : 2319-6890)
Volume No.2, Issue No.2, pp : 29-32 01 April 2013
IJER@2013 Page 31
Single winner rule is the most popular and simple reasoning
method in fuzzy rule-based classifiers. Using this method, new
pattern Xt = [xt 1, xt 2, xtn] is classified according to the
consequent class of the winner rule Rw. In fact, the winner rule
has the maximum compatibility grade with Xt among the fired
rules. This can be stated as
} ,...., 2 , 1 ), ( max{ ) ( N j Xt j Xt w = = (6)
IV. FUZZY RULE-BASED CLUSTERING
ALGORITHM
Our anticipated approach uses supervised classification
approach to do the unsupervised cluster analysis by the
addition of some auxiliary data patterns to the main data and
usage of a fuzzy classifier to solve this new problem.
For extracting each cluster FRBC considers all unlabeled
data pattern as main data pattern and label them as class 1.Then
generation of random data pattern takes place which are
uniformly distributed data patterns. This random data pattern
can be treated as auxiliary data pattern.
For high dimensional problem, it is impossible to generate
specific number of uniform pattern, so they are produced
randomly and added to pattern space as a class 2 to form two-
class problem. Auxiliary data generation is controlled by
number of main data and their distribution.
For appropriately estimation of the number of auxiliary data
pattern should be added, the summation of a within-cluster
point to-point scatter matrix is used. This value for main data is
defined
<
=
j i
Xi Xj d q ) , (
(7)
Where Xi is one of the mM main data patterns, and d (.,.) is a
distance metric, which is usually the Euclidean norm.
Similarly, for auxiliary data {X
i, i
=1, mA}
<
=
j i
j X i X d q ) ' , ' ( '
(8)
Auxiliary random patterns are added incrementally using q and
q until q exceeds q. In this regard, the distribution of main
data patterns influences the size of auxiliary instances.
After the preparation of the two-class problem, the FRBC
make use of SGERD for a fuzzy rule generation to solve two-
class problem. Then rule evaluation measures can be applied
for the rule R
j
.
j
p X j Xp j
class Aj f
class Xp class Xp
'
) ' ( ) (
) 1 (
2 1
q
e e
=
(9)
Where
= j ' q
Number of auxiliary patterns that are covered by
the rule Rj
After generation of the fuzzy rules, the FRBC sets best rule
as the initial members of the first cluster and removes them
from the problem space to not be reconsidered for other
clusters.
In this manner FRBC explore all potential clusters with
assignment of distinct class labels to the consequent of fuzzy
rules that represent the explored clusters [8]. It then uses the
fuzzy rules simultaneously to classify the main data patterns
and, therefore, identify the clusters boundaries. Since the
FRBC uses the single winner-rule reasoning method, it selects
fuzzy rule with highest compatibility grade as in (6).To
increase the clustering accuracy [5], centroids can be calculated
as in (10)
e
e
=
Gj Xp
Gj Xp
XP j
Xp Xp j
Cj
) (
) (
(10)
Algorithm: Fuzzy rule-based clustering algorithm.
Inputs: mI unlabeled data patterns and threshold .
Outputs: The number of clusters J and their members that
are identified by J fuzzy classification rules.
1. Name the unlabeled patterns {Xp , p = 1, . . . , mI } of
the problem as the main data.
2. Let j = 1 (j: the cluster number).
3. Let mM = mI (mM : number of main data
(non clustered) patterns).
4. Generate some uniformly random patterns and append
them incrementally to the main data patterns until q_ in
(8) exceeds q in (7) (call these added patterns {Xp, p
=1, mA} as auxiliary data).
5. Label mM main data patterns as Class 1 and mA
auxiliary data as Class 2.
6. By the usage of SGERD with the criterion in (9),
generate fuzzy rules to classify this two-class problem
(with mM + mA data patterns).
7. Among the rules that are generated for Class 1, choose
the best one and name it Rj.
8. If e (Rj) < goto Step 12.
9. Set aside the main data patterns in the fuzzy subspace
of Rj as the initial members of cluster Gj.
10. Remove the members of Gj from the problem space,
and let mM = mM |Gj |.
11. Let j = j + 1, then goto Step 4.
12. Let J = j 1 as the number of explored clusters and
R1, RJ as their representing fuzzy rules, where Rj
identifies cluster Gj.
13. Replace the consequent of the rule Rj with j (for
j =1,. , J) so that the fuzzy rule base, which represents
clusters {G1, GJ}, would be {R1, . . . , RJ } with the
consequents {1, . . . , J}.
14. By the usage of the fuzzy rule base, classify the mI
unlabeled patterns to identify the actual members of
J explored clusters.
15. Compute the centroid of J clusters from (10).
International Journal of Engineering Research (ISSN : 2319-6890)
Volume No.2, Issue No.2, pp : 29-32 01 April 2013
IJER@2013 Page 32
16. Regroup the mI unlabeled patterns according to the
nearest centroid of J clusters.
17. Stop.
V. FUZZY RULE GENERATION METHOD
Method used in generating fuzzy rules is SGERD, which is
discussed bellow.
A. Steady state genetic algorithm for extracting fuzzy
classification rules from data:-
SGERD generates a prespecified number of Q rules per class
(R = M Q rules in total at most). Input for this algorithm is
labeled pattern and projected output is fuzzy rules.
Algorithm: SGERD.
I nputs: m labeled patterns of an n-dimensional M-class
problem and Q.
Outputs: Possibly R = MQ fuzzy classification rules.
1. i = 1 (i: generation number).
2. Generate all fuzzy rules having only one active
antecedent variable (at most C = 14 n
candidate rules would be generated).
3. Determine the consequent class of each candidate rule
using (4).
4. Divide the candidate rules into M groups according to
their consequent class.
5. Rank, in descending order of their fitness values, the
Candidate rules in each group.
6. Choose the best Q rules from each class (i.e., possibly
R = M Q rules in total) as the population in the i
th
generation. In the first generation only, choose the second best
R rules as the auxiliary population and put away for mutation.
7. Increment i, if i > n, goto step 11.
8. Use all individuals in the previous generation (i.e., R
rules) as parents and do reproduction (i.e., crossover, mutation,
or elitism) on them. That is, for each parent rule, generate as
offspring all fuzzy rules having one more active antecedent
variable than its parent, provided each new offspring is fitter
than its parent. In
this case, the number of offspring will totally be R 14 at
most.
9. If no offspring fitter than the parents is produced in
step (8), goto step 11.
10. Consider both parents and offspring in step 8 as
Candidate rules (at most C = R + R 14 rules in total) and goto
step 3.
11. Use R = M Q rules (obtained in step 6 for the i
th
generation) as the final population and stop. The actual length
of these rules is i or less.
As mentioned before, the rule selection scheme in SGERD
only considers the evaluation measure of each rule to select the
best ones through competition.
V.CONCLUSSION
In this paper, we proposed FRBC is a novel fuzzy rule-
based clustering algorithm to automatically explore the
potential clusters in the datasets. It looks at the clustering
issue as a classification problem by the addition of some
auxiliary data patterns to the main data and then generation
of some fuzzy rules to classify the new pattern space.
The generated fuzzy rules, which represent the clusters, are
human understandable with acceptable accuracy. SGERD is a
steady-state genetic algorithm to extract fuzzy classification
rule from data. Best rule is generated in SGRED.
REFERENCES
I. Eghbal G. Mansoori,FRBC: A Fuzzy Rule-Based
Clustering Algorithm, IEEE Trans. Fuzzy Syst., vol. 19, NO.
5, October 2011
II. L. Wang, C. Leckie, K. Ramamohanarao, and J.
Bezdek, Automatically determining the number of clusters in
unlabeled data sets, IEEE Trans. Knowl. Data Eng., vol. 21,
no. 3, pp. 335350, Mar. 2009.
III. E. G. Mansoori, M. J. Zolghadri, and S. D. Katebi,
SGERD: A steady state genetic algorithm for extracting
fuzzy classification rules from data, IEEE Trans. Fuzzy Syst.,
vol. 16, no. 4, pp. 10611071, Aug. 2008.
IV. H. Ishibuchi and T. Yamamoto, Comparison of
heuristic criteria for fuzzy rule selection in classification
problems, Fuzzy Optim. DecisionMaking, vol. 3, no. 2, pp.
119139, 2004.
V. R.Xu and D.Wunsch,Survey of clustering
algorithms, IEEE Trans.Neural Netw., vol.16, no.3, pp.645-
678, May 2005.
VI. H. Ishibuchi and T. Yamamoto, Fuzzy rule selection
by multi-objective genetic local search algorithms and rule
evaluation measures in data mining, Fuzzy Sets Syst., vol.
141, no. 1, pp. 5988, 2004.
VII. G. J. Klir and B. Yuan, Fuzzy sets and fuzzy logic:
Theory and Applications, in Pattern Recognition. Englewood
Cliffs, NJ: Prentice-Hall, 1995.
VIII. R. Dubes and A. K. Jain, Validity studies in
clustering methodologies, Pattern Recognit., vol. 11, pp.
235254, 1979.
IX. J. C. Bezdek, Cluster validity with fuzzy sets, J.
Cybern., vol. 3, pp. 58 71, 1974.
X. L. Zadeh, Fuzzy sets, Inf. Control, vol. 8, pp. 338
353, 1965.