Comparing The Performance of Frequent Pattern Mini
Comparing The Performance of Frequent Pattern Mini
net/publication/272864559
CITATIONS READS
33 16,840
2 authors, including:
Kanwal Garg
Kurukshetra University
22 PUBLICATIONS 187 CITATIONS
SEE PROFILE
All content following this page was uploaded by Kanwal Garg on 16 March 2016.
29
International Journal of Computer Applications (0975 – 8887)
Volume 69– No.25, May 2013
backtracking.. Frequent patterns are stored in a bit matrix Figure 1. Comparison of Apriori, Eclat and FP
structure. Eclat is memory efficient because it uses prefix tree. Growth algorithm on artificial dataset.
The algorithm has good scalability due to the compact
representation.
3. METHODOLOGY
Figure 3. Comparison of apriori, éclat and FP
Growth algorithm on artificial dataset when the
The above mentioned three algorithms were implemented in number of attributes are made three times the
java and there performance was compared on synthetic dataset original.
by varying number of attributes and instances. The performance
comparing parameter is execution time.
4. COMPARATIVE ANALYSIS
30
International Journal of Computer Applications (0975 – 8887)
Volume 69– No.25, May 2013
Figure 4. Comparison of Apriori, Eclat and FP the between the two. Also from figure 2 and figure 3 it is clear
Growth algorithm on a dataset with the number that increasing the number of attributes affects more any
of transactions three times the original and individual algorithm than increasing the number of tuples by
number of attributes three times the original. same factor. When the number of transactions are made three
times and the number of tuples are also increased three times,
there is sharp increase in time taken by all algorithms. At this
stage FP Growth performs the best and Apriori performs the
worst as shown in figure 4. Figure 5 shows the scaling of FP
Growth algorithms under different conditions. Figure 6 and
figure 7 shows scaling of éclat and apriori under different
conditions. From the comparison of figure 5, figure 6 and figure
7 it is clear that FP Growth performs well under all kinds of
variations and therefore is the best among three algorithms.
Apriori performs the worst among three algorithms and thus
shows least scalability, Where as éclat lies in the middle of FP
Growth and Apriori.
Figure 5. Scaling of the FP Growth with respect
to the number of varying attributes and
transactions. 5. CONCLUSION
Frequent pattern mining is the most important step in association
rules which finally helps us in many applications like market
basket analysis, clustering, series analysis, games, decision
making, object mining, website navigation etc. In this paper the
researcher surveyed the pattern mining algorithms namely
apriori, Eclat and FP Growth. It is found that apriori uses join
and prune method, Ecalt works on vertical datasets and FP
Growth constructs the conditional frequent pattern tree which
satisfy the minimum support.
The major weakness of Apriori algorithm is producing large
number of candidate itemsets and large number of database
scans which is equal to maximum length of frequent itemset [5].
Figure 6. Scaling of Eclat with number of It is very much expensive to scan large database[11]. A true
varying attributes and transactions. reason of apriori failure is it lacks efficient processing method
on database [7]. FP Growth is the best among the three
algorithms and is thus most scalable. Eclat performs poorer than
FP Growth and the Apriori performs the worst. In the future
improvements must be taken care to enhance the performance of
Apriori and Eclat using a better layout to store the data.
6. REFERENCES
[1] Agarwal, R.C., Agarwal, C.C. and Prasad, V.V.V. (2001) A
tree projection algorithm for generation of frequent item
sets. Journal of Parallel and Distributed Computing, 61(3),
Pp. 350–371.
Figure 7. Scaling of the Apriori with number of [2] Bhadoria et. al. Analysis of Frequent Itemset Mining on
varying attributes and transactions.
Variant Datasets published in int.J.comp. Tech.appl.,
vol(2)5, ISSN:2229-6093, Pp. 1328-1333.
[3] http://en.wikipedia.org/wiki/Database_transaction [on 11th
nov 2012].
[4] C.Borgelt. “Efficient Implementations of Apriori and Eclat”.
In Proc. 1st IEEE ICDM Workshop on Frequent Item Set
Mining Implementations, CEUR Workshop Proceedings
90, Aachen, Germany 2003.
[5] Goswami D.N et. al. “An Algorithm for Frequent Pattern
Mining Based On Apriori ” (IJCSE) International Journal
on Computer Science and Engineering Vol. 02, No. 04,
4. RESULT AND DISCUSSION 2010, Pp. 942-947.
On standard dataset FP Growth performs the best and Apriori [6] Rahul Mishra et. al. “Comparative Analysis of Apriori
takes the maximum time. When the number of transactions are Algorithm and Frequent Pattern Algorithm for Frequent
made three times the original, there is more increment in time Pattern Mining in Web Log Data.” (IJCSIT) International
for both FP Growth and Eclat than Apriroi. When the number of Journal of Computer Science and Information
attributes are made three time the original, FP Growth performs Technologies, Vol. 3 (4) , 2012, Pp. 4662 – 4665.
the best and Apriori shows a sharp increase while Eclat lies in
31
International Journal of Computer Applications (0975 – 8887)
Volume 69– No.25, May 2013
32
IJCATM : www.ijcaonline.org
View publication stats