0% found this document useful (0 votes)

2 views

FP-Growth Algorithm (1)

The FP-Growth algorithm is an efficient method for mining frequent patterns in large datasets, utilizing a compact data structure called the FP-tree to avoid candidate generation. The process involves calculating item frequencies, constructing the FP-tree, creating a header table, and recursively mining for frequent itemsets. Key advantages of FP-Growth include reduced computation time, efficient memory use, and scalability with large datasets.

Uploaded by

Arya Verma

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

FP-Growth Algorithm (1)

Uploaded by

Arya Verma

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

FP-Growth Algorithm and FP-Tree Construction

November 28, 2024

1 Introduction
The FP-Growth algorithm is an efficient method for frequent pattern mining in
large datasets, which avoids the candidate generation step used in algorithms
like Apriori. It uses a compact data structure called the FP-tree (Frequent
Pattern tree) to represent the database in a compressed form. This allows for
efficient mining of frequent itemsets.

2 Step 1: Calculate Item Frequencies

Before building the FP-tree, we need to calculate the frequency (support) of
each item in the dataset. Support refers to the number of transactions that
contain the item.

2.1 Example Dataset

Consider the following transactions:

Transaction ID Items Purchased

T1 A, B, D, E
T2 B, C, D, E
T3 A, B, D, E, F
T4 B, C, D, E
T5 A, B, D, E, F

2.2 Item Frequency Calculation

The frequency (support) of each item is calculated as follows:

• A: Appears in T1, T3, T5 (3 times)

• B: Appears in T1, T2, T3, T4, T5 (5 times)
• C: Appears in T2, T4 (2 times)
• D: Appears in all transactions (5 times)

1
• E: Appears in all transactions (5 times)
• F: Appears in T3, T5 (2 times)

3 Step 2: Build the FP-Tree

Once the item frequencies are calculated, items are sorted in descending order of
their frequencies. Items with low support are removed from the dataset before
constructing the tree.

3.1 Sort Items by Frequency

The items are sorted in descending order of frequency:

B : 5, D : 5, E : 5, A : 3, C : 2, F : 2

3.2 Insert Transactions into the FP-Tree

Now, we insert each transaction into the FP-tree, sorting the items according
to the frequency list and building the tree. We either increment the count of an
existing node or create a new node.

3.2.1 Inserting Transactions

• T1: A, B, D, E → Sorted: B, D, E, A
• T2: B, C, D, E → Sorted: B, D, E, C

• T3: A, B, D, E, F → Sorted: B, D, E, A, F
• T4: B, C, D, E → Sorted: B, D, E, C
• T5: A, B, D, E, F → Sorted: B, D, E, A, F

3.3 FP-Tree Structure

After inserting all transactions, the FP-tree looks like this:

2
[ROOT]
|
(B:5)
|
+---------+---------+
| |
(D:4) (F:2)
|
+-------+-------+
| | |
(E:4) (A:3) (C:2)
|
(F:2)
Here:
• ROOT is the root node.

• The node B appears 5 times.

• The node D appears 4 times.
• The node E appears 4 times, and so on.

4 Step 3: Header Table

The header table is an essential part of the FP-tree. It stores the items and
their frequencies and provides links to the nodes in the tree. The header table
facilitates efficient mining by allowing easy traversal of the FP-tree.

4.1 Header Table Example

For the FP-tree above, the header table looks like this:

Item Frequency Linked Nodes

B 5 (B : 5) → (B : 4) → (B : 4) → (B : 3) → (B : 3)
D 5 (D : 4) → (D : 4) → (D : 3) → (D : 2)
E 5 (E : 4) → (E : 4) → (E : 3) → (E : 2)
A 3 (A : 3) → (A : 2)
C 2 (C : 2)
F 2 (F : 2) → (F : 2)

5 Step 4: Mining the FP-Tree

The mining process involves recursively extracting frequent itemsets from the
FP-tree by examining the header table and conditional pattern bases.

3
5.1 Conditional Pattern Base for Item B
To mine the patterns related to item B, we look at all paths that contain B and
trace back to the root. The conditional pattern base for B consists of all items
appearing with B in the transactions.

Transactions containing B:
T 1 : B, D, E, A
T 2 : B, D, E, C
T 3 : B, D, E, A, F
T 4 : B, D, E, C
T 5 : B, D, E, A, F
The conditional pattern base for B is:

{D, E, A, F }, {D, E, C}

We then recursively build a conditional FP-tree for this pattern base and
continue mining for further frequent itemsets.

6 Step 5: Recursive Mining and Frequent Item-

sets
The mining process continues recursively for each item in the header table, and
we extract frequent itemsets. The frequent itemsets for the given dataset could
include:

{B} : Support = 5
{D} : Support = 5
{E} : Support = 5
{A} : Support = 3
{B, D} : Support = 4
{B, E} : Support = 4
{B, D, E} : Support = 4
{A, B} : Support = 3
{A, D} : Support = 3
{A, B, D} : Support = 3

4
7 Advantages of FP-Growth
• No Candidate Generation: Unlike Apriori, FP-Growth does not gen-
erate candidate itemsets, which reduces computation time.
• Efficient Memory Use: The FP-tree is a compressed representation of
the dataset, saving memory.
• Scalability: FP-Growth scales well with large datasets due to its efficient
use of memory and reduced I/O operations.

8 Conclusion
The FP-Growth algorithm is a powerful tool for frequent pattern mining. By us-
ing the FP-tree and header table, FP-Growth efficiently mines frequent itemsets
without the need for candidate generation, making it faster and more scalable
than other algorithms like Apriori.

What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
37 pages
Step by Step Sap QM End User Manual
No ratings yet
Step by Step Sap QM End User Manual
68 pages
Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
FP Tree Growth: Frequent Pattern Growth Algorithm
100% (1)
FP Tree Growth: Frequent Pattern Growth Algorithm
2 pages
fp-tree
No ratings yet
fp-tree
37 pages
Lecture 6
No ratings yet
Lecture 6
18 pages
F P-Tree F P-Growth
No ratings yet
F P-Tree F P-Growth
7 pages
18-FP-Growth algorithm-12-02-2025
No ratings yet
18-FP-Growth algorithm-12-02-2025
24 pages
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
No ratings yet
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
6 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
Data Wirehose and Mining 3
No ratings yet
Data Wirehose and Mining 3
15 pages
FP Tree
No ratings yet
FP Tree
42 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
FP Growth Presentation v1 (Handout)
No ratings yet
FP Growth Presentation v1 (Handout)
10 pages
fpgrowth
No ratings yet
fpgrowth
11 pages
FP Growth
No ratings yet
FP Growth
21 pages
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
No ratings yet
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
23 pages
03 Pre Processing
No ratings yet
03 Pre Processing
20 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
16 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
Lecture_13_14_FP
No ratings yet
Lecture_13_14_FP
41 pages
fp-growth
No ratings yet
fp-growth
16 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
Module 4.2 Association Rule Mining
No ratings yet
Module 4.2 Association Rule Mining
88 pages
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
No ratings yet
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
13 pages
Improv Me Net
No ratings yet
Improv Me Net
7 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
23 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
An Implementation of The FP-growth Algorithm: Christian Borgelt
No ratings yet
An Implementation of The FP-growth Algorithm: Christian Borgelt
5 pages
FP Tree
No ratings yet
FP Tree
54 pages
Machine Learning Based FP Growth Algorithm
No ratings yet
Machine Learning Based FP Growth Algorithm
8 pages
FPgrowth
No ratings yet
FPgrowth
2 pages
FP GROWTH ALG
No ratings yet
FP GROWTH ALG
17 pages
4.1) FP Growth Algorithm
No ratings yet
4.1) FP Growth Algorithm
26 pages
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
No ratings yet
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
23 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
No ratings yet
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
3 pages
Association Rule Mining3
No ratings yet
Association Rule Mining3
13 pages
FPTree-09
No ratings yet
FPTree-09
45 pages
DWDM Unit-3
100% (1)
DWDM Unit-3
63 pages
DWM EXP10_96
No ratings yet
DWM EXP10_96
11 pages
An Improved Frequent Pattern Tree the Child Struct
No ratings yet
An Improved Frequent Pattern Tree the Child Struct
19 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
FP Growth (Tree)
No ratings yet
FP Growth (Tree)
24 pages
Data Mining Unit 2 (Part 2)-1
No ratings yet
Data Mining Unit 2 (Part 2)-1
7 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
5 pages
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
No ratings yet
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
22 pages
An Implementation of The FP-growth Algorithm
No ratings yet
An Implementation of The FP-growth Algorithm
6 pages
Knowledge-Based Systems: Gwangbum Pyun, Unil Yun, Keun Ho Ryu
No ratings yet
Knowledge-Based Systems: Gwangbum Pyun, Unil Yun, Keun Ho Ryu
15 pages
DWM EXP10_201107
No ratings yet
DWM EXP10_201107
13 pages
Powerpoint Presentation On Somlething
No ratings yet
Powerpoint Presentation On Somlething
181 pages
Efficient Algorithm For Mining Frequent Patterns Java Project
No ratings yet
Efficient Algorithm For Mining Frequent Patterns Java Project
38 pages
The Recursive Book of Recursion: Ace the Coding Interview with Python and JavaScript
From Everand
The Recursive Book of Recursion: Ace the Coding Interview with Python and JavaScript
Al Sweigart
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Assignment on Intermediate Code Generationand Three-Address Cod (1)
No ratings yet
Assignment on Intermediate Code Generationand Three-Address Cod (1)
1 page
Assignment-6
No ratings yet
Assignment-6
1 page
New_CV_Syllabus (1)
No ratings yet
New_CV_Syllabus (1)
3 pages
NVIDIA - SW 6M Jul-Dec Spring Intern
No ratings yet
NVIDIA - SW 6M Jul-Dec Spring Intern
2 pages
Measurelab User Manual
No ratings yet
Measurelab User Manual
25 pages
Why This Works
No ratings yet
Why This Works
13 pages
A Comprehensive Guide To Oracle Partitioning With Samples
No ratings yet
A Comprehensive Guide To Oracle Partitioning With Samples
36 pages
Img Build
No ratings yet
Img Build
4 pages
Dana 262 Analyzing With Cloudera Data Warehouse
No ratings yet
Dana 262 Analyzing With Cloudera Data Warehouse
3 pages
Coursera CURPZZP7PL5N
No ratings yet
Coursera CURPZZP7PL5N
1 page
MKV HMI Hardware
100% (3)
MKV HMI Hardware
20 pages
Practice MCQ
100% (2)
Practice MCQ
3 pages
Windows Formatting Guide: Getting Started
No ratings yet
Windows Formatting Guide: Getting Started
7 pages
Concept Paper
No ratings yet
Concept Paper
4 pages
3.1 Methodology of Research
No ratings yet
3.1 Methodology of Research
7 pages
Utiva Data Science Fellowship
No ratings yet
Utiva Data Science Fellowship
12 pages
Research Paper
No ratings yet
Research Paper
18 pages
se+unit-3
No ratings yet
se+unit-3
47 pages
Relational Algebra and Calculus Relational Algebra Relational Calculus SQL Embedded SQL QBE
No ratings yet
Relational Algebra and Calculus Relational Algebra Relational Calculus SQL Embedded SQL QBE
36 pages
Engineering Data Analysis Reviwer Quiz 01
No ratings yet
Engineering Data Analysis Reviwer Quiz 01
5 pages
x.25 Cookbook
No ratings yet
x.25 Cookbook
332 pages
Creation of Custom IDOC Type: 1. Business Case
100% (1)
Creation of Custom IDOC Type: 1. Business Case
8 pages
Practical Research i Template.docx 20250203 201301 0000
No ratings yet
Practical Research i Template.docx 20250203 201301 0000
27 pages
Memory Devices: Address Connections Data Connections Selection Connections Control Connections
No ratings yet
Memory Devices: Address Connections Data Connections Selection Connections Control Connections
14 pages
Sodapdf Converted
No ratings yet
Sodapdf Converted
2 pages
DF
No ratings yet
DF
2 pages
Create Table Player
No ratings yet
Create Table Player
8 pages
Mathematical Elements in Malay Traditional Game: A Case Study On Gasing Pangkah
No ratings yet
Mathematical Elements in Malay Traditional Game: A Case Study On Gasing Pangkah
11 pages
Type of Report: Research Report
No ratings yet
Type of Report: Research Report
47 pages
FAGL Transaction Codes
No ratings yet
FAGL Transaction Codes
4 pages
Research Proposal
No ratings yet
Research Proposal
7 pages
Hibernate - ORM Overview: What Is JDBC?
No ratings yet
Hibernate - ORM Overview: What Is JDBC?
76 pages
The Advantages of Upgrading To InfoSphere DataStage 8.7
No ratings yet
The Advantages of Upgrading To InfoSphere DataStage 8.7
6 pages

FP-Growth Algorithm (1)

Uploaded by

FP-Growth Algorithm (1)

Uploaded by

FP-Growth Algorithm and FP-Tree Construction

November 28, 2024

2 Step 1: Calculate Item Frequencies

2.1 Example Dataset

Transaction ID Items Purchased

2.2 Item Frequency Calculation

• A: Appears in T1, T3, T5 (3 times)

3 Step 2: Build the FP-Tree

3.1 Sort Items by Frequency

3.2 Insert Transactions into the FP-Tree

3.2.1 Inserting Transactions

3.3 FP-Tree Structure

• The node B appears 5 times.

4 Step 3: Header Table

4.1 Header Table Example

Item Frequency Linked Nodes

5 Step 4: Mining the FP-Tree

6 Step 5: Recursive Mining and Frequent Item-

You might also like