0% found this document useful (0 votes)

11 views31 pages

Data Mining Practical (1)

The document outlines practical implementations of various machine learning algorithms using Weka, including Naïve Bayes, Decision Tree, Clustering, and Apriori. Each section provides step-by-step instructions for loading datasets, applying algorithms, configuring parameters, and interpreting results. The document serves as a comprehensive guide for users to effectively utilize Weka for data analysis and machine learning tasks.

Uploaded by

Sahil Sayyad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views31 pages

Data Mining Practical (1)

Uploaded by

Sahil Sayyad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

INDEX

Sr. No Aim Date Sign

1
Show the implementation of Naïve Bayes algorithm.

2
Show the implementation of Decision Tree.

3
Show the implementation of Clustering Algorithm.

4
Show the implementation of Apriori Algorithm

5
Show the implementation of Time Series Algorithm.
Practical No : 1
Aim: Show the implementation of Naïve Bayes algorithm.

Step 1:

Step 1: Install Weka

1. Download Weka from Weka’s official website.

2. Install the application by following the on-screen instructions.

Step 2: Load Your Dataset

1. Open Weka.
2. Click on the "Explorer" button to launch the Weka Explorer interface.
3. In the "Preprocess" tab, click on "Open file".
4. Select your dataset file (usually in .arff or .csv format) and click "Open".
Step 3: Apply the Naïve Bayes Algorithm

1. Go to the "Classify" tab.

2. Click on the "Choose" button to select a classifier.
3. Navigate to:

rust
Copy
bayes -> NaiveBayes

Select NaiveBayes.
Step 4: Configure Naïve Bayes (Optional)

 If you need to tweak settings, click on the NaïveBayes classifier (after selecting it),
and the options will appear.
 Adjust parameters if needed, although Naïve Bayes generally requires little tuning.
Step 5: Train and Evaluate the Model

1. Choose an evaluation method:

o Use training set (evaluates on the same data).
o Cross-validation (recommended, e.g., 10-fold).
o Percentage split (e.g., 70% training, 30% testing).
2. Click "Start" to run the algorithm.
Step 6: Interpret Results

 Once the process is complete, Weka will display the results:

o Classifier output: Shows the model’s performance, accuracy, precision,
recall, etc.
o Confusion matrix: Helps in understanding classification errors.
Step 7: Save the Model (Optional)

 If satisfied with the model, you can save it:

o Right-click on the classifier output and select "Save model".
o Choose the location and name the file (usually .model format).
Step 8: Visualize the Data

 If satisfied with the model, you can save it:

o Right-click on the classifier output and select "Save model".
 The data must be clean.
 It should not contain null values.
 Visualize option allows you to visualize your processed data for
analysis.
 This is because the raw data collected from the field may contain null
values, irrelevant columns and so on.
 The data that is collected from the field contains many unwanted things
that leads to wrong analysis. For example, the data may contain null
fields, it may contain columns that are irrelevant to the current analysis,
and so on.
Practical No.- 2

Aim: Show the implementation of Decision Tree.

Step 1: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

2. Go to the "Preprocess" tab.
3. Click "Open file" and select your dataset (preferably in .arff or .csv format).
4. Click "Open" to load the dataset.
Step 3: Apply the Decision Tree Algorithm

1. Switch to the "Classify" tab.

2. Click "Choose" to open the list of classifiers.
3. Navigate to:

rust
Copy
trees -> J48

o J48 is Weka's implementation of the C4.5 algorithm, commonly used for

Decision Trees.
Step 4: Configure J48 (Optional)

 Click on J48 after selecting it to open the configuration window.

 Here, you can adjust parameters:
o -C (confidence factor): Controls pruning (default is 0.25).
o -M (minimum number of instances per leaf): Defines the minimum data
required to create a leaf.
 Example settings:
o Confidence factor: 0.1 (more aggressive pruning).
o Minimum instances per leaf: 5 (fewer rules, more generalization).
Step 5: Train and Evaluate the Model

1. Choose an evaluation method:

o Use training set (quick but may overfit).
o Cross-validation (e.g., 10-fold) – recommended for better generalization.
o Percentage split (e.g., 70% train, 30% test).
2. Click "Start" to run the classifier.

Step 6: Interpret the Results

After execution, Weka will display:

 Classifier output: Shows accuracy, precision, recall, etc.

 Confusion matrix: Displays true positives, false positives, etc.
 Decision tree structure: A readable tree showing how decisions are made (e.g., if-
else conditions).
Practical No- 3
Aim: Show the implementation of Clustering Algorithm.

Step 1: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

2. Go to the "Preprocess" tab.
3. Click "Open file" and select your dataset (.arff or .csv file).
4. Click "Open" to load the dataset.

Note: For clustering, the dataset should not have a class attribute because clustering
algorithms are unsupervised.

*Ensure the time attribute (e.g., a date or index) is set correctly.

Step 2: Prepare Time Series Data

 In Weka, time series data is treated as sequential data.

 Ensure that:
o The data is sorted chronologically.
o The target variable (the one you want to forecast) is set as the class attribute
(if applicable).

• Under Fields to forecast, select the attribute you want to predict (e.g., "Year" or "Pop").
Step 3: Apply the Time Series Algorithm (ARIMA)

1. Go to the "Classify" tab (since ARIMA is a supervised model).

2. Click "Choose" to open the list of classifiers.
3. Navigate to:

rust
Copy
timeSeries -> ARIMA
3. Fine-tune the parameters of the learning algorithm if needed.

Step 4: Configure Time Series Data(Optional)

 Click on ARIMA to open the configuration window.

 Adjust the parameters:
o p (autoregressive order): Number of lag observations included in the model.
o d (differencing order): Number of times the raw observations are
differenced.
o q (moving average order): Size of the moving average window.
 Example settings:
o p: 1
o d: 1
o q: 1

1. Visualize Predictions:
o Weka provides a graph comparing actual values and predicted
values for better.
OUTPUT
interpretability.
Practical No- 4
Aim: Show The Implementation of Clustering Algorithm.

To implement a Clustering Algorithm in Weka, we'll use the Explorer GUI.

Weka provides several clustering algorithms like k-means, EM (Expectation-
Maximization), and DBSCAN. Here’s how to apply a basic clustering algorithm,
such as k-means,

Step 2: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

2. Go to the "Preprocess" tab.
3. Click "Open file" and select your dataset (.arff or .csv file).
4. Click "Open" to load the dataset.

Note: For clustering, the dataset should not have a class attribute because clustering
algorithms are unsupervised.

1) K-MEANS:
Step 3: Apply the Clustering Algorithm

1. Go to the "Cluster" tab (next to the "Classify" tab).

2. Click "Choose" to open the list of clustering algorithms.
3. Select SimpleKMeans (for k-means clustering):

rust
Copy
cluster -> SimpleKMeans
Step 4: Configure the Clustering Algorithm (Optional)

 Click on SimpleKMeans to open the configuration window.

 Adjust the parameters:
o -N (number of clusters): Specify the number of clusters (e.g., 3).
o -I (max iterations): Set the maximum number of iterations (default is 500).
o -t (random seed): For reproducibility.
 Example settings:
o Number of clusters: 3
o Max iterations: 100
o Seed: 10
Step 5: Run the Clustering Algorithm

1. Click "Start" to execute the algorithm

2. Weka will cluster the data based on the specified parameters.
B) Hierarchical Clustering

Hierarchical clustering is an unsupervised learning algorithm that is used to

group together the unlabeled data points having similar characteristics.

 Step 1 − Treat each data point as single cluster. Hence, we will be having say
K clusters at start. The number of data points will also be K at start.
 Step 2 − Now, in this step we need to form a big cluster by joining two closet
datapoints. This will result in total of K-1 clusters.
 Step 3 − Now, to form more clusters we need to join two closet clusters. This
will result in total of K-2 clusters.
Practical NO- 5

Aim: Show the implementation of Apriori Algorithm

The Apriori algorithm is commonly used for mining frequent itemsets and
association rule learning. Weka provides an easy-to-use interface to apply the
Apriori algorithm. Here’s a step-by-step guide on how to implement the Apriori
algorithm in Weka:

Step 1: Prepare Your Dataset

 Format: Your data should be in the ARFF format or CSV. The dataset must be
transactional, where each transaction contains a list of items.

Step 2: Load Dataset in Weka

1. Open Weka GUI Chooser.

2. Click on "Explorer".
3. Load your dataset by clicking "Open file" and selecting your ARFF or CSV file.
Step 3: Apply the Apriori Algorithm

1. In the Weka Explorer, go to the "Associate" tab.

2. In the "Associator" section, choose "Apriori" from the drop-down menu.
3. Configure the parameters:
o Support: Minimum support threshold (e.g., 0.5 for 50%).
o Confidence: Minimum confidence level (e.g., 0.8 for 80%).
o Search Method: You can choose from "Best First", "A* Search", etc.
Step 4: Run the Algorithm

 Click "Start" to run Apriori.

 Weka will process the data and display the frequent itemsets and association rules in
the output panel.

HY-TTC 32: Quick Start Guide For CODESYS
100% (1)
HY-TTC 32: Quick Start Guide For CODESYS
29 pages
How To Create Manual: Learn The Secrets of Creating Subliminal Audios
100% (2)
How To Create Manual: Learn The Secrets of Creating Subliminal Audios
50 pages
edfd5afa-6f5f-4484-a938-42da781139ad
No ratings yet
edfd5afa-6f5f-4484-a938-42da781139ad
21 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
DMLB 1
No ratings yet
DMLB 1
3 pages
Wekappt
No ratings yet
Wekappt
58 pages
DMW lab Print
No ratings yet
DMW lab Print
21 pages
Dataware Practical 5
No ratings yet
Dataware Practical 5
4 pages
Weka Tool
No ratings yet
Weka Tool
12 pages
Data Mining Lab File
No ratings yet
Data Mining Lab File
20 pages
Dataminingg
No ratings yet
Dataminingg
22 pages
Lab04
No ratings yet
Lab04
7 pages
Data Mining - Lab - Manual
No ratings yet
Data Mining - Lab - Manual
20 pages
OS journal
No ratings yet
OS journal
28 pages
DWM1
No ratings yet
DWM1
19 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
DW Lab
No ratings yet
DW Lab
85 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
Exp 6
No ratings yet
Exp 6
9 pages
Weka (20030421-Version1 by Kdelab)
No ratings yet
Weka (20030421-Version1 by Kdelab)
51 pages
Weka Overview Slides
No ratings yet
Weka Overview Slides
31 pages
WEKA Practical Protocol
No ratings yet
WEKA Practical Protocol
40 pages
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
No ratings yet
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
42 pages
Data Mining Lab Questions
100% (1)
Data Mining Lab Questions
47 pages
Weka Tutorial
No ratings yet
Weka Tutorial
2 pages
Unidad I Tarea 3 Minería de Datos. Trabajar Con Weka Usando Archivo Weather Nominal
No ratings yet
Unidad I Tarea 3 Minería de Datos. Trabajar Con Weka Usando Archivo Weather Nominal
13 pages
Dwh Manual Merged
No ratings yet
Dwh Manual Merged
47 pages
Wa0002.
No ratings yet
Wa0002.
21 pages
DATA WAREHOUSING -TO WRITE
No ratings yet
DATA WAREHOUSING -TO WRITE
23 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
12 pages
DWDM LAB MANUAL
No ratings yet
DWDM LAB MANUAL
55 pages
DWDM Lab File
No ratings yet
DWDM Lab File
29 pages
DWM_Exp8_10080
No ratings yet
DWM_Exp8_10080
9 pages
DataMiningManual_Sawan
No ratings yet
DataMiningManual_Sawan
30 pages
Data Mining (WEKA) en Formatted
No ratings yet
Data Mining (WEKA) en Formatted
52 pages
3
No ratings yet
3
6 pages
Final Weka Lab Tutorial
No ratings yet
Final Weka Lab Tutorial
142 pages
dwdm_file-final_ver3.pdf_20241230_172003_0000
No ratings yet
dwdm_file-final_ver3.pdf_20241230_172003_0000
54 pages
CS-703 (B) Data Warehousing and Data Mining Lab
No ratings yet
CS-703 (B) Data Warehousing and Data Mining Lab
50 pages
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
100% (1)
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
8 pages
Unit-7 Tools of AI (April 9, 2024)
No ratings yet
Unit-7 Tools of AI (April 9, 2024)
88 pages
DA_LabFile
No ratings yet
DA_LabFile
63 pages
Weka Activity Report
No ratings yet
Weka Activity Report
30 pages
NOTES
No ratings yet
NOTES
45 pages
DMBI Exp1: Introduction To WEKA Tool
No ratings yet
DMBI Exp1: Introduction To WEKA Tool
6 pages
Lecture 7 - Weka
No ratings yet
Lecture 7 - Weka
69 pages
Part I - Installing Weka: HW Assignment 1
No ratings yet
Part I - Installing Weka: HW Assignment 1
3 pages
More Data Mining With Weka: Ian H. Witten
No ratings yet
More Data Mining With Weka: Ian H. Witten
61 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
Exp 6
No ratings yet
Exp 6
12 pages
Weka - Launching Explorer3
No ratings yet
Weka - Launching Explorer3
3 pages
Data Warehousing Lab Exp 1-3
No ratings yet
Data Warehousing Lab Exp 1-3
24 pages
Datawarehouse Pract 2
No ratings yet
Datawarehouse Pract 2
7 pages
DMW_LabFile_0901CS243D11_swastik
No ratings yet
DMW_LabFile_0901CS243D11_swastik
25 pages
Data Base Management Key Points
No ratings yet
Data Base Management Key Points
8 pages
AI32 Guide To Weka PDF
No ratings yet
AI32 Guide To Weka PDF
6 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
From Everand
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
Arun Manivannan
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
HPC 2025 (1)
No ratings yet
HPC 2025 (1)
16 pages
NGD
No ratings yet
NGD
9 pages
Big_Data_Unit1_Long_Answers
No ratings yet
Big_Data_Unit1_Long_Answers
7 pages
BDA ANSWERS (1)
No ratings yet
BDA ANSWERS (1)
18 pages
Big_Data_Data_Science_QA_Detailed
No ratings yet
Big_Data_Data_Science_QA_Detailed
2 pages
MongoDB_Detailed_Answers
No ratings yet
MongoDB_Detailed_Answers
3 pages
DA Resume
No ratings yet
DA Resume
2 pages
721482177-DATA-ANALYST-INTERNSHIP-CERTIFICATE 2025 (1) (1) (1) (1)
No ratings yet
721482177-DATA-ANALYST-INTERNSHIP-CERTIFICATE 2025 (1) (1) (1) (1)
1 page
ngd unit 1-4
No ratings yet
ngd unit 1-4
43 pages
Ch7-Image Segmentation (E-next.in)
No ratings yet
Ch7-Image Segmentation (E-next.in)
27 pages
NGD Practical Edited 1
No ratings yet
NGD Practical Edited 1
36 pages
DFS With Example
No ratings yet
DFS With Example
8 pages
CD Unit-V questions_MSWEC
No ratings yet
CD Unit-V questions_MSWEC
2 pages
HEF4060B: 1. General Description
No ratings yet
HEF4060B: 1. General Description
14 pages
Multicycle Approach Part 2
No ratings yet
Multicycle Approach Part 2
25 pages
Practical No 15
No ratings yet
Practical No 15
3 pages
Java Programming17
No ratings yet
Java Programming17
7 pages
Compaq Presario CQ50-106AU Drivers For Windows XP
No ratings yet
Compaq Presario CQ50-106AU Drivers For Windows XP
3 pages
CarWash MGT WBS
No ratings yet
CarWash MGT WBS
14 pages
Scheduling Question & Solutions
No ratings yet
Scheduling Question & Solutions
2 pages
Modeling and Analyzing Software Defect Prevention Using ODC: A Preliminary Dissertation On
No ratings yet
Modeling and Analyzing Software Defect Prevention Using ODC: A Preliminary Dissertation On
15 pages
11 Incredible Excel Conditional Formatting Tricks
No ratings yet
11 Incredible Excel Conditional Formatting Tricks
44 pages
BMAX Manual English
No ratings yet
BMAX Manual English
16 pages
Preleaf by Masai Data Analytics Curriculum
No ratings yet
Preleaf by Masai Data Analytics Curriculum
6 pages
NIACL Information Handout in English For Phase I Administrative Officers
No ratings yet
NIACL Information Handout in English For Phase I Administrative Officers
6 pages
CSE101 L12-13 Recurrence
No ratings yet
CSE101 L12-13 Recurrence
61 pages
Multisens Picture Error in Rlink 2: Ddt4All
No ratings yet
Multisens Picture Error in Rlink 2: Ddt4All
16 pages
Compact Expanded Synthetic Division - Wikipedia
No ratings yet
Compact Expanded Synthetic Division - Wikipedia
2 pages
Introduction To HedEx Lite V200R001 V6.1
No ratings yet
Introduction To HedEx Lite V200R001 V6.1
33 pages
Online Crime Detection and Reporting Using Fuzzy Logic Techniques
No ratings yet
Online Crime Detection and Reporting Using Fuzzy Logic Techniques
3 pages
Bidirectional_associative_memory
No ratings yet
Bidirectional_associative_memory
3 pages
Prime Numbers PDF
No ratings yet
Prime Numbers PDF
16 pages
Invisalign Glossary B10024-00 Rev D
No ratings yet
Invisalign Glossary B10024-00 Rev D
210 pages
Softland India Limited Dealership Policy and Product Portfolio For Hilipines
No ratings yet
Softland India Limited Dealership Policy and Product Portfolio For Hilipines
14 pages
Blackmagic2110IPConvertersManual
No ratings yet
Blackmagic2110IPConvertersManual
73 pages
"Referring To Local Data Type
No ratings yet
"Referring To Local Data Type
21 pages
AME2 2 Four Language V1.2 20230207 Compressed
No ratings yet
AME2 2 Four Language V1.2 20230207 Compressed
1 page
Creo Tips 1 3
No ratings yet
Creo Tips 1 3
4 pages
HP Prodesk 400 G4 Desktop Mini: Support and Service Considerations
No ratings yet
HP Prodesk 400 G4 Desktop Mini: Support and Service Considerations
20 pages

Data Mining Practical (1)

Uploaded by

Data Mining Practical (1)

Uploaded by

INDEX

Sr. No Aim Date Sign

Step 1: Install Weka

1. Download Weka from Weka’s official website.

Step 2: Load Your Dataset

1. Go to the "Classify" tab.

1. Choose an evaluation method:

 Once the process is complete, Weka will display the results:

 If satisfied with the model, you can save it:

 If satisfied with the model, you can save it:

Aim: Show the implementation of Decision Tree.

Step 1: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

1. Switch to the "Classify" tab.

o J48 is Weka's implementation of the C4.5 algorithm, commonly used for

 Click on J48 after selecting it to open the configuration window.

1. Choose an evaluation method:

Step 6: Interpret the Results

After execution, Weka will display:

 Classifier output: Shows accuracy, precision, recall, etc.

Step 1: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

*Ensure the time attribute (e.g., a date or index) is set correctly.

 In Weka, time series data is treated as sequential data.

1. Go to the "Classify" tab (since ARIMA is a supervised model).

Step 4: Configure Time Series Data(Optional)

 Click on ARIMA to open the configuration window.

To implement a Clustering Algorithm in Weka, we'll use the Explorer GUI.

Step 2: Load Your Dataset

1. Click "Explorer" on the main Weka interface.

1. Go to the "Cluster" tab (next to the "Classify" tab).

 Click on SimpleKMeans to open the configuration window.

1. Click "Start" to execute the algorithm

Hierarchical clustering is an unsupervised learning algorithm that is used to

Aim: Show the implementation of Apriori Algorithm

Step 1: Prepare Your Dataset

Step 2: Load Dataset in Weka

1. Open Weka GUI Chooser.

1. In the Weka Explorer, go to the "Associate" tab.

 Click "Start" to run Apriori.

You might also like