0% found this document useful (0 votes)

72 views

Lab 04 Extra Weka Experimenter

According to Department for Transport, road traffic accidents are responsible for more than 3000 deaths per year in the UK. Although progress is being made in a number of areas, the number of vehicles involved in accidents have not been falling in line over the year. This study focus on identifying the factor contributing the cause of UK traffic accidents severity. Dataset of UK traffic accidents from kaggle form 2005 to 2007, 2009 to 2011, and 2012 to 2014 that have 1.6 million instances with 34 attributes was chosen for the analysis. A nominal multinomial logistic regression model was built. This particular model type of regression analysis was used due to the mixed nature of data. Multinomial regression was used to compare accident severity of fatal injury, injury, and Property Damage Only. The influential factors include Light Conditions, Day of Week, Road Type, Road Class, Road surface condition, and weather conditions that affect accident severity. The analysis show different factors having a statistically significant impact on the accident severity.

Uploaded by

MuhdHusaini

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Lab 04 Extra Weka Experimenter

Uploaded by

MuhdHusaini

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CDS503: Machine Learning

Lab 4 Extra: Weka Experimenter

Overview
We have been running one experiment at a time using Weka Explorer. Weka Experimenter
enables us to create, run, modify, and analyse experiments in a more convenient manner than is
possible when processing the schemes individually. For example, we can create an experiment
that runs several schemes against a series of datasets and then analyse the results to determine
if one of the schemes is (statistically) better than the other schemes.
Let’s design a small experiment to evaluate a suite of standard classification algorithms on the
problem.
1. Close the Weka Explorer.
2. Click the “Experimenter” button on the Weka GUI Chooser to launch the Weka Experiment
Environment.

3. Click “New” to start a new experiment.

4. In the “Experiment Type” pane change the “Number of folds” from “10” to “5”.

1 | CDS503: Lab 04 Extra (JLSY)

Semester 2, 2017/2018
5. In the “Datasets” pane click “Add new…” and select iris.arff data set.
6. In the “Algorithms” pane click “Add new…” and add the following 6 classification algorithms
(Click the “Choose” button on the weka.gui.GenericObjectEditor dialog box to select the machine
learning algorithms):

• rules.ZeroR (simply predicts the majority class – usually called the majority baseline)
• bayes.NaiveBayes
• bayes.BayesNet
• functions.SMO
• lazy.IBk
• trees.J48
7. Select IBK in the list of algorithms and click the “Edit selected…” button.
8. Change “KNN” from “1” to “3” and click the “OK” button to save the settings.

2 | CDS503: Lab 04 Extra (JLSY)

Semester 2, 2017/2018
9. Click on “Run” at the top of the window to open the Run tab and click the “Start” button to run
the experiment. The experiment should complete in just a few seconds.

10. Click on “Analyse” to open the Analyse tab. Click the “Experiment” button to load the results
from the experiment.

3 | CDS503: Lab 04 Extra (JLSY)

Semester 2, 2017/2018
11. Click the “Perform test” button to perform a pairwise test comparing all of the results to the
results for ZeroR.

Test configuration information

Statistical test (T-test) results

Key to test sets

What does the test output mean?

Test configuration information

• Tester: Information about what statistical tests are used to compare the machine learning
results
• Analyzing: The field (selected performance metric) we run statistical tests on
• Datasets: Number of datasets (we have only one data set iris.arff)
• Resultsets: Number of result sets (we set up 6 experiments, so we have 6 result sets)
• Confidence: Confidence level of the statistical T-test
Statistical test results
The matrix shows average percent_correct for each experiment or result set. For example, result
set (1) shows average percentage of correctly classified instances (accuracy) as 33.33% (result
set 1 is used as the baseline for comparison with every other result set). Note that the “v” beside
the result sets (2) to (6) means that the accuracy of result sets (2), (3), (4), (5) and (6) are
significantly better than baseline result (1). Symbol “v” means significantly better, and symbol “*”
means significantly worse. If we do not see “v” or “*” beside its percent_correct, it means that the
result set is not significantly better or worse than result set (1).
The parenthesis below the percent_correct numbers is another way to represent statistical
significance information. The (x, y, z) style means: x = whether this result set is significantly better

4 | CDS503: Lab 04 Extra (JLSY)

Semester 2, 2017/2018
than baseline; y = whether it is inconclusive to get significance conclusion; z = whether it is
significantly worse than baseline. For example, (1/0/0) for result set (2) means (significantly better
than baseline, not inconclusive, not significantly worse than baseline) compared to baseline result
set (1).
The (50) in front of the percent_correct row is the number instances in the test set.
Key information
It records numbering of the 6 result sets, by numbering each experiment (1), (2), (3), (4), (5) and
(6).
The Experimenter analysis can make comparison across more than one experiment based on a
single performance metric whereas Explorer will show more detailed performance metrics
(including precision and recall by class and the confusion matrix) for only a single experiment.
Results
The results suggest SVM (SMO) achieved the highest accuracy.
12. Since SMO achieved the best performance, we can also compare SMO to every other
experiment to see if its performance is significantly better than the rest. Click “Select” for the “Test
base”, select “functions.SMO” and click the “Select” button to choose the new test base. Click the
“Perform test” button again to perform the new analysis.

Although the results for SMO look better, the analysis suggests that the difference between these
results and the results from all of the other algorithms (except ZeroR) are not statistically
significant.

5 | CDS503: Lab 04 Extra (JLSY)

Semester 2, 2017/2018

History of Job Satisfaction
88% (16)
History of Job Satisfaction
2 pages
Hajj Guide
No ratings yet
Hajj Guide
22 pages
Spirent Testcenter Results Reporter
No ratings yet
Spirent Testcenter Results Reporter
53 pages
Free Fall Acceleration and Error Analysis
100% (1)
Free Fall Acceleration and Error Analysis
3 pages
Lab 03
No ratings yet
Lab 03
10 pages
Defeating The Mammon Spirit
No ratings yet
Defeating The Mammon Spirit
35 pages
Ausubel: Meaningful Reception Learning
No ratings yet
Ausubel: Meaningful Reception Learning
18 pages
Ancient Greek Ideas On Elements and Atom
93% (59)
Ancient Greek Ideas On Elements and Atom
23 pages
Chapter 1 - Software Testing (Lecture 1 & 2)
No ratings yet
Chapter 1 - Software Testing (Lecture 1 & 2)
68 pages
Pairwise Testing: A Best Practice That Isn't
No ratings yet
Pairwise Testing: A Best Practice That Isn't
17 pages
A Model For Spectra-Based Software Diagnosis
No ratings yet
A Model For Spectra-Based Software Diagnosis
37 pages
Chapter 7 New
No ratings yet
Chapter 7 New
39 pages
workshop notes
No ratings yet
workshop notes
16 pages
Defect Reduction Through Test Case Design at Black Box (SRS)
No ratings yet
Defect Reduction Through Test Case Design at Black Box (SRS)
49 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
53 pages
TestBench PC
No ratings yet
TestBench PC
19 pages
Swansea University: Computer Science Department CS 339 Supervisors: Dr. M. Roggenbach, Prof. Holger Schlingloff
No ratings yet
Swansea University: Computer Science Department CS 339 Supervisors: Dr. M. Roggenbach, Prof. Holger Schlingloff
15 pages
6.034 Design Assignment 2: 1 Data Sets
No ratings yet
6.034 Design Assignment 2: 1 Data Sets
6 pages
SoftwareQualityAssurance-1
No ratings yet
SoftwareQualityAssurance-1
41 pages
What Is A Test Case
100% (1)
What Is A Test Case
9 pages
3-4 Types of Test Data 2
No ratings yet
3-4 Types of Test Data 2
25 pages
Module01 LimitsAndObjectivesOfTesting
No ratings yet
Module01 LimitsAndObjectivesOfTesting
37 pages
52) Statistical Analysis
No ratings yet
52) Statistical Analysis
11 pages
MLRD 5
No ratings yet
MLRD 5
20 pages
DOC-2024085.
No ratings yet
DOC-2024085.
7 pages
Dependent T Test
No ratings yet
Dependent T Test
38 pages
5. Design_Studies
No ratings yet
5. Design_Studies
21 pages
GUIDELINES FOR MACHIE LEARNING EXPERIMENTS - PDF (Lakshan)
No ratings yet
GUIDELINES FOR MACHIE LEARNING EXPERIMENTS - PDF (Lakshan)
11 pages
CS440: HW3
No ratings yet
CS440: HW3
7 pages
Test Levels Exercise 2 - 1 Match Statements With Test Levels
No ratings yet
Test Levels Exercise 2 - 1 Match Statements With Test Levels
14 pages
Research Lab Ass. Sem2
No ratings yet
Research Lab Ass. Sem2
27 pages
Chapter 7 - Software Testing-1
No ratings yet
Chapter 7 - Software Testing-1
29 pages
JAMOVI 2017 Statistics For Psychologists Section - JAMOVI Chapter - Using The Software - 1
No ratings yet
JAMOVI 2017 Statistics For Psychologists Section - JAMOVI Chapter - Using The Software - 1
25 pages
Software Testing
No ratings yet
Software Testing
2 pages
Software Testing Techniques
No ratings yet
Software Testing Techniques
36 pages
Following Is The Strategy We Used in One of My Projects:: Define Brain Stromming and Cause Effect Graphing? With Eg?
No ratings yet
Following Is The Strategy We Used in One of My Projects:: Define Brain Stromming and Cause Effect Graphing? With Eg?
10 pages
Assign1 s2 2024
No ratings yet
Assign1 s2 2024
5 pages
The Testing Stage: Questions
No ratings yet
The Testing Stage: Questions
4 pages
Exp 9 and 10 SE LAb
No ratings yet
Exp 9 and 10 SE LAb
5 pages
DMLB 1
No ratings yet
DMLB 1
3 pages
5 Improve
No ratings yet
5 Improve
51 pages
Dynamic White-Box Testing: Highlights of This Chapter Include
No ratings yet
Dynamic White-Box Testing: Highlights of This Chapter Include
27 pages
Design of Experiments (DOE) Tutorial
No ratings yet
Design of Experiments (DOE) Tutorial
11 pages
Testing Machine Learning Systems - Code, Data and Models - Made With ML
No ratings yet
Testing Machine Learning Systems - Code, Data and Models - Made With ML
33 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
Classifying Generated White-Box Tests: An Exploratory Study: D Avid Honfi Zolt An Micskei
No ratings yet
Classifying Generated White-Box Tests: An Exploratory Study: D Avid Honfi Zolt An Micskei
42 pages
Shivansh Exp 9 and 10 SE LAb
No ratings yet
Shivansh Exp 9 and 10 SE LAb
5 pages
Ept Lab 14
No ratings yet
Ept Lab 14
6 pages
Parametric Stat Excel MS2007 Prez
100% (1)
Parametric Stat Excel MS2007 Prez
146 pages
DWM - Exp No 5
No ratings yet
DWM - Exp No 5
7 pages
TEST CASES
No ratings yet
TEST CASES
9 pages
Exercise 6
No ratings yet
Exercise 6
2 pages
Beck Testing Framework
No ratings yet
Beck Testing Framework
13 pages
Statistical PERT Normal Edition Quick Start Guide For Version 5.0
No ratings yet
Statistical PERT Normal Edition Quick Start Guide For Version 5.0
21 pages
Model-Based Testing in Practice
No ratings yet
Model-Based Testing in Practice
10 pages
Sneha Research Methodology File
No ratings yet
Sneha Research Methodology File
52 pages
TESSY An Overall Unit Testing Tool
No ratings yet
TESSY An Overall Unit Testing Tool
35 pages
What Is Design of Experiments (DOE) ?
No ratings yet
What Is Design of Experiments (DOE) ?
8 pages
SPSS Anova
No ratings yet
SPSS Anova
20 pages
2003 Linkman An Evaluation of Systematic Functional Testing Using Mutation Testing
No ratings yet
2003 Linkman An Evaluation of Systematic Functional Testing Using Mutation Testing
15 pages
Excel: Statistics in Microsoft Excel
No ratings yet
Excel: Statistics in Microsoft Excel
39 pages
17 Software Testing - introduction 2024
No ratings yet
17 Software Testing - introduction 2024
94 pages
Fractorial Design For Computers
No ratings yet
Fractorial Design For Computers
23 pages
Factor Analysis Using SPSS: Example
No ratings yet
Factor Analysis Using SPSS: Example
16 pages
10 Minute Guide to Orthogonal Array Test Strategy
From Everand
10 Minute Guide to Orthogonal Array Test Strategy
Rajeev Nair Raman
No ratings yet
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
From Everand
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
Dwayne Phillips
No ratings yet
Formalizing Supervised Learning Model Selection
No ratings yet
Formalizing Supervised Learning Model Selection
1 page
Kernel Type: Libsvm-Sigmoid Libsvm - Polynomial C RBF Kernel
No ratings yet
Kernel Type: Libsvm-Sigmoid Libsvm - Polynomial C RBF Kernel
1 page
3.2.3 Target Market
No ratings yet
3.2.3 Target Market
2 pages
Ordered by Love An Introduction To John Duns Scotus (Thomas M. Ward) (Z-Library)
100% (2)
Ordered by Love An Introduction To John Duns Scotus (Thomas M. Ward) (Z-Library)
152 pages
1 All-In
No ratings yet
1 All-In
8 pages
Discourse Analysis: The Questions Discourse Analysts Ask and How They Answer Them
No ratings yet
Discourse Analysis: The Questions Discourse Analysts Ask and How They Answer Them
3 pages
QC Exercises 3
No ratings yet
QC Exercises 3
4 pages
Allport's Eight Stages of Self (Proprium) Development - The Mouse Trap
No ratings yet
Allport's Eight Stages of Self (Proprium) Development - The Mouse Trap
19 pages
Types of Speech Context
No ratings yet
Types of Speech Context
67 pages
Critical Analysis of The Catcher in The Rye
No ratings yet
Critical Analysis of The Catcher in The Rye
4 pages
Performance Appraisal in An NHS Hospital
0% (1)
Performance Appraisal in An NHS Hospital
15 pages
Design Analysis and Testing of Sand Muller For Fou
No ratings yet
Design Analysis and Testing of Sand Muller For Fou
6 pages
Under Down Under
100% (4)
Under Down Under
698 pages
Honesty and Personal Relationship
No ratings yet
Honesty and Personal Relationship
43 pages
Modernism
0% (1)
Modernism
3 pages
Plato The Midwife s Apprentice I. M. Crombie all chapter instant download
100% (6)
Plato The Midwife s Apprentice I. M. Crombie all chapter instant download
72 pages
09 Mo1517
No ratings yet
09 Mo1517
2 pages
Cum Funct Just Restaur
100% (1)
Cum Funct Just Restaur
476 pages
MG214 - Week 6 - Public Policy and Wicked Problems
No ratings yet
MG214 - Week 6 - Public Policy and Wicked Problems
40 pages
UCD Michael Smurfit School of Business, University College Dublin
No ratings yet
UCD Michael Smurfit School of Business, University College Dublin
53 pages
Impact of OB On Performance
100% (1)
Impact of OB On Performance
6 pages
UCSP Essay
No ratings yet
UCSP Essay
1 page
Permission Is Granted To Reprint. 1.15.15. Many Voices Were Involved in The Writing of This Letter. For Questions, Please Contact or
100% (3)
Permission Is Granted To Reprint. 1.15.15. Many Voices Were Involved in The Writing of This Letter. For Questions, Please Contact or
2 pages
Psychology 101 - Review
No ratings yet
Psychology 101 - Review
7 pages
Correlation Between Relative Pitch and Age
100% (1)
Correlation Between Relative Pitch and Age
4 pages
Mandaya: Etymology and Geographic Location
No ratings yet
Mandaya: Etymology and Geographic Location
2 pages
Gatelevel Modeling
No ratings yet
Gatelevel Modeling
13 pages
Training Evaluation Model
No ratings yet
Training Evaluation Model
397 pages

Lab 04 Extra Weka Experimenter

Uploaded by

Lab 04 Extra Weka Experimenter

Uploaded by

CDS503: Machine Learning

Lab 4 Extra: Weka Experimenter

3. Click “New” to start a new experiment.

1 | CDS503: Lab 04 Extra (JLSY)

2 | CDS503: Lab 04 Extra (JLSY)

3 | CDS503: Lab 04 Extra (JLSY)

Test configuration information

Statistical test (T-test) results

Key to test sets

What does the test output mean?

4 | CDS503: Lab 04 Extra (JLSY)

5 | CDS503: Lab 04 Extra (JLSY)

You might also like