Lab 04 Extra Weka Experimenter
Lab 04 Extra Weka Experimenter
• rules.ZeroR (simply predicts the majority class – usually called the majority baseline)
• bayes.NaiveBayes
• bayes.BayesNet
• functions.SMO
• lazy.IBk
• trees.J48
7. Select IBK in the list of algorithms and click the “Edit selected…” button.
8. Change “KNN” from “1” to “3” and click the “OK” button to save the settings.
10. Click on “Analyse” to open the Analyse tab. Click the “Experiment” button to load the results
from the experiment.
• Tester: Information about what statistical tests are used to compare the machine learning
results
• Analyzing: The field (selected performance metric) we run statistical tests on
• Datasets: Number of datasets (we have only one data set iris.arff)
• Resultsets: Number of result sets (we set up 6 experiments, so we have 6 result sets)
• Confidence: Confidence level of the statistical T-test
Statistical test results
The matrix shows average percent_correct for each experiment or result set. For example, result
set (1) shows average percentage of correctly classified instances (accuracy) as 33.33% (result
set 1 is used as the baseline for comparison with every other result set). Note that the “v” beside
the result sets (2) to (6) means that the accuracy of result sets (2), (3), (4), (5) and (6) are
significantly better than baseline result (1). Symbol “v” means significantly better, and symbol “*”
means significantly worse. If we do not see “v” or “*” beside its percent_correct, it means that the
result set is not significantly better or worse than result set (1).
The parenthesis below the percent_correct numbers is another way to represent statistical
significance information. The (x, y, z) style means: x = whether this result set is significantly better
Although the results for SMO look better, the analysis suggests that the difference between these
results and the results from all of the other algorithms (except ZeroR) are not statistically
significant.