0% found this document useful (0 votes)

12 views53 pages

Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a

The document discusses various classification techniques in machine learning, including decision trees, Bayesian classification, artificial neural networks, and support vector machines. It emphasizes the importance of evaluating model accuracy through metrics like precision, recall, and F-measure, as well as addressing issues like overfitting and underfitting. Additionally, it highlights validation techniques such as k-fold cross-validation and significance tests for comparing classifiers.

Uploaded by

nikhillamsal1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views53 pages

Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a

Uploaded by

nikhillamsal1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

6.

1 Definition (Classification, Prediction), Learning and

testing of classification,
6.2 Classification by decision tree induction, ID3 as attribute
selection algorithm
6.3 Bayesian classification, Laplace smoothing
6.4 ANN: Classification by backpropagation
6.5 Rule based classifier (Decision tree to rules, rule
coverage and accuracy, efficient of rule simplification)
6.6 Support vector machine,
6.7 Evaluating accuracy (precision, recall, f-measure)
• Issues in classification, Overfitting and underfitting

• K- fold cross validation, Comparing two classifier

(McNemar’s test)
6.7 Issues : Overfitting, Validation, Model
Comparison

2
Underfitting :
The underfitting
means model has
a low accuracy
score on both
training data and
test data.
An underfit model fails to significantly grasp the
relationship between the input values and target
variables.
Underfitting happens when the algorithm used to build
a prediction model is very simple and not able to learn
the complex patterns from the training data. In that
case, accuracy will be low on seen training data as well
as unseen test data. Generally, It happens with Linear
Algorithms. 4
Underfitting
For example, as shown in the figure above, the
model is trained to classify between the circles and
crosses. However, it is unable to do so properly due
to the straight line, which fails to properly classify
either of the two classes.

5
Overfitting :
It means model
has a High
accuracy score on
training data but
low score on test
data.
An overfit model has overly memorized the data set it
has seen and is unable to generalize the learning to
an unseen data set. That is why an overfit model
results in very poor test accuracy. Poor test accuracy
may occur when the model is highly complex, i.e., the
input feature combinations are in a large number and
affect the model's flexibility.
7
Overfitting
For example, As shown in the figure below, the
model is trained to classify between the circles and
crosses, and unlike last time, this time the model
learns too well. It even tends to classify the noise in
the data by creating an excessively complex model
(right).

8
Overfitting
◼ Overfitting occurs when a statistical model describes
random error or noise instead of the underlying
relationship.
◼ Overfitting generally occurs when a model is excessively
complex, such as having too many parameters relative to
the number of observations.
◼ A model which has been overfit will generally have poor
predictive performance.
◼ Overfitting depends not only on the number of parameters
and data but also the conformability of the model
structure.
◼ In order to avoid overfitting, it is necessary to use
additional techniques (e.g. crossvalidation, pruning (Pre or
9
10
◼ Reason
◼ Noise in training data.

◼ Incomplete training data.

◼ Flaw in assumed theory.

11
Model Comparison

◼ Models can be evaluated based on the output using

different method :
◼ i. Confusion Matrix

◼ ii. ROC Analysis

◼ iii. Others such as: Gain and Lift Charts, K-S Charts

12
i. Confusion Matrix (Contigency Table):
◼ A confusion matrix contains information about actual
and predicted classifications done by classifier.
◼ Performance of such system is commonly evaluated
using data in the matrix.
◼ It is also known as a contingency table or an error
matrix, is a specific table layout that allows
visualization of the performance of an algorithm.
◼ Each column of the matrix represents the instances in
a predicted class, while each row represents the
instances in an actual class.

13
Classifier Evaluation Metrics: Confusion
Matrix
Confusion Matrix:
Actual class\Predicted class Predicted C1 Predicted ¬ C1
Actual C1 True Positives (TP) False Negatives (FN)
Actual ¬ C1 False Positives (FP) True Negatives (TN)

Example of Confusion Matrix:

Actual class\Predicted buy_computer buy_computer Total
class = yes = no
buy_computer = yes 6954 46 7000
buy_computer = no 412 2588 3000
Total 7366 2634 10000

◼ Given m classes, an entry, CMi,j in a confusion matrix indicates

# of tuples in class i that were labeled by the classifier as class j
◼ May have extra rows/columns to provide totals
14
Classifier Evaluation Metrics: Accuracy,
Error Rate, Sensitivity and Specificity
A\P C ¬C ◼ Class Imbalance Problem:
C TP FN P
◼ One class may be rare, e.g.
¬C FP TN N
fraud, or HIV-positive
P’ N’ All
◼ Significant majority of the

◼ Classifier Accuracy, or negative class and minority of

recognition rate: percentage of the positive class
test set tuples that are correctly ◼ Sensitivity: True Positive
classified recognition rate
Accuracy = (TP + TN)/All ◼ Sensitivity = TP/P

◼ Error rate: 1 – accuracy, or ◼ Specificity: True Negative

Error rate = (FP + FN)/All recognition rate

◼ Specificity = TN/N

◼ FPR = 1- TNR(specificity) 15
Classifier Evaluation Metrics:
Precision and Recall, and F-measures
◼ Precision: exactness – what % of tuples that the classifier
labeled as positive are actually positive

◼ Recall: completeness – what % of positive tuples did the

classifier label as positive?
◼ Perfect score is 1.0
◼ Inverse relationship between precision & recall
◼ F measure (F1 or F-score): harmonic mean of precision and
recall,

◼ Fß: weighted measure of precision and recall

◼ assigns ß times as much weight to recall as to precision

16
Classifier Evaluation Metrics: Example

Actual Class\Predicted class cancer = yes cancer = no Total Recognition(%)

cancer = yes 90 210 300 30.00 (sensitivity
cancer = no 140 9560 9700 98.56 (specificity)
Total 230 9770 10000 96.40 (accuracy)

◼ Precision = 90/230 = 39.13% Recall = 90/300 = 30.00%

18
ii. ROC Analysis
◼ Receiver Operating Characteristic (ROC), or ROC curve, is a
graphical plot that illustrates the performance of a binary
classifier system as its discrimination threshold is varied.
◼ The curve is created by plotting the true positive rate
against the false positive rate at various threshold
settings.
◼ The ROC curve plots sensitivity (TPR) versus FPR
◼ ROC analysis provides tools to select possibly optimal
models and to discard suboptimal ones independently
from (and prior to specifying) the cost context or the class
distribution.
19
◼ ROC analysis is related in a direct and natural way to
cost/benefit analysis of diagnostic decision making.

20
Model Selection: ROC Curves
◼ ROC (Receiver Operating
Characteristics) curves: for visual
comparison of classification models
◼ Originated from signal detection theory
◼ Shows the trade-off between the true
positive rate and the false positive
rate
◼ The area under the ROC curve is a ◼ Vertical axis represents
measure of the accuracy of the model the true positive rate
◼ Rank the test tuples in decreasing ◼ Horizontal axis rep. the
order: the one that is most likely to false positive rate
belong to the positive class appears at ◼ A model with perfect
the top of the list accuracy will have an
area of 1.0
◼ The closer to the diagonal line (i.e., the
closer the area is to 0.5), the less
21
22
Figure shows the ROC curves of two classification models. The
diagonal line representing random guessing is also shown. Thus,
the closer the ROC curve of a model is to the diagonal line, the
less accurate the model.
If the model is really good, initially we are more likely to
encounter true positives as we move down the ranked
list.
Thus, the curve moves steeply up from zero. Later, as we start to
encounter fewer and fewer true positives, and more and more
false positives, the curve eases off and becomes more horizontal.
To assess the accuracy of a model, we can measure the area
under the curve. Several software packages are able to perform
such calculation.
The closer the area is to 0.5, the less accurate the
corresponding model is. A model with perfect accuracy
will have an area of 1.0.

23
Validation
◼ Validation techniques are motivated by two fundamental
problems in pattern recognition:
◼ model selection and

◼ performance estimation

◼ Validation Approaches:
◼ One approach is to use the entire training data to

select our classifier and estimate the error rate, but

the final model will normally overfit the training data.
◼ A much better approach is to split the training data

into disjoint subsets cross validation ( The Holdout

Method)
24
Using significance tests to compare the
performance of two classifiers:
1. t-test (Cross-validation)

2. McNemar’s test (single test set)

25
Training Dataset
age income student credit_rating buys_comput
<=30 high no fair no
Class: <=30 high no excellent no
31…40 high no fair yes
C1:buys_computer = ‘yes’ >40 medium no fair yes
C2:buys_computer = ‘no’ >40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
Data sample <=30 medium no fair no
<=30 low yes fair yes
X = (age <=30, Income = medium, >40 medium yes fair yes
Student = yes Credit_rating = Fair) <=30 medium yes excellent yes
31…40 medium no excellent yes
class (buys_computer) = ? 31…40 high yes fair yes
>40 medium no excellent no

NOTE: Data set divided into two groups.

Training set: used to train the classifier and
Test set: used to estimate the error rate of the trained
classifier
Total number of examples = Training Set +Test Set
26
Cross Validation (The holdout method)

◼ Approach1: Random Sub sampling

◼ Random Sub sampling performs K data splits of the

dataset
◼ Each split randomly selects (fixed) no. examples without

replacement
◼ For each data split we retrain the classifier from scratch

with the training examples and estimate error with the

test examples

27
◼ Approach2: K-Fold Cross-Validation
◼ K-Fold Cross validation is similar to Random Sub

sampling.
◼ Create a K-fold partition of the dataset, For each of

K experiments, use K-1 folds for training and the

remaining one for testing.
◼ The advantage of K-Fold Cross validation is that all

the examples in the dataset are eventually used for

both training and testing.
◼ The true error is estimated as the average error

rate
28
◼ Approach3: Leave-one-out Cross-Validation
◼ Leave-one-out is the degenerate case of K-Fold

Cross Validation, where K is chosen as the total

number of examples where one sample is left out
at each experiment.

29
31
Example – 5 Fold Cross Validation

32
33
34
35
Can we Reject H0 ?
=> See next slide
36
Can we Reject H0

37
◼ We want to study effect of drugs A & drugs B for pain.
◼ We recruit 400 individuals with pain & make 200 pairs
of these individuals where we pair them based on
similar pain score, gender and age.

38
◼ We then
randomly
assign drug

◼ A to one of
the
individuals
in the pairs,
and drug B
to the other
persons in
the pairs 39
n00 n01
n10 n11

◼ Note: Toal count 200 here is number of pairs not number of individuas

40
A single test set: McNemar’s test
◼ McNemar’s test
• It is corresponding chi-square test for paired data.
• Compares classifiers A and B on a single test set.
• Considers the number of test items where either A or
B make errors:
◼ n11: number of items classified correctly by both A

and B
◼ n00: number of items misclassified by both A and B

◼ n01: number of items misclassified by A but not by B

◼ n10: number of items misclassified by B but not by A

Null hypothesis:
◼ A and B have the same error rate. Then, n01 = n10
41
NOTE: Some books do not use -1 in formula above 42
43
McNemar’s test is used to compare the
performance of two classifiers on the same
test set.

This test works if there are a large number of

items on which A and B make different
predictions.

44
45
46
Q. We are interested in if proportion
Example using R:
of stroke patients unable to walk
without an assistive device changes
after completing physical therapy
(pt) program.

Data below shows a study with 56

stroke survivors with the results in
the contingency table.

After PT

Before PT

Solution:
Here p-value (0.00796) < 0.005 (alpha value);
So, null hypothesis (H0 : n01 = n10 ) is rejected;
This means project is successful effect of after PT is more than before PT. 47
48
Issues Affecting Model Selection
◼ Accuracy
◼ classifier accuracy: predicting class label
◼ Speed
◼ time to construct the model (training time)
◼ time to use the model (classification/prediction time)
◼ Robustness: handling noise and missing values
◼ Scalability: efficiency in disk-resident databases
◼ Interpretability
◼ understanding and insight provided by the model
◼ Other measures, e.g., goodness of rules, such as decision tree
size or compactness of classification rules
49
51
Summary (I)
◼ Classification is a form of data analysis that extracts models
describing important data classes.
◼ Effective and scalable methods have been developed for decision
tree induction, Naive Bayesian classification, rule-based
classification, and many other classification methods.
◼ Evaluation metrics include: accuracy, sensitivity, specificity,
precision, recall, F measure, and Fß measure.
◼ Stratified k-fold cross-validation is recommended for accuracy
estimation

52
Summary (II)
◼ Significance tests and ROC curves are useful for model selection.
◼ There have been numerous comparisons of the different
classification methods; the matter remains a research topic
◼ No single method has been found to be superior over all others
for all data sets
◼ Issues such as accuracy, training time, robustness, scalability,
and interpretability must be considered and can involve trade-
offs, further complicating the quest for an overall superior
method

Agile Transformation How to Successfully Shape Your Transition to a More Agile Organization (Lars Kahra) (Z-Library)
No ratings yet
Agile Transformation How to Successfully Shape Your Transition to a More Agile Organization (Lars Kahra) (Z-Library)
128 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
6 Evaluarea performantei
No ratings yet
6 Evaluarea performantei
43 pages
CH-5_ML
No ratings yet
CH-5_ML
36 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Chp8 Classification Basic Concepts - Lecture#8
No ratings yet
Chp8 Classification Basic Concepts - Lecture#8
40 pages
Classification Metrics.pptx
No ratings yet
Classification Metrics.pptx
39 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
Lecture 5 Evaluation_Classifer
No ratings yet
Lecture 5 Evaluation_Classifer
61 pages
DM 09 Classification and Prediction 19112024 102854am
No ratings yet
DM 09 Classification and Prediction 19112024 102854am
21 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
No ratings yet
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
18 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
3ML.02.MainConcepts_Evaluation
No ratings yet
3ML.02.MainConcepts_Evaluation
35 pages
6.Data Mining - Classification Ppt
No ratings yet
6.Data Mining - Classification Ppt
37 pages
lecture11evaluationmetricsforclassification-240913060639-0c766554
No ratings yet
lecture11evaluationmetricsforclassification-240913060639-0c766554
28 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Assingment On Database
No ratings yet
Assingment On Database
16 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Module 5 Advanced Classification Techniques
No ratings yet
Module 5 Advanced Classification Techniques
40 pages
5.2
No ratings yet
5.2
62 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
Week 5
No ratings yet
Week 5
72 pages
Machine_Learning_II
No ratings yet
Machine_Learning_II
61 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
DWDM Unit-3: What Is Classification? What Is Prediction?
No ratings yet
DWDM Unit-3: What Is Classification? What Is Prediction?
12 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Unit-6: Classification and Prediction
No ratings yet
Unit-6: Classification and Prediction
63 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Exam PA Knowledge Based Outline
No ratings yet
Exam PA Knowledge Based Outline
22 pages
Module 6
No ratings yet
Module 6
24 pages
ClassificationandPrediction_Module3
No ratings yet
ClassificationandPrediction_Module3
88 pages
DM - Ch4 - Classification (Part1)
No ratings yet
DM - Ch4 - Classification (Part1)
20 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Unit6 -4 ANN_b408ce32-a05e-4a6f-b244-555f294ab268
No ratings yet
Unit6 -4 ANN_b408ce32-a05e-4a6f-b244-555f294ab268
21 pages
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
No ratings yet
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
15 pages
Unit6 -5 Rule Based Classifier_93f49d96-79eb-49fb-be55-f590f823d528
No ratings yet
Unit6 -5 Rule Based Classifier_93f49d96-79eb-49fb-be55-f590f823d528
28 pages
Asymmetric Ciphers
No ratings yet
Asymmetric Ciphers
31 pages
Unit-2 Introduction To Finite Automata PDF
No ratings yet
Unit-2 Introduction To Finite Automata PDF
68 pages
Linux
No ratings yet
Linux
25 pages
Unit-4 Context Free Grammar
No ratings yet
Unit-4 Context Free Grammar
106 pages
Mod 6 Assignment
No ratings yet
Mod 6 Assignment
15 pages
Star Trek - FASA - Article - New Races - Kzinti
100% (3)
Star Trek - FASA - Article - New Races - Kzinti
3 pages
Sample Questions: Computer Based Pre-Admission Entry Test
No ratings yet
Sample Questions: Computer Based Pre-Admission Entry Test
8 pages
G7836CD Operating Instructions
100% (1)
G7836CD Operating Instructions
64 pages
Tutorial 5
No ratings yet
Tutorial 5
1 page
Rock Strength and UCS of Rock
No ratings yet
Rock Strength and UCS of Rock
26 pages
T19 CIRCUITS DRIFT VELOCITY AND HEATING
No ratings yet
T19 CIRCUITS DRIFT VELOCITY AND HEATING
10 pages
Structural Geology Laboratory Manual: Fourth Edition
No ratings yet
Structural Geology Laboratory Manual: Fourth Edition
207 pages
Lab Module 1
No ratings yet
Lab Module 1
8 pages
GC Operational Qual L19
No ratings yet
GC Operational Qual L19
5 pages
L Ikigai Worksheet
No ratings yet
L Ikigai Worksheet
4 pages
A Semi Detailed Lesson Plan (Process of Communication Week 2)
No ratings yet
A Semi Detailed Lesson Plan (Process of Communication Week 2)
2 pages
Tutorials: Tutorial 1 Getting Started
No ratings yet
Tutorials: Tutorial 1 Getting Started
11 pages
Model Summary and Parameter Estimates
No ratings yet
Model Summary and Parameter Estimates
1 page
Mechanobiology Handbook Jiro Nagatomi - The ebook with all chapters is available with just one click
100% (3)
Mechanobiology Handbook Jiro Nagatomi - The ebook with all chapters is available with just one click
67 pages
Script Final
No ratings yet
Script Final
4 pages
Tle - Animation: Quarter 3: Week 3 - 4
100% (1)
Tle - Animation: Quarter 3: Week 3 - 4
12 pages
Henkel Adhesive Pre Feasibility - PVC & Phenolic Resin
No ratings yet
Henkel Adhesive Pre Feasibility - PVC & Phenolic Resin
86 pages
Accepted Manuscript: 10.1016/j.ejor.2017.02.039
No ratings yet
Accepted Manuscript: 10.1016/j.ejor.2017.02.039
25 pages
Example Thesis Statement For Critical Essay
100% (2)
Example Thesis Statement For Critical Essay
5 pages
Interaction of Radiation
No ratings yet
Interaction of Radiation
47 pages
IELTS Academic Writing
No ratings yet
IELTS Academic Writing
32 pages
Ars Quatuor Coronatorum Vol. 60
No ratings yet
Ars Quatuor Coronatorum Vol. 60
269 pages
SLK SCIENCE6 Q4 Week-5docx
No ratings yet
SLK SCIENCE6 Q4 Week-5docx
14 pages
Big Viewfinder Date 35
No ratings yet
Big Viewfinder Date 35
30 pages
Environment Pollution Control Problems
No ratings yet
Environment Pollution Control Problems
3 pages
Modulo SFP Gpon - Ref. Olt-Gsfp-C++
No ratings yet
Modulo SFP Gpon - Ref. Olt-Gsfp-C++
6 pages
Chapter - 5 Fundamentals of Statisticsl
No ratings yet
Chapter - 5 Fundamentals of Statisticsl
81 pages
2024 Jce -Geography Chief Examiner's Report
No ratings yet
2024 Jce -Geography Chief Examiner's Report
10 pages

Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a

Uploaded by

Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a

Uploaded by

6.

1 Definition (Classification, Prediction), Learning and

• K- fold cross validation, Comparing two classifier

◼ Incomplete training data.

◼ Flaw in assumed theory.

◼ Models can be evaluated based on the output using

◼ ii. ROC Analysis

Example of Confusion Matrix:

◼ Given m classes, an entry, CMi,j in a confusion matrix indicates

◼ Classifier Accuracy, or negative class and minority of

◼ Error rate: 1 – accuracy, or ◼ Specificity: True Negative

Error rate = (FP + FN)/All recognition rate

◼ Recall: completeness – what % of positive tuples did the

◼ Fß: weighted measure of precision and recall

Actual Class\Predicted class cancer = yes cancer = no Total Recognition(%)

◼ Precision = 90/230 = 39.13% Recall = 90/300 = 30.00%

select our classifier and estimate the error rate, but

into disjoint subsets cross validation ( The Holdout

2. McNemar’s test (single test set)

NOTE: Data set divided into two groups.

◼ Approach1: Random Sub sampling

with the training examples and estimate error with the

K experiments, use K-1 folds for training and the

the examples in the dataset are eventually used for

Cross Validation, where K is chosen as the total

◼ n01: number of items misclassified by A but not by B

◼ n10: number of items misclassified by B but not by A

This test works if there are a large number of

Data below shows a study with 56

You might also like