0% found this document useful (0 votes)

7 views

Section 5

The document discusses techniques for assessing the performance of data mining models, including R-squared for linear regression, AUC and Gini coefficient for categorical response models, and confusion matrices. Higher R-squared, AUC, and Gini values indicate better model performance.

Uploaded by

HuanYu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Section 5

Uploaded by

HuanYu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Section 5: Measuring Models

1
Main objective of this session

Aim

•To present different techniques used to assess the performance of data mining
models.

Learning outcomes

1.Understand how the performance of linear regression models is measured: R-squared.

2.Understand how the discriminatory power of scorecards is measured: ROC and

AUROC, and the Gini coefficient.

2
1.4 SEMMA Process

SAMPLE EXPLORE MODIFY MODEL ASSESS

• Acquire an • For each • Data • Statistical • Determine

unbiased variable: adjustment. techniques how well the
sample of the • Get a feel of and models model fits the
data which typical values. • Outliers on data to data.
describes the treatments undertake the
situation • Outliers required data • What
detection • Adjust/take mining task. confidence
• Define the functions of you should
“targets” • Inter data to put it have in the
variables relationship in most useful results
which capture between form. obtained
the respond of different (Measurement
the situation. variables. function).

3
6.1 Assessing linear models

• Coefficient of determination R2

– Says how much variation in y is explained by the model.

– Is a goodness-of-fit measure.

• Adjusted R2 for multivariate linear models.

• This will be a number between 0 and 1.

– High values mean that most of variation in y is explained by the

model; low values mean that little variation is explained in this
way.
4
6.2 Assessing categorical
response models – scorecards
• Discriminatory power

– Given a cut off: The model can make the classification. Example
Credit Card Fraud detection: We have a model that assigns a
probability of whether a credit card transaction is fraudulent.
Fraud

P (Y = 1 | X 1 ,..., X n ) =
Cut off

No Fraud

5
6.2 Assessing categorical
response models – scorecards

• Discriminatory power True values

FRAUD NO FRAUD TOTAL
– Confusion Matrix: Given a particular cut off
POSITIVE TRUE FALSE TOTAL
Test result Model said POSITIVE (TP) POSITIVE (FP) POSITIVES
(from the “Fraud”
model) NEGATIVE FALSE TRUE
TOTAL NEGATIVES
Model said NEGATIVE (FN) NEGATIVE (TN)
“No Fraud”
TOTAL TOTAL FRAUDS TOTAL NO FRAUDS TOTAL CASES

– True Positive: Correct detection that the event has happened. 6

6.2 Assessing categorical
response models – scorecards

• Discriminatory power True values

 Classification accuracy = (TP+TN) / (TOTAL CASES)

7
 Error rate = (FP+FN) / (TOTAL CASES)
6.2 Assessing categorical
response models – scorecards

• Discriminatory power

– For each cut off we have a new confusion matrix and one pair of
(sensitivity, 1-specificity).
Fraud
Cut off 1
P (Y = 1 | X 1 ,..., X n ) = Cut off 2
Cut off 3

No Fraud

8
6.2 Assessing categorical
response models – scorecards

• Discriminatory power Cu a
ROC Curve
OC
1
0,9
– Plotting all (sensitivity, 1-specificity),
0,8 we obtain the Receiver

Sensitivity
Operating Characteristic (ROC)0,7 curve.
0,6
0,5
0,4
– The diagonal represents a “random”
0,3 classifier model.
0,2
0,1
0
0,0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0

1-Specificity

9
6.2 Assessing categorical
response models – scorecards

• Discriminatory power

– Interpretation of the ROC and AUROC

• The Area Under the ROC (AUROC or C) is a measure of

discriminatory power. Notice that 0.5 ≤ AUROC ≤ 1.

• An intuitive interpretation of the AUROC is that it provides an

estimate of the probability that a randomly chosen instance of
class 1 is correctly ranked higher than a randomly chosen
instance of class 0 (Hanley and McNeil, 1983).

10
• The higher the better!
6.2 Assessing categorical
response models – scorecards
1,0 1 1,0
0,9
0,8
• Discriminatory power 0,9
0,8
0,9
0,8

0,7 0,7 0,7

0,6 0,6 0,6

0,5 0,5 0,5

0,4 0,4 0,4

0,3 0,3
0,3
0,2 0,2 0,2
AUROC = 0.74 0,1
AUROC = 0.85 0,1
AUROC = 0.94
0,1
0 0,0
0,0
0,0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0 0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1

– Models with AUROC larger than 0.7 are acceptable (Mays,

2004)
11
– Models with AUROC larger than 0.95: Warning: “too good to be
6.2 Assessing categorical
response models – scorecards
• Discriminatory power
GINI
The Gini = 2 × ( area between ROC curve and diagonal)
coefficient
area between ROC curve and random scorecard curve
=
area between perfect scorecard curve and random scorecard curve
B
C
• If GINI = 1 then perfect discrimination

• If GINI = 0 then no discrimination

F(s|B)
• Relationship between AUROC and GINI:

GINI= 2(AUROC - 0.5)= 2AUROC - 1

A
F(s | G)

Econometrics Assinment 2
No ratings yet
Econometrics Assinment 2
3 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
lecture11evaluationmetricsforclassification-240913060639-0c766554
No ratings yet
lecture11evaluationmetricsforclassification-240913060639-0c766554
28 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Estimating The Roc Curve and Its Si PDF
No ratings yet
Estimating The Roc Curve and Its Si PDF
10 pages
Introduction To Data Mining Unit 4
No ratings yet
Introduction To Data Mining Unit 4
13 pages
09ClassAdvanced
No ratings yet
09ClassAdvanced
64 pages
INSY446 - 4 - Classification Part 1
No ratings yet
INSY446 - 4 - Classification Part 1
26 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Lectura 1
No ratings yet
Lectura 1
13 pages
Fitting A Model To Data
No ratings yet
Fitting A Model To Data
41 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Module 4
No ratings yet
Module 4
12 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
No ratings yet
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
8 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Bradley PR97 PDF
No ratings yet
Bradley PR97 PDF
15 pages
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
No ratings yet
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
1 page
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
APznzaag02xO1GGi5u_A2DhJZs4CkLi9le3t7z9-R-wpvTJmn6o4ZfwQPBMHbFF9nnLxXjm40qffE-ZJQt7sji0grSXm812681Z1HXweJuujlkNekCE0LBXhi7QZzIbYwVm0Gy8OihuREB3yX-xuUY9vnUp00zdff4914hbLoLi_yw8ca2WGrMjDOn15XXUi5lnBdigIFlLgiIztS_axMl
No ratings yet
APznzaag02xO1GGi5u_A2DhJZs4CkLi9le3t7z9-R-wpvTJmn6o4ZfwQPBMHbFF9nnLxXjm40qffE-ZJQt7sji0grSXm812681Z1HXweJuujlkNekCE0LBXhi7QZzIbYwVm0Gy8OihuREB3yX-xuUY9vnUp00zdff4914hbLoLi_yw8ca2WGrMjDOn15XXUi5lnBdigIFlLgiIztS_axMl
15 pages
QFR-rp181
No ratings yet
QFR-rp181
32 pages
Introduction to Data Mining
No ratings yet
Introduction to Data Mining
19 pages
A Novel Performance Measure For Machine
No ratings yet
A Novel Performance Measure For Machine
19 pages
An Introduction To ROC Curve (Receiver Operating Characteristics)
No ratings yet
An Introduction To ROC Curve (Receiver Operating Characteristics)
16 pages
Chapter11 Slides
No ratings yet
Chapter11 Slides
20 pages
Chap12 DiscriminantAnalysis
No ratings yet
Chap12 DiscriminantAnalysis
30 pages
matrix confusion
No ratings yet
matrix confusion
25 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Flach Roc Analysis
No ratings yet
Flach Roc Analysis
12 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
An Introduction To ROC Analysis
No ratings yet
An Introduction To ROC Analysis
14 pages
Binary Classification Machine Learning Models
No ratings yet
Binary Classification Machine Learning Models
4 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Exam PA Knowledge Based Outline
No ratings yet
Exam PA Knowledge Based Outline
22 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Discriminant Analysis For Risk Classification and Prediction
No ratings yet
Discriminant Analysis For Risk Classification and Prediction
23 pages
Module 3.1
No ratings yet
Module 3.1
25 pages
Bi Intro
No ratings yet
Bi Intro
24 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Classification Model, Features and Decision Region
No ratings yet
Classification Model, Features and Decision Region
17 pages
Classification Model, Features and Decision Region
No ratings yet
Classification Model, Features and Decision Region
17 pages
String Theory Demystified
From Everand
String Theory Demystified
David McMahon
3/5 (4)
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Introduction To Management Science: Waiting-Line Models
No ratings yet
Introduction To Management Science: Waiting-Line Models
43 pages
Making Sense of Odds and Odds Ratios.24
No ratings yet
Making Sense of Odds and Odds Ratios.24
4 pages
To Find Correlation and Regression of The Following Data
No ratings yet
To Find Correlation and Regression of The Following Data
5 pages
Robins e
No ratings yet
Robins e
70 pages
Probabilistic Programming in Quantitative Finance Probabilistic Programming in Quantitative Finance
No ratings yet
Probabilistic Programming in Quantitative Finance Probabilistic Programming in Quantitative Finance
86 pages
Exponential Family
No ratings yet
Exponential Family
13 pages
Statistics
No ratings yet
Statistics
6 pages
PDF Bayesian Structural Equation Modeling 1st Edition Sarah Depaoli download
100% (9)
PDF Bayesian Structural Equation Modeling 1st Edition Sarah Depaoli download
50 pages
Demand Forecasting
No ratings yet
Demand Forecasting
30 pages
Regression Analysis
No ratings yet
Regression Analysis
50 pages
Regression
No ratings yet
Regression
8 pages
Errors in Hypothetical Testing Basic
No ratings yet
Errors in Hypothetical Testing Basic
3 pages
RSH - Qam11 - Excel and Excel QM Explsm2010
No ratings yet
RSH - Qam11 - Excel and Excel QM Explsm2010
138 pages
DSCI 3870 Fall 2014 Exam 1 Key
No ratings yet
DSCI 3870 Fall 2014 Exam 1 Key
9 pages
Solution 4
No ratings yet
Solution 4
8 pages
Choosing Between and Interpreting The Heckit and Two-Part Models For Corner Solutions
No ratings yet
Choosing Between and Interpreting The Heckit and Two-Part Models For Corner Solutions
14 pages
Chap 10 Market Risk
100% (1)
Chap 10 Market Risk
115 pages
Beta Distributions of First and Second Kind
No ratings yet
Beta Distributions of First and Second Kind
28 pages
Managerial Economics
No ratings yet
Managerial Economics
28 pages
Practice Problem For Operations Researc-1
100% (1)
Practice Problem For Operations Researc-1
4 pages
Biol701 Refresh Stats Course Content
No ratings yet
Biol701 Refresh Stats Course Content
4 pages
Capital Asset Pricing Model (Ch-6)
No ratings yet
Capital Asset Pricing Model (Ch-6)
15 pages
1 Econreview-Questions
No ratings yet
1 Econreview-Questions
26 pages
Penelitian Dari Naufal Dan Magnadi (2017) PDF
No ratings yet
Penelitian Dari Naufal Dan Magnadi (2017) PDF
9 pages
Inferential Statistics Lecture
No ratings yet
Inferential Statistics Lecture
83 pages
Data Science Assignment
No ratings yet
Data Science Assignment
10 pages
Introduction To Probability - Binomial and Normal Distributions
No ratings yet
Introduction To Probability - Binomial and Normal Distributions
64 pages
Statistics For Economists ECON 404: Neslihan@umich - Edu
No ratings yet
Statistics For Economists ECON 404: Neslihan@umich - Edu
2 pages
2.4 Select Life Table
No ratings yet
2.4 Select Life Table
11 pages

Section 5

Uploaded by

Section 5

Uploaded by

Section 5: Measuring Models

1.Understand how the performance of linear regression models is measured: R-squared.

2.Understand how the discriminatory power of scorecards is measured: ROC and

SAMPLE EXPLORE MODIFY MODEL ASSESS

• Acquire an • For each • Data • Statistical • Determine

– Says how much variation in y is explained by the model.

• Adjusted R2 for multivariate linear models.

• This will be a number between 0 and 1.

– High values mean that most of variation in y is explained by the

• Discriminatory power True values

– True Positive: Correct detection that the event has happened. 6

• Discriminatory power True values

 Classification accuracy = (TP+TN) / (TOTAL CASES)

– Interpretation of the ROC and AUROC

• The Area Under the ROC (AUROC or C) is a measure of

• An intuitive interpretation of the AUROC is that it provides an

0,7 0,7 0,7

0,6 0,6 0,6

0,5 0,5 0,5

0,4 0,4 0,4

– Models with AUROC larger than 0.7 are acceptable (Mays,

• If GINI = 0 then no discrimination

GINI= 2(AUROC - 0.5)= 2AUROC - 1

You might also like