0% found this document useful (0 votes)

6 views9 pages

lesson-3.2-introduction-to-regression-structured-projects

The document discusses a structured data project focused on predicting bulldozer sale prices using regression techniques. It covers important concepts such as cross-validation, training and test splits, and various regression metrics like R2, MAE, and MSE. The document emphasizes the significance of understanding model generalization and choosing appropriate evaluation metrics for performance assessment.

Uploaded by

soulopp27

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

lesson-3.2-introduction-to-regression-structured-projects

Uploaded by

soulopp27

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Structured Data Project 2:

Predicting the sale price of

Bulldozers (regression)
Data

🕰
🚜 💰
Where can you get help?

• Follow along with the code

• Try it for yourself
• Press SHIFT + TAB to read the docstring
• Search for it
• Try again
• Ask
Cross-validation
5-fold Cross-validation
100 patient records
Normal Train & Test Split

100 patient records

Split 20 80 patient records

80 patient records 20

Training split (80%) Test split (20%)

Model is trained on training data, and evaluated on the test

data.

Model is trained on 5 diﬀerent versions of training data, and

evaluated on 5 diﬀerent versions of the test data.
The most important concept in
machine learning
(the 3 sets)

Course materials Practice exam Final exam

(training set) (validation set) (test set)

The ability for a machine learning model to perform

Generalization well on data it hasn’t seen before.
Classification and Regression
metrics
Classification Regression

Accuracy R2 (r-squared)

Precision Mean absolute error (MAE)

Recall Mean squared error (MSE)

F1 Root mean squared error (RMSE)

Bold = default evaluation in Scikit-Learn

Which regression metric should you
use?
• R2 is similar to accuracy. It gives you a quick indication of how well your model might be doing.
Generally, the closer your R2 value is to 1.0, the better the model. But it doesn't really tell exactly
how wrong your model is in terms of how far off each prediction is.
• MAE gives a better indication of how far off each of your model's predictions are on average.
• As for MAE or MSE, because of the way MSE is calculated, squaring the differences between
predicted values and actual values, it amplifies larger differences. Let's say we're predicting the
value of houses (which we are).
• Pay more attention to MAE: When being $10,000 off is twice as bad as being $5,000 off.
• Pay more attention to MSE: When being $10,000 off is more than twice as bad as being
$5,000 off.

The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
No ratings yet
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
28 pages
Model Evaluation Metrics
No ratings yet
Model Evaluation Metrics
21 pages
Unit 2
No ratings yet
Unit 2
80 pages
Lesson 2.4.1 What is Scikit Learn Keynote
No ratings yet
Lesson 2.4.1 What is Scikit Learn Keynote
21 pages
1-Linear Regression
No ratings yet
1-Linear Regression
22 pages
COMP1801 - Copy 1
No ratings yet
COMP1801 - Copy 1
18 pages
Regression Metrics
No ratings yet
Regression Metrics
26 pages
Session 6 - Gross Validation
No ratings yet
Session 6 - Gross Validation
26 pages
02-MLR For Prediction
No ratings yet
02-MLR For Prediction
24 pages
2. Performance Measures
No ratings yet
2. Performance Measures
19 pages
An Introduction To Model Accuracy and Metrics (Slides)
No ratings yet
An Introduction To Model Accuracy and Metrics (Slides)
13 pages
PS Notes (Machine Learning
No ratings yet
PS Notes (Machine Learning
14 pages
03 Regression
No ratings yet
03 Regression
39 pages
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation
No ratings yet
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation
25 pages
7 Regression
No ratings yet
7 Regression
15 pages
Regression
No ratings yet
Regression
19 pages
Module 1
No ratings yet
Module 1
19 pages
Book of Textile Engineering
0% (1)
Book of Textile Engineering
122 pages
Lecture-18 - Evaluation Metrics For Different Model
No ratings yet
Lecture-18 - Evaluation Metrics For Different Model
27 pages
Mid-1 ML
No ratings yet
Mid-1 ML
12 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Chapter2 1 55ppt
No ratings yet
Chapter2 1 55ppt
8 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Week 6 - Lecture 12-1
No ratings yet
Week 6 - Lecture 12-1
34 pages
Intermediate Analytics-Chai Square and ANOA-Week 2-1
No ratings yet
Intermediate Analytics-Chai Square and ANOA-Week 2-1
45 pages
DA U3
No ratings yet
DA U3
10 pages
MLA Manual
No ratings yet
MLA Manual
25 pages
Module 3 - ML
No ratings yet
Module 3 - ML
101 pages
Ml Algo Terms
No ratings yet
Ml Algo Terms
11 pages
Model Evalution
No ratings yet
Model Evalution
6 pages
Regression
No ratings yet
Regression
35 pages
Predictive ModellingAnalytics
No ratings yet
Predictive ModellingAnalytics
27 pages
ML
No ratings yet
ML
6 pages
2_DataPreProcessing_code
No ratings yet
2_DataPreProcessing_code
46 pages
NON-Linear SVM and Evaluation metrices
No ratings yet
NON-Linear SVM and Evaluation metrices
13 pages
Assesing Performance of Regression-Error Measures
No ratings yet
Assesing Performance of Regression-Error Measures
5 pages
Evaluation Metrics for Your Regression Model - Analytics Vidhya
No ratings yet
Evaluation Metrics for Your Regression Model - Analytics Vidhya
6 pages
UNIT 3 Regression
No ratings yet
UNIT 3 Regression
5 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Assignment - 01
No ratings yet
Assignment - 01
1 page
Regression Metrics
No ratings yet
Regression Metrics
3 pages
ML exp5
No ratings yet
ML exp5
7 pages
Common metrics used to evaluate the performance of regression models
No ratings yet
Common metrics used to evaluate the performance of regression models
3 pages
performance evaluation
No ratings yet
performance evaluation
24 pages
Module_2
No ratings yet
Module_2
5 pages
Metric
No ratings yet
Metric
6 pages
chapter 1 capstone project ai class 12
No ratings yet
chapter 1 capstone project ai class 12
5 pages
Regression v33
No ratings yet
Regression v33
81 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
Lecture-05 ML - Regression Model training
No ratings yet
Lecture-05 ML - Regression Model training
9 pages
Machine learning notes
No ratings yet
Machine learning notes
12 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
Machine Learning Model Evaluation | Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation | Zero To Mastery Academy
1 page
Data Science Statistics Mathematics Cheat Sheet
100% (1)
Data Science Statistics Mathematics Cheat Sheet
13 pages
Inmetro 对电池的要求
No ratings yet
Inmetro 对电池的要求
7 pages
A Guide On How To Compare Different Models in Linear Progression
No ratings yet
A Guide On How To Compare Different Models in Linear Progression
8 pages
StatementOfAccount_3134583396_Jan14_153418
No ratings yet
StatementOfAccount_3134583396_Jan14_153418
211 pages
Scientech 2808
No ratings yet
Scientech 2808
138 pages
KS Schadensbroschüre Englisch PDF
No ratings yet
KS Schadensbroschüre Englisch PDF
92 pages
Defectos y Soluciones de Monitores
100% (1)
Defectos y Soluciones de Monitores
355 pages
Eegame Logcat
No ratings yet
Eegame Logcat
20 pages
7 Nov Science
No ratings yet
7 Nov Science
19 pages
Ueb 3112 (Ks-Teori)
No ratings yet
Ueb 3112 (Ks-Teori)
21 pages
Waves Lab
0% (1)
Waves Lab
6 pages
Servo
No ratings yet
Servo
24 pages
Spike-NLOS Multi-Purpose Missile System
No ratings yet
Spike-NLOS Multi-Purpose Missile System
2 pages
Tesys T LTM R Ethernet/Ip With A Third-Party PLC: Quick Start Guide
No ratings yet
Tesys T LTM R Ethernet/Ip With A Third-Party PLC: Quick Start Guide
28 pages
Gnupro Userguide
No ratings yet
Gnupro Userguide
74 pages
Simulation of The Catalytic Partial Oxidation of Methane To Synthesis Gas by D.groote, Froment
100% (1)
Simulation of The Catalytic Partial Oxidation of Methane To Synthesis Gas by D.groote, Froment
20 pages
Orthogonal Frequency - Division Multiplexing: Mathematical Description
No ratings yet
Orthogonal Frequency - Division Multiplexing: Mathematical Description
16 pages
Music Prep Chart 2017-2018
No ratings yet
Music Prep Chart 2017-2018
6 pages
Boiler Annual Check List
100% (2)
Boiler Annual Check List
4 pages
Saral: Compact, Accurate and Reliable
No ratings yet
Saral: Compact, Accurate and Reliable
2 pages
Design of Air Bearing For High Speed Micro Gas Turbine: M.Muruganandam
No ratings yet
Design of Air Bearing For High Speed Micro Gas Turbine: M.Muruganandam
10 pages
Formula For Aerodynamic Heating
No ratings yet
Formula For Aerodynamic Heating
3 pages
Radio Communication
No ratings yet
Radio Communication
7 pages
FMM 100datk
No ratings yet
FMM 100datk
2 pages
FM09016A NV Series HD Upgrade NAE100 101 Exciters
No ratings yet
FM09016A NV Series HD Upgrade NAE100 101 Exciters
9 pages
Candidate Hall Ticket
No ratings yet
Candidate Hall Ticket
2 pages
Online Application: How Do I Apply at Henkel?
No ratings yet
Online Application: How Do I Apply at Henkel?
7 pages
Meherwan P Boyce - Gas Turbine Engineering Handbook-Elsevier Butterworth-Heinemann (2012) 28
No ratings yet
Meherwan P Boyce - Gas Turbine Engineering Handbook-Elsevier Butterworth-Heinemann (2012) 28
5 pages
Design Examples
No ratings yet
Design Examples
7 pages
Lyrics Finder 11 - Help File
No ratings yet
Lyrics Finder 11 - Help File
2 pages
Process Heater Efficiency Testing Results & Annual Cost Savings Calculations
No ratings yet
Process Heater Efficiency Testing Results & Annual Cost Savings Calculations
13 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet

lesson-3.2-introduction-to-regression-structured-projects

Uploaded by

lesson-3.2-introduction-to-regression-structured-projects

Uploaded by

Structured Data Project 2:

Predicting the sale price of

• Follow along with the code

100 patient records

Split 20 80 patient records

Training split (80%) Test split (20%)

Model is trained on training data, and evaluated on the test

Model is trained on 5 diﬀerent versions of training data, and

Course materials Practice exam Final exam

The ability for a machine learning model to perform

Precision Mean absolute error (MAE)

Recall Mean squared error (MSE)

F1 Root mean squared error (RMSE)

Bold = default evaluation in Scikit-Learn

You might also like