0% found this document useful (0 votes)

8 views

vertopal.com_C1_W2_Lab04_FeatEng_PolyReg_Soln

This lab focuses on feature engineering and polynomial regression to model complex, non-linear functions using linear regression techniques. It demonstrates how to create polynomial features, select important features through gradient descent, and apply feature scaling for improved convergence. The lab concludes with the ability to model complex functions like cosine using engineered features.

Uploaded by

Muhammad Talha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

vertopal.com_C1_W2_Lab04_FeatEng_PolyReg_Soln

Uploaded by

Muhammad Talha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Optional Lab: Feature Engineering and

Polynomial Regression

Goals
In this lab you will:

• explore feature engineering and polynomial regression which allows you to use the
machinery of linear regression to fit very complicated, even very non-linear functions.

Tools
You will utilize the function developed in previous labs as well as matplotlib and NumPy.

import numpy as np
import matplotlib.pyplot as plt
from lab_utils_multi import zscore_normalize_features,
run_gradient_descent_feng
np.set_printoptions(precision=2) # reduced display precision on numpy
arrays

Feature Engineering and Polynomial Regression

Overview
Out of the box, linear regression provides a means of building models of the form:
f w ,b =w 0 x 0 +w 1 x 1+ .. .+ wn −1 x n− 1+ b

What if your features/data are non-linear or are combinations of features? For example, Housing
prices do not tend to be linear with living area but penalize very small or very large houses
resulting in the curves shown in the graphic above. How can we use the machinery of linear
regression to fit this curve? Recall, the 'machinery' we have is the ability to modify the
parameters w , b in (1) to 'fit' the equation to the training data. However, no amount of adjusting
of w ,b in (1) will achieve a fit to a non-linear curve.
Polynomial Features
Above we were considering a scenario where the data was non-linear. Let's try using what we
know so far to fit a non-linear curve. We'll start with a simple quadratic: y=1+ x 2

You're familiar with all the routines we're using. They are available in the lab_utils.py file for
review. We'll use np.c_[..] which is a NumPy routine to concatenate along the column
boundary.

# create target data

x = np.arange(0, 20, 1)
y = 1 + x**2
X = x.reshape(-1, 1)

model_w,model_b = run_gradient_descent_feng(X,y,iterations=1000, alpha

= 1e-2)

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

plt.title("no feature engineering")
plt.plot(x,X@model_w + model_b, label="Predicted Value");
plt.xlabel("X"); plt.ylabel("y"); plt.legend(); plt.show()

2
Well, as expected, not a great fit. What is needed is something like y=w0 x 0+ b, or a polynomial
feature. To accomplish this, you can modify the input data to engineer the needed features. If
you swap the original data with a version that squares the x value, then you can achieve
2
y=w0 x 0+ b. Let's try it. Swap X for X**2 below:

# create target data

x = np.arange(0, 20, 1)
y = 1 + x**2

# Engineer features
X = x**2 #<-- added engineered feature

X = X.reshape(-1, 1) #X should be a 2-D Matrix

model_w,model_b = run_gradient_descent_feng(X, y, iterations=10000,
alpha = 1e-5)

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

plt.title("Added x**2 feature")
plt.plot(x, np.dot(X,model_w) + model_b, label="Predicted Value");
plt.xlabel("x"); plt.ylabel("y"); plt.legend(); plt.show()

Great! near perfect fit. Notice the values of w and b printed right above the graph: w,b found
by gradient descent: w: [1.], b: 0.0490. Gradient descent modified our initial
2
values of $\mathbf{w},b $ to be (1.0,0.049) or a model of y=1∗x 0 +0.049, very close to our
2
target of y=1∗x 0 +1. If you ran it longer, it could be a better match.
Selecting Features
Above, we knew that an x 2 term was required. It may not always be obvious which features are
required. One could add a variety of potential features to try and find the most useful. For
2 3
example, what if we had instead tried : y=w0 x 0+ w1 x 1+ w2 x2 + b ?

Run the next cells.

# create target data

x = np.arange(0, 20, 1)
y = x**2

# engineer features .
X = np.c_[x, x**2, x**3] #<-- added engineered feature

model_w,model_b = run_gradient_descent_feng(X, y, iterations=10000,

alpha=1e-7)

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

plt.title("x, x**2, x**3 features")
plt.plot(x, X@model_w + model_b, label="Predicted Value");
plt.xlabel("x"); plt.ylabel("y"); plt.legend(); plt.show()

Note the value of w , [0.08 0.54 0.03] and b is 0.0106.This implies the model after
fitting/training is:
2 3
0.08 x +0.54 x +0.03 x + 0.0106
Gradient descent has emphasized the data that is the best fit to the x 2 data by increasing the w 1
term relative to the others. If you were to run for a very long time, it would continue to reduce
the impact of the other terms.

Gradient descent is picking the 'correct' features for us by emphasizing its associated
parameter

Let's review this idea:

• Intially, the features were re-scaled so they are comparable to each other
• less weight value implies less important/correct feature, and in extreme, when the
weight becomes zero or very close to zero, the associated feature is not useful in fitting
the model to the data.
• above, after fitting, the weight associated with the x 2 feature is much larger than the
weights for x or x 3 as it is the most useful in fitting the data.

An Alternate View
Above, polynomial features were chosen based on how well they matched the target data.
Another way to think about this is to note that we are still using linear regression once we have
created new features. Given that, the best features will be linear relative to the target. This is
best understood with an example.
# create target data
x = np.arange(0, 20, 1)
y = x**2

# engineer features .
X = np.c_[x, x**2, x**3] #<-- added engineered feature
X_features = ['x','x^2','x^3']

fig,ax=plt.subplots(1, 3, figsize=(12, 3), sharey=True)

for i in range(len(ax)):
ax[i].scatter(X[:,i],y)
ax[i].set_xlabel(X_features[i])
ax[0].set_ylabel("y")
plt.show()

Above, it is clear that the x 2 feature mapped against the target value y is linear. Linear
regression can then easily generate a model using that feature.

Scaling features
As described in the last lab, if the data set has features with significantly different scales, one
should apply feature scaling to speed gradient descent. In the example above, there is x , x 2 and
3
x which will naturally have very different scales. Let's apply Z-score normalization to our
example.

# create target data

x = np.arange(0,20,1)
X = np.c_[x, x**2, x**3]
print(f"Peak to Peak range by column in Raw X:
{np.ptp(X,axis=0)}")

# add mean_normalization
X = zscore_normalize_features(X)
print(f"Peak to Peak range by column in Normalized X:
{np.ptp(X,axis=0)}")

Now we can try again with a more aggressive value of alpha:

x = np.arange(0,20,1)
y = x**2

X = np.c_[x, x2, x3]

X = zscore_normalize_features(X)

model_w, model_b = run_gradient_descent_feng(X, y, iterations=100000,

alpha=1e-1)

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

plt.title("Normalized x x**2, x**3 feature")
plt.plot(x,X@model_w + model_b, label="Predicted Value");
plt.xlabel("x"); plt.ylabel("y"); plt.legend(); plt.show()

Feature scaling allows this to converge much faster.

Note again the values of w . The w 1 term, which is the x 2 term is the most emphasized. Gradient
descent has all but eliminated the x 3 term.

Complex Functions
With feature engineering, even quite complex functions can be modeled:

x = np.arange(0,20,1)
y = np.cos(x/2)

X = np.c_[x, x2, x3,x4, x5, x6, x7, x8, x9, x**10,

x**11, x**12, x**13]
X = zscore_normalize_features(X)

model_w,model_b = run_gradient_descent_feng(X, y, iterations=1000000,

alpha = 1e-1)

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

plt.title("Normalized x x**2, x**3 feature")
plt.plot(x,X@model_w + model_b, label="Predicted Value");
plt.xlabel("x"); plt.ylabel("y"); plt.legend(); plt.show()

Congratulations!
In this lab you:

• learned how linear regression can model complex, even highly non-linear functions using
feature engineering
• recognized that it is important to apply feature scaling when doing feature engineering

(Feature Engineering) (Extended-Cheatsheet)
No ratings yet
(Feature Engineering) (Extended-Cheatsheet)
9 pages
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
No ratings yet
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
10 pages
Feature Engineering: Getting The Most Out of Data For Predictive Models
No ratings yet
Feature Engineering: Getting The Most Out of Data For Predictive Models
75 pages
Data Pre-Processing Python For Beginner
No ratings yet
Data Pre-Processing Python For Beginner
12 pages
Data Pre-Processing Python For Beginner
No ratings yet
Data Pre-Processing Python For Beginner
12 pages
MDS372_LAB4_2448001
No ratings yet
MDS372_LAB4_2448001
17 pages
Feature Engineering / Feature Selection
No ratings yet
Feature Engineering / Feature Selection
33 pages
Feature Engineering PDF
100% (1)
Feature Engineering PDF
75 pages
vertopal.com_C1_W2_Lab05_Sklearn_GD_Soln
No ratings yet
vertopal.com_C1_W2_Lab05_Sklearn_GD_Soln
3 pages
Feature Engineering
No ratings yet
Feature Engineering
23 pages
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
No ratings yet
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
6 pages
3
No ratings yet
3
14 pages
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
No ratings yet
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
22 pages
Machine Learning: by Team 2
No ratings yet
Machine Learning: by Team 2
41 pages
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
No ratings yet
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
45 pages
Featureengineering 171206213206
No ratings yet
Featureengineering 171206213206
45 pages
Lecture02. ML Pipeline (Chapter 2)
No ratings yet
Lecture02. ML Pipeline (Chapter 2)
50 pages
Unit 3-2
No ratings yet
Unit 3-2
15 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
machinelearning
No ratings yet
machinelearning
26 pages
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
No ratings yet
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
11 pages
Larning Rate
No ratings yet
Larning Rate
9 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
Scikit Hca
No ratings yet
Scikit Hca
8 pages
1 What Is Feature Engineering - Kaggle
No ratings yet
1 What Is Feature Engineering - Kaggle
6 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
C1 W2 Lab05 Sklearn GD Soln
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
3 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
13 pages
1 Tutorial: Linear Regression
No ratings yet
1 Tutorial: Linear Regression
8 pages
Linear Regression With Multiple Variables
No ratings yet
Linear Regression With Multiple Variables
37 pages
ML Lecture # 04 Multiple Regression
No ratings yet
ML Lecture # 04 Multiple Regression
29 pages
ML03
No ratings yet
ML03
14 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
FeatureEngineering (1)
No ratings yet
FeatureEngineering (1)
50 pages
Week 10
No ratings yet
Week 10
50 pages
Polynomial Curve Fitting in Machine Learning
No ratings yet
Polynomial Curve Fitting in Machine Learning
7 pages
Unit 4 Basics of Feature Engineering
100% (1)
Unit 4 Basics of Feature Engineering
33 pages
Unit-II
No ratings yet
Unit-II
119 pages
Machine Learning Mindmap PDF
100% (1)
Machine Learning Mindmap PDF
5 pages
0.1 Guilherme Marthe - Boston House Pricing Challenge
100% (1)
0.1 Guilherme Marthe - Boston House Pricing Challenge
15 pages
Explore Feature Engineering
No ratings yet
Explore Feature Engineering
10 pages
7-8 Feature Engineering 101-Normalization
No ratings yet
7-8 Feature Engineering 101-Normalization
8 pages
AIL303 M
No ratings yet
AIL303 M
22 pages
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
No ratings yet
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
27 pages
Lecture Material 10
No ratings yet
Lecture Material 10
9 pages
Session 7 Feature Selection & Dimensionality Reduction
No ratings yet
Session 7 Feature Selection & Dimensionality Reduction
20 pages
Feature Engineering For Machine Learning
No ratings yet
Feature Engineering For Machine Learning
41 pages
AI Explanation Basic 2
No ratings yet
AI Explanation Basic 2
35 pages
Data Pre Processing
No ratings yet
Data Pre Processing
2 pages
2_DataPreProcessing_code
No ratings yet
2_DataPreProcessing_code
46 pages
Slide 4 - Linear Regression With Multiple Variables
100% (1)
Slide 4 - Linear Regression With Multiple Variables
30 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
Question 1 The Given Dataset Can Be Visualized As Follows
No ratings yet
Question 1 The Given Dataset Can Be Visualized As Follows
13 pages
Linear Regression With Multiple Features
No ratings yet
Linear Regression With Multiple Features
7 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Lecture Material 3
No ratings yet
Lecture Material 3
7 pages
MLLabManual
No ratings yet
MLLabManual
24 pages
Wine Classification
No ratings yet
Wine Classification
10 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Numerical Method
No ratings yet
Numerical Method
6 pages
Nonlinear Programming
No ratings yet
Nonlinear Programming
6 pages
ASSIGNMENT 1 - Basic Concepts of Analysis and Design of Algorithms
No ratings yet
ASSIGNMENT 1 - Basic Concepts of Analysis and Design of Algorithms
3 pages
Problemset 2
No ratings yet
Problemset 2
3 pages
Big M Method
No ratings yet
Big M Method
9 pages
Kresse Vasp Phonon
No ratings yet
Kresse Vasp Phonon
54 pages
G8 MIMs # 5 - WEEK 1 - 1st QUARTER
No ratings yet
G8 MIMs # 5 - WEEK 1 - 1st QUARTER
3 pages
Taylor Series Method, Euler's Method and Modified Euler's Method
No ratings yet
Taylor Series Method, Euler's Method and Modified Euler's Method
8 pages
Zaid-LAB - 09 - Jupyter Notebook
No ratings yet
Zaid-LAB - 09 - Jupyter Notebook
3 pages
MCSC 202 Numerical Methods
No ratings yet
MCSC 202 Numerical Methods
466 pages
Lecture3_IO_BLG336E_2022
No ratings yet
Lecture3_IO_BLG336E_2022
61 pages
ASSIGNMENT-I (unit-III) Maths For Btech Biotech
No ratings yet
ASSIGNMENT-I (unit-III) Maths For Btech Biotech
2 pages
15MA206-Numerical Methods: S Athithan
No ratings yet
15MA206-Numerical Methods: S Athithan
14 pages
Spline Cubic Aplications
100% (1)
Spline Cubic Aplications
13 pages
Image Fusion 1
100% (1)
Image Fusion 1
11 pages
Numerical Methods I PDF
No ratings yet
Numerical Methods I PDF
44 pages
Package Fuzzymcdm': R Topics Documented
No ratings yet
Package Fuzzymcdm': R Topics Documented
8 pages
Finite Element Formulations-1
No ratings yet
Finite Element Formulations-1
64 pages
Preconditioning: Condition Number
No ratings yet
Preconditioning: Condition Number
5 pages
On The Two-Dimensional Knapsack Problem
No ratings yet
On The Two-Dimensional Knapsack Problem
10 pages
Linear Algebra Review
No ratings yet
Linear Algebra Review
2 pages
CS510 Notes 17 Approximation - Algorithms
No ratings yet
CS510 Notes 17 Approximation - Algorithms
40 pages
Polynomial Transformations of Tschirnhaus, Bring and Jerrard4s++
No ratings yet
Polynomial Transformations of Tschirnhaus, Bring and Jerrard4s++
5 pages
Unit I Introduction: Mf7201 Optimization Techniques in Manufacturing
No ratings yet
Unit I Introduction: Mf7201 Optimization Techniques in Manufacturing
3 pages
Numerical Methods 1
No ratings yet
Numerical Methods 1
14 pages
Computer Vision Mayank Tripathi PDF
No ratings yet
Computer Vision Mayank Tripathi PDF
1 page
Flow Shop Scheduling
100% (4)
Flow Shop Scheduling
25 pages
DL 4
No ratings yet
DL 4
11 pages
Adding and Subtracting Polynomials PPP
No ratings yet
Adding and Subtracting Polynomials PPP
24 pages
OR Module 2 SESSION 4
No ratings yet
OR Module 2 SESSION 4
45 pages

vertopal.com_C1_W2_Lab04_FeatEng_PolyReg_Soln

Uploaded by

vertopal.com_C1_W2_Lab04_FeatEng_PolyReg_Soln

Uploaded by

Optional Lab: Feature Engineering and

Feature Engineering and Polynomial Regression

# create target data

model_w,model_b = run_gradient_descent_feng(X,y,iterations=1000, alpha

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

# create target data

X = X.reshape(-1, 1) #X should be a 2-D Matrix

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

Run the next cells.

# create target data

model_w,model_b = run_gradient_descent_feng(X, y, iterations=10000,

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

Let's review this idea:

fig,ax=plt.subplots(1, 3, figsize=(12, 3), sharey=True)

# create target data

Now we can try again with a more aggressive value of alpha:

X = np.c_[x, x**2, x**3]

model_w, model_b = run_gradient_descent_feng(X, y, iterations=100000,

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

Feature scaling allows this to converge much faster.

X = np.c_[x, x**2, x**3,x**4, x**5, x**6, x**7, x**8, x**9, x**10,

model_w,model_b = run_gradient_descent_feng(X, y, iterations=1000000,

plt.scatter(x, y, marker='x', c='r', label="Actual Value");

You might also like

X = np.c_[x, x2, x3]

X = np.c_[x, x2, x3,x4, x5, x6, x7, x8, x9, x**10,