0% found this document useful (0 votes)

24 views

Exp 1

The document describes experiments implementing linear regression in Python to predict stock prices and other variables. It shows how to divide datasets into training and test sets, fit linear regression models, calculate errors, and evaluate the performance of polynomial regression models of different degrees on noisy data. The highest degree polynomial model of 9 was found to best represent the corrupted signal, while the first degree model underfit the data, demonstrating the importance of model selection.

Uploaded by

jay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Exp 1

Uploaded by

jay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Machine Learning 21BEC505

Experiment-1
Objective: To implement of Linear Regression in Python
Task 1: Implementing Linear Regression in Python
Code:
import numpy as np
from sklearn.linear_model import LinearRegression
x = np.array([5,15,25,35,45,55]).reshape((-1,1))
y = np.array([5,20,14,32,22,38])
print(x)
print(y,"\n")

model = LinearRegression()
model.fit(x,y)
r_sq = model.score(x,y)
print("coefficeient of determination: ",r_sq,"\n")
print("intercept w0: ",model.intercept_)
print("Slope w1: ",model.coef_,"\n")

new_model = LinearRegression().fit(x,y.reshape((-1,1)))
print("intercept w0: ",new_model.intercept_)
print("Slope w1: ",new_model.coef_,"\n")

y_pred = model.predict(x)
print('predicted response: ',y_pred,sep='\n')
print('\n')
y_pred = model.intercept_ + model.coef_ * x
print('predicted response: ',y_pred,sep='\n')
print('\n')

x_new = np.arange(6).reshape((-1,1))
print(x_new,'\n')
y_new = model.predict(x_new)
print(y_new,'\n')
Machine Learning 21BEC505

Output:

Task 2: Multiple Linear Regression With scikit-learn

Code:
import numpy as np
from sklearn.linear_model import LinearRegression
x = np.array([[0,1],[5,1],[15,2],[25,5],[35,11],[45,15],[55,34],[60,35]])
y = np.array([4,5,20,14,32,22,38,43])
print(x,'\n')
print(y,"\n")

model = LinearRegression().fit(x,y)
r_sq = model.score(x,y)
print("coefficeient of determination: ",r_sq,"\n")
print("intercept w0: ",model.intercept_)
Machine Learning 21BEC505

print("Slope w1: ",model.coef_,"\n")

y_pred = model.predict(x)
print('predicted response: ',y_pred,sep='\n')
print('\n')
y_pred = model.intercept_ + np.sum(model.coef_ * x, axis=1)
print('predicted response: ',y_pred,sep='\n')
print('\n')

x_new = np.arange(10).reshape((-1,2))
print(x_new,'\n')
y_new = model.predict(x_new)
print(y_new,'\n')
Output:
Machine Learning 21BEC505

Task 3: Write a program to predict the salary of an employee using Linear Regression.
Code:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

dataset = pd.read_csv(r'E:\Jay\NIRMA\Sem6\ML\Exp1\Salary_Data.csv')
X = dataset.iloc[:,:-1].values
y = dataset.iloc[:,1].values

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=1/3,random_state=0)

from sklearn.linear_model import LinearRegression

regressor = LinearRegression()
regressor.fit(X_train, y_train)
y_pred = regressor.predict(X_test)

plt.scatter(X_train, y_train, color='red')

plt.plot(X_train,regressor.predict(X_train),color='blue')
plt.title('Salary vs Experience(Training set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.show()

plt.scatter(X_test, y_test, color='red')

plt.plot(X_train,regressor.predict(X_train),color='blue')
plt.title('Salary vs Experience(Test set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.show()

model = LinearRegression().fit(X,y)
r_sq = model.score(X,y)
Machine Learning 21BEC505

print("coefficeient of determination: ",r_sq,"\n")

print("intercept w0: ",model.intercept_)
print("Coef w1: ",model.coef_,"\n")
Output:
Machine Learning 21BEC505

Exercise:
1. Apply Linear Regression to predict the stock market price.
Code:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

dataset = pd.read_csv(r'E:\Jay\NIRMA\Sem6\ML\Exp1\prices-split-adjusted.csv')
y = dataset.iloc[:30,3].values
dataset.drop('close', inplace=True, axis=1)
X = dataset.iloc[:30,2:].values
print('Y: ',y)
print("\n")
print('X: ',X)
print("\n")
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=1/3,random_state=0)

from sklearn.linear_model import LinearRegression

regressor = LinearRegression()
regressor.fit(X_train, y_train)
y_pred = regressor.predict(X_test)
np.set_printoptions(precision=2)
print('y_predicted: \n')
print(np.concatenate((y_pred.reshape(len(y_pred),1), y_test.reshape(len(y_test),1)),1))

Output:
Machine Learning 21BEC505

2. Create a regression model for an oscillating sinusoidal function corrupted with Gaussian noise of 0
mean and 0.25 variance as the output.
 Generate polynomial model for all degrees starting 1 through 9 for training set data.
 Get the predicted output for each fit model.
 Analyze the error on the training data for each polynomial degree feature.
 Investigate the best fit model to obtain the coefficients and constants and test it on the test data
set.
 Evaluate the error of test set.
 Conclude which regression model is best torepresent the corrupted signal.
 Identify the underfit and overfit model, if there is any.
Machine Learning 21BEC505

Code:
import numpy as np
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import PolynomialFeatures
from sklearn.metrics import mean_squared_error
from sklearn.model_selection import train_test_split

x = np.linspace(-5, 5, 1000)
y = np.sin(x) + np.random.normal(0, 0.25, len(x))
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=1/3, random_state=0)
plt.scatter(x_train, y_train, s=5, label='Training data')
plt.scatter(x_test, y_test, s=5, label='Test data')
plt.legend()
plt.show()

# Generate polynomial models of degrees 1 through 9 and fit to training data

train_errors = []
test_errors = []
degrees = range(1, 10)
for degree in degrees:
poly = PolynomialFeatures(degree)
x_poly_train = poly.fit_transform(x_train.reshape(-1, 1))

# Fit model to training data

model = LinearRegression()
model.fit(x_poly_train, y_train)

# Get predicted outputs for training and test data

y_pred_train = model.predict(x_poly_train)
y_pred_test = model.predict(poly.fit_transform(x_test.reshape(-1, 1)))

# Calculate errors on training and test data

train_error = mean_squared_error(y_train, y_pred_train)
Machine Learning 21BEC505

test_error = mean_squared_error(y_test, y_pred_test)

train_errors.append(train_error)
test_errors.append(test_error)

# Print coefficients and constants for polynomial regression model

print('Degree:', degree)
print('Coefficients:', model.coef_)
print('Constants:', model.intercept_)
print('Training error:', train_error)
print('Test error:', test_error)
print()

# Plot the training and test errors for each polynomial degree
plt.plot(degrees, train_errors, label='Training error')
plt.plot(degrees, test_errors, label='Test error')
plt.legend()
plt.show()

# Find the degree with the lowest test error

best_degree = np.argmin(test_errors) + 1
print('Best fit model has degree:', best_degree)

Output:
Machine Learning 21BEC505
Machine Learning 21BEC505

Conclusion:
This experiment taught us how to divide a dataset into training and testing sets for machine learning
applications and how to use linear regression to predict stock prices. However, it is essential to keep in mind
that, despite the fact that linear regression can be a useful model for predicting stock prices, there are numerous
other factors that can influence stock prices, and accurate predictions may necessitate the application of
additional machine learning models and strategies. Additionally, the test data's lowest error indicated that the
polynomial model with degree 9 was the most accurate representation of the corrupted signal. This suggests
that, in comparison to the lower degree polynomial models, the higher degree polynomial models were better
able to capture the underlying pattern in the data. When a model is too simple and unable to capture the
underlying pattern in the data, it is considered to be an underfit model, resulting in high test and training errors.
Due to its high training and test error, the first degree polynomial model appears to be underfitting the data in
this instance. When a model is too complex and fits the data's noise, an overfit model has a low test error but
a high training error. We didn't see any overfitting in this case.

Staticschapter 3
83% (29)
Staticschapter 3
205 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
29 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
ML Lab 07
No ratings yet
ML Lab 07
4 pages
sahil_ml
No ratings yet
sahil_ml
21 pages
Lect03 Linear Model ML
No ratings yet
Lect03 Linear Model ML
93 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
ML Activity Kalyan
No ratings yet
ML Activity Kalyan
21 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
ML manoj
No ratings yet
ML manoj
51 pages
MLR Example 2predictors
No ratings yet
MLR Example 2predictors
5 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
CSL0777 L15
No ratings yet
CSL0777 L15
24 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Exercise 03
No ratings yet
Exercise 03
5 pages
Unit 5
No ratings yet
Unit 5
171 pages
Simple Linear Regression: Math Behind
No ratings yet
Simple Linear Regression: Math Behind
6 pages
2-(9-3) Regression Classifiers
No ratings yet
2-(9-3) Regression Classifiers
35 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Lect03 Linear Model ML
No ratings yet
Lect03 Linear Model ML
100 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
AIML PRACTICALS
No ratings yet
AIML PRACTICALS
22 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Week2 BBM406 Lec2.1 LinearRegression
No ratings yet
Week2 BBM406 Lec2.1 LinearRegression
49 pages
Regression Model
No ratings yet
Regression Model
6 pages
Dflyw9x3wm16 ML B1
No ratings yet
Dflyw9x3wm16 ML B1
9 pages
II YEAR AM3403 ML Concepts- Applications QB.docx
No ratings yet
II YEAR AM3403 ML Concepts- Applications QB.docx
18 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
LR-LogReg
No ratings yet
LR-LogReg
53 pages
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript Ds
No ratings yet
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript Ds
11 pages
Ml Cyber Lab
No ratings yet
Ml Cyber Lab
16 pages
06 Regression With Simple Data Preparation
No ratings yet
06 Regression With Simple Data Preparation
2 pages
Exp 1
No ratings yet
Exp 1
6 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
Lab01 Linear Regression
No ratings yet
Lab01 Linear Regression
4 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
Regression
No ratings yet
Regression
16 pages
B-56 Sanket Jambhulkar MLA-2
No ratings yet
B-56 Sanket Jambhulkar MLA-2
8 pages
MA TH 183: L-lff-2/NAME Date: 10/04/2019
No ratings yet
MA TH 183: L-lff-2/NAME Date: 10/04/2019
2 pages
Dr. Joon-Yeoul Oh: IEEN 5335 Principles of Optimization
No ratings yet
Dr. Joon-Yeoul Oh: IEEN 5335 Principles of Optimization
27 pages
DSPLab2 Sampling Theorem
No ratings yet
DSPLab2 Sampling Theorem
8 pages
Lesson Plan
100% (2)
Lesson Plan
3 pages
Rectilinear Translation Four-Bar Flexure Mechanism Based On Four Remote Center Compliance Pivots
No ratings yet
Rectilinear Translation Four-Bar Flexure Mechanism Based On Four Remote Center Compliance Pivots
4 pages
Best Book For Each Subject
No ratings yet
Best Book For Each Subject
8 pages
Strength of Materials 2024GC. ASTU Final Exam
0% (1)
Strength of Materials 2024GC. ASTU Final Exam
13 pages
20ad41e2 - Data Science
No ratings yet
20ad41e2 - Data Science
2 pages
Unsolved Problems - Mathematics Edition: August 2020
No ratings yet
Unsolved Problems - Mathematics Edition: August 2020
28 pages
Digital Logic Design Lab Report 02
No ratings yet
Digital Logic Design Lab Report 02
6 pages
CAT-02_Paper-1_Question_29-Oct-2023_BEAS
No ratings yet
CAT-02_Paper-1_Question_29-Oct-2023_BEAS
18 pages
AI Lab
No ratings yet
AI Lab
11 pages
Pensiero Computazionale Informazioni Pratiche
No ratings yet
Pensiero Computazionale Informazioni Pratiche
29 pages
Gauss-Seidel Iterative Method: = + sin (5 Ct) cos (Ct) −ρ A 2 gh
No ratings yet
Gauss-Seidel Iterative Method: = + sin (5 Ct) cos (Ct) −ρ A 2 gh
4 pages
Work Power & Energy Past Paers 2023
No ratings yet
Work Power & Energy Past Paers 2023
46 pages
10 as Statistics and Mechanics Practice Paper E Mark Scheme
No ratings yet
10 as Statistics and Mechanics Practice Paper E Mark Scheme
10 pages
Activity 1: Medians of A Triangle Are Concurrent
No ratings yet
Activity 1: Medians of A Triangle Are Concurrent
5 pages
UGEB2530C Homework 1
No ratings yet
UGEB2530C Homework 1
8 pages
ME 372 (Chapter-4) - Extended Surfaces (Fins)
No ratings yet
ME 372 (Chapter-4) - Extended Surfaces (Fins)
38 pages
MATH 105 SAC Syllabus
No ratings yet
MATH 105 SAC Syllabus
3 pages
Report On Mastery Level
No ratings yet
Report On Mastery Level
8 pages
Yield Line Method Applied
No ratings yet
Yield Line Method Applied
144 pages
NCERT Solutions For Class 12 Maths Chapter 13 Probability Exercise 13.1
No ratings yet
NCERT Solutions For Class 12 Maths Chapter 13 Probability Exercise 13.1
22 pages
Evolution of Operations Management Past, Present and Future
100% (1)
Evolution of Operations Management Past, Present and Future
30 pages
MBK 85.e
No ratings yet
MBK 85.e
184 pages
Lab 1 Report: Rmit University Vietnam School of Science and Technology
No ratings yet
Lab 1 Report: Rmit University Vietnam School of Science and Technology
15 pages
Class 31 - Statistics 2
No ratings yet
Class 31 - Statistics 2
17 pages
Fundamentals Of Highdimensional Statistics With Exercises And R Labs Johannes Lederer pdf download
No ratings yet
Fundamentals Of Highdimensional Statistics With Exercises And R Labs Johannes Lederer pdf download
89 pages

Exp 1

Uploaded by

Exp 1

Uploaded by

Machine Learning 21BEC505

Task 2: Multiple Linear Regression With scikit-learn

print("Slope w1: ",model.coef_,"\n")

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression

plt.scatter(X_train, y_train, color='red')

plt.scatter(X_test, y_test, color='red')

print("coefficeient of determination: ",r_sq,"\n")

from sklearn.linear_model import LinearRegression

# Generate polynomial models of degrees 1 through 9 and fit to training data

# Fit model to training data

# Get predicted outputs for training and test data

# Calculate errors on training and test data

test_error = mean_squared_error(y_test, y_pred_test)

# Print coefficients and constants for polynomial regression model

# Find the degree with the lowest test error

You might also like