0% found this document useful (0 votes)

3 views5 pages

Assignment 1

The document outlines an assignment to develop a Linear Regression model using the Least Squares Estimation method to predict salaries based on years of experience. It includes code for data preprocessing, model training, and performance evaluation using Mean Squared Error (MSE). The implementation features data visualization and error calculation for both training and testing datasets.

Uploaded by

Yash Shirsat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Assignment 1

Uploaded by

Yash Shirsat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Assignment 1

Name: Satyajit Shinde

Div: TY AI C Roll No.: 41
PRN: 12211701

Develop and implement a Linear Regression model using the Least

Squares Estimation method to predict a target variable based on a
given dataset. Calculate the sum of squared differences between
the actual and predicted values. The implementation should
include dataset preprocessing, model training, and performance
evaluation using metrics such as Mean Squared Error (MSE).

Code:

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

df = pd.read_csv("C:\\Users\\user\\Desktop\\Sem 6\\SI\\salary_data.csv")

df.head()

df.shape

# Splitting the data in X and Y

# where, X has independent variable and Y is dependent variable.

X = df.loc[:,"YearsExperience"]

y = df.loc[:,"Salary"]
# Splitting X and Y into X_train, y_train, X_test,y_test

X_train = X.iloc[:21]

y_train = y.iloc[:21]

X_test = X.iloc[21:]

y_test = y.iloc[21:]

X_train,y_train

# Calculating Line Equation

N = len(X_train)

sum_X = sum(X_train)

sum_Y = sum(y_train)

sum_XY = sum(X_train*y_train)

sum_X_square = sum(X_train**2)

b = ((N * sum_XY) - (sum_X * sum_Y))/((N*sum_X_square)-(sum_X**2))

a = (sum_Y - (b*sum_X))/N

# Predicting Value

def pred(a,b,x):

return a + b*x

for x in X_train:

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

for x in X_test:

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

c = pred(a,b,6)
c

# Predcting a test on train-sets

pred_test = pred(a,b,X_test)

pred_train =pred(a,b,X_train)

pred_test

pred_train

# plotting Scatter Plot

plt.plot(X_train,pred_train,color="yellow")

plt.scatter(X_train,y_train)

plt.show()

# Plotting Predicted values and Actual values

import matplotlib.pyplot as plt

plt.plot(X_test, pred_test, label='Model Prediction')

plt.scatter(X_test, pred_test, color='red', label='Predicted')

plt.scatter(X_test, y_test, label='Actual')

for x, y in zip(X_test, pred_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='left', color='red')

for x, y in zip(X_test, y_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='right')

for x, y_pred, y_actual in zip(X_test, pred_test, y_test):

plt.plot([x, x], [y_pred, y_actual], color='gray', linestyle='--')

plt.xlabel('X_test')

plt.ylabel('Values')

plt.legend()

plt.show()

#Calculating mean Squared error

error_list = []

def mean_squared_error(true,pred):
squared_error = (true - pred)**2

error_list.append(squared_error)

mse = sum(squared_error) / len(true)

return mse

mse_test = mean_squared_error(y_test,pred_test)

mse_train = mean_squared_error(y_train,pred_train)

print(f"Mean Squared Error for testing set is : {mse_test}")

print(f"Mean Squared Error for training set is : {mse_train}")

def abs_error(true,pred):

error = abs(true -pred)

print(f"Error is:\n{error}")

final = sum(error)

ae = final/len(true)

return ae

error_list_mse = []

error_list_mae = []

for i in range(N):

y_pred = np.array([pred(a,b,x) for x in X_train])

mae = abs_error(y_train,y_pred)

mse = mean_squared_error(y_train,y_pred)

error_list_mse.append(mse)

error_list_mae.append(mae)

The Loonaverse: Sofía Leal
100% (2)
The Loonaverse: Sofía Leal
5 pages
Naive Bayes
No ratings yet
Naive Bayes
58 pages
Btech1007022_lab5.1
No ratings yet
Btech1007022_lab5.1
9 pages
Btech1007022_lab5
No ratings yet
Btech1007022_lab5
14 pages
ICT Assignment 2
No ratings yet
ICT Assignment 2
7 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
ml1 prg
No ratings yet
ml1 prg
2 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
DS_P6_yash
No ratings yet
DS_P6_yash
8 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Lab 6 - Linear Regression and Multiple Linear Regression
No ratings yet
Lab 6 - Linear Regression and Multiple Linear Regression
12 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
Expt-1
No ratings yet
Expt-1
6 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Experiment No.:1: Program
No ratings yet
Experiment No.:1: Program
7 pages
Exp 1
No ratings yet
Exp 1
6 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
4 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
AI Lab9
No ratings yet
AI Lab9
5 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
PythonFile[1]
No ratings yet
PythonFile[1]
5 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
Task1
No ratings yet
Task1
5 pages
LinearReg33
No ratings yet
LinearReg33
3 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Regression
No ratings yet
Regression
16 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
vertopal.com_Untitled
No ratings yet
vertopal.com_Untitled
3 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Regression Model
No ratings yet
Regression Model
6 pages
Exp 1 121a1047 Lavanya Kurup ML
No ratings yet
Exp 1 121a1047 Lavanya Kurup ML
11 pages
Data analytics
No ratings yet
Data analytics
10 pages
ML Lab Manual
No ratings yet
ML Lab Manual
29 pages
Experiment 5 Code
No ratings yet
Experiment 5 Code
4 pages
M.E MACHINE LEARNING -CP4252 LAB MANUAL4716718074353656238
No ratings yet
M.E MACHINE LEARNING -CP4252 LAB MANUAL4716718074353656238
26 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
ml
No ratings yet
ml
17 pages
AI LAB
No ratings yet
AI LAB
19 pages
sahil_ml
No ratings yet
sahil_ml
21 pages
Logistic Regression
No ratings yet
Logistic Regression
3 pages
AIML PRACTICALS
No ratings yet
AIML PRACTICALS
22 pages
Linear _Regression_Insuarace_StudentsPerformance
No ratings yet
Linear _Regression_Insuarace_StudentsPerformance
4 pages
Cl-Vii Ass2 4301063
No ratings yet
Cl-Vii Ass2 4301063
5 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
No ratings yet
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
11 pages
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet
Copy of Copy of Green Modern Futuristic Artificial Intelligence Presentation (1)
No ratings yet
Copy of Copy of Green Modern Futuristic Artificial Intelligence Presentation (1)
11 pages
zh2x0k42pmdocx (1)
No ratings yet
zh2x0k42pmdocx (1)
2 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
fl6j5098ufDL_ASS4_43
No ratings yet
fl6j5098ufDL_ASS4_43
6 pages
1
No ratings yet
1
1 page
Mud Drum: Blowdown Control
No ratings yet
Mud Drum: Blowdown Control
1 page
Harmonic Sequence
No ratings yet
Harmonic Sequence
5 pages
Unlocking-Self-Awareness-Mastering-the-Johari-Window
No ratings yet
Unlocking-Self-Awareness-Mastering-the-Johari-Window
8 pages
Unit 1(2marks)
No ratings yet
Unit 1(2marks)
2 pages
II-IV B.Tech I-SEM Regular Exams-December-2024 (R23) Results (1)
No ratings yet
II-IV B.Tech I-SEM Regular Exams-December-2024 (R23) Results (1)
124 pages
Installation Guide For Solid Wall PVC Sewer Pipe
100% (1)
Installation Guide For Solid Wall PVC Sewer Pipe
20 pages
JADE Word
No ratings yet
JADE Word
16 pages
RW-200 (Viton)
No ratings yet
RW-200 (Viton)
2 pages
Seminar Report On Biological Computers
No ratings yet
Seminar Report On Biological Computers
3 pages
Baypoint Contracting Company (Asphalt) (2) - 230906 - 155031
No ratings yet
Baypoint Contracting Company (Asphalt) (2) - 230906 - 155031
18 pages
1st Draft January April 2025 Teaching Timetable Regular
No ratings yet
1st Draft January April 2025 Teaching Timetable Regular
26 pages
Name City Dealer Activity Certification Number: California Only Shellfish Shippers List
No ratings yet
Name City Dealer Activity Certification Number: California Only Shellfish Shippers List
6 pages
Stanje Hrčkov V Evropě (Eng.)
No ratings yet
Stanje Hrčkov V Evropě (Eng.)
73 pages
Foed 7 9 - Finals
No ratings yet
Foed 7 9 - Finals
23 pages
Grade 12 Mathematics Board Project 1 Details 2023-2024
No ratings yet
Grade 12 Mathematics Board Project 1 Details 2023-2024
3 pages
IB MYP - Year 1 - SA2 Syllabus
No ratings yet
IB MYP - Year 1 - SA2 Syllabus
3 pages
Energy Skate Park Physics PhET
No ratings yet
Energy Skate Park Physics PhET
2 pages
Quarter 3 Lesson 2 - Triangle Congruence
100% (1)
Quarter 3 Lesson 2 - Triangle Congruence
11 pages
UG RESEARCH EVENTS BSAMCH Brochure
No ratings yet
UG RESEARCH EVENTS BSAMCH Brochure
4 pages
RF Heating: Created in COMSOL Multiphysics 5.3a
No ratings yet
RF Heating: Created in COMSOL Multiphysics 5.3a
22 pages
Artikel Ilmiah - Syafiq Irsyadillah J - 21050117130073 (Bhs Inggris)
No ratings yet
Artikel Ilmiah - Syafiq Irsyadillah J - 21050117130073 (Bhs Inggris)
16 pages
An Inconvenient Truth Reaction Paper
No ratings yet
An Inconvenient Truth Reaction Paper
10 pages
12 Tape
No ratings yet
12 Tape
2 pages
Consequence To Life Category
No ratings yet
Consequence To Life Category
6 pages
Ma Final
No ratings yet
Ma Final
10 pages
Uts Modules
No ratings yet
Uts Modules
8 pages
RDC 17 10 BPF Inglês Rev1
100% (1)
RDC 17 10 BPF Inglês Rev1
109 pages
Measuring The Environmental Footprint of Leather Processing Technologies
No ratings yet
Measuring The Environmental Footprint of Leather Processing Technologies
9 pages
Newtons Sum and Vietas Formula
No ratings yet
Newtons Sum and Vietas Formula
5 pages

Assignment 1

Uploaded by

Assignment 1

Uploaded by

Assignment 1

Name: Satyajit Shinde

Develop and implement a Linear Regression model using the Least

import matplotlib.pyplot as plt

# Splitting the data in X and Y

# where, X has independent variable and Y is dependent variable.

# Calculating Line Equation

b = ((N * sum_XY) - (sum_X * sum_Y))/((N*sum_X_square)-(sum_X**2))

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

# Predcting a test on train-sets

# plotting Scatter Plot

# Plotting Predicted values and Actual values

import matplotlib.pyplot as plt

plt.plot(X_test, pred_test, label='Model Prediction')

plt.scatter(X_test, pred_test, color='red', label='Predicted')

plt.scatter(X_test, y_test, label='Actual')

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='left', color='red')

for x, y in zip(X_test, y_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='right')

for x, y_pred, y_actual in zip(X_test, pred_test, y_test):

plt.plot([x, x], [y_pred, y_actual], color='gray', linestyle='--')

#Calculating mean Squared error

mse = sum(squared_error) / len(true)

print(f"Mean Squared Error for testing set is : {mse_test}")

print(f"Mean Squared Error for training set is : {mse_train}")

error = abs(true -pred)

y_pred = np.array([pred(a,b,x) for x in X_train])

You might also like