0% found this document useful (0 votes)

8 views

Practical # 10

Uploaded by

Alishba Aleem

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Practical # 10

Uploaded by

Alishba Aleem

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Department of Software Engineering

Mehran University of Engineering and Technology, Jamshoro

Course: SWE – Data Analytics and Business Intelligence

Instructor Ms Sana Faiz Practical/Lab No. 10
Date CLOs 04
Signature Assessment Score

Topic To understand basics of python

Objectives To become familiar with Linear Regression using SciKit

Lab Discussion: Theoretical concepts and Procedural steps

Linear regression
 Linear regression is a basic and commonly used type of predictive analysis. The overall
idea of regression is to examine two things:
(1) does a set of predictor variables do a good job in predicting an outcome
(dependent) variable?
(2) Which variables in particular are significant predictors of the outcome
variable?
Simple linear regression
 1 dependent variable (interval or ratio), 1 independent variable
 These regression estimates are used to explain the relationship between one dependent
variable and one or more independent variables. The simplest form of the regression
equation with one dependent and one independent variable is defined by the formula y =
c + b*x, where y = estimated dependent variable score, c = constant, b = regression
coefficient, and x = score on the independent variable.
Regression variables
 Naming the Variables. There are many names for a regression’s dependent variable. It may
be called an outcome variable, criterion variable, endogenous variable, or regress
 The independent variables can be called exogenous variables, predictor variables, or
regressors.
Import libraries and read data from csv files

# Import the necessary libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
# Import the dataset
dataset = pd.read_csv('salaryData.csv')
X = dataset.iloc[:, :-1].values # Assuming the feature is in the first column
y = dataset.iloc[:, -1].values # Assuming the target is in the second column

Train classifier and predict outcomes

# Split the dataset into the training set and test set
# We're splitting the data in 1/3, so out of 30 rows, 20 rows will go into the training
set,
# and 10 rows will go into the testing set.
xTrain, xTest, yTrain, yTest = train_test_split(X, y, test_size=1/3, random_state=0)

# Optional: Check the split data (this step depends on your needs)
import pandas as pd
show_data = pd.DataFrame({'Training Set': xTrain.flatten(), 'Training Target':
yTrain})
print(show_data)

# Creating a LinearRegression object and fitting it on our training set.

linearRegressor = LinearRegression()
linearRegressor.fit(xTrain.reshape(-1, 1), yTrain)

# Predicting the test set results

yPrediction = linearRegressor.predict(xTest.reshape(-1, 1))

# Flattening the prediction to match original test set format (if needed)
yPrediction = yPrediction.flatten()
print(yPrediction)
Visualizing training and target training

Showing actual and predicted data and visualize it

# Showing test set and predicted values side by side

results = pd.DataFrame({
'Test Set': xTest.flatten(),
'Actual Value': yTest,
'Predicted Values': yPrediction
})
print(results)

# Visualising the training set results

plt.subplot(2, 1, 1) # Define a 2-row, 1-column grid, and use the 1st cell
plt.scatter(xTrain, yTrain, color='red')
plt.plot(xTrain, linearRegressor.predict(xTrain.reshape(-1, 1)), color='blue')
plt.title('Salary vs Experience (Training set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.show()

Actual and predicted values

Plotting test set data

# Visualising the test set results

plt.subplot(2, 1, 2) # Define a 2-row, 1-column grid, and use the 2nd cell
plt.scatter(xTest, yTest, color='red')
plt.plot(xTrain, linearRegressor.predict(xTrain.reshape(-1, 1)), color='blue') # Use
training data to plot regression line
plt.title('Salary vs Experience (Test set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.show()

Regression metrics

from sklearn.metrics import mean_absolute_error, mean_squared_error, r2_score

# Calculating and printing the performance metrics

print("Mean Absolute Error:", mean_absolute_error(yTest, yPrediction))
print("Mean Squared Error:", mean_squared_error(yTest, yPrediction))
print("Variance Score (R^2):", r2_score(yTest, yPrediction))

Regression metrics
 Mean Absolute Error: mean absolute error is a measure of difference between two
continuous variables.
 Mean Squared Error: the mean squared error or mean squared deviation of an estimator
measures the average of the squares of the errors—that is, the average squared difference
between the estimated values and what is estimated. MSE is a risk function, corresponding to
the expected value of the squared error loss.
Visualizing data
Class Tasks
Submission Date: --

 Perform linear regression on the student dataset uploaded on the drive.

 Perform linear regression on the dataset of your own choice.

Ghysels, Eric - Marcellino, Massimiliano - Applied Economic Forecasting Using Time Series Methods-Oxford University Press (2018)
No ratings yet
Ghysels, Eric - Marcellino, Massimiliano - Applied Economic Forecasting Using Time Series Methods-Oxford University Press (2018)
617 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Chapter 3 Complete Solutions
100% (7)
Chapter 3 Complete Solutions
18 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
DS Unit 2 Essay Answers
No ratings yet
DS Unit 2 Essay Answers
17 pages
AIH_LAB1
No ratings yet
AIH_LAB1
10 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
ML Combined
No ratings yet
ML Combined
254 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Machine Learning Assignment-2
No ratings yet
Machine Learning Assignment-2
7 pages
EXP-4 DMusingPYTHON
No ratings yet
EXP-4 DMusingPYTHON
7 pages
Whole ML PDF 1614408656
100% (1)
Whole ML PDF 1614408656
214 pages
Aakash S Project Report
No ratings yet
Aakash S Project Report
12 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
ML manoj
No ratings yet
ML manoj
51 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
BA Notes[End Sem)
No ratings yet
BA Notes[End Sem)
26 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
Assignment 3 - LP1
No ratings yet
Assignment 3 - LP1
13 pages
Lab # 9
No ratings yet
Lab # 9
6 pages
DAV-EXP
No ratings yet
DAV-EXP
11 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Exp 1
No ratings yet
Exp 1
6 pages
Notes Machine Learning
No ratings yet
Notes Machine Learning
34 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
Expt-1
No ratings yet
Expt-1
6 pages
ML Lab File[1]
No ratings yet
ML Lab File[1]
43 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
ML Lab - Sukanya Raja
No ratings yet
ML Lab - Sukanya Raja
23 pages
KNN-Unit1-Notes (1)
No ratings yet
KNN-Unit1-Notes (1)
57 pages
MLA Manual
No ratings yet
MLA Manual
25 pages
Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
ICT-4202, DIP Lab Manual - 8
No ratings yet
ICT-4202, DIP Lab Manual - 8
20 pages
Linear Model
No ratings yet
Linear Model
10 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
DSUP_Exp4[1]
No ratings yet
DSUP_Exp4[1]
6 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
32 pages
m2 Data analytic and visualization
No ratings yet
m2 Data analytic and visualization
53 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
DataAnalytics_LabBook
No ratings yet
DataAnalytics_LabBook
61 pages
ML practical Manjot 6-10
No ratings yet
ML practical Manjot 6-10
10 pages
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
No ratings yet
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
8 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
Ilovepdf_merged (1)_merged - Copy
No ratings yet
Ilovepdf_merged (1)_merged - Copy
30 pages
20dit073 Jay Prajapati ML
No ratings yet
20dit073 Jay Prajapati ML
68 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
Machine Learning QB
No ratings yet
Machine Learning QB
32 pages
Unit I
No ratings yet
Unit I
14 pages
Machine Learning Lab Manual 8
No ratings yet
Machine Learning Lab Manual 8
12 pages
ML unit-2
No ratings yet
ML unit-2
52 pages
Krishna Edx Machine Learning With Python
No ratings yet
Krishna Edx Machine Learning With Python
18 pages
Multiple Regression
100% (1)
Multiple Regression
100 pages
En Tanagra PLS DA
No ratings yet
En Tanagra PLS DA
10 pages
Understanding Analysis: Foundations and Applications
From Everand
Understanding Analysis: Foundations and Applications
Tanmay Shroff
No ratings yet
Collin Gretl
No ratings yet
Collin Gretl
28 pages
STAT 3008 Applied Regression Analysis Tutorial 2 - Term 2, 2019 20
No ratings yet
STAT 3008 Applied Regression Analysis Tutorial 2 - Term 2, 2019 20
2 pages
Econ320 Syllabus
No ratings yet
Econ320 Syllabus
5 pages
(Ebooks PDF) Download Statistical Methods and Calculation Skills 4th Edition Edition Isabel Willemse Full Chapters
100% (4)
(Ebooks PDF) Download Statistical Methods and Calculation Skills 4th Edition Edition Isabel Willemse Full Chapters
84 pages
Measuring Vulnerability To Assess Households Resil
No ratings yet
Measuring Vulnerability To Assess Households Resil
19 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
53 pages
Q4 Statistics and Probability 11 - Module 1
No ratings yet
Q4 Statistics and Probability 11 - Module 1
18 pages
BSSM: Bayesian Inference of Non-Linear and Non-Gaussian State Space Models in R
No ratings yet
BSSM: Bayesian Inference of Non-Linear and Non-Gaussian State Space Models in R
14 pages
Computer Intensive Methods in Statistics
No ratings yet
Computer Intensive Methods in Statistics
227 pages
Chapter Five: Cost Estimation: Total Costs
No ratings yet
Chapter Five: Cost Estimation: Total Costs
2 pages
HW2368-Chapter3
No ratings yet
HW2368-Chapter3
18 pages
Statistics and Machine Learning
No ratings yet
Statistics and Machine Learning
51 pages
Research Skills 2
No ratings yet
Research Skills 2
5 pages
Mathematics in The Modern World - Lecture 4
No ratings yet
Mathematics in The Modern World - Lecture 4
11 pages
Importance of Inferential Statistics in Healthcare
No ratings yet
Importance of Inferential Statistics in Healthcare
4 pages
ANOVA of Equal Sample Sizes
No ratings yet
ANOVA of Equal Sample Sizes
7 pages
standard error
No ratings yet
standard error
14 pages
SaeHB Me Beta
No ratings yet
SaeHB Me Beta
6 pages
Dsur I Chapter 02 Everything You Ever Wanted To Know About Statistics
No ratings yet
Dsur I Chapter 02 Everything You Ever Wanted To Know About Statistics
36 pages
Matemáticas para El Análisis Económico
No ratings yet
Matemáticas para El Análisis Económico
33 pages
It Is Claimed That Automobiles Are Driven On Average More Than 2
100% (1)
It Is Claimed That Automobiles Are Driven On Average More Than 2
2 pages
Free Access to Test Bank for Basic Statistics for the Behavioral Sciences, 7th Edition Chapter Answers
100% (15)
Free Access to Test Bank for Basic Statistics for the Behavioral Sciences, 7th Edition Chapter Answers
43 pages
Marketing Head'S Conundrum
No ratings yet
Marketing Head'S Conundrum
16 pages
Analysis of Data and Interpretetion of The Results of Statistical Computations
No ratings yet
Analysis of Data and Interpretetion of The Results of Statistical Computations
62 pages
Logistics Management - Chapter 5 PPT NFJnK1J2IS
No ratings yet
Logistics Management - Chapter 5 PPT NFJnK1J2IS
50 pages
Tools For Decision Making A Practical Guide For Local Government 2nd Edition David N. Ammons
No ratings yet
Tools For Decision Making A Practical Guide For Local Government 2nd Edition David N. Ammons
84 pages
Full Factorial Design DOE..
No ratings yet
Full Factorial Design DOE..
6 pages
Introduction To Statistical Learning R Labs and Exercises Code
No ratings yet
Introduction To Statistical Learning R Labs and Exercises Code
33 pages

Practical # 10

Uploaded by

Practical # 10

Uploaded by

Department of Software Engineering

Mehran University of Engineering and Technology, Jamshoro

Course: SWE – Data Analytics and Business Intelligence

Topic To understand basics of python

Lab Discussion: Theoretical concepts and Procedural steps

# Import the necessary libraries

Train classifier and predict outcomes

# Creating a LinearRegression object and fitting it on our training set.

# Predicting the test set results

Showing actual and predicted data and visualize it

# Showing test set and predicted values side by side

# Visualising the training set results

Actual and predicted values

# Visualising the test set results

from sklearn.metrics import mean_absolute_error, mean_squared_error, r2_score

# Calculating and printing the performance metrics

 Perform linear regression on the student dataset uploaded on the drive.

You might also like