0% found this document useful (0 votes)

8 views

LinearRegression_Iris

The document discusses linear regression as a statistical method for estimating relationships between dependent and independent variables, primarily for prediction and forecasting. It uses the Salary dataset to predict salary based on years of experience and the Iris dataset to demonstrate linear regression with features like sepal and petal lengths. The process includes data loading, model training, and performance evaluation using metrics such as R² score and RMSE.

Uploaded by

kumar.rucky.rakesh1423

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

LinearRegression_Iris

Uploaded by

kumar.rucky.rakesh1423

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Linear Regression

Regression analysis is a statistical process used to estimate the relationships

between the dependent variable and one or more independent variables. Regression
analysis is mostly used for prediction and forecasting which overlaps with machine
learning. In this task we will experiment some linear regression use case.

The objective of LinearRegression is to fit a linear model to the dataset by adjusting

a set of parameters in order to make the sum of the squared residuals of the model
as small as possible.

A linear model is defined by: y = b + bx, where y is the target variable, X is the data,
b represents the coefficients.

Let's try and predict something using linear regression.

The Salary dataset consists of two variables [YearsExperience, Salary], The goal is to
predict the salary one is going to get using the years of experience.

We will kick off with a very famous data set loved by machine learning practitioner...

Let's get to know how Data and have fun with.

IRIS DATA SET LINEAR REGRESSION

In [1]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris

In [2]:
iris_df = pd.read_csv('iris.csv')
iris_df.head()

Out[2]:

SepalWidthC PetalWidthC
Id SepalLengthCm PetalLengthCm Species
m m

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

we load the iris dataset directly online (in case you do not
have it you can import it from skearlm)
In [3]:
data = load_iris()
data.feature_names #feature can be refere to as column but is a term(i think) that
refere to independents var

Out[3]:
['sepal length (cm)',
'sepal width (cm)',
'petal length (cm)',
'petal width (cm)']
In [4]:
data.target_names #and over here we have names of species or our target, the
dependent values.

Out[4]:
array(['setosa', 'versicolor', 'virginica'], dtype='<U10')
In [5]:
data.target # over here we see that by calling target on the dataset we get the
number representations
# or dummy representatives of the values in the dependent column

Out[5]:
array([0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2])
In [ ]:

In [102]:
X = data.data #data basically refere to the values in the independent columns
X.shape #check the shape hapy with that

Out[102]:
(150, 4)
In [103]:
y = data.target # collecting the number represatation of the independent values
y.shape #check the shape... not happy. let's reshape to 2D

Out[103]:
(150,)
In [104]:
# because sklearn doesn't like 1D arrays or vectors we're going to reshape it
y = y.reshape(-1, 1)
y.shape # get it to 2D

Out[104]:
(150, 1)

we're going to plot the lenght of sepal and petal to check

if the data Linear
In [81]:
plt.figure(figsize=(18,8),dpi=100) #set the canvas size for visibility

plt.scatter(X.T[0],X.T[2]) #over here I use the T ndarray method to transpose the

data then get columns at index 0 and 2
plt.title('IRIS Petal and sepal length', fontsize=20) # set the title of the plot
and adjust my font size for readability

#then we set the label (just to be obvious)

plt.ylabel('Petal Length')
plt.xlabel('sepal length')

Out[81]:
Text(0.5, 0, 'sepal length')

We can't really see how the iris are grouped but we can clearly see that there a
linear relationship here

Let's start the prediction

We're going to take this simple steps to predict the
hourlywage of a person
 split the data using the train_test_split() method from skitlearn
 Then we going to build the model and fit (train) it using our train data
 Last but not least we're going to generate prediction

In [105]:
from sklearn.model_selection import train_test_split #the tool for split the
data
from sklearn.linear_model import LinearRegression #and because we know we
going to use linear regression for our prediction we import the class as well

#over here we split the data. into the x&y trainer and y&x tester
X_train,X_test,y_train,y_test = train_test_split(X,y, test_size = 0.20)

In [106]:
lr = LinearRegression() #create our linear model

#fitting the model on the training data and try to predict the X_test
iris_model = lr.fit(X_train, y_train)
predictions = iris_model.predict(X_test)

In [48]:
#plotting the error in our in our predicitions
plt.errorbar(range(1, len(y_test)+1), y_test, yerr=(y_test-predictions), fmt='^k',
ecolor='red')

Out[48]:
<ErrorbarContainer object of 3 artists>

In [50]:
from sklearn.metrics import r2_score #class will help us to calculate and see the
score of our predictions

r2_score(y_test, predictions)

Out[50]:
0.904901491129183
In [51]:
#so over to get the RMSE we first get the distance between the y_test and the
prediction then we elavated it to the power of **2
#after we get the average number and finally use the numpy square root function.
np.sqrt(((predictions - y_test)**2).mean())

Out[51]:
0.24520071494252943

Iso 08764-1-2004
No ratings yet
Iso 08764-1-2004
28 pages
RETS Developer Start Guide
No ratings yet
RETS Developer Start Guide
10 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
CO3
No ratings yet
CO3
8 pages
Implementation of Simple Linear Regression Algorithm Using Python
No ratings yet
Implementation of Simple Linear Regression Algorithm Using Python
12 pages
PR
No ratings yet
PR
17 pages
Data Science Machine Leraning222
No ratings yet
Data Science Machine Leraning222
11 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
WEEK
No ratings yet
WEEK
17 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Vertopal.com Lab4 KNN
No ratings yet
Vertopal.com Lab4 KNN
9 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Classification Algorithms II
No ratings yet
Classification Algorithms II
9 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
DS Report
No ratings yet
DS Report
11 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Mlaifile1 3
No ratings yet
Mlaifile1 3
27 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
100% (1)
22MCA1008 - Varun ML LAB ASSIGNMENTS
41 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
f3683849-7ca6-4854-8f96-af11b6e837ec
No ratings yet
f3683849-7ca6-4854-8f96-af11b6e837ec
20 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
Exp 1
No ratings yet
Exp 1
6 pages
ML manoj
No ratings yet
ML manoj
51 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
External
No ratings yet
External
11 pages
ML Lab
No ratings yet
ML Lab
7 pages
SC Assignment Q2
No ratings yet
SC Assignment Q2
7 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
Argha's ML LAB_240927_121838
No ratings yet
Argha's ML LAB_240927_121838
13 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
ML#07
No ratings yet
ML#07
21 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Unit 2 ML
No ratings yet
Unit 2 ML
93 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Solid State and Quantum Mechanics Notes
No ratings yet
Solid State and Quantum Mechanics Notes
5 pages
User Manual: Petroleum Experts
No ratings yet
User Manual: Petroleum Experts
1,471 pages
Allied Telesis GS980MX Series Datasheet
No ratings yet
Allied Telesis GS980MX Series Datasheet
4 pages
Burgess 2014
No ratings yet
Burgess 2014
5 pages
Yield Line Patterns For End-Plates and Columns 2021-12-31
No ratings yet
Yield Line Patterns For End-Plates and Columns 2021-12-31
117 pages
08_Fidelity_68_Pagode_Signature_MKII_EN
No ratings yet
08_Fidelity_68_Pagode_Signature_MKII_EN
7 pages
Lecture notes-AIRCRAFT STRUCTURES-introduction
No ratings yet
Lecture notes-AIRCRAFT STRUCTURES-introduction
23 pages
Datasheet Bianca 3 BTV Interapp
No ratings yet
Datasheet Bianca 3 BTV Interapp
8 pages
TEXTURE IMAGE SEGMENTATION USING NEURO EVOLUTIONARY METHODS-a Survey
No ratings yet
TEXTURE IMAGE SEGMENTATION USING NEURO EVOLUTIONARY METHODS-a Survey
5 pages
Wireless 802.11b/g PCI Adapter: User's Manual
No ratings yet
Wireless 802.11b/g PCI Adapter: User's Manual
41 pages
BFM
No ratings yet
BFM
17 pages
Electrical Storms in Tesla's Colorado Springs Notes
86% (7)
Electrical Storms in Tesla's Colorado Springs Notes
122 pages
H Seagate® Laptop Thin SSHD
No ratings yet
H Seagate® Laptop Thin SSHD
38 pages
Topology 640, Midterm Exam: (Due On Wednesday, March 22)
No ratings yet
Topology 640, Midterm Exam: (Due On Wednesday, March 22)
2 pages
Hall 5
No ratings yet
Hall 5
1 page
Practical Thermal Design of Shell-and-Tube Heat Exchangers: R. Mukherjee
No ratings yet
Practical Thermal Design of Shell-and-Tube Heat Exchangers: R. Mukherjee
25 pages
Global Mortality From Outdoor Fine Particle by Fossil
No ratings yet
Global Mortality From Outdoor Fine Particle by Fossil
8 pages
Diagnostic Test in Mathematics 5
100% (4)
Diagnostic Test in Mathematics 5
6 pages
IJSARTV4I724548 (Generating Lisp Program For Assembly Drawing in Autocad)
No ratings yet
IJSARTV4I724548 (Generating Lisp Program For Assembly Drawing in Autocad)
4 pages
Mack III 8 211 V Service Manual
100% (55)
Mack III 8 211 V Service Manual
20 pages
VAX/VMS Complete Study Material
No ratings yet
VAX/VMS Complete Study Material
502 pages
Stress Analysis Spur Gear Design by Using Ansys Workbench PDF
No ratings yet
Stress Analysis Spur Gear Design by Using Ansys Workbench PDF
3 pages
4 QEM Process Capability
No ratings yet
4 QEM Process Capability
6 pages
Dipole Moments
No ratings yet
Dipole Moments
12 pages
Toyota Yaris 2006 Model PDF
No ratings yet
Toyota Yaris 2006 Model PDF
7 pages
2nd Year Special Exams 2021 by Bismillah Academy 0300-7980055
100% (2)
2nd Year Special Exams 2021 by Bismillah Academy 0300-7980055
40 pages
Carbohydrates - General Properties
96% (25)
Carbohydrates - General Properties
22 pages
Classification of Elements ...... - Worksheet
No ratings yet
Classification of Elements ...... - Worksheet
6 pages