Experiment - 8

Uploaded by

gowrishchhabra

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Experiment - 8

Uploaded by

gowrishchhabra

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

EXPERIMENT – 8

Aim: Exercises to draw a scatter diagram, residual plots, outliers leverage and
influential data points in R

Theory and Technique:

Scatter Plot - A scatter plot is a set of dotted points representing individual data pieces
on the horizontal and vertical axis. In a graph in which the values of two variables are
plotted along the X-axis and Y-axis, the pattern of the resulting points reveals a
correlation between them.

Scatter plot in R Programming Language using the plot() function.

Syntax: plot(x, y, main, xlab, ylab, xlim, ylim, axes)

Code:
x = c(50,50,55,60,65,65,65,60,60,50)
y = c(11,13,14,16,16,15,15,14,13,13)
plot(x, y, main="Scatter Plot", xlab="Sales", ylab="Expenses")

Residual plots are often used to assess whether or not the residuals in regression
analysis are normally distributed and whether or not they exhibit heteroscedasticity.

Code:
x = c(6,7,7,8,10,10,11,12,14,15,16)
y = c(55,40,50,41,35,28,38,32,28,18,13)
mod = lm(y~x)
summary(mod)
plot(x, y, main="Size of Data Vs Requests", xlab="Gigabytes", ylab="Processed Requests",
pch=16, col="blue")
abline(a=70.16, -3.39, col="red");

Outliers: Outliers are the points that are distinct and deviant from the bulk of the dataset. In
general, the outliers have high residual values means that the difference is greater than the
b/w observed and predicted value.

Code:
data <- data.frame(x,y)
plot(data$x, data$y)
# Example: Detecting outliers
# Identify observations with high residuals
outliers <- which(abs(resid(mod)) > 2 * sd(resid(mod))) X <- 1:100
Y <- 2 * X + rnorm(100, mean = 0, sd = 10) model <- lm(Y ~ X, data = data)
data <- data.frame(X = 1:100, Y = 2 * X + rnorm(100, mean = 0, sd = 10))
outliers <-which(abs(resid(model)) > 2 * sd(resid(model)))
plot(data$X, data$Y)
points(data$X[outliers], data$Y[outliers], col = "red", pch = 19)
Influential Points:
An influential point is a point that has a large impact on the regression. Surprisingly, these
are not the same thing. A point can be an outlier without being influential. A point can be
influential without being an outlier. A point can be both or neither

Code:
influential <- cooks.distance(mode threshold <- 3 / length(data$X)
influential_obs <- which(influential > threshold)

# Highlight influential observations in the scatterplot plot(data$X, data$Y)

points(data$X[influential_obs], data$Y[influential_obs], col = "orange",
pch = 19)

Multiple Regression
No ratings yet
Multiple Regression
7 pages
Lab 5
No ratings yet
Lab 5
6 pages
10 - APM 1205 Linear Model
No ratings yet
10 - APM 1205 Linear Model
40 pages
R Lab 3
No ratings yet
R Lab 3
7 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
R-code
No ratings yet
R-code
5 pages
Basic Regression Analysis 3
No ratings yet
Basic Regression Analysis 3
6 pages
Outliers Influence
No ratings yet
Outliers Influence
6 pages
Lab file AD pdf
No ratings yet
Lab file AD pdf
25 pages
Correlation and Regression
No ratings yet
Correlation and Regression
2 pages
Statistics, Statistical Modelling and Data analytics_practicalfile_sj
No ratings yet
Statistics, Statistical Modelling and Data analytics_practicalfile_sj
23 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
ML Fundamentals
No ratings yet
ML Fundamentals
38 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
Lecture 20: Outliers and Influential Points
No ratings yet
Lecture 20: Outliers and Influential Points
11 pages
Mod3
No ratings yet
Mod3
50 pages
R Lab 1
No ratings yet
R Lab 1
5 pages
Uni T - 2 - R Programming
No ratings yet
Uni T - 2 - R Programming
10 pages
Practice-Training_BTTC
No ratings yet
Practice-Training_BTTC
25 pages
Outliers and Influential Points
No ratings yet
Outliers and Influential Points
14 pages
Unit 2 R
No ratings yet
Unit 2 R
16 pages
lec37
No ratings yet
lec37
12 pages
Nonlinear Model
No ratings yet
Nonlinear Model
3 pages
如何读图
No ratings yet
如何读图
3 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
10-Visualization of Streaming Data and Class R Code-10!03!2023
No ratings yet
10-Visualization of Streaming Data and Class R Code-10!03!2023
19 pages
Regression Model
No ratings yet
Regression Model
6 pages
Problem Set 1 Solution Numerical Methods
No ratings yet
Problem Set 1 Solution Numerical Methods
32 pages
Cheatsheet Part 2
No ratings yet
Cheatsheet Part 2
2 pages
Matematika BAB 5 Graphic in R
No ratings yet
Matematika BAB 5 Graphic in R
6 pages
Assignment 5
No ratings yet
Assignment 5
13 pages
nw
No ratings yet
nw
1 page
Stat 8-14
No ratings yet
Stat 8-14
11 pages
Session Set Working Directory Choose Directlry
No ratings yet
Session Set Working Directory Choose Directlry
17 pages
Modern Regression 1 - hw6
No ratings yet
Modern Regression 1 - hw6
11 pages
R Course
No ratings yet
R Course
7 pages
Chapter 4
No ratings yet
Chapter 4
10 pages
Stats101A - Chapter 3
No ratings yet
Stats101A - Chapter 3
54 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
data analysis in r
No ratings yet
data analysis in r
10 pages
Lec 4
No ratings yet
Lec 4
18 pages
LR Assumptions
No ratings yet
LR Assumptions
9 pages
4-Regression Diagnostics SAS
No ratings yet
4-Regression Diagnostics SAS
12 pages
R Remaing PRGMS
No ratings yet
R Remaing PRGMS
9 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
R Manual
No ratings yet
R Manual
10 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
First Course On R
No ratings yet
First Course On R
26 pages
Sta238 Wks - Week1+2
No ratings yet
Sta238 Wks - Week1+2
35 pages
r 2m
No ratings yet
r 2m
34 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
MIT 402 CAT 2 S
No ratings yet
MIT 402 CAT 2 S
8 pages
Course Notes18
No ratings yet
Course Notes18
113 pages
R Examples
No ratings yet
R Examples
56 pages
Statistics Study Notes
No ratings yet
Statistics Study Notes
71 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet

Experiment - 8

Uploaded by

Experiment - 8

Uploaded by

EXPERIMENT – 8

Theory and Technique:

Scatter plot in R Programming Language using the plot() function.

# Highlight influential observations in the scatterplot plot(data$X, data$Y)

You might also like