Regression Correlation

The document discusses regression and correlation analysis techniques. It defines dependent and independent variables, and explains how to interpret correlation coefficients and regression lines. Examples are provided to demonstrate computing correlation coefficients and linear regression equations from data.

Uploaded by

Shafqat Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

Regression Correlation

Uploaded by

Shafqat Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Research Methodologies and

Biostatistics
Somia Bakhtiar Lone
Lecture 3: Regression & Correlation
Regression & Correlation
1. Scatter diagram
2. Correlation coefficient
3. Straight line regression model
4. Interpretation of regression coefficient and correlation co-efficient.
Dependent and Independent Variable
• For measuring the relationship between two variables or more.
Y= f(X)
• One variable is always response or dependent variable, that is, a variable
to be predicted from or explained by other variables. In other words, The
dependent variable is the one which the investigator is trying to estimate
or predict (denoted y).
• The other variables are called predictors, or explanatory variables or
independent variables. In other words, The independent variable is the
variable which is under the investigator’s control (denoted x).
• In a variety of applications, the dependent variable (y) of interest is a
continuous variable that we can assume may, after an appropriate
transformation, be normally(symmetric) distributed.
• Can a relationship be used to predict what happens to y as x changes (i.e.
what happens to the dependent variable as the independent variable
changes)?
Applications
In analyzing data for the health sciences disciplines, sometimes we
may, for example, be interested in studying the relationship between
1. blood pressure and age,
2. height and weight,
3. the concentration of an injected drug and heart rate,
4. the consumption level of some nutrient and weight gain,
5. total family income and medical care expenditures.
The nature and strength of the relationships between variables such as
these may be examined using linear models such as regression and
correlation analysis.
Scatter Plots
• Scatterplots play a crucial role in regression and correlation analysis by
visually representing the relationships between variables. They provide a
clear and intuitive way to observe patterns, trends, and associations
between two continuous variables.
• Visualizing Relationships between Variables: Scatterplots allow
researchers to visualize how one variable changes concerning another
variable. By plotting data points on a graph, scatterplots help in
identifying the direction, strength, and form of relationships between
variables. They are essential for detecting linear or non-linear patterns in
the data.
Scatter Diagram
• The diagrammatic way of representing
bivariate data is called scatter diagram.
Suppose, (x1,y1), (x2,y2)………..(xn,yn) are
n pairs of observations. If the values of
the variables x and y be plotted along the
x-axis and y-axis respectively in the xy-
plane, the diagram of dots so obtained is
known as scatter diagram.
• For example, data (n = 55) on the age and
the systolic BP were collected
• Scatter plot of Systolic BP versus Age.
Example: Two week ago Ben started a new job as a car salesman. His
supervisor gives him the advice that the more test drives per day he gets his
customers to take the more sales he will make per day. He records the
following data over the past week.
y

This clearly shows there is a relationship between the two variables.

As x increases we see that y also increases. This shows there is
what’s called a positive linear correlation between the two variables.
Correlation
Correlation measures the extent to which two variables are related. It
quantifies the strength and direction of the relationship between variables.
There are three types of correlation: positive, negative, and zero correlation.
• Positive Correlation: A positive correlation exists when both variables
move in the same direction. This means that as one variable increases, the
other variable also increases. An example is the relationship between
height and weight, where taller individuals tend to weigh more.
• Negative Correlation: In a negative correlation, one variable increases as
the other decreases. An example is the relationship between height above
sea level and temperature, where as altitude increases, temperature
decreases.
• Zero Correlation: Zero correlation indicates no relationship between two
variables. For instance, there is no correlation between the amount of tea
consumed and intelligence level.
Pearson's Correlation
• Pearson's Correlation Coefficient: Pearson's correlation coefficient (r)
is a numerical measure that quantifies the strength and direction of a
linear(straight-line) relationship between two continuous variables.
Let, (x1,y1), (x2,y2),…,(xn,yn) be the pairs of n observations. Then the
correlation coefficient between x and y is denoted by r and defined
as,

• or
Interpretation of correlation coefficient
 It ranges from -1 to +1, where +1 indicates a perfect positive linear
relationship, -1 indicates a perfect negative linear relationship, and 0
indicates no linear relationship.
 The correlation is high if observations lie close to a straight line (i.e.,
values close to +1 or -1) and low if observations are widely scattered
(correlation value close to 0).
Interpretation of correlation coefficient
 1 = Perfect positive correlation
 0.7 < r < 1 = Strong positive correlation
 0.4 < r < 0.7 = Fairly positive correlation
 0 < r < 0.4 = Weak positive correlation
 0 = No correlation
 0 > r > -0.4 = Weak negative correlation
 -0.4 > r > -0.7 = Fairly negative
correlation
 -0.7 > r < -1 = Strong negative
correlation
 -1 = Perfect negative correlation
Properties of correlation coefficient
1. Correlation coefficient lies between -1 to +1. i.e, -1< rxy < 1.
2. Correlation coefficient is symmetric. i.e, rxy= ryx
3. For two independent variable correlation coefficient is zero.
4. It is always unit free.
Advantages: It summarizes the relationship in one value, tells the degree of
correlation & also describes the direction of correlation
Limitation:
1. Always assume linear relationship
2. Interpreting the value of r is difficult.
3. Value of Correlation Coefficient is affected by the extreme values.
4. Time consuming methods
• Example: Let’s reconsider our previous example with Ben the car
salesman. Compute the Correlation Coefficient r for the data set.
• We have 7 data points so n = 7. So lets compute r.

Since the correlation coefficient r is nearly 1, this shows that there is a

strong linear correlation between x and y. The sign of r is positive which
also indicates that when the number of test drives per day increases
then the sales of cars will also increases.
Spearman Rank correlation coefficient
• It is a non-parametric measure of correlation. This procedure makes
use of the two sets of ranks that may be assigned to the sample
values of x and y.
• Spearman Rank correlation coefficient could be computed in the
following cases:
 Both variables are quantitative.
 Both variables are qualitative ordinal.
 One variable is quantitative and the other is qualitative ordinal.
Procedure:
1. Rank the values of X from 1 to n where n is the sample size.
2. Rank the values of Y from 1 to n.
3. Compute the value of difference, di= rank of Xi - rank of Yi
4. Apply the following formula

5. The value of rs denotes the magnitude and nature of association

giving the same interpretation as simple r.
Introduction to Regression
Regression analysis is a statistical method used to examine the relationship between one dependent
variable and one or more independent variables. It aims to understand how the value of the
dependent variable changes when one or more independent variables are varied.
Purpose: The primary purpose of regression analysis is to predict the value of the dependent
variable based on the values of one or more independent variables. It helps in understanding the
strength and direction of the relationship between variables, making it a valuable tool in forecasting
and decision-making.
Types of Regression: There are many type of regressions. Two main types are:
 Linear Regression: Linear regression is a type of regression analysis where the relationship
between the dependent variable and independent variable(s) is modeled as a linear equation. It
is used when there is a linear relationship between the variables.
 Multiple Regression: Multiple regression involves predicting the value of a dependent variable
based on two or more independent variables. It extends the concepts of simple linear regression
to more complex relationships.
Linear Regression
• In linear regression, the relationship between the dependent variable (Y) and
independent variable (X) is represented by the equation
Y=a+bX
• Here, "a" represents the intercept of the line with the Y-axis, and "b"
represents the slope of the line, indicating how much Y changes for a unit
change in X.
• Interpretation of Coefficients (a and b):
Intercept (a): The intercept "a" in the linear regression equation represents the
value of Y when X is zero. It indicates where the regression line crosses the Y-
axis.
Slope (b): The slope "b" in the linear regression equation signifies how much Y
changes for a one-unit change in X. It reflects the rate of change in Y concerning
changes in X.
Straight line Regression Model
The Equation of the Regression Line is y = b+ mx, where

• The model above is referred simple because it contains only one

independent variable (simple linear regression model).
• It is linear because the independent variable appears only in the first
power; if we graph the mean of Y versus X, the graph is a straight line
with intercept b and slope m.
Examples: Find the equation of the regression line. Round the slope m
and the intercept b to two decimals.
• We have 6 data points so n = 6. Lets compute the slope m of the regression line.

• So m = 0.84. So increasing the number of test drive by one unit will increase the sales by
0.84 unit. Now we will compute the y-intercept b.

This gives b = -1.53. The regression

line for this data set is
𝑦 = −1.53 + 0.84 𝑥
Now we can put the values of x in
above equation to compute the fitted
value of 𝑦 to plot regression line.

Marketplace: System Requirements Specification (SRS)
100% (1)
Marketplace: System Requirements Specification (SRS)
12 pages
Privacy 27701 RMISC
No ratings yet
Privacy 27701 RMISC
41 pages
Correlation and Regression
100% (4)
Correlation and Regression
49 pages
ITIL 4 Foundation Exam Specification PDF
No ratings yet
ITIL 4 Foundation Exam Specification PDF
7 pages
AISC Design Guide 34-Steel Framed Stairway Design
100% (9)
AISC Design Guide 34-Steel Framed Stairway Design
114 pages
Correlation & Regression
No ratings yet
Correlation & Regression
26 pages
Microsoft PowerPoint Session 4 PDF
No ratings yet
Microsoft PowerPoint Session 4 PDF
86 pages
07 - Correlation and Regression Analysis-1
No ratings yet
07 - Correlation and Regression Analysis-1
13 pages
Regression & Correlation 230224 221642
No ratings yet
Regression & Correlation 230224 221642
9 pages
Correlation Anad Regression
No ratings yet
Correlation Anad Regression
13 pages
Correlation Regression
100% (1)
Correlation Regression
25 pages
Chapter 8 - PSYC 284
No ratings yet
Chapter 8 - PSYC 284
7 pages
26 - Correlation and Regression Analysis
No ratings yet
26 - Correlation and Regression Analysis
50 pages
BStats 2
No ratings yet
BStats 2
66 pages
Correlation and Regression
No ratings yet
Correlation and Regression
4 pages
Oe Statistics Notes
No ratings yet
Oe Statistics Notes
32 pages
Chapter 4- Correlation and Linear Regression.ppt
No ratings yet
Chapter 4- Correlation and Linear Regression.ppt
28 pages
Correlation and Regression
No ratings yet
Correlation and Regression
4 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
23 pages
Corelation & Regression
No ratings yet
Corelation & Regression
21 pages
Correlation 140708105710 Phpapp01
No ratings yet
Correlation 140708105710 Phpapp01
21 pages
Correlation Analysis: 1101091-1101100 PGDM-B
No ratings yet
Correlation Analysis: 1101091-1101100 PGDM-B
25 pages
Correlation Final
No ratings yet
Correlation Final
52 pages
Examining Relationships in Quantitative Research
No ratings yet
Examining Relationships in Quantitative Research
9 pages
Unit 6, Regression
No ratings yet
Unit 6, Regression
34 pages
Correlation
No ratings yet
Correlation
8 pages
Class Note II_044242
No ratings yet
Class Note II_044242
19 pages
Chapter-9-Simple Linear Regression & Correlation
No ratings yet
Chapter-9-Simple Linear Regression & Correlation
11 pages
Module 2 - Section 4 (Linear Regression) - 11
No ratings yet
Module 2 - Section 4 (Linear Regression) - 11
20 pages
Correlation Regression
No ratings yet
Correlation Regression
58 pages
Correlation Bmlt
No ratings yet
Correlation Bmlt
5 pages
20200519072923cce68d4cc4
No ratings yet
20200519072923cce68d4cc4
28 pages
Correlation & Regression Analysis
No ratings yet
Correlation & Regression Analysis
21 pages
PSNM - Ch. 1
No ratings yet
PSNM - Ch. 1
16 pages
Correlation
No ratings yet
Correlation
34 pages
Correlation and Regression: by Tushar Bhatt
100% (1)
Correlation and Regression: by Tushar Bhatt
66 pages
Correlation N Regression
No ratings yet
Correlation N Regression
25 pages
Correlation and Regression
No ratings yet
Correlation and Regression
23 pages
Summarize The Methods of Studying Correlation.: Module - 3
No ratings yet
Summarize The Methods of Studying Correlation.: Module - 3
17 pages
Correlation
No ratings yet
Correlation
29 pages
16.. Correlation Analysis_Michael
No ratings yet
16.. Correlation Analysis_Michael
25 pages
Correlation and Regression
No ratings yet
Correlation and Regression
16 pages
Chapter - Six
No ratings yet
Chapter - Six
8 pages
Correlation and Regression
No ratings yet
Correlation and Regression
5 pages
Computer Numerical and Statistical Method Unit 2 Calicut Univercitty Note
No ratings yet
Computer Numerical and Statistical Method Unit 2 Calicut Univercitty Note
17 pages
Corr_Regression Analysis
No ratings yet
Corr_Regression Analysis
19 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
11 pages
PS - Module 3 - ViRa
No ratings yet
PS - Module 3 - ViRa
104 pages
Stat Chapter 6
No ratings yet
Stat Chapter 6
23 pages
Correlation and Simple Linear Regression Analyses: Objectives
No ratings yet
Correlation and Simple Linear Regression Analyses: Objectives
6 pages
Correlation: Khairil Anuar Md. Isa Bbiomedicalsc. (Hons), Ukm Msc. (Medical Stat), Usm
No ratings yet
Correlation: Khairil Anuar Md. Isa Bbiomedicalsc. (Hons), Ukm Msc. (Medical Stat), Usm
33 pages
Correlation Analysis
No ratings yet
Correlation Analysis
30 pages
Correlation and Regression 2020
No ratings yet
Correlation and Regression 2020
63 pages
Difference Between Correlation and Regression
No ratings yet
Difference Between Correlation and Regression
7 pages
Topic 6 Correlation and Regression
100% (1)
Topic 6 Correlation and Regression
25 pages
MRS - Diana-Correlation Analysis-Notes
No ratings yet
MRS - Diana-Correlation Analysis-Notes
16 pages
15 MAY - NR - Correlation and Regression
No ratings yet
15 MAY - NR - Correlation and Regression
10 pages
Correlation and Regression
No ratings yet
Correlation and Regression
43 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
25 pages
Correlation and Regration
No ratings yet
Correlation and Regration
8 pages
Unit 3-1
No ratings yet
Unit 3-1
12 pages
Corelation With Example
No ratings yet
Corelation With Example
112 pages
Simple Linear Correlation-1
No ratings yet
Simple Linear Correlation-1
15 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
DEVASHISH SHARMA
No ratings yet
DEVASHISH SHARMA
1 page
1kanyashree K2 Upgradation 2023-24
No ratings yet
1kanyashree K2 Upgradation 2023-24
7 pages
2024 fall syllabus bios 500H v3
No ratings yet
2024 fall syllabus bios 500H v3
18 pages
Voestalpine Heavy Plate TTD DUROSTAT E 10042015
No ratings yet
Voestalpine Heavy Plate TTD DUROSTAT E 10042015
16 pages
NWC203c Summer 2022 Sample PE
No ratings yet
NWC203c Summer 2022 Sample PE
9 pages
Computer Application in Business
No ratings yet
Computer Application in Business
8 pages
Math2 English Textbook (Full)
No ratings yet
Math2 English Textbook (Full)
129 pages
Premier Valves Product Selector
No ratings yet
Premier Valves Product Selector
8 pages
IMXRT1170RM
100% (1)
IMXRT1170RM
6,214 pages
Kad Led
No ratings yet
Kad Led
5 pages
SPEECH Battle Imran Bin Mohd Tarmizi
No ratings yet
SPEECH Battle Imran Bin Mohd Tarmizi
2 pages
CLASS 7 REVISION WORK SHEET (2024-25) Term I-4
No ratings yet
CLASS 7 REVISION WORK SHEET (2024-25) Term I-4
8 pages
Cognitive Simplification Operations Improve Text Simplification
No ratings yet
Cognitive Simplification Operations Improve Text Simplification
25 pages
Achieving Network Agility With Software-Defined WAN
No ratings yet
Achieving Network Agility With Software-Defined WAN
3 pages
Joko Jainul Arif: Personal Profile Work Experience
No ratings yet
Joko Jainul Arif: Personal Profile Work Experience
1 page
MAPS and MAP Elements
No ratings yet
MAPS and MAP Elements
10 pages
IT Application Controls Icq
No ratings yet
IT Application Controls Icq
10 pages
STULZ Liquid Cooling Brochure 2405 EN
100% (1)
STULZ Liquid Cooling Brochure 2405 EN
8 pages
Chapter 2 - Multimedia Basics and Data Representation
No ratings yet
Chapter 2 - Multimedia Basics and Data Representation
23 pages
810-1928-00 Rev A TM1 Operations & Maintenance Guide
No ratings yet
810-1928-00 Rev A TM1 Operations & Maintenance Guide
13 pages
Component Maintenance Manual With Illustrated Parts List
100% (8)
Component Maintenance Manual With Illustrated Parts List
62 pages
Schick Legacy Driver Installation
No ratings yet
Schick Legacy Driver Installation
2 pages
Real Time Project Documentation
No ratings yet
Real Time Project Documentation
43 pages
Greedy
No ratings yet
Greedy
22 pages
Lesson 2 Force Systems
No ratings yet
Lesson 2 Force Systems
7 pages
EagleBurgmann - Chemstar L 6226 - L - EN
No ratings yet
EagleBurgmann - Chemstar L 6226 - L - EN
2 pages