Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
TOPIC :- REGRESSION
INTRODUCTION
• REGRESSION
• REGRESSION LINES
• REGRESSION COEFFICIENT
TYPES OF REGRESSION
METHODS OF STUDYING
REGRESSION
PROPERTIES OF REGRESSION
SOLVED EXAMPLES
INTRODUCTION
2) Regression Equation Y on X :-
Y-𝑌̅ = 𝑏𝑦𝑥 (X-𝑋̅)
𝜎𝑦
=r (X-𝑋̅) [It estimate Y for a given value of X]
𝜎𝑥
Where X=Value of x
𝑋̅=Mean of x
σ𝑥= Standard deviation of x series
r = Correlation coefficient
Y= Value of Y
𝑌̅= Mean of Y
σy = Standard deviation of y series
b = Slope or coefficient of regression
Regression Lines :-
If a bivariate data are plotted as points on graph paper , it will be found that
the concentration point follows a certain patterns showing the relationship
between the variables. When the trend points are found to be linear, we
determine the best fitting straight line by Least Square Method.
Such straight lines which are used to obtain best estimates of one variables
for given value of the other ,are called regression lines.
If two variables are linearly related ,then relation can be expressed as Y=bx
= a. where b =slope of the line relating Y to X and ‘a’ is the ‘Y’ intercept
of the line.
A line regression is the straight line which gives the best fit in the least
square sense to given sets of data.
Regression coefficient:-
1) The regression coefficient (b) is an expression of how much ( on the
average) one dependent variable (Y) may be expected to change per unit
change in some other independent variable (X).
2) It is denoted by letter ‘b’.
3) The regression coefficient of Y on X =
𝜎𝑦 𝑆.𝐷.𝑜𝑓 𝑌𝑠𝑒𝑟𝑖𝑒𝑠
= byx = r .
𝜎𝑥 𝑆.𝐷.𝑜𝑓 𝑋 𝑠𝑒𝑟𝑖𝑒𝑠
The regression coefficient of X on Y =
𝜎𝑥 𝑆.𝐷.𝑜𝑓 𝑋 𝑠𝑒𝑟𝑖𝑒𝑠
= bxy = r .
𝜎𝑦 𝑆.𝐷.𝑜𝑓 𝑌 𝑠𝑒𝑟𝑖𝑒𝑠
Types of regression:-
a) SIMPLE REGRESSION :-
I. Here the dependent variable (criterion) is a function of single
independent variable (predictor) .
II. The score of the dependent variable is predicted from the given scores
of single predictor.
EXAMPLE : Height of person on his weight
Simple liner regression model-
The regression model describes the mean of that normally distributed dependent
variable Y as a function of the predictor or independent variable X:
𝑌𝑖 = 𝛽𝑜 + 𝛽1 𝑋𝑖 + 𝜀𝑖
The scatter diagram is a useful diagnostic tool for checking out the validity of
features of the simple linear regression model.
Testing for independence –
In addition to being able to predict the mean at various levels of the independent
variables , regression data can also used to test for the independence between the
two variables under investigation. Such a statistical test can be viewed in two
ways: through the coefficient of correlation or through the slope.
1) The correlation coefficient r measures the strength of the relation between
two variables it is an estimate of an unknown population correlation
coefficient ρ (rho),the same way the sample mean 𝑥̅ is used as an estimate
of an unknown population mean µ.
t= r√(𝑛 − 2)/(1 − 𝑟 2 )
the procedure is often performed as two sided ,i.e
𝐻𝐴 : ρ≠0
And it is t test with n-2 degrees of freedom
2) The role of the slope 𝛽1 ,since the regression model describes the mean of
the dependent variable Y as a function of the predictor or the independent
variable X, µ𝑦 =𝛽0 =𝛽 1 x
If 𝛽1 =0 ,Y and X would be independent. The test for 𝐻0 : 𝛽1 = 0.
b) MULTIPLE REGRESSION:-
i. Here the dependent variable is a function of two or more predictors
ii. The scores are predicted from the scores of more than one predictors
iii. It may be linear or non linear.
EXAMPLE : Thyroid calcitonin on combination of thyroxine secretion and
serum calcium.
Regression model with several independent variables-
Suppose that we want to consider k independent variables simultaneously-
𝑘
µ𝑖 = 𝛽0 = ∑ 𝛽𝑗 𝑥𝑗𝑖
𝑗=1
The model above is referred to as multiple linear regression model.
Effective modification -consider a multiple regression model involving
two independent variables :
Polynomial regression:-
Consider the multiple regression model involving one independent
variable:-
C) LINEAR REGRESSION-
i. here the dependent variable is linearly correlated with the predictor (independent
variable)
ii. the scores of dependent variables are predicted by working out an equation for a
straight line, depending on the linear association between the two.
The statistical analysis is to find out the exact position of the straight line is known as
linear regression analysis.
ii. Its equation is y = a = bx
iii. The slope of the line b in the equation is known as the regression coefficient
it shows that y changes b times as fast as x.
iv. Symbolically the regression coefficient of y on x is 𝑏𝑦𝑥 .
(B) i. If the line of regression is so chosen that the sum of square of deviation
parallel to the axis of x is minimized fig 14.2 (b), it is called the line of
regression of X on Y and it gives the best estimate of x for any value of y.
ii. The regression equation in this case is x = a = by.
Computation of linear equation
Properties of regression –
Solved examples
r= 0.6 𝑋̅ = 10 𝑌̅ =20
𝜎𝑦
byx= r = 0.6×20/15 = 0.8
𝜎𝑥
15
bxy=o.6 × = 0.45
20