BCOR 3750 Linear Regression Models
BCOR 3750 Linear Regression Models
Covariance
• Measure of linear association between two variables, X and Y.
• Average of the x-deviations and y-deviations from their respective
means
• As absolute value of the covariance increases, the strength of the
linear association between X and Y increases
• A positive covariance indicates a direct relationship, while a
negative covariance indicates an inverse relationship between X
and Y
◦ Excel function: = COVARIANCE.P(data range 1, data range 2) or
COVARIANCE.S(data range 1, data range 2)
Measures of Association
Correlation
• Measure of linear association between two variables, X and Y.
• Measured by the Pearson correlation coefficient
• Does not depend upon units of measurement: -1<=ρxy<=+1
◦ Covar(X,Y) = ρxy σxσy
4
Assessing the Fit of a Simple Linear Regression
Model
❖The Coefficient of Determination:
• Proportion of variation in Y for sample that can be explained by the linear
relationship in the regression equation
• 0 <= R2 <= 1
❖How is R2 calculated?
• R2 = (ρxy)2
• Use the ratio SSR/SST to evaluate the goodness of fit for the estimated
regression equation
Least Squares Method
◦ A procedure for using sample data to find the
estimated regression equation:
ith residual: The error made using the regression model to estimate the
mean value of the dependent variable for the ith observation, denoted as
ei = 𝑦𝑖 − 𝑦ො𝑖 :
𝑛 2 𝑛
min 𝑖=1 𝑦𝑖 − 𝑦ො𝑖 = min 𝑖=1 𝑒𝑖 2
= =
.2 .2
.1 .1
0
X
0 _
A B C D 18 19 20 21 22 23 24 X
(18) (20) (22) (24)
Properties of Summary Measures
X =
◦ i.e. X is unbiased
Standard error (standard deviation) of the sampling distribution
when sampling with replacement:
X =
n
◦ As n increases, X decreases
◦ Sampling more decreases the uncertainty in the estimate for
Significant Coefficients
If the dependent variable y does not change when x1
changes, the true value of the slope would be 0
H0: 1=0
To test to see if this is true, look at the p-value for an
independent variable’s coefficient: