Residual Analysis and test_02
Residual Analysis and test_02
5
REGRESSION EQUATION WITH
ESTIMATES
• If we actually knew the population parameters, Bo and
B1, we could use the Simple Linear Regression
Equation.
E(y) = + x
• In reality we almost never have the population
parameters. Therefore we will estimate them using
sample data. When using sample data, we have to
change our equation a little bit.
• ŷ, pronounced "y-hat“ is the point estimator of E(y)
ŷ= +x
• ŷ, is the mean value of y for a given value of x.
ICS 223 Compiler Design 6
Least square criterion
• = observed value of dependent variable (tip amount)
• = estimated (predicted) value of the dependent
variable (predicted tip amount)
• The goal is to minimize the sum of the squared
differences between the observed value for the
dependent variable () and the estimated/predicted
value of the dependent variable () that is provided by
the regression line. Sum of the squared residuals.
If one or more of these assumptions are violated, then the results of our
linear regression may be unreliable or even misleading.
14
15
Best case residual distribution
• Evenly distributed left to right, up to down, all over the graph
16
• Residuals are not evenly distributed
17
Points observed
21
Sum of Squared Error
28
Confidence Interval
• 95% confidence that the actual mean for the population
falls within this interval
t-value calculation
•±
Where is standard deviation of the slope,
is margin of error, is point estimator for the slope
29
Standard Deviation of the slope
•=
• =2.742/sqrt(4206)
• =0.04228
• Hypothesis :