TEST1
TEST1
is. What do you expect to find if you would fit a regression line to
these data?
Sales
35 y = -0,3246x + 29,627
5 6 23 R² = 0,027
6 9 25 30
7 15 27 25
8 6 25 20
9 11 26 5 6 7 8 9 10 11 12 13 14 15 16 17
10 16 27 Advertising
11 11 25
Data Linéaire (Data)
12 6 50
13 13 26
14 11 23
15 13 26 Normally we would expect to find out a positive relationship between advertising and
16 7 23 sales.
17 8 23
However, due to the extreme value (6;50) of the 12th week, the regression results
18 8 24
are skewed by it.
19 12 26
20 9 24
b) Estimate the coefficients a and b in the simple regression model with sales as dependent variable and advertising as explanatory factor. Also compute the
standard error and t-value of b. Is b significantly different from 0?
a = 29.63
b = -0.3246
Standard error of b = 0.4589
Statistiques de la régression t-value of b = -0.707
Coefficient de p-value of b = 0.4885 > 0.05 Thus b is not significantly different from 0.
détermination multiple 0,1644364
Therefore, in this model, it appears that the number of advertisements does not
Coefficient de
détermination R^2 0,02703933 affect the number of sales.
Coefficient de -
détermination R^2 0,027014041
Erreur-type 5,836474462
Observations 20
ANALYSE DE VARIANCE
Degré de Somme des Moyenne
liberté carrés des carrés F Valeur critique de F
Régression 1 17,04018547 17,04018547 0,500233921 0,488454014
Résidus 18 613,1598145 34,06443414
Total 19 630,2
20
10
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
-5
-10
Residuals squared
600
500
400
300
200
100
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
d) Apparently, the regression result of part (b) is not satisfactory. Once you realize that the large residual corresponds to the week with opening hours during
the evening, how would you proceed to get a more satisfactory regression model?
In order to obtain a more satisfactory regression model, we should delete the 12th
week's values. Therefore, we will have only 19 observations, but with fewer
observations, we can achieve a more accurate forecast.
e) Delete this special week from the sample and use the remaining 19 weeks to estimate the coefficients a and b in the simple regression model with sales as
dependent variable and advertising as explanatory factor. Also compute the standard error and t-value of b. Is b significantly different from 0?
4
25
y = 0,375x + 21,125 3
24
R² = 0,5154
23 2
22
1
5 6 7 8 9 10 11 12 13 14 15 16 17
0
Advertising
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Residuals
2
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
-1
-2
-3
Statistiques de la régression
Coefficient de
détermination multiple 0,717893879
Coefficient de
détermination R^2 0,515371622
Coefficient de
détermination R^2 0,48686407
Erreur-type 1,053704948
Observations 19
ANALYSE DE VARIANCE
Degré de Somme des Moyenne
liberté carrés des carrés F Valeur critique de F
Total 18 38,94736842
Before After
a 29,627 21,125
b -0,3246 0,375 Comparing the different outcomes that have been computed in Excel between
R² 0,027 0,5154 the cases with 20 and 19 observations.
standard error 0,459 0,088
t-value -0,707 4,252
p-value 0,489 0,00054
We can see that by removing this special week, the number of sales is better
explained by the number of advertisements made (51.54% explanatory power).