Regression Analysis,.Final
Regression Analysis,.Final
There is a significant relationship between the dependent variable (5 year average return %)
and the independent variables FundDE and FundIE
b)
R-squared (R²)
Consequently, the independent factors in the regression model help to explain 58.68% of the
variance in the dependent variable. Stated differently, 58.68% of the variability in the dependent
variable can be explained by the model.
c)
The estimated regression equation that could be used to predict the 5-year average return
given the type of fund, the net asset value and the expense ratio is:
5 year average return % = 1.19 + 6.90 FundDE + 17.68 FundIE – 0.0283 Net Asset
Value + 6.46 Expense Ratio
Null hypothesis:
H₀: β₁ = β₂ = β₃ = β₄ = 0
That is, there is not a significant relationship between dependent variable (5 year average
return %) and independent variables FundDE , FundIE , the net asset value and the expense
ratio .
Alternative hypothesis:
That is, there is a significant relationship between dependent variable (y) and independent
variables FundDE (x₁), FundIE (x₂), the net asset value (x₃) and the expense ratio (x₄).
Level of significance:
Rejection rule:
Conclusion:
Null hypothesis:
H₀: β₂ = 0
Alternative hypothesis:
Hₐ: β₂ ≠ 0
From the output, it is found that the t-statistic corresponding to FundIE is 5.3315 with
df of 40 and the p-value is 0.000.
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the
independent variable FundIE
is significant.
Therefore, the data provide sufficient evidence to conclude that the addition of the dummy variable
FundIE is significant.
Null hypothesis:
H₀: β₃ = 0
That is, the addition of the Net Asset Value is not significant.
Alternative hypothesis:
Hₐ: β₃ ≠ 0
From the output, it is found that the t-statistic corresponding to Net Asset Value is 0.695 with df of
40 and the p-value is 0.695.
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data do not provide sufficient evidence to conclude that the addition of the Net Asset
Value is significant.
Null hypothesis:
H₀: β₄ = 0
Alternative hypothesis:
Hₐ: β₄ ≠ 0
From the output, it is found that the t-statistic corresponding to Expense Ratio is 2.3399 with df of 40
and the p-value is 0.024.
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the Expense Ratio is
significant.
Thus, Net Asset value is not significant and it should be deleted from the estimated regression
equation.
D)
5 year average return = -4.61 + 8.17 FundDE + 19.52 FundIE + 5.52 Expense Ratio + 5.92 3StarRank
+8.244 4StarRank +6.62 5StarRank
None of the independent variables are deleted from the regression equation.
State the test hypotheses:
Null hypothesis:
H₀: β₁ = 0
That is, the addition of the independent variable FundDE is not significant.
Alternative hypothesis:
Hₐ: β₁ ≠ 0
Level of significance:
Rejection rule:
Conclusion:
The data provide sufficient evidence to conclude that the addition of the independent variable
FundDE is significant.
Null hypothesis:
H₀: β₂ = 0
Alternative hypothesis:
Hₐ: β₂ ≠ 0
From the output, it is found that the t-statistic corresponding to FundIE is 7.02 with df of 38 and the
p-value is 0.000
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the dummy variable
FundIE is significant.
H₀: β₃ = 0
Alternative hypothesis:
Hₐ: β₃ ≠ 0
From the output, it is found that the t-statistic corresponding to Expense Ratio is 2.13 with df of 38
and the p-value is 0.039.
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the Expense Ratio is
significant.
Null hypothesis:
H₀: β₄ = 0
Alternative hypothesis:
Hₐ: β₄ ≠ 0
From the output, it is found that the t-statistic corresponding to 3StarRank is 2.10 with df of 38 and
the p-value is 0.043.
Level of significance:
Rejection rule:
Conclusion:
Here, the p-value is less than the level of significance.
Therefore, the data provide sufficient evidence to conclude that the addition of the 3StarRank is
significant.
Null hypothesis:
H₀: β₅ = 0
Alternative hypothesis:
Hₐ: β₅ ≠ 0
From the output, it is found that the t-statistic corresponding to 4StarRank is 2.11 with df of 38 and
the p-value is 0.042.
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the 4StarRank is
significant.
Null hypothesis:
H₀: β₆ = 0
Alternative hypothesis:
Hₐ: β₆ ≠ 0
Level of significance:
Rejection rule:
Conclusion:
Therefore, the data provide sufficient evidence to conclude that the addition of the 5StarRank is
significant.
Thus, all the independent variables are significant and no independent variables are deleted from
the regression equation.
E)
The predicted the 5-year average return for a domestic equity fund with an expense ratio of
1.05% and a 3-star Morningstar Rank is 56.057%.
5 year average return = -4.61 + 8.17 FundDE + 19.52 FundIE + 5.52 Expense
Ratio + 5.92 3StarRank +8.244 4StarRank +6.62 5StarRank
The 5-year average return for a domestic equity fund with an expense ratio of 1.05% and a 3-
star Morningstar Rank implies that,
Thus, the predicted the 5-year average return for a domestic equity fund with an expense ratio
of 1.05% and a 3-star Morningstar Rank is,
5 year average return = -4.61 + 8.17 (1) + 19.52 (0) + 5.52 (1) + 5.92 (1)
+8.244(0) +6.62 (0)
=15.28
PART B
a)
The data related to the wet, noise, buy again and purchase variables for 68 season tires. The
independent variable x1 defines the variable labelled wet, x2 defines the variables labelled
noise and the dependent variable y denoting purchase which is defined as,
y = {
1 if the value of the Buy-Again variable is 7 or greater
0 if the value of the Buy-Again variable is less than 7
E(y) = 1/(1+e^-(b0+b1x1+...+bpxp))
If the two values of the dependent variable y are coded as 0 or 1, then the logistic regression
equation can be defined as,
Thus the logistic regression equation relating x1 = Wet performance rating, x2 = Noise
performance rating to y = Purchase is E(y)=1/(1+e−(b0+b1x1+b2x2)).
b)
Odds:
Logit:
g(x_1, x_2, ..., x_p) = ln(odds) = b_0 + b_1x_1 + b_2x_2 + ... + b_px_p
c)
Calculation:
If tires have a Wet performance rating of 8 then x1=8 and if tires have a Noise performance
rating of 8 then x2=8.
= 1.38
ŷ = e^(g(x)) / (1 + e^(g(x)))
= e^(1.38) / (1 + e^(1.38))
≈ 0.798991
Thus, the estimated probability that a customer will probably or definitely purchase a
particular tire again with a wet performance rating of 8 and a Noise performance rating of 8 is
0.798991
d)
If tires have a Wet performance rating of 7 then x1=7 and if tires have a Noise performance
rating of 7 then x2=7.
= -2.52
ŷ = e^(g(x)) / (1 + e^(g(x)))
= e^(-2.52) / (1 + e^(-2.52))
≈ 0.07446
Thus, the estimated probability that a customer will probably or definitely purchase a
particular tire again with a wet performance rating of 7 and a Noise performance rating of 7 is
0.07446.
e)
Given that wet and noise performance appear to have a significant impact on customer
purchase decisions, as CEO, I will think about giving priority to the development and
promotion of tires with higher performance ratings.