Final
Final
The final is open book and open note. Your are also permitted use of a calculator. Multiple
choice problems are worth 5 points each. It is possible to get partial credit for an incorrect multiple
choice problem answer, but only if you show your work or provide an explanation for your answer.
The following table presents data collected in the 1960s for 21 countries on X = Annual Per Capita
Cigarette Consumption (“Cigarette”), and Y = Deaths from Coronary Heart Disease per 100,000
persons of age 35–64 (“Coronary”).
Scatterplot of Coronary vs. Cigarette Consumption Residuals Versus the Fitted Values
(response is Coronary)
250 100
200
Coronary
150
Residual
0
100
50
0
-100
1000 2000 3000 4000
100 150 200 250
Cigarette
Fitted Value
Analysis of Variance
Model Summary
Coefficients
Regression Equation
(a) Based on the scatterplot of Y versus X, does there appear to be a linear relationship between
cigarette consumption and heart disease? If so, does the relationship appear to be negative or
positive?
(b) What patterns or problems, if any, do you see in the residuals versus fitted values plot? Would
you feel reasonably comfortable in fitting a simple linear regression model to this data set?
(c) Write the equation for the fitted model.
(d) Give an interpretation of the fitted slope, β̂1 .
2
(e) How much natural variability is associated with β̂0 ? (In other words, approximately what is
the standard deviation of the random variable β̂0 ?)
.........
(a) Based on the Minitab output, is it plausible that the true intercept β0 is zero? Explain. What
would be the practical interpretation of the result that β0 = 0? Is there any contradiction here?
(b) Do you think that natural variability alone could account for such a large value of β̂1 as actually
found here? Explain.
(c) Using the Minitab output, determine whether sufficient statistical evidence exists to conclude
that there is a linear relationship between X and Y at the 1% level of significance.
(d) Based on R2 , assess the strength of the linear relationship between X and Y .
(e) Do the p-value for β̂1 and the value of R2 provide contradictory evidence on the strength of
the linear relationship between smoking and heart disease? Explain.
.........
The weights of ten $100 casino chips (selected at random from a large batch of new $100 chips at
the Trump Castle Casino) averaged 0.8 ounces, with a sample standard deviation of 0.03 ounces.
(a) Assuming that the weights of the chips in the batch are normally distributed, construct a 95%
confidence interval for the mean weight of the entire batch.
(b) Does the interval you got in part (a) have a 95% chance of containing the mean weight of the
entire batch? Explain.
.........
Problem 4 (5 points)
For the situation described in Problem 3, if µ is the mean weight for the entire batch, test H0 : µ =
.83 versus Ha : µ 6= .83 at level .05.
.........
3
Problem 5 (25 points)
One hundred randomly selected milk cows were observed for one week and then given a genetically
engineered drug designed to increase milk production. The increase in milk production (second
week minus first week) averaged to 11 gallons with a sample standard deviation of 50 gallons.
(a) State the appropriate null and alternative hypotheses for this problem, in terms of µ.
(c) What do the null and alternative hypotheses imply about the effectiveness of the drug?
(d) Give all values of α at which the null hypothesis can be rejected.
(e) Suppose the drug had no effect. Then out of 1000 random samples of 100 cows, how many
samples would be expected to yield an increase in milk production at least as large as what
was found in our sample?
.........
4
Questions 6–9 concern the following situation. A random sample of 50 adults were asked how
much they spend on lottery tickets, and were interviewed about various socioeconomic variables.
The variables are
Model Summary
Coefficients
Regression Equation
PercLott = 15.1 - 0.591 YrsEdu + 0.0065 Age + 0.082 Kids - 0.0666 Income
Problem 6
Based on the output, is there statistical evidence to suggest that relatively educated people spend
a different amount on lotteries than relatively uneducated people?
(a) Yes
(b) No
.........
5
Problem 7
(a) All of the true slope coefficients in the model are nonzero
(b) At least one of the true slope coefficients in the model is nonzero
.........
Problem 8
(c) (-1,1)
.........
Problem 9
Performing a two-tailed hypotesis test for the null hypothesis that the true coefficient of YrsEdu
is -1, at the 5% level of significance, we:
.........
6
Problem 10
Let’s return to the simple regression described in Problem 1. The residual for Greece is:
(a) 1800
(b) 29.45
(c) 31.74
(d) 1768.26
(e) -88.474
.........
Problem 11
A sample of size 100 is going to be taken from a population with mean 3 and variance 25. The
probability that the sample mean will exceed 4 is approximately:
(a) .0456
(b) .4207
(c) .0793
(d) .3446
(e) .0228
.........
Problem 12
Suppose that X and Y are independent random variables with P (X > 4) = 0.8 and P (Y > 5) = 0.6.
The probability that X exceeds 4 and Y exceeds 5 is
(a) 1.4
(b) 0.92
(c) 0
(d) 0.48
.........