Hypothesis Testing Exam
Hypothesis Testing Exam
Objective Questions
4. What is the assumption made for performing the hypothesis test with T-distribution?
(a) the distribution is non-symmetric (b) the distribution has more than one modal class (c) the distribu-
tion has a constant variance (d) the distribution follows a normal distribution
Solution: the distribution follows a normal distribution
6. In a two-tailed test when a Null Hypothesis is rejected for a True Alternative Hypothesis then it has
(a) Type 1 error (b) Type 2 error (c) No error (d) Many errors
Solution: No error
Page 1
(A) Select the level of significance
(B) Setup null and alternative hypothesis
(C) Establish the decision rule
(D) Performance computations
(E) Select test statistics
(F) Draw conclusion
10. Match the items of List - II with the items of List - I and select the code of correct matching Some matchings from
List - I List - II
(a) Chi-square Test (i) Testing the significance of the differences of the mean values among
more than two sample groups.
(b) ANOVA (F-test) (ii) Testing the goodness of fit of a distribution
(c) Z-test (iii) Testing the significance of the difference of the mean values between
two large sized samples
(d) t - test (iv) Testing the significance of the difference of the mean values between
two small sized samples when population standard deviation is not available
Column A and Column B are given below. Select the correct matching?
(A) (a) - (i), (b) - (ii), (c) - (iii), (d) - (iv) (B) (a) - (iv), (b) - (iii), (b) - (ii), (d) - (i) (C) (a) - (i), (b) -
(ii), (c) - (iv), (d) - (iii) (D) (a) - (ii), (b) - (i), (c) - (iii), (d) - (iv)
Solution: (a) - (ii), (b) - (i), (c) - (iii), (d) - (iv)
Page 2
Subjective Questions
Problem 1.
A drug company is testing a drug intended to increase heart rate. A sample of 100 yielded a mean increase of 1.4 beats
per minute, with a standard deviation known to be 3.6. Since the company wants to avoid marketing an ineffective drug,
it proposes a 0.001 significance level. Should it market the drug?
Solution:
In this problem we want to test, H0 : µ = 0 vs H1 : µ > 0, i.e., a right tailed test. Here, the significant level
α = 0.001, n = 100, σ = 3.6.
x̄−µ
√ 0 ∼ N (0, 0).
Under H0 : µ = 0, the test statistic Z = σ/ n
1.4−0
Here, Z = √3.6
= 3.8889.
100
Since, Zα = z0.001 = 3.09 we have Zcal > zα . ⇒ It falls in the rejection region, hence reject H0 .
At the α = 0.001 level of significance, there exists enough evidence to conclude that the drug increases heart rate.
Therefore, the company should market this drug.
Problem 2.
The accompanying data on the breakdown voltage of electrically stressed circuits was read from a normal probability
plot that appeared in the article ”Damage of Flexible Printed Wiring Boards Associated with Lightning-Induced Voltage
Surges” (IEEE Transactions on Components, Hybrids, and Manuf. Tech., 1985: 214-220). The straightness of the plot
gave strong support to the assumption that breakdown voltage is approximately normally distributed.
1470, 1510, 1690, 1740, 1900, 2000, 2030, 2100, 2190, 2200, 2290, 2380, 2390, 2480, 2500, 2580, 2700
X̄−2126.471 2000−2126.471
(b) P (X̄ ≤ 2000) = P ( 370.5729/ √
17
≤ √
370.5729/ 17
) = P (Z ≤ −1.4072) = 0.0793.
Problem 3.
A company manufacturing automobile tyres finds that tyre-life is normally distributed with a mean of 40,000 km and
a standard deviation of 3000 km. It is believed that a change in the production process will result in a better product and
the company has developed a new tyre. A sample of 100 new tyres has been selected. The company has found that the
mean life of these new tyres is 40,900 km. Can it be concluded that the new tyre is significantly better than the old one,
using the significant level of 0.01?
Solution:
In this problem we want to test, H0 : µ = 40000 vs H1 : µ > 40000, i.e., a right tailed test. Here, the significant level
α = 0.01, n = 100, σ = 3000.
x̄−µ
√ 0 ∼ N (0, 0).
Under H0 : µ = 140, the test statistic Z = σ/ n
40,900−40,000
Here, Z = 3000
√
= 3.
100
Since, Zα = z0.01 = 2.33 we have Zcal > zα . ⇒ It falls in the rejection region, hence reject H0 .
Hence, we can conclude that the new tyre is significantly better than the old one.
Page 3
Problem 4.
The Environmental Protection Department requires a mill to aerate its effluent so that the mean dissolved oxygen
(DO) level is above 6.0 mg/L . To maintain the compliance, the Department collects air samples at 12 randomly selected
dates. The data collected is given as
5.85, 6.28, 6.50, 6.21, 5.94, 6.12, 6.65, 6.14, 6.34, 6.19, 6.29, 6.40
The Department requires a strong evidence that the mean DO is high. It is required to test it with significance level,
α = 1%. Answer the following questions. Assume that the population data follows normal distribution.
(a) What should be the null and alternate hypothesis?
(b) Which test statistics: z, t or χ2 test is applicable in this case?
(c) According to your test statistics, calculate the critical value.
(a) The null hypothesis, H0 and alternate hypothesis, H1 are given below. H0 : µ = 6.0; H1 : µ > 6.0
(b) For the given data, the population mean is to be inferred and population variance is not known. Hence, it comes
under t-test.
(c) With degree of freedom = 11 and α = 0.01, from the t-distribution table, the critical t-value, tc = 2.7181 for one
tailed test.
x̄−µ
(d) From the sample data, x̄ = 6.2425, s2 = 0.04957 and n = 12. Hence t = √
s n
= 3.773
(e) Since t > tc , the null hypothesis is rejected. This means that the mill does not violate the Department’s claim.
Problem 5.
It is claimed that an automobile is driven on average more than 20,000 kilometers per year. To test the claim, a
random sample of 25 automobile owners is asked to keep a record of the kilometers they travel. The random sample
showed an average of 23500 kilometers and a standard deviation of 3900 kilometers. It is planned to test the above with
parametric-based hypothesis testing. Assume a 5% confidence level.
(a) What should be the null and alternate hypothesis?
Solution:
(a) The null hypothesis, H0 and alternate hypothesis, H1 are given below. H0 : µ = 20000; H1 : µ > 20000
(b) For the given data, the population mean is to be inferred and population variance is not known. Hence, it comes
under t-test.
(c) With degree of freedom = 24 and α = 0.05, from the t-distribution table, the critical t-value, tc = 1.711 for one
tailed test.
x̄−µ 23500−20000
(d) The test value from the sample statistics is given below: t = √
s/ n
= √
3900/ 25
= 4.48
(e) Since t > tc , the null hypothesis is rejected. The decision from the hypothesis testing is: H0 is rejected that means
an automobile is driven on average more than 20,000 kilometers per year.
Page 4
Problem 6. A manufacturer of electric batteries claims that the average capacity of a certain type of battery that the
company produces is at least 140 ampere-hours with a standard deviation of 2.66 ampere-hours. An independent sample
of 20 batteries gave a mean of 138.47 ampere-hour. Test at 5% significance level H0 = 140 against H1 < 140. Can the
manufacturer’s claim be sustained on the basis of this sample?
Solution:
In this problem we want to test, H0 : µ = 140 vs H1 : µ < 140, i.e., a left tailed test. Here, the significant level is 5%
i.e., α = 0.05, n = 20, x̄ = 138.47, s = 2.66.
Under H0 : µ = 40000, the test statistic t = x̄−µ√ 0 ∼ tn−1 .
s/ n
138.47−140
Here, t = 2.66
√
= −2.57.
20
Since, −tα,n−1 = −t0.05,19 = −1.729 we have tcal < −tα,n−1 . ⇒ It falls in the rejection region, hence reject H0 .
Thus, we can conclude that the average capacity of a certain type of battery is less than 140 ampere-hours.
Problem 7. A manufacturer of car batteries claims that the life of the company’s batteries is approximately normally
distributed with a standard deviation equal to 0.9 per year. If a random sample of 10 of these batteries has a standard
deviation of 1.2 years, do you think that σ > 0.9 year? Use a 0.05 level of significance.
Solution:
In this problem we want to test, H0 : σ 2 = 0.81 vs H1 : σ 2 > 0.81, i.e., a right tailed test. Here, the significant level is
5% i.e., α = 0.05, n = 10, s = 1.2.
(n−1)s2
Under H0 : σ 2 = 0.81, the test statistic χ2 = σ02
∼ χ2n−1 , where σ02 = 0.81.
2
Here, χ2 = (10−1)×1.2
0.81 = 16.
Since, χ2α,n−1 = χ20.05,9 = 16.919 we have χ2cal < χ2α,n1 +n2 −2 . ⇒ It do not fall in the rejection region, hence do not reject
H0 .
Thus, we are unable to conclude that the standard deviation of battery life exceeds 0.9.
Problem 8.
The alkalinity, in milligrams per litre, of water in the upper reaches of rivers in a particular region is known to be
normally distributed with a standard deviation of 10mg/l. Alkalinity readings in the lower reaches of rivers in the same
region are also known to be normally distributed, but with a standard deviation of 25mg/l. Ten alkalinity readings are
made in the upper reaches of a river in the region and fifteen in the lower reaches of the same river with the following
results.
Upper reaches 91 75 91 88 94 63 86 77 71 69 - - - - -
Lower reaches 86 95 135 121 68 64 113 108 79 62 143 108 121 85 97
Investigate, at the 1% level of significance, the claim that the true mean alkalinity of water in the lower reaches of this
river is greater than in the upper reaches.
Solution:
In this problem we want to test, H0 : µ1 = µ2 vs H1 : µ1 > µ2 , i.e., a right tailed test. Here, the significant level
α = 0.01, n1 = 15, n2 = 10, σ1 = 25, σ2 = 10, x̄1 = 1485/15 = 99.0, x̄2 = 805/10 = 80.5.
Problem 9. A chemical engineer claims that the population mean yield of a certain batch process is 500 grams per
milliliter of raw material. To check this claim he samples 25 batches each month. If the computed t-value falls between
−t0.05 and t0.05 , he is satisfied with this claim. What conclusion should he draw from a sample that has a mean x̄ = 518
grams per milliliter and a sample standard deviation s = 40 grams? Assume the distribution of yields to be approximately
normal.
Solution:
Page 5
We want to test the null hypothesis H0 : µ = 500 against the alternative hypothesis H1 : µ 6= 500. We are given:
n = 25, x̄ = 9, s = 40.
Under the null hypothesis, H0 : σx = σy , the statistic t = (x̄−µ)
√
s/ n
follows t-distribution with (n − 1)df. Therefore,
t = (518−500)
√
40/ 25
= 2.25.
The tabulated significant values are: t24 (0.05) = 1.711 and −t24 (0.05) = −1.711. So if the t-statistic falls in the range
(−1.711, 1.711), the engineer is satisfied. Since the calculated value of t = 2.25 lies outside between −1.711 and 1.711, the
engineer might conclude his process yield is different from 500 grams per milliliter.
Problem 10.
An experiment was performed to compare the abrasive wear of two different laminated materials. Twelve pieces of
material 1 were tested by exposing each piece to a machine measuring wear. Ten pieces of material 2 were similarly tested.
In each case, the depth of wear was observed. The samples of material 1 gave an average (coded) wear of 85 units with a
sample standard deviation of 4, while the samples of material 2 gave an average of 81 with a sample standard deviation
of 5. Can we conclude at the 0.05 level of significance that the abrasive wear of material 1 exceeds that of material 2 by
more than 2 units? Assume the populations to be approximately normal with equal variances.
Solution:
In this problem we want to test, H0 : µ1 − µ2 = 2 vs H1 : µ1 − µ2 > 2, i.e., a right tailed test. Here, the significant
level is 5% i.e., α = 0.05, n1 = 12, n2 = 10, s1 = 4, s2 = 5, x̄1 = 85, x̄2 = 81.
q
−x̄2 −µ0 (n1 −1)s21 +(n2 −1)s22
Under H0 : µ1 − µ2 = 2(µ0 ), the test statistic t = x̄1q 1 1
∼ tn1 +n2 −2 , where Sp = n1 +n2 −2 .
Sp n + n
1 1
q
= 4.478 and t = (85−81)−2
2 +9×52
Here, Sp = 11×4 12+10−2
√ 1 1 = 1.04.
4.478 12 + 10
Since, tα,n1 +n2 −2 = t0.05,20 = 1.725 we have tcal < tα,n1 +n2 −2 . ⇒ It do not fall in the rejection region, hence do not reject
H0 .
Thus, we are unable to conclude that the abrasive wear of material 1 exceeds that of material 2 by more than 2 units.
Problem 11. Pumpkins were grown under two experimental conditions. Two random samples of 11 and 9 pumpkins
show the sample standard deviations of their weights as 0.8 and 0.5 respectively. Assuming that the weight distributions
are normal, test the hypothesis that the true variances are equal, against the alternative that they are not, at the 10%
level.
Solution:
We want to test the null hypothesis H0 : σx = σy against the alternative hypothesis H1 : σx 6= σy .
We are given: n1 = 11, n2 = 9, sx = 0.8, sy = 0.5.
2
Sx
Under the null hypothesis, H0 : σx = σy , the statistic F = Sy2 follows F-distribution with (n1 − 1, n2 − 1) df.
n1 11 n2
Now, Sx2= 2
=
n1 −1 sx 10 (0.8)
2
= 0.704. Similarly, = Sy2 = 98 (0.5)2 = 0.28125. Therefore, F = 0.28125
2
n2 −1 sy
0.704
= 2.5
The significant values of F for two tailed test at level of significance α = 0.10 are:
α α
F > F10,8 = F10,8 (0.05) and F < F10,8 1 − = F10,8 (0.95)
2 2
The tabulated significant values are: F10,8 (0.05) = 3.35 and F10,8 (0.95) = 0.326
Since the calculated value of F (= 2.5) lies between 0.326 and 3.35, it is not significant and hence null hypothesis of
equality of population variance can not be rejected at 10% level of significance.
Problem 12. In testing for the difference in the abrasive wear of the two materials in Problem 10. we assumed that the
two unknown population variances are equal. Were we justified in making this assumption? Use a 0.10 level of significance.
Solution:
In this problem we want to test, H0 : σ12 = σ22 vs H1 : σ12 6= σ22 , i.e., a two tailed test. Here, the significant level is 10%
i.e., α = 0.1, α2 = 0.05, n1 = 12, n2 = 10, s1 = 4, s2 = 5.
n1 2
S12 n1 −1 s1
Under H0 : σ12 = σ22 , the test statistic F = S22
∼ Fn1 −1,n2 −1 . Here, the test statistic is F = n2
s2 = 0.6284 In this two
n2 −1 2
sample test we reject H0 when F < F1−α/2,n1 −1,n2 −1 or F > Fα/2,n1 −1,n2 −1 .
1
The critical values are F0.05,11,9 = 3.11 and F0.95,11,9 = F0.05,9,11 = 0.34. Since, we have Fcal > F0.95,11,9 and Fcal <
F0.05,11,9 . ⇒ It do not fall in the rejection region, hence do not reject H0 .
Thus, we conclude that there is insufficient evidence that variances differ.
Page 6
Problem 7. In a sale of 300 units of an item the following preferences for colors are observed for customers:
Problem 14.
A survey was conducted among 500 students who are studying either in “government funded collages” (GVT) or
“privately funded colleges” (PVT). The objective of the survey is to see the choice of “classroom-based learning” (C) over
“Internet-based learning” (I). It is proposed to apply the χ2 test to verify if there exists any association between “colleges”
and “learning”.
Categories C I Total
GVT 75 125 200
PVT 60 240 300
(a) Decide the null and alternate hypotheses in this case. Justify your answer.
(b) Calculate the χ2 -value from the sample data.
(c) Test the hypothesis with 5% confidence level.
Solution:
(a) The hypothesis of the χ2 -test is given below. H0 : There is no association between the attributes College and Learning
vs H1 : There is an association between the attributes College and Learning.
(b) Here we have,
(75 − 54)2 (125 − 146)2 (60 − 81)2 (240 − 219)2
χ2 = + + +
54 146 81 219
= 8.16 + 3.02 + 5.44 + 2.01 = 18.63
Categories C I Total
GVT 75 (54) 125 (146) 200
PVT 60 (81) 240 (219) 300
Total 135 365 500
(c) Critical value is = 3.841. As χ2 > 3.841. Reject the null hypothesis i.e., there exists association between “colleges”
and “learning”.
Problem 15. Let X be the number of defects in printed circuit boards. A random sample of n = 60 printed circuit
boards is taken and the number of defects is recorded. The results were as follows: Does the assumption of a Poisson
distribution seem appropriate as a model for these data?
Solution:
To test H0 : X ∼ Poisson vs H1 : X does not follow a Poisson distribution.
The mean of the (assumed) Poisson distribution is unknown so must be estimated from the data by the sample mean:
µ̂ = 0.75. Using the Poisson distribution with µ = 0.75 we can compute pi , the hypothesised probabilities associated with
each class. From these we can calculate the expected frequencies (under the null hypothesis) as follows:
Page 7
Number of Defects Observed Frequency
0 32
1 15
2 9
3 4
e−0.75 (0.75)0
p0 = P (X = 0) = = 0.472 ⇒ E0 = 0.472 × 60 = 28.32
0!
e−0.75 (0.75)1
p1 = P (X = 1) = = 0.354 ⇒ E1 = 0.354 × 60 = 21.24
1!
e−0.75 (0.75)2
p2 = P (X = 2) = = 0.133 ⇒ E2 = 0.133 × 60 = 7.98
2!
p3 = P (X ≥ 3) = 1 − (p0 + p1 + p2 ) = 0.041 ⇒ E3 = 0.041 × 60 = 2.46
Note: The chi-squared goodness of fit test is not valid if the expected frequencies are too small. There is no general
agreement on the minimum expected frequency allowed, but values of 3,4, or 5 are often used. If an expected frequency
is too small, two or more classes can be combined. In the above example the expected frequency in the last class is less
than 3, so we should combine the last two classes to get:
Page 8