Fstats ch3
Fstats ch3
3 HYPOTHESIS
TESTING:
ONE SAMPLE
TESTS
Objectives
After studying this chapter you should
3.0
Introduction
Alternative
hypothesis (H1):
Type 1 error:
37
Type 2 error:
Test statistic:
Critical region:
Critical value:
Two-tailed test:
Example
A consumer group, concerned about the mean fat content of a
certain grade of steakburger submits to an independent laboratory
a random sample of 12 steakburgers for analysis. The percentage
of fat in each of the steakburgers is as follows.
21 18 19 16 18 24 22 19 24 14 18 15
The manufacturer claims that the mean fat content of this grade
of steakburger is less than 20%. Assuming percentage fat
content to be normally distributed with a standard deviation of 3,
carry out an appropriate hypothesis test in order to advise the
consumer group as to the validity of the manufacturer's claim.
Solution
For this problem, denoting the percentage fat content by X, then
0.05
5%
Accept HH00
z < 1.645
3
X ~ N 20,
12
2
Under H0,
38
1.645
Critical region
reject HH00
z=
Calculation gives
x=
and thus
z=
x 228
=
= 19
12
n
19 20
= 1.15
3
12
This value does not lie in the critical region. Thus there is no
evidence, at the 5% level of significance, to support the
manufacturer's claim.
3.1
n
is a t statistic with degrees of freedom v = n 1, where
2
1
1 2 ( x )
2
2 =
(x x) =
x
.
n
n 1
n 1
39
2
X ~ N 20,
12
1
2282
with x = 19 and 2 = 4448
= 10.545 and so
12
11
= 3.25.
Hence
t=
19 20
= 1.07
3.25
12
This value does not lie in the critical region. Thus there is no
evidence, at the 5% level of significance, to support the
manufacturer's claim.
Activity 1
40
0.05
5%
Accept HH00
1.796
Critical region
H00
reject H
Exercise 3A
1. During a particular week, 13 babies were born in
a maternity unit. Part of the standard procedure
is to measure the length of the baby. Given
below is a list of the lengths, in centimetres, of
the babies born in this particular week.
49 50 45 51 47 49 48 54 53 55 45 50 48
30.8
28.1
24.8
27.4
22.1
24.7
27.3
27.5
27.8
23.9
23.2
146.2
127.9
92.0
91.1
109.8
114.9
115.3
108.5
99.0
112.8
130.7
141.7
122.6
119.9
3.2
( n 1) 2 ~ 2
2
41
The statistic
Example
A user of a certain gauge of steel wire suspects that the standard
deviation of its breaking strength, in newtons (N), is different
from the value of 0.75 as specified by the manufacturer.
Consequently the user tests the breaking strength of each of a
random sample of nine lengths of wire and obtains the following
results.
72.1 74.5 72.8 75.0 73.4 75.4 76.1 73.5 74.1
5%
0.05
Degrees of freedom, v = 8
0 2.733
( x )2
( n 1) 2 = x 2
n
= 49430. 49
2 =
666.9 2
= 13.2
9
( n 1) 2 = 13.2 = 23.47
2
0.5625
This value does lie in the critical region. Thus there is evidence,
at the 10% level of significance, to dispute the manufacturer's
specification as regards variability of breaking strength.
42
15.507
Calculation gives
Under H 0 ,
5%
0.05
Accept H00
Critical region
reject H00
Activity 2
Exercise 3B
1. Given the data in Question 1 of Exercise 3A,
test, at the 5% level of significance, the
hypothesis that the population standard deviation
of length is 2.5 cm.
(AEB)
2. Referring to Question 2 of Exercise 3A,
investigate the claim that the population standard
deviation is greater than 2 kg. (AEB)
3. Using the data given in Question 3 of
Exercise 3A, test the hypothesis that the standard
deviation of yield for the herd at Brookfield
Farm is 20 kg.
(AEB)
3.3
(AEB)
Binomial population
proportion
43
then
X ~ B( n, p )
and
n
n x
P( X = x ) = p x (1 p )
x
where
n
n!
=
x x!( n x )!
x = 0, 1, 2, K, n
Example
Until recently, an average of 60 out of every 100 patients have
survived a particularly severe infection. When a new drug was
administered to a random sample of 15 patients with the
infection, 12 survived. Does this provide evidence that the new
drug is effective?
Solution
Let X denote the number of patients who survive.
Then X ~ B(15, p) , and it is required to test
H0: p = 0.6
(not effective)
(effective; one-tailed)
44
15 patients surviving
= 0.00047
14 or 15 patients surviving
= 0.00517
= 0.02711
= 0.09050
but
The number of patients who actually survived was 12. This value
does not lie in the critical region. Thus there is no evidence, at the
5% level of significance, to suggest that the drug is effective.
Note that rather than actually determine the critical region, it is
equally valid to simply argue that since
P( X 12 ) = 0.09050 > 0.05 , then H0 cannot be rejected.
Remember also that tables of the Cumulative Binomial
Distribution Function may often provide an easier and quicker
alternative for calculating probabilities or finding critical regions.
Activity 3
Can you tell HP Baked Beans from a supermarket brand? Can you
tell Coca Cola from a supermarket brand? Can you tell Stork from
butter?
You are going to set up an experiment to determine whether people
really can tell the difference between two similar foods or drinks.
Each person taking part in the test is given 3 samples; 2 of one
product and 1 of another (so they may have 2 beakers containing
Coca Cola (say) and 1 beaker containing a supermarket brand or
vice-versa).
Ask each person to try to identify the sample which is different
from the other two.
Note that there are 6 possible groupings of the 3 samples,
ABB
BAB
BBA
BAA
ABA
AAB
45
1
3
1
3
Example
A machine which manufactures black polythene dustbin bags, is
known to produce 3% defective bags. Following a major
breakdown of the machine, extensive repair work is carried out
which may result in a change in the percentage of defective bags
produced. To investigate this possibility, a random sample of
200 bags is taken from the machine's production and a count
reveals 12 defective bags. What may be concluded?
Solution
Here n = 200, p = population proportion of defective bags
produced, and it is required to test
H0: p = 0.03
(no change)
H1: p 0.03
(change; two-tailed)
46
Example
Company A proposes the take-over of Company B. The latter's
Chief Executive claims that her Company's shareholders are
equally divided for and against the take-over on the basis of the
terms offered. However, the Chairman of Company A claims that
more than half of Company B's shareholders are in favour of
accepting his Company's offer.
To investigate these two rival claims, the view of each of 400
randomly selected shareholders of Company B is sought. A
subsequent count reveals that 219 are in favour of the offer; the
remainder are against.
Does this provide evidence, at the 1% significance level, that the
claim made by Company B's Chairman is valid?
Solution
Let p denote the actual proportion of Company B's shareholders
who are in favour of the offer. Then
H0: p = 0.50 (claim of Company B's Chief Executive)
H1: p > 0.50 (claim of Company A's Chairman; one-tailed)
Significance level, = 0.01
If X denotes the number of Company B's shareholders in the
sample who are in favour of the offer, then under H0
X ~ B( 400, 0.50 )
200 100
,
P ~ N
400 400 2
i.e.
P ~ N ( 0.50, 0.000625)
So
H00
Accept H
0.01
1%
2.326
Critical region
reject H00
p =
219
= 0.5475
400
so test statistic,
z=
0.5475 0.50
= 1.90
0.000625
This value does not lie in the critical region. Thus there is no
evidence, at the 1% level of significance, to support the claim of
Company A's Chairman.
Exercise 3C
1. A pharmacist claims that more than 60% of all
customers simply collect a prescription. One of
her assistants notes that, in a random sample of
12 customers, 10 simply collected a prescription.
Does this provide sufficient evidence, at the 5%
level, to support the pharmacist's claim?
48
(AEB)
3.4
xe
x!
x = 0,1,2,3, K,
Example
The number of faults in one metre of a particular type of thread is
known to have a Poisson distribution. It is claimed that the
average number of faults is 0.02 per metre. A random sample of
100 one metre lengths of the thread reveals a total of 6 faults.
Does this information support the claim?
Solution
Remembering that if A ~ Po( a ) independent of B ~ Po( b ) then
C = A + B ~ Po( a + b ) , and denoting the total number of faults in
the 100 one metre lengths by X, then if the claim is valid
X ~ Po(100 0.02 = 2 )
(two-tailed)
49
Activity 4
x=0
549 y 877
878 y 976
977 y 996
997 y 999
x =1
x=2
x=3
x=4
Example
Prior to extensive modernisation of its forecourt, the average
number of vehicles calling for fuel at a small garage has been 5 per
hour. Following the modernisation, a total of 543 vehicles called
for fuel in a random sample of 100 one-hour intervals. Has the
modernisation of the garage's forecourt significantly increased the
average number of vehicles calling for fuel?
Solution
If X denotes the total number of vehicles calling for fuel in the 100
one-hour intervals, then under the null hypothesis of no increase
X ~ Po(100 5 = 500 )
50
N (500, 500 )
(one-tailed)
5%
0.05
H00
Accept H
1.645
Critical region
H00
reject H
543 500
= 1.92
Test statistic, z =
500
Exercise 3D
1. In the manufacture of commercial carpet on a
particular machine, small faults occur at random
in the carpet at an average rate of 0.925 per
25 m 2 . Following an overhaul of the machine, an
inspection of a random sample of four 5 m 5 m
squares of carpet reveals only 2 small faults. Is
there evidence, at the 5% level of significance,
that, following the overhaul, the mean number of
small faults has been reduced?
23
19
17
20
25
32
22
26
3.5
Population median
n + 1
defined as the
th item of data; or, in other words, the
2
middle item.
51
or
Example
The values below are the scores (maximum 20) obtained in an
aptitude test by a random sample of 11 graduates. It is known
that for the non-graduate population the median score is 12. Is
there evidence, at the 10% significance level, that graduates
achieve a higher median score than the non-graduate population?
14 15
9 10 10 13 14 19 12 16 13
Solution
H0: = 12
H1: > 12
(one-tailed)
+ +
+ 0 +
Activity 5
Using your data collected for Activity 1, test the hypothesis that
the true median value for the contents is equal to the value
stated as contents.
Explain the reasons for the similarity, or difference, between
your conclusions for the two activities.
Exercise 3E
1. The following values are the annual salaries,
in , of a random sample of 10 recent statistics'
graduates.
13 250
7 485 15 136 12 258 11 019
14 268 19 536 14 326 16 326 17 984
5 6
61 54 65 18 12
9 31
C B A B B C D C F B A B B
30 32 42 56 35
41 38 38 30 29
72 38 40 30 62
53
3.6
Miscellaneous Exercises
56.4
82.1
67.2
78.7
75.1
53.3
55.5
61.5
60.6
55.2
63.1
48.0
4 13 27 22 13
54
64
77
82
66
94
72
83
61
72
65
89
54
73
55
67
71
68
15
17
18
15
18
12
14
19
14
17
9.89, 10.06,
9.81, 10.01,
9.99,
9.98, 10.09,
9.87, 10.09.
263
284
277
281
269
271
262
275
266
272
281
274
279
277
267
269
275
266
Number of times
observed
12
27
26
12
35
47
180
39
58
40
39
51
55
209
534
198
789
633.
95
98
96
95
seconds,
56
218 207 214 189 211 206 203 217 183 186
219 213 207 214 203 204 195 197 213 212
90
(AEB)
188 221 217 184 186 216 198 211 216 200
2456
2389
2361
2360
2332
2402
32
18
29
11
21
19
3
4
4
3
3
1
4
4
1
2
1
3
3
3
5
3
3
3
3
2
2
2
3
5
4
4
2
1
3
5
3
4
2
3
3
4
57
58