P & S (25)
P & S (25)
Sam
pli
n g
D i
str
ib
uti
onofX
Mea
na ndSt
a n
dar
d Devi
ati
ono
f
t
heSa
m pl
ing D
is
tr
ibut
ion fX
o
Cen
tr
alLimitThe
orem
TOPICS FOR TODAY
Sampling Distribution of p̂
Sampling Distribution of
X1 X2
EXAMPLE: A construction company has 310
employees who have an average annual salary
of Rs.24,000 and standard deviation Rs.5,000.
Suppose that the employees of this
company launch a demand that the government
should institute a law by which their average
salary should be at least Rs. 24500, and,
suppose that the government decides to check
the validity of this demand by drawing a
random sample of 100 employees of this
company, and acquiring information regarding
their present salaries.
What is the probability that, in a random
sample of 100 employees, the average salary
will exceed Rs.24,500 (so that the government
decides that the demand of the employees of
this company is unfounded, and hence does not
pay attention to the demand(although, in
reality, it was justified))?
SOLUTION
The sample size (n = 100) is large enough
to assume that the sampling distribution ofX is
approximately normally distributed with the
following mean and standard deviation:
x Rs.24,000.
N n 5000 310 100
x .
n N 1 100 310 1
Rs. 412.20
NOTE: Here we have used finite population
correction factor (fpc), because the sample size
0 1.21 Z
2
2 10 1
0.05.
60 20 20
To verify the given relations, we first
calculate the population proportion p.
Thus :
X
p , where X represents the number of even
N
numbers in the population.
3
In other words, p 0.5 ,
6
H
en
ce
,wef
ind
tht
a p̂
0.
5
p,
pqNn 0
. 3
25
6
. .
nN 1 3 6
1
a
nd
0.
25
0.
05
Vp̂
ar
5
H
ence
,twop
ro
pert
ieso
fthesa
mpl
ing
d
is
tr
ibu
ti
onfp̂
o arever
if
ied
.
PROPERTIES OF THE SAMPLING
DISTRIBUTION OF p̂
Property No. 1:
The mean of the sampling distribution of
proportions, denoted by p̂ , is equal to the
population proportion p, that is p̂ p.
Property No. 2:
The standard deviation (standard error) of
p̂ and denoted by p̂ , is given as:
pq
a) p̂ ,
n
when the sampling is performed with
replacement pq N n
b) p̂ ,
n N 1
when sampling is done without replacement
from a finite population.
(As in the case of the sampling distribution
of X, N n , is known as the finite population
N 1
correction factor (fpc).)
Property 3: SHAPE OF THE DISTRIBUTION
The sampling distribution of p̂ is the binomial
distribution. However, for sufficiently large
sample sizes, this sampling distribution is
approximately normal.
p̂ p. p̂
pq
p̂ ,
n
As a rule of thumb, the sampling distribution of
p̂ will be approximately normal whenever both
np and nq are equal to or greater than 5.
Let us apply this concept to
a real-world situation:
EXAMPLE-2: Ten percent of the 1-kilogram
boxes of sugar in a large warehouse are
underweight. Suppose a retailer buys a random
sample of 144 of these boxes. What is the
probability that at least 5 percent of the sample
boxes will be underweight?
SOLUTION: Here the statistic is the sample
proportion
The sample size (n = 144) is large enough
to assume that the sample proportion is
approximately normally distributed with mean
M e a n o f th e s a m p lin g d is tr ib u tio n
o f p̂ :
p̂ p 0 . 10 ,
and
S t a n d a r d E r r o r o f p̂ :
p̂
pq
0 . 10 0 . 90
n 144
0 .3
0 . 025 .
12
Therefore, the sampling distribution of p̂ is
approximately N(0.10, 0.025) And, hence:
p̂ p̂ p̂ p
Z
p̂ pq / n
p̂ 0.10
0.025
is approximately N(0, 1).
We are required to find the probability that
the proportion of underweight boxes in the
sample is equal to or greater than 5% i.e., we
require Pp̂ 0.05.
In this regard, a very important point to be
noted is that, just as we use a continuity
correction of + ½ whenever we consider the
normal approximation to the binomially
distributed random variable X, in this situation,
since X
p̂ ,
n
therefore, we need to use a continuity
1
correction of 2n in the case of the sampling
distribution of p̂ .
A pplying the continuity correction
in this problem , w e have:
1
P p̂ 0 . 05 P p̂ 0 . 05
2 144
1
P p̂ 0 . 05
288
P
pˆ 0 . 10
0 . 05 1 / 288 0 . 10
0 . 025 0 . 025
P Z 2 . 14
P 2 . 14 Z 0 P 0 Z
0 . 4838 0 . 5 0 . 9838
(using the area table of the standard normal
distribution)
0.4838 0.5
0.10 p̂
-2.14 0 Z
Hence, the probability that at least 5% of
the sample boxes are under-weight is as high
as 98% !
The sampling distributions of X and p̂
pertain to the situation when we are drawing all
possible samples of a particular size from one
particular population.
Next, we will discuss the case when we
are dealing with all possible samples drawn
from two populations, such that the samples
from the two populations are independent.
In this regard, we will consider the
sampling distributions of X1 X 2 and p̂1 p̂ 2 :
S A M P L IN G D IS T R IB U T IO N O F
D IF F E R E N C E S B E T W E E N M E A N S :
S u p p o se we have tw o d istin c t
p o p u la tio n s w ith m e a n s 1 and 2 a n d
2 2
1
v a r ia n c e s and 2
r e sp e c tiv e ly .
Let independent random samples of sizes
n1 and n 2 be selected from the respective
populations, and the differences x1 x 2
between the means of all possible pairs of
samples be computed.
Then, a probability distribution of the
differences X1 X 2 can be obtained.
Such a distribution is called the
sampling distribution of the
differences of sample means X1 X 2 .
We illustrate the sampling distribution of X1 X2
2
4 6 2
6 6 2
8 6 2
8
.
1
3 3
The mean and variance of the second
population are:
1 2 3
2 2 , and
3
2
1 2 2
2 2 2
3 2 2
2
.
2
3 3
Now x1 x 2 4 6 2 1 2 , and
2 2
1 28 1 2 1
. .
n1 n 2 3 2 3 2
4 1 5
3 3 3
1.67
2
x1 x 2
Hence, two properties of the
sampling distribution of X1 X 2
are satisfied.
PROPERTIES OF THE SAMPLING
DISTRIBUTION OF X1 X 2
Property No. 1:
The mean of the sampling distribution of
X1 X 2 , denoted by X X , is equal to the
1 2
difference between population means, that is
X1 X2 1 2
P ro p erty N o . 2 :
In ca se o f sa m p lin g w ith o r w ith o u t
re p la ce m e n t fr o m tw o in fin ite
p o p u la tio n s, th e sta n d a r d d e v ia tio n o f
th e sa m p lin g d istr ib u tio n o f X 1 X 2
(i.e. sta n d a rd e rro r o f X 1 X 2 ),
d en o te d b y X 1 X 2 , is g iv en b y
2 2
1 2
X1 X2
n1 n2
The above expression for the Standard
Error of X1 X 2 also holds for finite
population when sampling is performed
with replacement.
In case of sampling without replacement from a
finite population, the formula for the standard
error of X1 X2 will be suitably modified.
Property No. 3:
Shape of the distribution:
a) If the POPULATIONS are
normally distributed, the sampling
distribution of X1 X2 , regardless of
sample sizes, will be normal with
2 2
1 2
mean 1 2 and variance .
n1 n2
In other words, the variable
Z
X1 X 2 1 2
2 2
1 2
n1 n2