0% found this document useful (0 votes)
5 views

Unit-3 by EasePDF

The document discusses stratified random sampling, a method used when populations are heterogeneous to improve representation and reduce variability. It outlines the principles of stratification, properties of stratified sampling, and methods for sample size allocation, comparing it to simple random sampling. Key concepts include the unbiased estimation of population mean and variance, as well as the calculation of proportions within strata.

Uploaded by

nageh71441
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as RTF, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Unit-3 by EasePDF

The document discusses stratified random sampling, a method used when populations are heterogeneous to improve representation and reduce variability. It outlines the principles of stratification, properties of stratified sampling, and methods for sample size allocation, comparing it to simple random sampling. Key concepts include the unbiased estimation of population mean and variance, as well as the calculation of proportions within strata.

Uploaded by

nageh71441
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as RTF, PDF, TXT or read online on Scribd
You are on page 1/ 29

UNIT 3 STRATIFIED RANDOM Stratified Random

Sampling
SAMPLING
Structure
3.1Introduction
Objectives
3.2Principles of Stratification
Notations and Terminology
3.3Properties of Stratified Random Sampling
3.4Mean and Variance for Proportions
3.5Allocation of Sample Size
Equal Number of Units from Each Stratum
Proportional Allocation
Neyman’s Allocation
Optimum Allocation
3.6 Stratified Sampling versus Simple Random Sampling
Proportional Allocation Versus Simple Random Sampling
Neyman’s Allocation Versus Proportional Allocation
Neyman’s Allocation Versus Simple Random Sampling
Merits and Demerits of Stratified Random Sampling
3.7 Summary
3.8 Solutions/Answers

3.1 INTRODUCTION
When the units of the population are scattered and not completely
homogeneous in nature, then simple random sample does not give proper
representation of the population. So if the population is heterogeneous the
simple random sampling is not found suitable. In simple random sampling the
variance of the sample mean is proportional to the variability of the sampling
units in the population. So, in spite of increasing the sample size n or
sampling fraction n/N, the only other way of increasing the precision is to
device a sampling which will effectively reduce the variability of the sample
units, the population heterogeneity. One such method is stratified sampling
method.

In stratified sampling the whole population is to be divided in some


homogeneous groups or classes with respect to the characteristic under study
which are known as strata. That means, we have to do the stratification of the
population. Stratification means division into layers. The auxiliary
information related to the character under study may be used to divide the
population into various groups or strata in such a way that units within each
stratum are as homogeneous as possible and the strata are as widely different
as possible.

Thus, all strata would comprise the population. Then from each stratum
sample would be drawn and lastly all samples would be combined to get the
ultimate sample. For example, let us consider that population consists of N
units and these are distributed in a heterogeneous structure. Now first of all
45
Statistical Techniques
we divide the population into ‘k’ non overlapping strata of sizes N1, N2, N3,
..., Nk such that each stratum becomes homogeneous. Evidently N = N1 + N2 +
N3 + ... + Nk. Then from first stratum a sample of size n1 would be drawn by
simple random sampling method. Similarly, from the second stratum a sample
of n2 units would be drawn and so on, up to kth stratum. Now all these k
samples would be combined to get the ultimate sample. So, the ultimate size
of sample would be n  n1  n2  n3 ...  nk . This method of sampling is
known as Stratified random sampling because here stratification is done first
to make population homogeneous and then samples are drawn randomly by
simple random sampling from each stratum.
The principles of stratification are explained in Section 3.2. The properties of
stratified random sampling are described in Section 3.3, whereas Section 3.4
provides the derivation of the mean and variance of proportions in stratified
random sampling. The allocation of sample size with the help of different
techniques is described in Section 3.5. The comparative study between
stratified random sampling and simple random sampling is given in Section
3.6.
Objectives
After studying this unit, you would be able to
 define the stratified random sampling;
 explain the principles of stratification;
 describe the properties of stratified random sampling;
 derive the mean and variance of proportions in stratified random
sampling;
 describe the allocation of sample size with the help of different
techniques; and
 calculate the estimate of population mean and variance of sample mean.

3.2 PRINCIPLES OF STRATIFICATION

The principles to be kept in mind while stratifying a population are given


below:

1. The strata should not be overlapping and should together comprise the
whole population.

2. The strata should be homogeneous within themselves and heterogeneous


between themselves with respect to characteristic under study.

3. If a investigator is facing difficulties in stratifying a population with


respect to the characteristic under study, then he/she has to consider the
administrative convenience as the basis for stratification.

4. If the limit of precision is given for certain sub-population then it should


be treated as stratum.

46
3.2.1 Notations and Terminology Stratified Random
Sampling
N = Population size
n = Sample size
k = Number of strata
Ni = Size of ith stratum
k

Then N   N i
i1

ni = Size of sample drawn from ith


stratum
k

Then n   n i
i1

Xij
= Value of character under study for jth unit of ith stratum
Ni
Xi  Population mean
i
th stratum 
1 X ij
of j1

Ni
k Ni
1
X  Mean 
N
X ij

Population
i1 j1
k k
1

N i1
 N i X i   Wi X i i1
is called the weight of i
where, Wi N

stratum
N
Si
th i
2
= Population mean square
Ni
of the i th stratum
1
    2   
Ni 1 Xij Xi , j 1, 2,..., & i = 1, 2, ..., k
j1
Ni
x ij
= Value of jthsample unit taken from ith stratum

n
x i = 1  xij = Mean of sample units selected from i stratum
i

ni j1 2

ni
1
 x  xi  ,
2
si  ij
i  1, 2,..., k 
ni 1 j1

= Sample mean square of the i th


th
stratum
Let us consider the following sample means to estimate the populations mean
of ith stratum X which are:
k
1
x n   n i xi
n i1
k
xst  1  Ni xi 
k
and Wi xi
N i1  i1

where, xst
is the weighted mean of the strata sample means, weights being
equal to strata sizes. These two will be identical if ni  Ni
47
Statistical Techniques
3.3 PROPERTIES OF STRATIFIED RANDOM
SAMPLING

Theorem 1: xst is an unbiased estimate of the population mean X i.e.


E x st   X
Proof: We have
k
1
x st   N i x i
N i1
Therefore,

E x st
k
1 
  E  i
1
Ni x i

1 k
N Ex 
Ni1 i i

Since the sample units selected from each of stratum are simple random
sample, then we have
E x i   Xi
Therefore,

1 k
E   N X
xst N i1 i i
k Ni
1 1
=
N
N i1
i X
i j1
ij

k Ni
N
1
=
N
 Xi1 j1
ij
 X

Hence proved

Theorem 2: Prove that


k Si k
1 1  2 2
1
Var x st   2  N N i i
 ni    Wi S i i
Ni 
N n
 
i1 i i1 
Proof: We have
Var x  
 k Var Wx
  
st i i
 i1 
 W 2
Var x 
k i i
i1

The covariance term vanish since the samples from different strata are
independent and the sample units in each stratum are the simple random
sample without replacement, we have
48
1 1  2
Var x    S
Stratified Random
Sampling
 
n N 
1 1  2
or Ni 
Var (xi ) =  i

 in

Therefore, S k
2 1 1  2
Si N i 
Var x st   
 ni
W i
i1

1
 2 Ni
N i1 k 2

  Ni S
ni n i
 i

From the above result the variance depends on Si2 the heterogeneity within the
strata. Thus, if Si2 are small i.e. strata are homogeneous then stratified
sampling schemes provides estimates with greater precision.

Theorem 3: If Si2 is not known then prove that estimate of the variance of the
sample mean of the stratified random sample is given by
k
1
1  2 2

EVarx    Wi S i i
st Ni 
n

i1 

Proof: In general Si2are not known. A simple random sample is drawn from each stratum. If we assume a
individual stratum as a population then the sample, drawn from it, would be a simple random sample. If the sample
is drawn from ith stratum, the sample mean square s 2would be an estimate of population mean square Si
i. e.

E s2
i 1, 2,..., k
… (1)
i

2
 S

i i

Accordingly an unbiased estimate of the variance is given by


k
1 1  2 2

Varx    Wi si i
Ni 

Therefore, st i1 

 n
1 1  2 2

EVar x st  E   Wi i si 
N
n i 
  
 i1 
k
 1 1  2
    Wi
E is
2
 
n N
i1  i i
Substituting from equation (1), we get
k
1 1  2 2

EVar x st    Wi Si i
n Ni 
  i1 
49
Statistical Techniques
3.4 MEAN AND VARIANCE FOR PROPORTIONS
As in simple random sampling, we can divide a population into two classes
with respect to a attribute. Hence the units in the population are classified in
these two classes accordingly as it possesses or does not possess the given
attribute. After taking a sample of size n, we may be interested in estimating
the population proportion of the defined attribute.
If a unit possesses the attribute, it receives the code value 1 and if an unit
does not possesses the attribute, it receives the value 0. Let the number of
units belonging to A in the i2
th
stratum of size Ni be Mi and if the sample of size ni taken from ith stratum, the number of units belonging to A be
mi. Denoting the proportion of units belonging to A in the population, in the ithstratum and sample from the ith
stratum by , i and pi respectively, various formula for mean and variance are as follows:
M m
π  i and p  i
i i
Ni ni

and k Ni
k

π  πi  W i πi
N for i =1, 2,…, k

i1

i1

The estimated proportion pst under stratified sampling for the units belonging
to A is
k
pst
  Wi
pi i1

Mean p st 
Wi pi    Wi E p i 
k k

 E
 i1  i1
since we draw SRS from each stratum so by Theorem 10 of Section 2.4 of
Unit 2 we have
E p i   π i

By putting this value in above formula, we have


k
Mean p st    Wi π i  π
i1

Hence sample proportion under stratified sampling is unbiased estimate for


population proportion.
Now variance of pst is given by
Var   
  k 1 N 2 V p 



k
pst Var  Wi pi  2
 i1  N i1 i i

since we draw SRS from each stratum so by Theorem 11 of Section 2.4 of


Unit 2 we have

50
Ni  n i π i  1  π i  Stratified Random
Var pi   . Sampling
N1 i
n
i

By putting this value in above formula, we have


1 k
N 2
N  n π 1  π 
Var p st  

2
i i

N i 1 n i
i i i

N i1

If Ni is large enough, consider 1/ Ni as negligible and Ni-1~Ni, formula for


Var (pst ) reduces to
Var p  i 1   i 
1 N N  n 
k
 n
i i i
st 2
N i1 i

and if ni / Ni is negligible, therefore


Varp
 2  i 1   i 
W
st
 k
i
i1 n
i

An unbiased estimate of Var (pst ) is given by


E Var p 1
 
k
Ni  n i
 N p q
st
N i1
i
n i 1 i i

2
where, qi = 1− pi

3.5 ALLOCATION OF SAMPLE SIZE


In stratified sampling, the allocation of the sample to different strata is done
by considering the following factors:
1. The total number of units in the stratum, i.e. stratum size;
2. The variability within the stratum; and
3. The cost in taking observations per sampling unit in the stratum.
A good allocation is one where maximum precision is obtained with minimum
resources. In other words, the criterion for allocation is to minimize the cost
for a given variation or minimize the variance for a fixed cost, thus making
the most effective use of the available resources.

Types of Allocation of Sample Size


It is evident from the formula for variance of xst that it depends on ni the
number of units selected at random from ith stratum. Hence, the problem
arises, what optimum value of ni ( i  1, 2,..., k) can be chosen out of n, so
that, the variance is as small as possible. Four types of allocations are
considered here:
3.5.1 Equal Number of Units from Each Stratum
This is a situation of considerable practical interest for reasons of
administrative convenience. In this allocation method, the total sample size n
is divided equally among all the strata i.e. if the population is divided in k
strata then the size of sample for each stratum would be

51
Statistical Techniques n 
n for all i =1, 2, …, k
i
k

3.5.2 Proportional Allocation


This allocation was originally proposed by Bowley in 1926. This procedure of
allocation is very common in practice because of its simplicity. As its name
indicates, proportional allocation means that we select a small sample from a
small stratum and a large sample from a large stratum. The sample size in
each stratum is fixed in such a way that for all the strata, the ratio ni Ni is
equal to n N i.e.
ni n
Ni N
n
or n N
i i
N
ni  n Wi
or ni  Ni … (2)

In other words, the allocation of a sample of size n to different strata is to be


done in proportional to their sizes. We have variance of xst
k
 n  W
2
  1 i
 i
Varx
2
S Ni  n i
st i
i1 

Thus, substituting the values of ni and ni Ni as n Wi and n N respectively in


variance formula then we get variance under proportional allocation
 n k
W
S2

i i

Var
x st


1 
PROP
 N  i1 … (3)
n
1 1 
W Sk  
2
N

n   i i
  i1

In case, the sampling fraction (n/ N) is negligible


k
W
S2
 i i

Var x st  PROP
n
i1
... (4)

3.5.3 Neyman’s
Allocation
This allocation of the total sample size n to the different stratum is
called minimum variance allocation and is due to Neyman (1934).
This result was first discovered by Tchuprow (1923) but remained
unknown until it was rediscovered independently by Neyman. This
allocation of samples among different strata is based on a joint
consideration of the stratum size and the stratum variance. In this
allocation, it is assumed that the sampling cost per unit among
different strata is same and the size of the sample is fixed. Sample
sizes are allocated by

52
nin Wi S Ni Si
i n Stratified Random
k Sampling
k
WS
i1
i i
N i
Si … (5)
i1

A formula for the minimum variance with fixed n is obtained by substituting


the value of ni in variance formula, then we get
2
 k
 k

  Wi Si  W i i
2
S
  i1 
 i1
NEY
n N … (6)
Var 
xst

3.5.4 Optimum Allocation


The variance of estimated mean depends on ni which can arbitrarily be fixed.
One more factor, which is none the less important, also influences the
variance of estimated mean. The allocation problem is two fold:
1. We attain maximum precision for the fixed cost of the survey; and
2. We attain the desired degree of precision for the minimum cost.
Thus, the allocation of the sample size in various strata, in accordance with
these two objectives, is known as optimum allocation.

In any stratum the cost of survey per sampling unit cannot be the same. That
is, in one stratum the cost of transportation may be different from the other.
Hence, it would not be wrong to allocate the cost of the survey in each stratum
differently.

Let ci be the cost per unit of survey in the ithstratum from which a sample of
size ni is stipulated. Also suppose c0 as the over head fixed cost of the survey.
In this way the total cost C of the survey comes out to be
k

C  c 0   ci
ni … (7)
i1
c0 and ci are beyond our control. Hence we will determine the optimum
value of ni which minimizes the variance of stratified sample mean.
To determine the optimum value of ni, we consider the function

  Var x st   C
k
1 1  2 2  k

     Wi Si  λ c 0  ci ni 
i 1  ni Ni   i 1  ... (8)
where  is constant and known as Lagrange’s multiplier.

Using the method of Lagrange’s multiplier we select ni and the constant 


to minimize  . Differentiating  with respect to ni , we have
53
Statistical Techniques
2 2
 Wi S i  λ c  … (9)
i
n2
0 i
or
Wi Si
ni  … (10)
 ci

Since λ is an unknown quantity, it has to be determined interms of known


values. So, we take the sum over all in equation (10) and thus obtain

k
1 Wi Si
n  k


i1
i
 i1
ci

1 Wi Si
or n
k  ci
 i1

1 n i1
  Wi Si
  ci
Substituting the value of  in equation (10), we get the value of ni

W S ci  N S
k ci 

i i
ni 
n k i i
… (11)
  n 
i1
Wi Si ci  NS i i ci
i1

Thus, the relation (11) leads to the following important conclusions that we
have to take a larger sample in a given stratum if
1. The stratum size Ni is larger;
2. The stratum has larger variability (Si); and
3. The cost per unit is lower in the stratum.

3.6 STRATIFIED SAMPLING VERSUS SIMPLE


RANDOM SAMPLING
Now, we shall make a comparative study of simple random sampling without
replacement and stratified random sampling under different kinds of
allocations i.e. Proportional allocation and Neyman’s allocation.
3.6.1 Proportional Allocation versus Simple Random
Sampling
The variance of the estimate of stratified sample mean with proportional
allocation and variance of the sample mean of simple random sampling is
given respectively by
Var x  1 1 k
 
S st PROP  W i i
n N
  i1
… (12)

54
X  X i 2
N 1 
where, S2i  1 Ni
ij
Stratified Random
Sampling
i j1
and Var x 1 1
   S2
SRSWOR

k Ni
n N  … (13)
where, S  1  X ij  X 
2 2

N 1 i1 j1
In order to comparing (12) and (13) we shall first express Sk
2


2
in terms of S2i we have 
2


N
S
N1 Xij Xi Xi X
    k i

k  Ni
i1 j
1  2
   2

   N
N 1 S2    Xij Xi
i1  j1
  Xi X
 i1 j1 k
 Ni


2 Xi  X  Xij  Xi  
i1  j1 

N 1S2  kN 1 S2  N  X  X


k

 2

i i i i
i1 i1

The product term vanishes since

Ni

 X  X  0 ij
i


j1

being the sum of square of deviation from the stratum mean. If we assume that
Ni and consequently N are sufficiently large so that we can put Ni-1= Ni and
N-1 = N, then we get

N X  X 
k k 2
NS2  
2
i
Si  i
i1
N
i1  k i

S2  
k W S2  W  X
2
… (14)
 X
i i i i
i1 i1

Substituting in equation (13), we get


Varx  1 1  1 1  k W X  X 
k 2

    Wi Si2   
 i i
SRSWOR n N  i1  n N
 i1

Var    
 k1

1    2

x SRSWOR
Var xst x  n PROP
N
  Wi Xi X
st
  i1

 
Var x
SRSWOR
 Var
 PROP
… (15)

3.6.2 Neyman’s Allocation versus Proportional Allocation


Considering the variances of estimated sample mean in stratified random
sampling with proportional allocation and Neyman’s allocation respectively
we have
55
 1Statistical
1  Techniques
k   … (16)


Varx  
W
S
st PROP i 2
i1  1
n N
and

2
1
 W S  
W S
k
k

Var x 
st NEY i i
n i i N … (17)
i1
 i1

By subtracting equation (17) from equation (16) we get


Varx  Varx  1 1 k
 
WS
st PROP st NEY   i i
 n N  i1
 1  2
2

2
k  1 k 
 W S  
 W S 2

i i

n
  i1
i i
2
Ni1 
 k  k
 
1
 W S2    WS  
ii
n  i1 i
 i1  
k
1
 n W )(S S i
i … (18)
i1

where, S   1
W kS  N S is the weighted mean of the stratum sizes N
k
 i i i i i
N
i1 i1

Hence from equation (18) we can say

Var x st PROP  Var x st NEY

because, R.H.S. of equation (18) is non-negative.

3.6.3 Neyman’s Allocation versus Simple Random Sampling

From the relationship between the proportional allocation and simple random
sampling and the relation between proportional and Neyman allocation we
have
Var x    x
Var 1 1 2
 
X
2

k
W X 
st PROP   i i
SRSWOR
n N  i1
… (19)
 x PRO x st NEY
P
and Var st
 Var
1 
n S  ... (20)
k 2 i i
i1 W S
By substituting the value of the variance under proportional allocation in
equation (19) from equation (20), we have
 
Var x Varx 1
k W S
2

 
 S 
SRSWOR st NEY i i
n i1
1 1
  X
  2
…(21)
 N
 W X
i i
n  i1
That means
Var x SRSWOR  Var x st NEY
because both the terms in R.H.S. of equation (21) are positive. From the
results of the relations of variance of simple random sample mean and the
56
variance of stratified sample means with proportional and Neyman Stratified Random
Sampling
allocations, we can reach on the conclusion that
Var x SRSWOR  Var x st PROP  Var x st NEY
3.6.4 Merits and Demerits of Stratified Random Sampling
Merits

1. More Representative
Stratified random sampling ensures any desired representation in the
sample of the various strata in the population. It overruled the probability
of any essential group of the population being completely excluded in the
sample.

2. Greater Accuracy
Stratified random sampling provides estimate of parameters with
increased precision in comparison to simple random sampling. Stratified
random sampling also enables us to obtain the results of known precision
for each of the stratum.

3. Administrative Convenience
The stratified random samples would be more concentrated geographically
in comparison to simple random samples. Therefore, this method needs
less time and money involved in interviewing the supervision of the field
work can be done with greater case and convenience.
Demerits
However, stratified random sampling has some demerits too, which are:

1. May Contain Error due to Subjectiveness


In stratified random sampling the main objective is to stratify the
population in homogeneous strata. But stratification is a subjective issue
and so it may contain error.

2. Lower Efficiency
If the sizes of samples from different stratum are not properly determined
then stratified random sampling may yield a larger variance that means
lower efficiency.
Example 1: A sample of 60 persons is to be drawn from a population
consisting of 600 belonging to two villages A and B. The means and standard
deviations of their marks are given below:

Villages Strata Sizes Means Standar d Deviation


Ni   
Xi  i 
Village A 400 60 20
Village B 200 120 80

Draw a sample using proportional allocation techniques.

57
Statistical Techniques
Solution: If we regard the villages A and B as representing two different
strata then the problem is to draw a stratified random sample of size 30 using
technique of proportional allocation. In proportional allocation, we have
n
n  N
Therefore,
i i
N
n1  60 
 40
400
n2  600
 20
60

200
600
Thus, the required sample sizes for the villages A and B are 40 and 20
respectively.

E1) A sample of 100 employees is to be drawn from a population of collages


A and B. The population means and population mean squares of their
monthly wages are given below:
Village Ni Xi 2
Si
Collage A 300 25 25

Collage B 200 50 100

Draw the samples using proportional and Neyman allocation technique


and compare.

E2) Obtain the sample mean and estimate of the population mean for the
given information in Example 1 discussed above.

3.7 SUMMARY
In this unit, we have discussed:
1. The definition and procedure of stratified random sampling;
2. The principles of stratification;
3. The properties of stratified random sampling;
4. The mean and variance of proportions in stratified random sampling;
5. The allocation of sample size with the help of different techniques; and
6. Calculation of the estimate of population mean and variance of sample
mean.

3.8 SOLUTIONS /ANSWERS

E1) If we regard the collages A and B representing two different strata then
the problem is to draw as stratified sample of 100 employees using
technique of proportional allocation and Neyman’s allocation.
58
In proportional allocation we have Stratified Random
Sampling
n
n 
100
N  N
500
i i
N
k
i
 60

100
n    40
300
1
500
100
n  
200
2
500
In Neyman’s allocation, we have

ni  n 
NiSi
k

NS i i
i1

NS i = 300  5 + 200  10


i
i1 = 1500 + 2000 = 3500
n  100
NiSi i
3500
300  5
n  100   42.85  43
1
3500

200  5
n  100   57.14  57
2
3500
Therefore, the samples regarding the colleges A and B for both
allocations are obtained as:

Proportional Neyman
Collage A 60 43
Collage B 40 57
Total 100 100

E2) We have the following data:

Ni σi N N S2 N σ2 2 2
Village Xi S2  i i i i Xi Ni Xi
σ N1
2 i i

A 400 60 20 400.67 160267.11 160000 3600 3440000

B 200 120 80 6432.16 1286432.16 1280000 14400 880000


Totat 600 1446699.27 1440000 180000 4320000
k
1
N
X  Ni Xi
i1

59
Statistical Techniques 1
 600 400  60  200 120
1 48000
 24000  24000  80
600 600

1 1 N
X
i 2
Var x
i
S

  
i1


PROP
n N N

  600  60  1446699.27 1446699.27


 60  600  600  40000
 

= 36.1675
Var x 1 1
SRSWOR
    S2
 n N
where, 1 k

 
i
2
S2  

  n Xij X
i1 j1

1  k 2


 N 1 
 N  2  k N X  X
i i i i   
 i1 i1 
k
1  
  N 2 
k
N X 2  NX 2
 
N  1  i1
i i
i
i1 

1
 599 1440000  4320000  600  80  80

1 1920000
 599 5760000  3840000 
599
Varx
600  60 1920000
SRSWOR  
600  599
60
Then the conclusion is
Var x st PROP 
36.1675 Var x SRSWOR
 48.08
Therefore, precision of
can be obtained by x st
Var x SRS  Var x st PROP
Gain in precision = 100
Varx st PROP

48.04  36.1675
 36.1675 100

= 32.8 %
60

You might also like