0% found this document useful (0 votes)
153 views45 pages

Sampling Fundamentals Modified

1. Sampling is a method of obtaining information about a population by examining only a subset of its members. It allows estimates of population parameters to be made with less cost and effort than a full census. 2. There are two main types of sampling: probability sampling, where every member of the population has a known, non-zero chance of being selected; and non-probability sampling, where selection is non-random. 3. Common probability sampling designs include simple random sampling, stratified sampling, systematic sampling and cluster sampling. Non-probability methods include convenience sampling and snowball sampling.

Uploaded by

Arjun Khosla
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
153 views45 pages

Sampling Fundamentals Modified

1. Sampling is a method of obtaining information about a population by examining only a subset of its members. It allows estimates of population parameters to be made with less cost and effort than a full census. 2. There are two main types of sampling: probability sampling, where every member of the population has a known, non-zero chance of being selected; and non-probability sampling, where selection is non-random. 3. Common probability sampling designs include simple random sampling, stratified sampling, systematic sampling and cluster sampling. Non-probability methods include convenience sampling and snowball sampling.

Uploaded by

Arjun Khosla
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 45

Sampling fundamentals and Estimation

1
2
Introduction
The need for adequate and reliable data is ever increasing
for taking wise decisions in different fields of human
activity
and business. There are two ways in which the required
information may be obtained:
1. Complete enumeration survey or Census method.
2. Sampling method.
In the first case, data are collected for each and every unit.
i.e Universe/ population (complete set of items).

3
What is Population?
In any field of inquiry, all the items under consideration
constitute ‘population’ or ‘universe’.

A complete enumeration of all the items of ‘population’ is


known as a census inquiry. In such an inquiry it is assumed
that highest accuracy is obtained.
But this type of inquiry involves a great deal of time, money
and energy. Not only this, census inquiry is not possible in
practice under many circumstances.

4
Sample

Hence, quite often we select only a few items from the


universe for our study purposes. The items so selected is
technically called sample.

5
What is Sampling Process?
Sampling may be defined as the selection of some part
of an aggregate or totality using a specific method. on
the basis of which a judgment or inference about the
population (aggregate or totality) is made.

In other words, it is the process of obtaining


information about an entire population by examining
only a part of it.

6
Need/Reason for sampling

1. Sampling can save time and money.


2. Sampling may produce more accurate information if it is
conducted by trained and experienced investigator.
3. Sampling becomes the only option when the population
size is infinite.

7
Sampling Method/Sampling Technique
1. Probability Sampling

2. Non-probability sampling

8
Probability Sampling

 Probability sampling is also known as ‘random sampling’


or ‘chance sampling’.
Samples selected according to some chance are known as
random or probability samples i.e. every item in the
population has known chance of being included in the
sample.

9
Non-probability sampling

On the other hand, non-random or non-probability samples


are those where the selection of sample unit is based on
the judgment of the researcher than randomness.

10
Important Sampling Designs
Probability Sampling:
i. Simple Random Sampling

ii. Systematic Sampling

iii. Stratified Sampling

iv. Cluster Sampling

11
Major non-probability sampling are:
i. Deliberate Sampling/ Purposive sampling/ Judgment
sampling

ii. Quota Sampling

12
Simple Random Sampling Method
Under this sampling design , every item of the universe has
an equal chance of inclusion in the sample.

For example, if we have to select a sample of 300 items


from a universe of 15,000 items, then we can put the names
or numbers of all the 15,000 items on slips of paper and
conduct a lottery.

13
Systematic Sampling

In some instances, the most practical way of sampling is to select


every i th item from the universe where ‘i’ refers to the
sampling interval.

The sampling interval can be determined by dividing the size of


the population by the size of the sample to be chosen.
For example, if we wish to draw 32 names out of the list of 320
names, the sampling interval will be 10. It means every 100th
name will be selected. In this process a random start is always
Preferable, i. e. a start is determined by chance. If the starting no
is say 6, then the sample is composed of the number 6, 16, 26,
36,…..

14
Merits and demerits
Merits :
a. It is a simple method
b. It can be taken as an improvement over a simple random
sample as it spread more evenly over the population.
Demerits:
c. It is not truly random in the strict sense. This is because all
items selected for the sample ( except the first term) are pre-
determined by the constant interval.
d. There are certain dangers too in using this type of sampling. If
there is a hidden periodicity in the population, systematic
sampling will prove to be inefficient method of sampling.
Example , quality checking of 4% sample.
15
Stratified Sampling

If a population from which a sample is to be drawn does not


constitute a homogeneous group,( highly heterogeneous)
stratified sampling technique is generally applied in order to
obtain a representative sample. Under stratified sampling the
population is divided into several sub-populations that are
individually more homogeneous than the total population. The
different sub-populations are called ‘strata’ . Then we select
items from each ‘strata’ to constitute a sample. Since each
‘strata is more homogeneous than the population, we get a
better estimate of the whole.

16
The following three questions are highly
relevant in the context of stratified sampling
a) How to form strata?

b) How should items be selected from each stratum?

c) How many items be selected from each stratum or how


to allocate the sample size of each stratum?

17
Regarding the first question, we can say that the items
which are homogeneous ( i.e. of common characteristics)
should be put in the same group or strata. In other words
strata be formed in such a way that elements are most
homogeneous within the strata and most heterogeneous
between the different strata.

18
In respect to 2nd question, we can say that to choose the
items from each strata we normally adopt simple random
sampling.

19
To answer the 3rd question we have to
understand the following concepts:
Stratified sampling can be of two types: proportionate and
disproportionate.

In proportionate stratified sampling the number of sample


units in various strata are in the same proportion as found
in the population. Thus, larger the particular stratum, the
more weight it receives in the analysis.
Eg: If a bank wants to conduct a survey to understand the
problems that is consumers are facing .

20
Disproportionate Stratified sampling

Here the strata are represented in the total sample in a


proportion other than the one with which they are found in
the population.

21
Number of Employees in Sample

Stata Job Level No. of Proportionate Disproportionate Sample


Employees Sample

1 Top Mgt 15 (15/750)/100 = 2 3

2 Middle Level Mgt 30 (30/750)/100 = 4 10

3 Lower level Mgt 55 (55/750)/100 = 7 15

4 Supervisors 105 (105/750)/100 = 14 25

5 Clerks 510 (510/750)/100 = 68 37

6 Secretaries 35 (35/750)/100 = 5 10

Total 750 100 100

22
Cluster Sampling

In Cluster sampling first we divide the population into


groups called ‘clusters’ and then select some units from the
groups or the clusters for sample. Cluster sampling is
totally
opposite to stratified sampling in the sense that ,
a. The units within each cluster should be as heterogeneous
as possible.

b. There should be small difference between the clusters.

23
Ex. Suppose a departmental store in a town wants to study
only those customers ( i.e the frequency or amount of
transaction made by using credit cards)who purchase goods
from this store by using HDFC bank credit card . To know
that a survey was conducted on the behalf of the store
which reveals that the total no of customers holding cards
is 15,000. sample size is 450. For cluster sampling the list of
15000 customers could form 100 clusters of 150 members
each. From these 100 clusters we have to choose randomly
450 units.

24
Non-probability sampling
Judgment sampling: selection made by choice not by
Chance.
Example: A company wanting to launch a new product may use
judgmental sampling for selecting experts who have prior
knowledge or experience of similar products.
The most common application is B to B marketing.
Convenience sampling :
The only criterion for selecting sampling units is the
convenience of the researcher.
•People interviewed in a shopping mall
•Interviews conducted by a TV channel of people coming out of
a cinema hall, to seek their opinion
25
Snowball Sampling: Snowball sampling is generally used when
it is difficult to identify the members of the desired population.

Quota Sampling: In stratified sampling, to choose sample unit


from each strata this technique is often being used . Here the
interviewer got some quota ( a minimum number) to be filled from
each specified subgroup in the population where actual selection of
units totally depends on the interviewers judgment.

26
Sampling Vs Non-Sampling Error
There are two types of error that may occur while we are
trying to estimate the population parameters from
the sample.

1. Sampling error

2. Non-Sampling error

27
Statistic and Parameter

 A statistic is a characteristics of a sample , whereas a


parameter is a characteristic of a population.

28
The central limit theorem

Take a large (30 or more) random sample of size n from


any population with mean μ and standard deviation σ. The
sample mean, X is approaches the normal distribution with
mean μ and standard deviation . 
n
  
X ~  N   , 
 n

29
Sample size for estimating population mean

X = Sample mean  The value of standard



μ = Population mean error is
n
N= Population Size ( when samples are drawn
from an infinite
n = Sample size population)
σ = Population standard
deviation

30
Confidence Interval: The interval within which the population

parameter is expected to lie.

Confidence Interval = Point Estimate ± Margin of Error

Margin of Error: The value added or subtracted from a point

estimate in order to develop an interval estimate of a population

parameter.

Margin of Error = Z x Standard Error of a Particular Statistic

Zc = Value of Standard Normal Variable


31
32
Q1.The average monthly electricity consumption for a
sample of 100 families is 1250 units. Assuming the
standard deviation of electricity consumption of all
families is 150 units, construct a 99% confidence interval
estimate of the actual average monthly electricity
consumption.

33
Confidence Interval = 1250 ± 2.58 x 150/√100

= 1250 ± 38.70 per month

34
Q2. The average monthly electricity consumption for a

sample of 100 families out of 1000 families is 1250 units.

Assuming the standard deviation of electricity consumption

of all families is 150 units, construct a 99% confidence

interval estimate of the actual average monthly electricity

consumption.

35
Q3.The HR department of an organization would like to estimate the

family medical expenses of its employees to determine the feasibility

of providing a medical insurance plan. A random sample of 10

employees reveals the family medical expenses (INR thousands) in

the last year as under:

11, 37, 25, 62, 51, 21, 18, 43, 32, 20

Set up a 99% confidence interval of the average family medical

expenses for the employees of this organization.

36
Sample Size Determination

An economist is interested in estimating the average


monthly household expenditure on food items by the
households of a town. Based on past data, it is estimated
that he standard deviation of the population on the
monthly expenditure on food item is 30 rupees. With
allowable error set at 7 rupees , estimate the sample size
required at a 90 percent confidence.

37
n= (z 2 . σ 2)/e 2

n= 50 ( approx)

38
It is desired to estimate the mean life of a certain kind
of vacuum cleaner. Given that the population standard
deviation 320 days, how large a sample is needed to be
able to assert with a confidence level of 96 per cent
that the mean of the sample will differ from the
population mean by less than 45 days ?

39
 At 96 per cent confidence Z= 2.055

n= 214 ( approx)

40
Estimating the Proportion

A manager of a company wants to estimate the proportion


of the company’s workers whose primary reason for staying
on their job . He undertook a survey of 894 respondents
with salaries below 10 lacks per year. He asked them “ What
is the Primary Reason for Staying on Your Job?” For this
sample of 894 respondents 367 indicated that the primary
reason for staying on their job was interesting job
Responsibilities.
Construct a 95 per cent confidence interval for the proportion
of the workers whose primary reason for staying on their job
was interesting job responsibilities.
41
Determination of sample size for
estimating the population proportion
Q.A consumer electronics company wants to determine the
job satisfaction levels of its employees. For this they ask a
simple question , ‘Are you satisfied with your job?’.It is
estimated that no more than 30 percent of the employees
would answer yes. What should be sample size for this
company to estimate the population proportion to ensure
95 per cent confidence in result and to be within 0.04 of
the true population parameter.

42
n= (z 2 . pq)/e 2

= 505 ( approx)

43
Q. A researcher wants to have 90 % confidence on his
estimation. He wants to estimate the proportion of office
workers who respond to office email within an hour. The
error should be within ±0.05. Since no one has previously
undertaken such a study, there is no information available Wi
t
h
i

from past data. Determine the sample size needed.


n

44
Ans: 271 office workers

45

You might also like