Sampling and Sampling Distribution 39
Chapter 3 Sampling and Sampling Distribution
Lesson 3.1: Sampling and Sampling Distribution
Sampling
Oftentimes, in our researches or even in
with a large group of People or objects, It is of cx
+ impossible to deal with every member of this large group known as population.
; Intimes like this, we have a temedy, which is, selecting a portion of the population
known as sample. This process is called sampling. One of the best methods of
sampling which is usually used in Tesearches is called random sampling.
daily activities, we are concerned
ourse very difficult, or sometimes
Definition
The totality of subjects (people, animals or objects) under consideration is
called population. The portion chosen from a population is called Sample and
the process of taking samples is called sampling.
Random Sampling refers to the sampling technique in which each member of
the population is given equal chance to be chosen as part of the sample. The
lottery method, drawing lots, or the use of random numbers can be used to
accomplish random sampling.
Parameter and Statistic
Ifa population we are concerned with is finite or small in number, say the
25 captive-bred Philippine Eagles successfully produced by the Philippi
ine Eagle
Foundation (PEF) as of October 15, 2015, then, we can easily describe it. ‘Every
Measurement or quantity that represents the general characteristics of this
Population, say the average height of these 25 captive-bred raptors is 3.5 meters,
is called parameter.
On the other hand, if we are dealing with a very large population and we
have resorted to sampling, then, every measurement or quantity that describes the
. characteristics of the sample is called sample statistic or simply statistic.
Definition
The measurement or quantity that describes the population is called parameter
While the measurement or quantity that describes the sample is called statistic.40 — Sampling and Sampling Distribution
Sampling Distribution of the Sample Mean
Suppose a jar contains numbers 1, 3 and 5. If we take two numbers in
succession with replacement, then, the possible 2-number samples are: (1,1), (3,
3), (5, 5), (1,3), (3,1), (1,5), (5, 1), (3, 5) and (5, 3). The average or mean of each
pair, in that order are 1, 3, 5, 2, 2, 3, 3, 4 and 4. If we denote the means as random
variable X, then:
X= {1, 2, 3,4, 5}
As we can see, P(1) =1/9, P(2)=2/9, P(3)=3/9 or 1/3, P(4)=2/9, and P(5)=1/9
Therefore, the probability distribution of X is:
x 1 2 3 4 5
fC) 1/9 2/9 1/3 2/9 1/9
The probability distribution above represents the means of the samples,
that’s why the distribution is now called Sampling Distribution of the Sample
Means.
Example 1:
In order to test the effect of the new drug to humans, 20 patients were given
the dose. After a minute, it was found that the body temperature in average,
decreased by 2°C. Answer the following:
. a) Are the 20 patients mentioned above population or sample?
b) Is the 2°C decrease in the body temperature considered parameter or
statistic?
Answer:
a. The 20 patients taken are considered sample.
b. Since the measurement 2°C refers to the average decrease of the 20 patients
(sample), it is therefore considered as statistic.Sampling and Sampling Distribution 41
Example 2:
Construct the sampling distribution of the sample means when two dice
are rolled.
Answer:
If we construct a table for the mean of the results, it would be:
|e |e |||
1 1 LS 25 [3 | 35
2 1S 2 25 3 3.5
3°[ 2 [2s] 3 [asl 4 [45
4 2.5 3 35 4 45 5
5 7 35 4 45 5 5.5
6 3.5 4 45 5 5.5 6
z ane 25|3 |35| 4 |45| 5 |55| 6
~| 1/36 286 | 336 436 | 536 636 6 | #36 | 338 [286/136
£(%) :
Practice Exercise 3. | teeta
A.
Determine if the given subject is population_or sample, then describe the
given quantity as parameter or statistic:
‘1, The average grade of the whole class under study is 82.15.
Whole class:
Average grade (82.15):
50 out of the 200 animals in the zoo were taken and checked on their
weight. The variance of their weight is 12.5 kg.
50 animals:
Variance (12.5 kg):
. The standard deviation of the life span of a specie endemic in the
Philippines is 2.3 years
A specie endemic in the Philippines:
Standard Deviation (2.3 years):42
Sampling and Sampling Distribution
4. Based on the survey conducted to 1200 respondents, 1 out of 3 Filipinos
can’t live without cell phone.
1200 respondents:
1 out of 3 Filipinos can’t live without cell phone:
5. Based on the US National Hospital Discharge Record in 2010, the
average length of stay of patients in US hospitals is 4.8 days.
Patients:
Average stay (4.8 days):
B Construct the Sampling Distribution of the Sample Means and answer the
questions that follow:
Bl. Ajar contains numbers I, 2, 3 and 4. Construct the sampling distribution
of the sample means when two numbers are taken from the jar with replacement.
1.
2.
3.
4
What is the probability that the mean of the numbers is 2.5?
What is the probability that the mean of the numbers is less than 2?
What is the probability that the mean of the numbers is greater than 1.5?
What is the probability that the mean of the numbers is between 1.5 and 5.
Construct the histogram of the’sampling distribution.
B.2.__ Adrian Cedrick receives 82 or 83 as his grade on his three major subjects.
Construct the sampling distribution of his mean grade.
6.
15
8.
9.
10.
What is the probability that his mean grade is lower than 83?
What is the probability that his mean grade is greater than 82.33?
What is the probability that his mean grade is 82.67?
What is the probability that-his mean grade is between 82.33 and'83?
Construct the histogram of the sampling distribution of the mean grade.
B.3. _. Three containers contain the numbers 0, 1, and 2. Construct the sampling
distribution of the sample mean when a number is taken from each container.
11.
12.
13.
14,
15.
What is the probability that the sample mean is less than 1?
What is the probability that the sample mean is greater than 0.67?
What is the probability that the sample mean is 1?
What is the probability that the mean is between 1 and 2?
Construct the histogram of the sampling distribution of the sample means.ied
The concept of sampling distribution lies at the foundation of
statistical inference. In this chapter, you will be introduced to the
key concepts of sampling and sampling distributions of the sample
mean. You will also learn to apply these concepts in solving real-life
problems,
- Discuss the Probability distribution of the sample mean
- Determine the mean, varian
sample mean
. Distinguish the sam
size is large
ce, and standard deviation of the
pling distribution of X when the sample
- Distinguish the sampling distribution of X
when the population
is normal
Basic Concepts on Sampling Distributions
Suppose you want to know some characteristics of the population.
Itis, however, difficult to collect data from the entire population. If
you are trying to describe a population, what you can do instead is
to randomly collect data from a sample. Consider this example: a
player was given four pool balls having the numbers 5, 6, 9, and 10.
Since the player has a total of four balls, these four balls represent
the population of interest. The average of the numbers on all the
balls is the- parameter. Suppose two balls are’ selected randomly
(with replacement) and the average of their numbers is computed.
All possible outcomes are on the next page.
55Table 4.1 All possible outcomes when two balls are selected
randomly with replacement
Outcome
The sample we get from the first outcome is different from the
second outcome. In repeated sampling, the value of the sample mean
would vary from sample to sample. A sample mean is the average of
a small sample taken from the population while a population mean
is the mean of the whole population of the data set. Notice that the
means are 5:5, 7.0, 8.0, or 9.5. The frequencies of these means are
shown in the table below. The relative frequencies are equal to the
Table 4.2 Frequencies of Means for n = 2
frequencies divided by the total outcomes.
2
[eo | 2 | o.t667
[9s | 2 | o.t667_ |
56Presenting the data in a histogram, we have
z
2
2
=
=
83
&
2
&
2
Figure 4.1 Distribution of Means for n = 2
Figure 4.1 shows a relative frequency distribution of the means
based on the values of Table 4.2. After randomly selecting two
balls out of four balls repeatedly, the mean of those samples would
cluster in a certain set of value. In our example, the values clustered
around the mean value 7.5. It means that 7.5 or close to 7.5 is our
population mean. A statistic is a number computed from a sample.
The distribution of the statistic that repeatedly draws samples from
the population is called sampling distribution. It is the probability
distribution of that statistic. It is also the information: that allows
us to go from. descriptive statistics to inferential statistics. In many
situations, the sample mean has a distribution that is approximately
normal. :Typically, we draw one sample in a population. But the
concept of a sampling distribution is very important. The value of
- a Statistic that we see in our sample will be a random sample from
the statistic’s sampling distribution. In mathematical arguments,
statistic’s sampling distribution is used to make statements about
population parameters.
> Mean and Standard Deviation of the Sample Mean
We take. samples from the population because it is difficult to
measure the whole population. If we take the sample mean of the
samples from the population, do we expect that the value is exactly
equal to the population mean y? The answer is no. If we kept drawing
samples from, the population, all of the same size n, and compute
the sample mean X of each one, we will likely get a different value
57each time. We can consider x as a random variable. We will wine
X when the sample mean is considered as a random variable and x
for each value that it gets.
For example, given the following normal distribution of the
population with mean yz, we repeatedly take samples then or ae
the mean of each. The sample mean may not be ae ° me
population mean p but the values may get closer to it. Eac'
represents the sample mean of each sample taken.
RRR RK X, X distribution
of the
dll,
population
We now have the new distribution of the sample means: x,,
Ky Xyperver Rye
X distribution
of the
sample mean
If all possible random samples, each of size n, are taken from any
population with mean y and standard deviation o then the sampling
distribution of the sample mean will have the following properties:
* The mean of the sample means will be equal to the population
mean
iidThe standard deviation of the sample means will be equal
to the standard deviation of the population o divided by the
Square root of the sample size n.
o
oe
The most commonly used measure of dispersion is the
standard deviation which is simply the square root of the
variance. If the Standard deviation of the sample mean is
o =o . . “25
f Th’ then the variance of the sample mean Var (X) =
22 OF
%
As the sample size n increases, the variance of the
sample mean decreases, It means that if larger samples are
taken from the population, the variance will be smaller. The
sample mean will be extremely close to the Population mean.
If the population is norm:
distribution will also be
sizes.
ally distributed, then the sampling
normally distributed for all sample
Example 4.1
All vehicles for registration in the Philippines are required to
Pay the tax value. The value depends on the model of the car.
The mean and standard deviation of the tax value of Dan’s car are
= 541,000 and o = P167,200 respectively. Suppose random
samples of size 100 000 are drawn from the population of vehicles.
What are the mean, variance, and standard deviation of the sample
Solution
Since n = 100 000 4, = w = 541 000.
59167 200
x = (167 2007 _ ae = 528.73
Var (X) = 100 000" 279 558. 5 750 000
The sample mean has a mean of 541 000; variance of 279 558.4;
and a standard deviation of 528.78.
Example 4.2
Given that the distribution of young women is N uy, oO) =N
(64.5 inches, 2.5 inches), what will be the mean, variance, and
the standard deviation if you randomly select 10 women as your
sample?
Solution
‘ 2,
yz = = 64.5 inches Var (X) = Ger = 0.625
The sample mean has a mean of 64.5 inches; variance of 0.625;
and standard deviation of 0.79 inch.
Example 4.3
Using the same distribution in Example 4.2, what will be the
sample mean and standard deviation for a sample size of 100?
Solution
z= = 64.5 inches yar (X) = (2:5 - 0.0625
100
2.5
G = = 0.25 i
TH 25 inch
The sample mean has a mean of 64.5 inche: i .
. s; val 0625;
and standard deviation of 0.25 inch. ; variance of 0.06:
ee