G11 Modules
G11 Modules
OUTCOME-BASED EDUCATION
QUARTER 3
LEARNING
MODULE WEEK 8/8A
1
MODULE IN
STATISTICS & PROBABILITY
QUARTER 3
WEEK 8/8A
2
What I Need to Know
In the previous lessons, you have learned the probability distributions and sampling
distributions. You also learned that sample statistics could be generated from sample surveys.
These sample statistics are descriptive summary measures that serve as estimates of the
corresponding population characteristics or parameters.
In this module, you will study situations when sample values are not large enough for the
Central Limit Theorem to be applied, so you should have a better understanding of the t-
distribution with simple examples. Specifically, the t-distribution is highlighted in this lesson
and its importance when making inferences about the population parameter.
There are two major ways of making inferences about the population: one is estimation,
and the other is hypothesis testing. The concepts and foundations of estimation will be
introduced and illustrated. Moreover, finding the length of the interval and sample
determination are also highlighted in this lesson, and the sample size is an important aspect
in estimation.
Objectives:
3
What I Know
Direction: Read each item carefully. Select the correct answer among the choices given and
write the letter of your choice.
1. Which of the following is another term used to describe t-distribution?
A. binomial t-distribution B. z-distribution
C. geometric t-distribution D. student’s t-distribution
2. Which of the following is NOT a characteristic of t-distribution?
A. It is a leptokurtic distribution. B. It is a mesokurtic distribution.
C. It is symmetrical about the mean. D. The mean of the distribution is not equal to zero.
3. Which of the following is NOT TRUE about z- and t-distribution?
A. Both distributions have bell-shaped graphs.
B. Both assume that the population variance is given.
C. Both can transform individual scores to standardized scores.
D. Both can be used for the hypothesis testing of the population mean.
4. Which of the following t-values describes a two-tailed test with a 95% confidence level and
degrees of freedom of 20?
A. 1.729 B. 2.086 C. 2.093 D. 2.861
5. In a t-distribution, the critical values are based on ____.
A. n B. t C. z D. df
6. When n < 30 and the population standard deviation is not known, what is the appropriate
distribution to use?
A. p B. r C. t D. z
𝑥̅ −𝜇 𝑠
7. In the t-distribution formula, 𝑡 = 𝑠 , what do you call the number of values?
√𝑛
√𝑛
A. p B. r C. t D. z
8. Which of the following describes the absolute difference between the upper confidence limit
and the lower confidence limit?
A. Margin of Error B. Confidence Limits
C. Confidence Level D. Length of the Confidence Interval
9. In a research study about students' learning style, 1 200 students were asked if they can
assimilate concepts while watching television. There were 586 students who said Yes.
Using a 95% confidence level, what is the confidence interval estimate for the population
proportion p?
A. 44% to 54% B. 45% to 53% C. 46% to 52% D. 48% to 50%
10. What is the length of the confidence interval in item #9?
A. 0.02 B. 0.06 C. 0.08 D. 0.10
11. What is the length of the confidence interval given the following data: 𝑝̂ = 0.25,
𝑛 = 400, 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 = 95%?
A. 0.072 B. 0.078 C. 0.080 D. 0.085
12. What is the sample size required to estimate a population proportion given the following
data: 𝑝̂ = 0.48, 𝐸 = 0.042, 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 = 90%?
A. 300 B. 338 C. 382 D. 383
4
Lesson
1 The t-Distribution
What’s In
What’s New
Direction: Given next are the z- and t-distribution curves and their properties. Using a Venn
diagram, classify each property if it belongs to the z-distribution or t-distribution, or
both. Write only the number that corresponds to each property in the Venn
Diagram that follows. Write your answer on a separate sheet of paper.
5
The z- and t- distribution Properties of z- and t-distribution
6
Using the table above, find the confidence coefficients or the t-value for each of the following
given the following information. (An example is done for you).
What is It
The t-Distribution
t – Distribution – a family of distributions that look almost identical to the normal distribution
curve, only a bit shorter and flatter. Just like the normal curve, a bell-shaped and unimodal. It
is formulated in 1908 by an Irish brewing employee named William S. Gosset. It is sometimes
called the student’s t-distribution.
Statistics analysis on some studies which cannot be done using the normal distribution can be
done using the t-distribution. The t-distribution is used with small samples taken from an approximately
normal population. While z-statistic is learned in the previous lessons, which uses population variance,
the t-statistic uses the sample deviation, especially when the population variance is unknown. The z-
statistic is used when 𝑛 ≥ 30, while the t-statistic is used when 𝑛 < 30.
The t-values, found in the Table of t Critical Value or the student’s t-Distribution, are
proportions of the areas of the t-curve. They are also called Confidence Coefficients.
𝑥̅ −𝜇
𝑥̅ = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛
The t-Distribution Formula is: 𝑡𝛼,𝑛−1 = 𝑠 where 𝜇 = 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
√𝑛
𝑠 = 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑠𝑎𝑚𝑝𝑙𝑒
𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
7
Observe that on the t-table:
The value of t depends on the degrees of freedom (d.f.) and the level of significance
𝛼. The level of significance for directional tests (1-tailed) is different from that of non-
directional (2-tailed) tests.
Degrees of Freedom, denoted by d.f., is the number of values that are free to vary
after the sample statistic has been computed. These values tell us the specific curve to use
when a distribution consists of a family of curves.
Example: If the mean of 5 values is 10, then 4 of the 5 are free to vary. But once the
4 values are selected, the 5th value must be a specific number to get a sum of 50. Thus, if n
=5, df = n - 1 = 4.
𝒕𝜶 −𝒕𝜶
When to use the t-distribution instead of z-distribution? When n < 30 and the sample
standard deviation is known.
8
Remark: Take note that this concept is very important when dealing with hypothesis testing in the next chapter.
SOLUTION:
Example 2: Find the critical value t for constructing a 95% confidence interval for a mean of sample
size n = 18 observations.
SOLUTION:
Example 3. Find the 90th percentile of the t-distribution with 12 degrees of freedom
SOLUTION:
The percentile of the t-distribution is the value of the variable
that has an area to the left of the given percentage.
The value of the 90th percentile of the distribution is 1.356 90th percentile
9
Study the following examples:
10
the degrees of freedom in the 𝛼 = 0.05
column, and the value of the 𝛼 𝛼
= 0.025
in the row. 2
Example 4 SOLUTION
The graph of distribution with the given df = 15 is a) Given that df = 15 and 𝛼 = 0.05
shown below. (1 − 𝛼) 100% = 𝜌
(1 − 0.05)100% = 𝜌
Answer each of the following: 0.95 (100%) = 𝜌
a) If the shaded area on the right is 0.05, 𝜌 = 95%
what is the area to the left of 𝑡1 ?
𝜌 = 0.95 area to the left of 𝑡1 .
b) What does 𝑡1 represent?
c) Find the value of 𝑡1 .
b) Hence, 𝑡1 𝑟𝑒𝑝𝑟𝑒𝑠𝑒𝑛𝑡𝑠 the 95th percentile
𝑡0.95 .
What’s More
Activity 4: Solve Me!
Direction: Given each graph of a t-distribution, answer the questions that follow. Write
your answer on a separate sheet of paper.
PROBLEM/SITUATION SOLUTION
11
Find the value of 𝑡1 such that the:
a) shaded area on the left is equal to 0.10
b) shaded area on the right is 0.10
c) total shaded area is 0.02
d) area to the left of 𝑡1 = 0.95
________ 6. df = 6
________ 7. df = 25
________ 8. df = 40
Direction: Find the critical values of t when the area of the left-hand tail of the t-distribution
is 0.10 given the degrees of freedom df.
________ 9. df = 7
________ 10. df = 24
12
What I Have Learned
C. t-values, found in the t-Table – are proportions of the areas in two-tails of the t-curve.
They are called critical values of t. They are also called confidence coefficients.
What I Can Do
Activity 6: Let Me Confirm!
Direction: Complete the solution to the given situation below. Write your answer on a
separate sheet of paper.
Situation: A student suspects that the data she collected for the research study does not
represent the target population. Here are the data she collected.
16 20 22 25 26
27 29 30 30 32
34 30 37 42 35
The population mean is 27. Assuming normality in the target population, is the student’s
suspicion correct? Use a 90% confidence level.
13
Lesson LENGTH OF THE
2 INTERVAL ESTIMATE AND
SAMPLE SIZE
What’s In
Direction: Choose the letter of the answer that correctly describes each statement.
1. In statistics, what is the process by which one makes inferences about a population based
on information obtained from a sample?
A. Estimator B. Estimate
C. Point Estimation D. Estimation
2. What do you call a single value that may refer to a parameter value?
A. z-statistic B. Point Estimate
C. t-statistic D. Interval Estimate
5. Which of the following best illustrate the statement, as the sample size n increases, the
shape of the distribution of the sample means taken with replacement from a population
with mean 𝜇 and standard deviation 𝜎 will approach a normal distribution?
A. The z-distribution B. The t-distribution
C. The Bayes’ Theorem D. The Central Limit Theorem
What’s New
Point estimation is an estimation that results in a single value of the statistic, while
interval Estimation is an estimation that results in a range of values of the statistic.
14
What is It
Point Estimation
Among all parameters, the population mean is the most commonly estimated and tested. This
is because of the characteristic and usefulness of this measure to various fields and settings.
The sample mean 𝑥̅ is the best point estimate for the population mean 𝝁.
Example 2: The following monthly salaries (In Philippine Peso) were gathered from a
sample of randomly selected ten teachers in the public school.
The sample proportion 𝑝̂ is the best point estimate of the population proportion p.
Take note that in getting the population proportion, we use the formula:
𝑥
𝑝̂ = , where 𝑝̂ (read as “p hat”) is the point estimator
𝑛
x is the number of successes
n is the sample size
It follows that 𝑞̂ = 1 − 𝑝̂ , where 𝑞̂ (𝑟𝑒𝑎𝑑 𝑎𝑠 "q hat") is the sample proportion of failures in the
sample size n.
Example 3: In a recent survey of 850 Grade 11 students, it was found out that 476 of them
have laptops. What is the sample proportion 𝑝̂ ?
Solution: Given: x = 476; n = 850; 𝑝̂ = ?
Find 𝑝̂ .
𝑥 476
𝑝̂ = = = 0.56 𝑜𝑟 56%
𝑛 850
Example 4: In a random sample of 120 Grade 11 students were asked if they like modular
learning as their learning modality preference that they could use in this time of
new normal due to the covid-19 pandemic crisis, 48 of them said YES. What is
the sample proportion 𝑝̂ ?
15
Margin of Error
Confidence Level in an interval estimate of a parameter is the probability that the interval
estimate contains the parameter. It describes what percentage of the interval from many
different samples contain the unknown population parameter. There are three commonly used
confidence levels: 90%, 95%, and 99%.
Example 5: Calculate the margin of error using the formula given above with given the
confidence level, standard deviation, and sample size. Find the value of 𝑧𝛼.
2
Complete the table below using the z-table.
1. 80% 2.5 30
2. 90% 5 50
3. 95% 12 100
4. 98% 15 75
5. 99% 10 50
Example 6: Calculate the margin of error E using the formula given above with given the
confidence level, standard deviation, and sample size. Find the value of 𝑡𝛼.
2
16
Margin of Error for Population Proportion p
The margin of error E in the case of population proportion p is similar to the case of
𝑝𝑞
the population mean except for the fact that the standard error 𝑆𝐸𝑝 = 𝑧𝛼 √ . Take note that
𝑛
2
𝑥
𝑝̂ , n𝑞̂ > 5, recall that 𝑝̂ = , where x = number of successes and n = sample size.
𝑛
Example 7: Calculate the margin of error using the formula given above with given the
confidence level, standard deviation and sample size. Find the value of 𝑧𝛼.
2
Complete the table below.
𝑧𝛼 Number of
Confidence Sample 𝑥 Margin of
2 Successes 𝑝̂ = 𝑞̂ = 1 - 𝑝̂
Level Size n 𝑛 Error E
x
1. 80% 30 20
2. 90% 50 30
3. 95% 100 80
4. 98% 75 35
5. 99% 50 20
*Use z-table
Interval Estimation
Example 8: The sample mean weight of 50 Grade 11 students is between 57.5 kgs and
62.5 kgs, with a 95% level of confidence.
Estimator: sample mean
Estimate: 57.5 – 62.5 kgs.
The range of values is the interval estimate or confidence interval where the
confidence level, the margin of error, and the range of values of the computed statistic are
presented. The confidence level of the confidence coefficient is (1 – 𝛼)100%. The common
values of confidence level are their corresponding z-score which will help you in finding the
confidence interval are presented in the table that follows:
Confidence Level 𝜶 z-score (two-tailed)
90% 0.10 ±1.645
95% 0.05 ±1.96
98% 0.02 ±2.33
99% 0.01 ±2.575
17
𝒙̅ = mean of the random sample of size n or the point estimate
𝜶
𝒛𝜶 = called the critical value or z-value or confidence coefficient at (1- ) confidence
𝟐 𝟐
level
𝝈
= called the standard Error of the mean
√𝒏
n = sample size
𝜎 = population standard deviation = √𝜎 2
For a specific value, 𝛼 = 0.05, the distribution of 𝒙̅ ‘s is:
Example 9: The mean score of a random sample of 49 Grade 11 students who took the
First Statistics Quiz is 78. The population variance is known to be 0.16. At 95%
confidence level, find the interval estimate of all Grade 11 students' mean
scores.
SOLUTION:
Step 1. Find the value of 𝒛𝜶 .
𝟐
Given: ̅ = 78;
𝒙 𝜎 𝟐 = 0.16; 𝜎 = √0.16 = 0.4; 𝑛 = 49
Solving for 𝛼:
Confidence level = 95%
(1 – 𝛼)100% = 95%
(1 – 𝛼)1 = 0.95
1 – 𝛼 = 0.95
𝛼 = 1 − 0.95
𝛼 = 0.05 𝛼 𝑖𝑠 𝑑𝑖𝑣𝑖𝑑𝑒𝑑 𝑏𝑦 2 𝑠𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑖𝑛𝑡𝑒𝑟𝑒𝑠𝑡 𝑖𝑠 𝑜𝑛
𝜶
Solving for ,
𝟐
α 0.05 𝑒𝑖𝑡ℎ𝑒𝑟 𝑙𝑒𝑓𝑡 𝑜𝑟 𝑟𝑖𝑔ℎ𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
= = 0.025
2 2
18
Step 3. Substitute the values of 𝒙̅ and E in the confidence interval
σ σ
x̅ − zα ( ) < μ < x̅ + zα ( )
2 √n 2 √n
x̅ − E < μ < x̅ + E
78 − 0.112 < μ < 78 + 0.112
77.888 < μ < 78.112
77.89 < μ < 78.11
So, 77.89 = Lower Confidence Limit (LCL)
78.11 = Upper Confidence Limit (UCL)
Interpretation:
Thus, with 95% confidence, the sample mean 𝑥̅ = 78 differs from the population mean 𝜇 by no
more than 0.112 or 0.11, and the true population mean 𝜇 is between 77.89 and 78.11.
B. Interval Estimation for Population Mean 𝝁 with Unknown Variance, but the Sample
Size is Large
This is when the population standard deviation is not given, but the sample size is
sufficiently large (n ≥ 30). If the confidence level of (1 – 𝛼)100%, then the confidence
interval can be calculated using the estimator below:
𝑠 𝑠
𝑥̅ − 𝑧𝛼 ( ) < 𝜇 < 𝑥̅ + 𝑧𝛼 ( ) where,
2 √𝑛 2 √𝑛
𝑠 𝜎
𝐸 = 𝑧𝛼 ( ) = margin of error; = standard error of the mean
2 √ 𝑛 √𝑛
𝛼
𝑧𝛼 = critical value or z-value or confidence coefficient at (1- 2 ) confidence level
2
𝑥̅ = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛; 𝑠 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛; n = sample size
Example 10: In a sample size of 40 with a mean of 186 and standard deviation of 6, interpret
the confidence interval for the population at a 95% confidence level.
SOLUTION:
Step 1. Find the value of 𝒛𝜶 .
𝟐
Given: ̅ = 186;
𝒙 𝒔 = 6; 𝑛 = 40
Solving for 𝛼:
Confidence level = 95%
(1 – 𝛼)100% = 95%
(1 – 𝛼)1 = 0.95
1 – 𝛼 = 0.95
𝛼 = 1 − 0.95
𝛼 = 0.05 𝛼 𝑖𝑠 𝑑𝑖𝑣𝑖𝑑𝑒𝑑 𝑏𝑦 2 𝑠𝑖𝑛𝑐𝑒 𝑡ℎ𝑒 𝑖𝑛𝑡𝑒𝑟𝑒𝑠𝑡 𝑖𝑠 𝑜𝑛
𝜶
Solving for ,
𝟐
𝜶 𝟎.𝟎𝟓 𝑒𝑖𝑡ℎ𝑒𝑟 𝑙𝑒𝑓𝑡 𝑜𝑟 𝑟𝑖𝑔ℎ𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
= = 𝟎. 𝟎𝟐𝟓
𝟐 𝟐
19
E = 1.86
Step 3. Substitute the values of 𝒙̅ and E in the confidence interval
s s
x̅ − zα ( ) < μ < x̅ + zα ( )
2 √n 2 √n
x̅ − E < μ < x̅ + E
186 − 1.86 < μ < 186 + 1.86
184.14 < μ < 187.86
So, 184.14 = Lower Confidence Limit
187.86 = Upper Confidence Limit
Interpretation:
With 95% confidence, the sample mean 𝑥̅ = 186 differs from the population mean 𝜇 by no
more than 1.86, and also, the true population mean 𝜇 is between 184.14 and 187.86.
C. Interval Estimation for Population Mean 𝝁 with Unknown Variance but the Sample
Size is Small
The process and formula in calculating a confidence interval for the population mean
𝜇 using the t-distribution is when the sample size is small (n < 30) and the population
variance is unknown.
If the population standard deviation is unknown with a (1 – 𝛼)100% confidence level,
then the confidence interval can be calculated using the estimator below:
𝑠 𝑠
𝑥̅ − 𝑡𝛼 ( ) < 𝜇 < 𝑥̅ + 𝑡𝛼 ( ) where,
2 √𝑛 2 √𝑛
𝑠
𝑥̅ − 𝑡𝛼 ( ) = called the Lower Confidence Boundary or Limit (LCL)
2 √𝑛
𝑠
𝑥̅ + 𝑡𝛼, ( ) = called the Upper Confidence Boundary or Limit (UCL)
2 √ 𝑛
𝑠
𝑡𝛼 ( ) = called the margin of error
2 √𝑛
𝑡 = called the critical value or t-value or confidence coefficient
𝛼
2
𝑛 − 1 = 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚, n = sample size
𝑠
= standard error of the mean, s=sample standard deviation
√𝑛
Example 11: Mathematics teachers at the high school level are under stress if students’
achievement in their subject is low. A medical student researcher wanted to
know if stress contributes to the high hemoglobin level of these teachers. He
sampled 20 mathematics teachers and found out that the teachers’ average
hemoglobin reading was 16 grams per 100 milliliters with a sample standard
deviation of 2 grams. Find the 95% confidence interval of the true mean.
SOLUTION:
̅ = 16; 𝒔 = 2; 𝑛 = 20; 𝑑. 𝑓. = 20 − 1 = 19
Step 1. Find the value of 𝒕𝜶 . Given: 𝒙
𝟐
Solving for 𝛼:
Confidence level = 95%
(1 – 𝛼)100% = 95%
(1 – 𝛼)1 = 0.95
1 – 𝛼 = 0.95
𝛼 = 1 − 0.95
𝛼 = 0.05
Finding for the value of 𝒕𝜶 ,
𝟐
Use the t-table, find 19 in the left column and 0.05 (two-tails) in the row labeled significance
level 𝛼. The intersection where the two meets gives the value for 𝒕𝜶,𝒏−𝟏 = 2.093.
𝟐
Step 2. Find the Margin of Error E.
𝒔
𝑬 = 𝒕𝜶,𝒏−𝟏 ( )
𝟐 √𝒏
𝟐
𝑬= 𝟐. 𝟎𝟗𝟑 ( )
√𝟐𝟎
20
𝑬 = 0.94
Step 3. Substitute the values of 𝒙̅ and E in the confidence interval
𝒔 𝒔
𝒙̅ − 𝒛𝜶 ( ) < 𝝁 < 𝒙̅ + 𝒛𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
𝒙̅ − 𝑬 < 𝝁 < 𝒙̅ + 𝑬
𝟏𝟔 − 𝟎. 𝟗𝟒 < 𝝁 < 𝟏𝟔 + 𝟎. 𝟗𝟒
𝟏𝟓. 𝟎𝟔 < 𝝁 < 𝟏𝟔. 𝟗𝟒
So, 15.06 = Lower Confidence Limit (LCL)
16.94 = Upper Confidence Limit (UCL)
Interpretation:
With 95% confidence, the sample mean 𝑥̅ =16 differs from the population mean 𝜇 by no more
than 0.94, and the true population mean 𝜇 is between 15.06 and 16.94.
𝑝̂𝑞̂
𝐸 = 𝑍𝛼 (√ ) = margin of error; n = sample size
2 𝑛
𝑝̂ = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 𝑜𝑟 𝑡ℎ𝑒 𝑝𝑜𝑖𝑛𝑡 𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 p
𝑞̂ = 1 − 𝑝̂
𝛼
𝑧𝛼 = critical value or z-value or confidence coefficient at (1- ) confidence level
2 2
𝑝̂𝑞̂
𝜎𝑝̂ = √ = standard deviation of the sampling distribution of 𝑝̂
𝑛
̂‘s is:
For a specific value, 𝛼 = 0.05, the distribution of 𝒑
SOLUTION:
Step 1. Find the value of 𝒑̂ and 𝒒
̂.
Given: 𝒙 = 715; 𝑛 = 1,200
̂ 𝑎𝑛𝑑 𝒒
Solving for 𝒑 ̂:
𝑥
𝑝̂ =
𝑛
715
𝑝̂ =
1,200
𝑝̂ = 0.60
21
𝑞̂ = 1 − 0.60
𝑞̂ = 0.40
(0.60)(0.40)
𝐸 = 1.96 (√ )
1,200
𝐸 = 0.03
Step 3. Substitute the values of 𝑥̅ and E in the confidence interval
𝑝̂𝑞̂ 𝑝̂𝑞̂
𝑝̂ − 𝑍𝛼 (√ ) < 𝑝 < 𝑝̂ + 𝑍𝛼 (√ )
2 𝑛 2 𝑛
𝑝̂ − 𝐸 < 𝑝 < 𝑝̂ + 𝐸
0.60 − 0.03 < 𝑝 < 0.60 + 0.03
0.57 < 𝑝 < 0.63
57% < 𝑝 < 63%
So, 57% = Lower Confidence Limit (LCL)
63% = Upper Confidence Limit (UCL)
Interpretation:
Thus, we can be 95% confident that the true population proportion p of all citizens who favors
the implementation of the Anti-Terror Law is between 57% to 63% based on a random sample
of 1 200 citizens.
The length of a confidence interval (LCI) is the absolute difference between the upper
confidence limit (UCL) and the lower confidence limit (LCL). In symbols,
LCI = Length of the Confidence Interval
LCI = | 𝑈𝐶𝐿 − 𝐿𝐶𝐿 | = | 𝐿𝐶𝐿 − 𝑈𝐶𝐿 | UCL = Upper Confidence Level
or LCL = Lower Confidence Level
LCI = UCL – LCL where
𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
22
A. Length of a Confidence Interval for population mean 𝝁, n ≥ 𝟑𝟎
Deriving the formula for interval estimate of a population mean 𝜇 with known variance:
𝝈 𝝈
𝒙̅ − 𝒛𝜶 ( ) < 𝝁 < 𝒙̅ + 𝒛𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
To find the length of the confidence interval, subtract the lower confidence limit from the
upper confidence limit:
𝝈 𝝈
LCI = [𝒙̅ + 𝒛𝜶 ( )] - [𝒙̅ − 𝒛𝜶 ( )]
√𝒏
𝟐 √𝒏 𝟐
𝝈 𝝈
LCI = 𝒙̅ + 𝒛𝜶 ( ) − 𝒙̅ + 𝒛𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
𝝈 𝝈
LCI = 𝒙̅ + 𝒛𝜶 ( ) − 𝒙̅ + 𝒛𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
𝝈
LCI = 2𝒛𝜶 ( )
𝟐 √𝒏
𝒔 𝒔
𝒙̅ − 𝒕𝜶 ( ) < 𝝁 < 𝒙̅ + 𝒕𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
To find the length of the confidence interval, subtract the lower confidence limit from the
upper confidence limit:
𝒔 𝒔
LCI = [𝒙̅ + 𝒕𝜶 ( )] - [𝒙̅ − 𝒕𝜶 ( )]
√𝒏
𝟐 √𝒏 𝟐
𝒔 𝒔
LCI = 𝒙̅ + 𝒕𝜶 ( ) − 𝒙̅ + 𝒕𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
𝒔 𝒔
LCI = 𝒙̅ + 𝒕𝜶 ( ) − 𝒙̅ + 𝒕𝜶 ( )
𝟐 √𝒏 𝟐 √𝒏
𝒔
LCI = 2𝒕𝜶 ( )
𝟐 √ 𝒏
Example 1: Find the length of the confidence interval given the following data:
𝑠 = 6.17; 𝑛 = 12; 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 = 99%
SOLUTION:
23
𝛼 = 0.01
Hence, using the t-table 𝑡𝛼 = 3.106
2
Example 2: Find the length of the confidence interval given the following data:
𝜎 = 0.5; 𝑛 = 36; 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 = 95%
SOLUTION:
̂𝒒
𝒑 ̂ ̂𝒒
𝒑 ̂
̂ − 𝒁𝜶 ( √ ) < 𝒑 < 𝒑
𝒑 ̂ + 𝒁𝜶 ( √ )
𝟐 𝒏 𝟐 𝒏
To find the length of the confidence interval, subtract the lower confidence limit from the
upper confidence limit:
̂𝒒
𝒑 ̂ ̂𝒒
𝒑 ̂
̂ + 𝒁𝜶 (√ )] - [𝒑
LCI = [𝒑 ̂ − 𝒁𝜶 (√ )]
𝟐 𝒏 𝟐 𝒏
̂𝒒
𝒑 ̂ ̂𝒒
𝒑 ̂
̂ + 𝒁𝜶 ( √ ) − 𝒑
LCI = 𝒑 ̂ + 𝒁𝜶 ( √ )
𝟐 𝒏 𝟐 𝒏
̂𝒒
𝒑 ̂
LCI = 𝟐 𝒁𝜶 (√ )
𝟐 𝒏
Example 1: Find the length of the given confidence interval: 0.57 < 𝑝 < 0.63
SOLUTION:
Given: UCL = 0.63; LCL = 0.57
LCI = UCL – LCL
LCI = 0.63 − 0.57
24
LCI = 0.06 or 6%
Example 2: Find the length of the confidence interval given the following data:
𝑝̂ = 0.25; 𝑛 = 300; 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 = 95%
SOLUTION:
Step 1. Find 𝛼 in (1 – 𝛼)100% confidence level then find 𝑍𝛼.
2
(1 – 𝛼)100% = 95%
(1 – 𝛼)1 = 0.95
1 - 𝛼 = 0.95
𝛼 = 0.05
𝜶
Solving for :
𝟐
𝜶 𝟎. 𝟎𝟓
= = 𝟎. 𝟎𝟐𝟓
𝟐 𝟐
Finding for the value of 𝒛𝜶 , subtract 0.025 from 0.5:
𝟐
0.5 – 0.025 = 0.4750
Hence, using the z-table 𝑧𝛼 = 𝟏. 𝟗𝟔.
2
(𝟎.𝟐𝟓)(𝟎.𝟕𝟓)
LCI = 2(1.96) (√ )
𝟑𝟎𝟎
LCI = 0.098 or 9.8%
Example: Sarah, a researcher, would like to study the average life-span of a high-end mobile
gadget produced by a certain company. The population mean life-span of the said
mobile is eight years, and the standard deviation is four years. How many sampling
units must be included in the investigation for her to be sure with 99% accuracy
that the sample mean is within 1.5 years of the population mean?
25
𝑧𝛼 • 𝜎 2
2
𝑛=( )
𝐸
(2.575)(4) 2
𝑛= [ ]
1.5
𝑛 = 47.15
𝑛 = 48
Note: the resulting value of n is rounded up to the nearest whole number.
How to Determine the Sample Size for Estimating 𝒑?
Derivation for sample size n needed in estimating a population proportion p:
To do this, modify the equation of margin of Error E by applying the properties of equality:
2
𝑝̂ 𝑞̂ 𝑛 • 𝐸2 = 𝑝̂ 𝑞̂ • (𝑧𝛼 )
𝐸 = 𝑧𝛼 √ 2
2 𝑛 2
𝑝̂𝑞̂•(𝑍𝛼 )
𝐸 𝑝̂𝑞̂ 2
=√ 𝑛= or
𝑧𝛼 𝑛 𝐸2
2 𝑍𝛼 2
2
𝐸 𝑝̂𝑞̂ 𝑛 = 𝑝̂ 𝑞̂ ( 𝐸2 ) for 𝑝̂ 𝑖𝑠 𝑘𝑛𝑜𝑤𝑛
(𝑧 ) =
𝛼 𝑛 2
2
𝐸2 𝑝̂𝑞̂ 0.25 (𝑍𝛼 )
2 = 𝑛= 2
for 𝑝̂ 𝑖𝑠 𝑢𝑛𝑘𝑛𝑜𝑤𝑛
𝑛
(𝑧𝛼 ) 𝐸2
2
Example: Andrei, a Senior High School student, would like to estimate the proportion of those
who support the bill, which lowers the criminal liability of minors from 15 years old
to 9 years old within five percentage points with a 90% confidence level. A survey
conducted a week ago showed that 39% are in favor of this bill. How many
respondents should he sample to estimate the proportion of those in favor of the
bill?
Slovin’s Formula
There are situations when we do not know the behavior of the population. One formula
usually used by researchers when there was limited information about the population is
Slovin’s formula.
Slovin’s formula is a random sampling formula used to estimate sample size, making
use of the total population and margin of error.
In symbols,
𝑁
𝑛= , where n = sample size; N = population size
1+𝑁𝑒 2
e = margin of error = 0.05 or 5% which is constant
Example: A researcher wants to study social media's effects on Grade 11 students in Ilocos
Norte College of Arts and Trades. If there are 1 000 Grade 11 students in the school, how
many students should there be in the sample?
26
𝑁
𝑛=
1+𝑁𝑒 2
1,000
𝑛=
1+(1,000)(0.05)2
𝑛 = 285.7
𝑛 = 286
Therefore, the minimum sample size should be 286.
What’s More
Direction: Solve each of the following with the given condition:
Problem Answer
VI. Solve for the sample size for each given the
following information: a) ____________________________
a) E = 0.5; 𝜎 = 2.5; 90% 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 b) ___________________________
b) E = 0.3; 𝜎 = 4.1; 95% 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙
VII. Solve for the sample size for each given the
following information: a) ____________________________
a) E = 0.08; 𝑝̂ = 0.65; 99% 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 b) ____________________________
b) E = 0.05; 𝑝̂ = 𝑢𝑛𝑘𝑜𝑤𝑛; 90% 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙 c) ____________________________
c) E = 0.05; 𝑁 = 4830
27
What I Have Learned
A. The length of a confidence interval (LCI) is the absolute difference between the
upper confidence limit (UCL) and the lower confidence limit (LCL).
In symbols,
LCI = | 𝑈𝐶𝐿 − 𝐿𝐶𝐿 | = | 𝐿𝐶𝐿 − 𝑈𝐶𝐿 | LCI = Length of the Confidence Interval
or UCL = Upper Confidence Level
LCI = UCL – LCL where LCL = Lower Confidence Level
What I Can Do
28
SITUATION: One of the few negative effects of quitting smoking is weight gain. The data
below shows the weight gain (in pounds) of 18 respondents 12 months after they quit. If
the mean gain follows a normal distribution with a standard deviation of 9 pounds, estimate
the mean weight gain of the 18 respondents at a 95% confidence level.
14 13 20 22 12 12 17 10 12 9 11 17 15 24 29 13 13 19
GOAL: Your goal is to assist the Chairperson of the Smoking Cessation Program of the
Department of Health by providing a statistical report that involves estimation.
ROLE: You are a statistician who will estimate the mean weight gain of 18 smokers who
gained weight after they quit smoking.
PRODUCT: You are expected to prepare a statistical report which includes the result of
the estimation. Moreover, the interpretation of the analysis should be provided.
AUDIENCE: This proposal shall be presented and evaluated by the Chairperson of the
Smoking Cessation Program of the Department of Health.
STANDARDS: Your output will be evaluated based on the following: accuracy of the
estimation, comprehensiveness, and acceptability of interpretation of the results, and
content of the written presentation.
Accuracy of The report The report shows The report The report No attempt
Estimation shows accurate accurate and shows minimal needs made
and detailed convincing errors that improvement as
results of the results of the affect the it contains
estimation estimation accuracy of the numerous
procedure. procedure. estimation. significant errors
which affect the
accuracy of the
estimation.
Comprehensiven The The interpretation The The No attempt
ess and interpretation is is correct, and interpretation interpretation is made
acceptability of complete, claims are contains some completely
the interpretation correct, and mathematically errors that are incorrect.
of the results supported with acceptable not
details that all mathematically
claims are acceptable
mathematically
acceptable.
Content of the Content and Content and Content and Content and No attempt
Written illustrations are illustrations are illustrations are illustrations are made
Presentation clear, clear and somewhat not clear.
impressive, and impressive clear.
convincing.
29
Assessment
Direction: Read each item carefully. Select the correct answer among the choices given and
write the letter of your choice. Write your answer on a separate sheet of paper.
1. Which type of distribution whose graph is bell-shaped, symmetric about t = 0, and its
variance is greater than 1?
A. binomial t-distribution B. z-distribution
C. geometric t-distribution D. student’s t-distribution
2. What happens to the shape of the t-distribution as n becomes larger?
A. The shape skewed to the left
B. The shape skewed to the right
C. The shape becomes platykurtic
D. The t-distribution gets closer to the standard normal distribution
3. What statistical analysis is appropriate to use with small samples taken from an
approximately normal population?
A. binomial t-distribution B. t-distribution
C. geometric t-distribution D. z-distribution
4. When n ≥ 30 and the population standard deviation is known, what is the appropriate
distribution?
A. binomial t-distribution B. t-distribution
C. geometric t-distribution D. z-distribution
5. What is the critical value of t if the sample size n is 8, two-tailed at 99% confidence level?
A. 3.169 B. 3.250 C. 3.355 D. 3.500
30
31
Publishing House, Inc.
Mercado, J.P. 2016. Next Century Mathematics: Statistics and Probability. Phoenix
Publishing, Inc.:Educating minds, Shaping the future.
Gabuyo, Y.A. and M.C Cardenas. 2016. Statistics and Probability. Philippines: The Inteligente
Belecina, R.E. and E. Mateo. 2016. Statistics and Probability. Philippines: Rex Bookstore, Inc.
Publishing, Inc.
Barcelona, A.B. 2020. Statistics and Probability for Senior High School: Golden Cronica
References
Pretest
1. D 2. C 3. B 4. B 5. D
6. C 7. C 8. D 9. B 10. C
11. D 12. D What’s More
Activity 4: Solve Me!
Lesson 1 1. a) -t = 1.341
b) t = 1.341
What’s In c) t = ±2.602
Activity 1: Activate My Prior Knowledge! d) t = 1.753
1. vertical, horizontal
2. T 2. a) 97.5%
3. T b) 𝑡1 represents the 97.5th percentile 𝑡97.5
4. T c) ) 𝑡1 = 2.093
5. T
6. Boolean, Gaussian
7. T Lesson 2.
8. 𝜇 = 1, 𝜎 = 0, 𝜇 = 0, 𝜎 = 1 What’s In Activity 1
9. T 1. D 2. B 3. A 4. D 5. D
10. T
What’s More Activity 2
What’s New I. a) 5.25 b) 0.502
Activity 2: Compare Me! II. a) 19.6 b) 28.20
III. a) 0.42 b) 0.20
IV.a) 4.97 b) 0.4.79
V. a) 0.08 b) 0.10
VI.a) 68 b) 718
VII. a) 36 b) 271 c) 370
Assessment: Posttest
Activity 3: Use Me!
1. D 2. D. 3. B 4. D 5. D
1. 1.476 or -1.476 6. C 7. A 8. A 9. A 10. D
2. ±2.447
3. . ±1.363
4. 2.500 or -2.500
. ±2.575
Answer Key
For inquiries or feedback, please write or call: