Statprb Quarter 4 Module 2 Final 1
Statprb Quarter 4 Module 2 Final 1
OUTCOME-BASED EDUCATION
4
LEARNING QUARTER
MODULE WEEK 2
1
MODULE IN
STATISTICS AND PROBABILITY
FOURTH QUARTER
WEEK 2
Development Team
Writer: Michael G. Calipjo Jerick S. Paltong
Ma. Teresa R. Pascual Vanessa A. Miguel
Editors/Reviewers : Gerson Jeremy C. Antonio Myla Fei Martinez
Gregorio P. Agatep, Jr.
Illustrator: Vanessa A. Miguel
Lay-out Artist: Vanessa A. Miguel
Management Team: Vilma D. Eda Arnel S. Bandiola
Lourdes B. Arucan Juanito V. Labao
Marlyn S. Ventura
2
What I Need to Know
In this module, you will learn the different types of parametric tests and the assumptions
that must be observed from the data gathered for each type. You will also identify the
appropriate test statistic that must be utilized in a particular situation. In addition, you will also
formulate hypotheses that will be subjected to testing and learn the succeeding steps in
hypothesis testing.
OBJECTIVES:
At the end of the module, the student must be able to:
1. formulate the appropriate null and alternative hypotheses on a population mean;
2. identify the appropriate form of statistic to use; and
3. apply the procedures for a test of hypotheses concerning a single population
mean.
3
What I Know
Directions: Read the questions carefully and choose the correct answer.
The mean marrying age in the Municipality of Battuc is 25 years with a standard deviation
of 6 years. A group of researchers would like to know if several teenage marriages occur in
the municipality to help them plan for conducting seminars for the youth. A random sample of
50 marriage records was studied, and it was found that the mean marrying age in the
municipality is 20 years. Test the hypothesis that the mean marrying age is lower than 25 years
old using a 0.05 level of significance.
3. How many samples were randomly taken for the new study?
A. 6 B. 20
C. 25 D. 50
6. This question is related to your answer in question 4. What is its critical value?
A. −1.96 B. −1.645
C. +1.645 D. +1.96
4
7. What is the value of the test statistic?
A. −5.89 B. −4.51
C. 5.04 D. 7.15
8. In conducting a significance test in a single mean, what is the appropriate test statistic
to be utilized for samples that are less than 30?
A. t-test B. z-test
C. two-tailed test D. both t- and z-test
10. Which of the following best describes the Central Limit Theorem?
A. It is a basis for sampling in Statistics. It emphasizes that the sampling distribution
of the mean of sample data will be normal or nearly normal if the sample size is
large or adequate.
B. It states that if you have a population mean and a standard deviation and you take
sufficiently large random samples from the population, the distribution of the
sample means will be approximately normally distributed.
C. It states that having a sufficiently large sample size from a population with finite
variance, the mean of all the samples from the same population will be
approximately equal to the mean of the population. Moreover, the mean of the
sample data will be closer to the mean of the overall population.
D. All of the above
5
HYPOTHESIS TESTING
LESSON
ABOUT POPULATION MEAN 𝝁
1 USING THE CRITICAL VALUE
APPROACH
What’s In
Activity 1. The Missing Piece
Directions: In each puzzle piece, write word(s) or phrase(s) related to hypothesis testing.
What’s New
In hypothesis testing, it is customary to make and follow a decision model. On our
end, we will use the following steps in hypothesis testing as our decision model.
In this lesson, we will focus on number 2—determining the test statistic to use. We will
answer the essential question, “How can we identify the appropriate form of the test statistics
to be used?”.
6
What Is It
The two most common tests of hypothesis in testing significant differences are the z-
test and the t-test. They fall under the parametric tests, including certain assumptions, such
that samples are chosen randomly from normal populations with known variance, and the data
used are either interval or ratio scale.
z-test
The z-test can be used when the following assumptions are observed in the
data gathered:
The z-test includes two cases. First is a one-sample test that compares the population
mean 𝜇 and the sample mean 𝑥̅ . The second case is a two-sample test that examines the
differences between two groups of samples (𝑛1 and 𝑛2 ). Two-sample test is used to determine
if two population means are different when the variances are known.
t-test
The t-test can be used when the following assumptions are observed in the
data:
The t-test is ideal for smaller samples. Just like the z-test, the t-test also has two cases:
one sample and two samples.
7
Central Limit Theorem
The Central Limit Theorem (CLT) states that having a sufficiently large sample size
from a population with finite variance, the mean of all the samples from the same population
will be approximately equal to the mean of the population. Moreover, with a sample size of at
least 30, the mean of the sample data will be closer to the mean of the population.
Furthermore, all the samples will then follow an approximately normal distribution.
Hence, the CLT is a basis for sampling in Statistics. It emphasizes that the sampling
distribution of the mean of sample data will be normal or nearly normal if the sample size is
large or adequate. Having large enough samples can be a basis for more conclusive results
in a study.
Let us consider the following examples. For each item, we will answer the question,
“What is the appropriate form of test statistic to be used?”.
Example 1.
Previous research studies show that the average life span of Filipinos has been 71
years with a standard deviation of 5 years. Suppose new research is conducted about the
average life span of Filipinos. The researcher has computed an average life span of 60 years
among random samples of 55 individuals in terms of their recorded age of death. What is
the appropriate test of hypothesis? Will there be a significant difference in the population mean
in terms of the average lifespan and the sample mean of the new average life span? The
researcher wishes to be 95% accurate in the results.
Answer: The test that is appropriate to be used is the z-test. The sample size of 55 is greater
than 30, and the population standard deviation is assumed to be known. The problem
involves z-test using one sample which is comparing the population mean 𝜇 and the
sample mean 𝑥̅ .
Example 2.
A researcher from ST University would like to know if the mean IQ of incoming first-
year students who will take accountancy as a program is still equal to 108 based on previous
records of the admission office. Suppose the researcher took 25 sample records of first-year
students currently enrolled, and the mean IQ is known to be 106 with a standard deviation
of 20. Is it sufficient to say that the mean IQ of first-year students taking up accountancy is
now lower than 108?
Answer: The t-test is appropriate because the sample size is 25, which is lesser than 30, and
the sample standard deviation is known. The problem illustrates the first case of the
t-test, which is comparing the population mean 𝜇 and the sample mean 𝑥̅ of one
sample.
8
Example 3.
An investor is interested in determining the overall profit from a stock index composed
of 2 000 stocks. He can take a random sample of at least 40 stocks to be analyzed. The
mean profit from these random samples can estimate the whole stock index and are assumed
to be normally distributed.
Answer: The CLT is to be used. The sample size of at least 40 random samples is considered
sufficient for the CLT to hold. A sufficiently large sample size can predict the
characteristics of a population accurately. In addition, it was mentioned in the problem
that the mean profit from these random samples can estimate the whole stock index
and are assumed to be normally distributed.
Example 4.
ImmUNO capsule was advertised from previous studies as a good source of zinc and
Vitamin C. The average amount of zinc and Vitamin C in the capsule is known to be 100mg.
However, a group of consumers claimed that the average amount of zinc and Vitamin C in the
capsule is less than 100mg. A researcher from DOH has taken 20 random samples of the
capsule and found out that the average amount of zinc and Vitamin C in the capsule is 98mg
with a standard deviation of 2mg. Is there enough evidence to believe the claim? Test the
hypothesis that the average amount of Vitamin C in the capsule s less than 100mg at a 0.05
level of significance.
Answer: The appropriate test for the problem is the t-test because the sample size of 20 is
less than 30. Moreover, the sample standard deviation was given. Thus, the
conditions for the t-test are satisfied.
Example 5.
Answer: The sample size of 40 is large enough to utilize the CLT. Since the sample is
sufficiently large, the sample standard deviation can be a used as a substitute to the
unknown population standard deviation. Hence, the study will give a conclusive result.
Example 6.
The average IQ of first-year students admitted to a certain college was 100 for the
previous years, with a standard deviation of 8. Fifty freshmen took the IQ test, and the
results showed that the average IQ is 95. Test the hypothesis that the average IQ of the
freshman is no longer 100. Use 0.05 level of significance.
Answer: The appropriate test is the z-test because the sample size of 50 is more than 30.
Moreover, the population standard deviation is known.
9
CONDUCTING HYPOTHESIS TESTING ON A POPULATION MEAN 𝝁 USING THE
CRITICAL VALUE APPROACH
Before we proceed to hypothesis testing, let us have a short lesson on small sample
test and large sample test on population means. Here are the things that we need to
remember.
𝑥̅ − 𝜇 𝑥̅ − 𝜇 𝑥̅ − 𝜇
𝑧= 𝜎 𝑡= 𝑠 𝑧= 𝑠
√𝑛 √𝑛 √𝑛
where, where, where,
𝑥̅ is the sample mean 𝑥̅ is the sample mean 𝑥̅ is the sample mean
𝜇 is the population mean 𝜇 is the population mean 𝜇 is the population mean
𝜎 is the population standard 𝑠 is the sample standard 𝑠 is the sample standard
deviation deviation deviation
𝑛 is the sample size 𝑛 is the sample size 𝑛 is the sample size
The value of 𝑥̅ is usually the result of new studies or researches. Meanwhile, 𝜇 refers
to the population mean which is a result of studies done previously with either a known or
unknown variance. On the other hand, 𝑛 refers to the sample size or the number of
respondents in the study.
• The normal curve and table of values (see Appendix A and B on pages 21-22)
are the bases in deciding whether to reject or fail to reject the null hypothesis.
10
If the alternative hypothesis
uses “less than” (<), the test
is a negative one-tailed
normal curve where the
critical value is negative (−).
*The −𝑧𝛼 is the critical value taken from the z-table used in z-test.
* The −𝑡𝛼 is the critical value taken from the t-table used in t-test.
*The 𝑧𝛼 is the critical value taken from the z-table used in z-test.
* The 𝑡𝛼 is the critical value taken from the t-table used in t-test.
If the obtained computed value is within the nonrejection region (NRR), the decision is
to fail to reject the null hypothesis. If the computed value falls in the rejection region (RR), then
the decision is to reject the null hypothesis.
Remarks: This lesson will be further discussed in your next module.
Now, we are ready to proceed and follow the steps in hypothesis testing. Let us
consider the following scenarios.
Example 1:
Cole created a webpage to advertise her products online. Based on previous records,
it was noticed that the average number of clicks on the webpage is 1000, with a variance of
100. She decided to redesign the webpage to help increase her sales. After 60 days that the
new web page was launched, Cole noticed that the average number of clicks is 1050. Test
the hypothesis that the average number of clicks is more than 1000 at a 0.05 level of
significance. Did the redesigned webpage attract more customers to visit the page?
11
Step 2: Determine the test statistic to use.
We use the z-test since 𝑛 = 60 (greater than 30), and the population variance (100) is
known. This means we can determine the population standard deviation, which is equal to 10.
Step 3: Determine the level of significance, critical value, and the decision rule.
𝛼 = 0.05, one-tailed
𝑧 − 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 = 1.645 (See Appendix A on page 22)
𝑥̅ −𝜇 1050−1000
𝑧= 𝜎 = 10 = 38.73
√𝑛 √60
Step 5: Since the computed value 38.73 is greater than the critical value of 1.645 and falls in
the rejection region, the decision is to reject the null hypothesis.
Step 6: It is sufficient to support the claim that the average number of clicks on the webpage
is more than 1000 at a 0.05 level of significance. Hence, the redesigned webpage
helped attract more clicks and views from customers.
Example 2.
Records from the Registrar Office of St. Paul College show that the mean IQ of the
first-year students taking up the accountancy program is 108 with a standard deviation of
20. A researcher took 33 sample records of the first-year students enrolled in the accountancy
program and found out that the mean IQ of the students is 102. Test the hypothesis that the
mean IQ of the first-year students enrolled in the accountancy program is no longer 108. Use
0.05 level of significance to test the hypothesis.
12
Step 2: Determine the test statistic to use.
We use the z-test since 𝑛 = 33 (greater than 30) and the known population standard
deviation is 20.
Step 3: Determine the level of significance, critical value, and the decision rule.
𝛼 0.05
= = 0.025, two-tailed
2 2
𝑧 − 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 = ±2.24 (See Appendix A on page 22)
Step 5: Since the computed value −1.72 is greater than the critical value −2.24 and falls in the
nonrejection region, then the decision is to fail to reject the null hypothesis.
Step 6: The result of the test suggests that the null hypothesis should not be rejected. There
is not enough evidence to conclude that the mean IQ of the freshmen accounting
students is no longer 108.
Example 3.
13
Step 1: State the hypotheses.
We use the t-test since 𝑛 = 20 (less than 30) and the population standard deviation is
unknown but the sample standard deviation is known to be 2.
Step 3: Determine the level of significance, degrees of freedom, critical value, and the decision
rule.
𝛼 = 0.05, one-tailed
𝑑𝑓 = 𝑛 − 1 = 20 − 1 = 19
𝑡 − 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 = −1.729 (See Appendix B on page 23)
Step 5: Since the computed value −4.47 is less than the critical value −1.729 and falls in the
rejection region, then the decision is to reject the null hypothesis.
Step 6: The result of the test suggests that the null hypothesis should be rejected. There is a
sufficient evidence to conclude that the average amount of Vitamin C and zinc in the
capsule is less than 100mg.
14
Example 4.
We use the z-test since 𝑛 = 40 (greater than 30). The population standard
deviation is unknown but we can use the sample standard deviation as an
approximation substitute for the unknown since the sample is sufficiently large.
Step 3: Determine the level of significance, critical value, and the decision rule.
𝛼 = 0.01, one-tailed
𝑧 − 𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 = 2.33 (See Appendix A on page 22)
Step 5: Since the computed value 2.53 is greater than the critical value 2.33 and falls in the
rejection region, then the decision is to reject the null hypothesis.
Step 6: The result of the test suggests that the null hypothesis should be rejected. There is a
sufficient evidence to conclude that the average amount of Vitamin C and zinc in the
capsule is less than 100mg.
*Remarks: Steps 4, 5 and 6 will be further discussed in the next chapter
15
What’s More
ACTIVITY 2. Is It Appropriate?
A. Identify the appropriate test to be used and then explain your answer in no more than
three sentences.
2. It has been claimed from previous studies that the mean Mathematics grade of
students taking up the Engineering program is 92. Records from the Dean’s Office yield
a mean Math grade of 90 with a standard deviation of 8 from a random sample of 15
Engineering students. Test the hypothesis that the mean Math grade of students taking
up Engineering is no longer 92 at 0.05 level of significance.
____________________________________________________________________
__________________________________________________________________
B. Formulate the appropriate null and alternative hypotheses (written in both words and
symbols) of the items in A.
1.
In words In symbols
𝐻0 :
𝐻1 :
2.
In words In symbols
𝐻0 :
𝐻1 :
C. Using Appendix A on page 20, fill in the missing values in the table.
16
What I Have Learned
The two most common tests of hypothesis in testing significant differences are the z-
test and the t-test. Certain assumptions must be observed from the data gathered for each
type of test.
The z-test can be used when the following assumptions are observed in the data
gathered:
The t-test can be used when the following assumptions are observed in the data:
The Central Limit Theorem (CLT) states that having a sufficiently large sample size
from a population with finite variance, the mean of all the samples from the same population
will be approximately equal to the mean of the population and that all the samples will then
follow an approximately normal distribution. It emphasizes that having a large enough sample
can be a basis for more conclusive results in a study.
17
What I Can Do
1. The records of SCA Registrar show that the average final grade in Mathematics for
STEM students is 91 with a standard deviation of 10. A group of student-researchers
found out that the average final grade of 37 randomly selected STEM students in
Mathematics is no longer 91. Use 0.05 level of significance to test the hypothesis.
2. The average zone of inhibition (in mm) for mouthwash H as tested by medical technology
students has been known to be 9mm. A random sample of 10 mouthwash H was tested
and the test yielded an average zone of inhibition of 7.5 mm with a variance of 25mm. Is
there enough reason to believe that the antibacterial property of the mouthwash has
decreased? Test the hypothesis that the average zone of inhibition of the mouthwash is
no less than 9mm using 0.05 level of significance.
18
Assessment
Directions: Read the questions carefully, then choose the best answer from the given choices.
1. What is the critical value of a two-tailed z-test with a significance level of 0.10?
A. ±1.645 B. ±1.70
C. ±1.96 D. ±2.575
4. How many samples were randomly selected for the new study?
A. 8 B. 10
C. 235 D. 250
5. With respect to the alternative hypothesis, what is the appropriate test statistic that must
be administered?
A. t-test B. z-test
C. one-tailed-test D. two-tailed test
The mean height of the toddlers attending the Sama-Summer Dance Class is
93 cm with a standard deviation of 13 cm. One dance triner is a Math enthusiast and
wishes to conduct research on the mean height of the toddlers attending the dance class
this year. She chose a random sample of 52 toddlers and found out that the mean height
is 88 cm. Test the hypothesis that the mean height of the toddlers is no longer 93 using
a 0.10 level of significance.
19
6. What is the appropriate alternative hypothesis for the problem?
A. 𝜇 = 93 B. 𝜇 < 93
C. 𝜇 ≠ 93 D. 𝜇 > 93
20
Answer Key
21
Appendix A
Table for Commonly Used levels of Significance and Their Critical Values for 𝒛
To find the critical value for z, you have to identify the the intersection of the level of
significance 𝛼 (in the rows) and the appropriate type of test (in the columns). For example,
we are doing a two-tailed test using 0.02 level of significance.
22
Appendix B
Table for 𝒕-critical values
𝒅𝒇 O n e - Ta i l e d T w o - Ta i l e d
𝜶 = 𝟎. 𝟏𝟎 𝜶 = 𝟎. 𝟎𝟓 𝜶 = 𝟎. 𝟎𝟐𝟓 𝜶 = 𝟎. 𝟎𝟏 𝜶 = 𝟎. 𝟏𝟎 𝜶 = 𝟎. 𝟎𝟓 𝜶 = 𝟎. 𝟎𝟏
To find the critical value for 𝑡, you have to identify the the intersection of the degrees
of freedom 𝑑𝑓 (in the rows) and the level of significance 𝛼 with the appropriate type of test
(in the columns). For example, we are doing a two-tailed test using 0.01 level of significance
with a degrees of freedom of 13. The critical value for 𝑡 is 3.012.
23
References
Arceo, V.R., et.al. 2016. Math in Today’s World Statistics and Probability. Phoenix Publishing
House, Inc.
“Large Sample Tests for a Population Mean”, LibreTexts. Accessed May 9, 2021
https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book%3A_Introductory_Statist
ics_(Shafer_and_Zhang)/08%3A_Testing_Hypotheses/8.02%3A_Large_Sample_Tests_for_
a_Population_Mean#:~:text=There%20are%20two%20formulas%20for,sample%20standard
%20deviation%20is%20used.
“Using the z-distribution to Find the Standard Deviation in a Statistical Sample”, Deborah J.
Rumsey. Accessed May 9, 2021 https://www.dummies.com/education/math/statistics/using-
the-z-distribution-to-find-the-standard-deviation-in-a-statistical-sample/
“Hypothesis Testing: Upper-, Lower and Two Tailed Tests”, Wayne W. LaMorte. Accessed on
May 12, 2021 https://sphweb.bumc.bu.edu/otlt/mph-modules/bs/bs704_hypothesistest-
means-proportions/bs704_hypothesistest-means-proportions3.html