Weeks 5 10
Weeks 5 10
Learning Competency
Solves problems involving test of hypothesis on the population mean.
(Quarter 4, Week 5): LC Code: M11/12SP-IIIe-1
Formulates the appropriate null and alternative hypotheses on a population proportion.
(Quarter 4, Week 5): LC Code: M11/12SP-IIIe-2
Identifies the appropriate form of the test-statistic when the Central Limit Theorem is to be used.
(Quarter 4, Week 5): LC Code: M11/12SP-IIIe-3
In the previous lesson, we have learned the sampling distribution of the sample mean using
the Central Limit Theorem. Remember that the Central Limit Theorem allows us to use the standard
normal distribution of sample means provided that n ≥ 30 or the sample size is large. In testing the
hypothesis when the population proportion is given, we need to consider another test statistic for us to
formulate the decision, whether to reject or not to reject the null hypothesis as basis for the formulation of
conclusion. For the last lesson this week, we will identify the appropriate form of the test-statistic in
population proportion when the Central Limit Theorem is to be used.
3
Example 1
It has been claimed that 30% of students in a certain school who have difficulty of waking up early due to
playing online games. A researcher would like to verify the claim by getting 700 sample students for
survey. Out of 700 students, 240 students said that they had difficulty of waking up early due to playing
online games.
a. What test statistic should be applied?
b. What are the null and alternative hypotheses?
c. What are the corresponding values of the variables in the z-test formula?
d. What is the computed test value?
Solution:
a. z-test for one-sample proportion
b. Ho: The proportion of students in a certain school who have difficulty of waking up early due to
playing online games is 30%. (po = 0.30)
Ha: The proportion of students in a certain school who have difficulty of waking up early due to
playing online games is not equal to 30%. (po ≠ 0.30)
c. po = 0.30 (convert 30% to decimal) n = 700
x = 240
p̂ = 240 ÷ 700 = 0.34
d. z = 2.31
Example 2
Don Fast Food Restaurant believes that more than 90% of their customers are satisfied with the quality of
service that they offer. 150 customers were surveyed and it was found out that only 130 customers were
satisfied.
a. What test statistic should be applied?
b. What are the null and alternative hypotheses?
c. What are the corresponding values of the variables in the z-test formula?
d. What is the computed test value?
Solution:
a. z-test for one-sample proportion
b. Ho: The proportion of customers who are satisfied with the quality of service that Don Fast Food
Restaurant offers is 90%. (po = 0.90)
Ha: The proportion of customers who are satisfied with the quality of service that Don Fast Food
Restaurant offers is more than 90%. (po > 0.90)
c. po = 0.90 (convert 90% to decimal) n = 150
x = 130
p̂ = 130 ÷ 150 = 0.87
d.
z = -1.22
Activity
1. A certain school claims that less than 20% of their students prefer online learning in the new normal
education. After conducting a survey on 500 randomly chosen students, they found out that 87 of them
preferred online learning.
2. It has been claimed that 30% of students in a certain school dislike Mathematics. A researcher
conducted a survey and it showed that 153 out of 600 students dislike Mathematics.
4
Reflection:_____________________________________________________________________________
_____________________________________________________________________________________
hypothesis.
Illustrative Example:
Example 1
𝑝̂ = 𝑋
𝒑̂ = 0.81
𝑛
z = -2.24
5
The alternative hypothesis is directional. Hence, one-tailed test
shall be used.
Using the Areas Under the Normal Curve Table, the critical value is
-2.326 at α = 0.01 level. There is a negative sign in the value due to the direction of the
alternative hypothesis.
d. DECISION: Since the computed test statistic (zcom) z = -2.24 does not fall
in the rejection region, fail to reject the null hypothesis (Ho).
P-VALUE APPROACH
What is P-value?
In critical value approach, a test statistic is compared with a critical value.
However, in p-value approach (short for probability value), probabilities or areas
are compared. P-value measures the consistency of the sample statistics with the
null hypothesis. High P-values mean that sample results are consistent with a true
null hypothesis while low P-values are not consistent. If the P value is small
enough, we can conclude that the sample is so incompatible with the null
hypothesis. Therefore, we can reject the null hypothesis for the entire population.
Solution:
Using the formula:
P=P(Z≥5.49)=0.0000⋯≈0
6
CONCLUSION: Because the p-value is smaller than the significance level α=0.05, we can reject
the null hypothesis. Again, we would say that there is sufficient/enough evidence to
conclude that boys are more common than girls in the entire population at α=0.05 level.
As should always be the case, the two approaches (critical value approach and p-value
approach) lead to the same conclusion.
NOTE:
Conclusions are answers in sentence form which include: 1) whether there is enough
evidence or not (based on the decision); 2) the level of significance; and 3) whether the original
claim is supported or rejected. Conclusions are based on the original claim which may be the
null or alternative hypothesis. The decisions are always based on the null hypothesis.
NOTE:
If the null hypothesis isn’t rejected, this doesn’t necessarily mean that it’s true. It simply
means that there is not enough evidence to justify rejecting it.
The hypothesis-testing procedure leads to the acceptance of H0 when H0 is true and the
rejection of H0 when H0 is false. Unfortunately, since hypothesis tests are based on sample
information, the possibility of errors must be considered. A Type I error corresponds to
rejecting H0 when H0 is actually true, while a Type II error corresponds to accepting H0 when H0 is
false.
Activity : Fill It Up!
Directions: Compute the test statistic. Fill in the blank with the word REJECT if the decision is
to reject the null hypothesis. Otherwise, write FAIL TO REJECT. Then, draw your own
conclusions by completing the statement.
1. In a public senior high school, a survey conducted last year by a Health Officer showed that 12% of
the students drink alcohol. This year, a new survey was conducted randomly on 500 students from
the same school. It was found that 97 of them drink alcohol. Test if the claim was higher at α =0.01
level.
a. Ho :p = 0.12 or p > 0.12 Ha : p < 0.12
b. Level of Significance: α = 0.01
c. Computed Test Statistic: zcom =
d. Critical Value: 2.326
e. DECISION: Since the computed test statistic zcom = falls in the
rejection region, the null hypothesis (Ho).
7
.
Reflection:_____________________________________________________________________________
_____________________________________________________________________________________
Just like in puzzles, you need to think of different ways on how you will be
able to solve it. Same with solving problems involving test of hypotheses on
population proportions, you need to follow important steps in order to arrive at the
correct answer.
Here are the five (5) steps in solving problems for a test of hypothesis on
the population proportion.
STEP 1. HYPOTHESES: State the null and alternative hypotheses (either in
sentence/statement form or in symbols).
8
p0 = population proportion q0 = 1 – p
Illustrative Example
Every year, the assigned teachers determine the Body Mass Index
(BMI) of students. In a certain public junior high school, a study finds that
10% of Grade 7 students observed are underweight. A sample of 780
Grade 7 students were randomly chosen and it was found out that 125 of
them are underweight. Is this claim different for their grade level age? Use
0.05 level of significance.
SOLUTION:
STEP 1: State the null and alternative hypotheses.
Ho ; p = 0.10 Ha : p ≠ 0.10
STEP 2: Choose a level of significance. α = 0.05
STEP 3: Compute the test statistic.
Given: x= 125 p0 = 0.10 n = 780
𝑝̂ =
𝑋 𝑛
zc = 5.61
NOTE: Using the Areas Under the Normal Curve Table, critical
10
World Health Organization (WHO) number of COVID-19
summarized the number of COVID- 19 recoveries around the recoveries
world.
Data that involve two variables are called bivariate data. The
statistical procedure used to determine and describe the relationship
between two variables is called correlation analysis.
Examples Variables involved
In Tayabas City public market, a consumer observed that the supply and price of vegetable
fewer is the supply of vegetables, the
higher the price gets.
The Quezon provincial government gave emphasis that number of household members
limiting the number of household members going outside to and rate of COVID-19
purchase essential goods will help decrease the rate of infection
COVID -19 infection in the province.
Here is the other concept!
Scatter plot, scatter graph, scatter diagram, or scatter gram is a
graphical representation that shows the relationship or the correlation of two
variables of bivariate data.
Scatter plot shows how points collected from a set of bivariate data are
scattered on a Cartesian plane. It gives a good visual picture of how two variables
are related or associated with one another in terms of form, trend, and variation of
correlation. The form of points in the scatter plot determines the shape of the
correlation of the variables. The trend determines the direction of the points, either
the variables have positive, negative, or no correlation. The variation or strength of
correlation is based on the closeness of the points on a trend line and it determines
whether the variables have no, weak, moderate, strong, or perfect correlation.
In constructing a scatter plot, you should know how to plot points in a
Cartesian plane. The independent variable will assume the values of x or abscissa
while the dependent variable will assume the values of y or ordinate.
Example:
The given numbers are the age of a person in years and his/her
corresponding weight.
Age of a 11 12 13 14 15 16 17 18 19 20
person (x)
Weight (y) 40 42 38 35 45 51 48 48 50 47
11
Here is the other concept!
The correlation of the variables can be described in terms of form (shape),
trend (direction), and variation (strength) of scatter plot. The form of correlation
can be determined by the shape of points on a scatter plot categorized as linear or
curvilinear. The form of correlation is linear if the points on scatter plot follow a
trend of straight line. The form of scatter plot is non-linear if the points follow a
trend of curve line. Sample scatter plots showing curvilinear form of correlation
are given below.
The correlation of variables can also be described in terms of its trend or direction. The
trend of correlation can be positive, negative, or zero/negligible depending on the direction of the
points. The trend of correlation is summarized in the table that follows
The closeness of the points around the trend line determines the variation or
strength of the correlation between the variables involved. The closer the points to
the trend line, the stronger the correlation of the variables is. The strength of
correlation between two variables can be perfect, strong, weak, or no/negligible
correlation. To summarize the strength of correlation, refer to the table below.
12
Reflection: Give two(2) best experiences in your life wherein you think you made the right
decisions. Share some things, ideas, or techniques that you considered before finally deciding.
LESSON: CORRELATION
The Pearson’s sample correlation coefficient (also known as Pearson r ),
denoted by r, is a test statistic that measures the strength of the linear relationship
between two variables. To find r, the following formula is used:
13
Here is the other concept!
Correlation is used to determine the existence, strength, and direction of
relationship between two variables. Correlation coefficient r is a number between -
1and 1 that describes both the strength and the direction of correlation. In
symbol, we write -1 ≤ r ≤ 1.
In the first activity in What’s In, you solved for the value of r and identified its
trend. In What’s New activity, you identified the trends, estimated the values of r,
and based on the values, you chose the correct descriptions of the strength of the
correlation. So now, we will interpret r value by looking at the scale that gives both
strength and direction of correlation.
Using the correlation scale, we can determine the strength of the correlation
coefficient r. For example, you have r = 0.63 which means that there is a “strong
positive correlation” between the two variables. To interpret, we can simply state it
this way: “As x values increase, y values also increase and vice versa.”
In interpreting the linear relationship of two bivariate data, refer to the value
of r and the scale presented above. We can state our interpretation in different
14
ways. In order for you to solve problems involving correlation analysis, you must
know how to calculate the value of r and interpret this value using the scale. Since
computing for r value is a necessary skill, you may go back to the previous lesson if
you feel that you haven’t mastered it yet. Otherwise, proceed to the following
examples of solving for r.
Scenario: Filipino employees are known for being persistent and hardworking. That is
why they truly value every single cent of their salary. Here are some situations
showing the relationship between the salary and spending of a Filipino employee.
Situation 1: There is a survey wherein the correlation coefficient r between salary and
spending of employee was found to be 0.97.
Interpretation: There is a “strong positive correlation” between salary and spending
of employees.
Situation 2: In another survey, the correlation coefficient r between salary and spending of
employee was found to be 0.38.
Interpretation: There is a “weak positive correlation” between salary and spending
of employees.
Reflection:_____________________________________________________________________________
_____________________________________________________________________________________
15
STATISTICS AND PROBABILITY
Name: ______________________ Grade: ___________
Section: ____________________ Date : ___________
Learning Competency
Identifies the independent and dependent variables.
(Quarter 4, Week 9): LC Code: M11/12SP-IVi-1
Calculates the slope and y-intercept of the regression line.
(Quarter 4, Week 8): LC Code: M11/12SP-IVi-3
Interprets the calculated slope and y-intercept of the regression line..
(Quarter 4, Week 9): LC Code: M11/12SP-IVi-4
Predicts the value of the dependent variable given the value of the independent variable.
(Quarter 4, Week 10): LC Code: M11/12SP-IVj-1
Solves problems involving regression analysis.
(Quarter 4, Week 10): LC Code: M11/12SP-IVj-2
In the previous activity, you became familiar with bivariate data. Bivariate
data always involve two variables. One of these variables is the dependent
variable and the other one is the independent variable.
16
What is the difference between the two variables?
Dependent variable depends on other variables or factors. It is something
that is influenced and affected. It is also associated with the word effect or
outcome.
Independent variable affects the dependent variable. It is something you
have control over, one which you can choose and manipulate. However, in some
cases, you may not be able to manipulate the independent variable. It is commonly
known as the cause or the reason behind changes.
Example 1: The researcher wants to determine the effects of use of social
media in the academic performance of students in Mathematics.
The bivariate data in the study are use of social media and academic
performance. The academic performance depends on the use of social media, or
we can say that academic performance is affected by the use of social media.
Therefore, independent variable here is the use of social media and the
dependent variable is the academic performance.
Example 2: Research Title: Math App (Math Tricks) for Improving Fundamental
Basic Skills and Retention of Grade 11 Students of Bagabag National High School
Increasing or improving your fundamental basic skills and retention
depends on how you frequently practice or use the Math app called Math Tricks.
Your learnings in calculating the slope and y-intercept of a regression line will help you to
get the regression equation in the form y′ = bx + a, where b0 is the slope of the regression line, b1is
the y-intercept of the regression line, x is the value of the independent variable and y ′is the predicted
value.
For example, if the computed value for the slope and y-intercept of a regression line is
24.31 and 3.25 respectively then, the regression equation will be y′ = 24.31x + 3.25.
Regression Analysis- the process of predicting the value of a variable in terms of the others
variable.
Regression equation- it is the algebraic expression of the regression line.
17
We can predict the value of one variable in terms of the other variable as long as the correlation of
two variables are statistically significant.
Example: The data below shows the number of absences and number of missed quizzes of 6 students. If there
is a significant relationship between the two variable, predict the number of missed quizzes by a student
who was absent for 7 days.
Step 1.
Identify the dependent and independent variables.
- In the given data, the independent variable is the number of absences while the dependent
variable is the number of missed quizzes.
Step 2.
Compute the correlation coefficient r using the formula:
6(42) − (13)(16)
𝑟=
√[(6)(35) − (13)2][(6)(48) − (16)2]
𝑟 = 0.9183
-The computed r is 0.9183 indicating a very high positive correlation.
Step 3. Compute the values of b and b in the regression equation y′ = bx + a using the following
0 1
formulas.
6(42)−(13) ( 16 )
= 6 (35 )−(13) 2 = 1.0732
Step 4.
Form the regression equation.
- Substitute the computed values of slope and y-intercept in the regression equation.
y′ = bx + a
y = 1.0732x + 0.3414
′
18
Step Predict
5. the number of missed quizzes by a student who was absent for 7 days.
- Find the value of y when x is 7 in the regression equation.
y′ = 1.0732x + 0.3414
y′ = 1.0732(7)+ 0.3414
y′ = 7.8538
Therefore, the predicted number of missed quizzes of a student who was absent for 7
days is approximately 8 quizzes. Remember that this is just a predicted value based on the
given data.
Reflection:_____________________________________________________________________________
_____________________________________________________________________________________
19