WEEK-4-only-Statistics-and-Probability (1)
WEEK-4-only-Statistics-and-Probability (1)
X F
138 1
139 2
140 3
141 4
142 3
143 2
144 1
B. The following items show four pictures captured by the author during his
travel in the different places and constructed using GeoGebra
application which represents a common object. Analyze the group of
pictures carefully then identify the common object they
represent. Write your answer by filling in the empty boxes below each
item. A hint is also provided for you.
1. 2.
3.
4. 5.
Can you define or describe each word that you have obtained in this activity?
1.
2.
3.
4.
5.
What is it
How do you find the activity? How can you describe the graph you have made in
Activity A? Are the terms or words you obtained in Activity B will describe the graphs
in Activity A?
In the activity above, the distribution of the height of the teachers is one of the
examples of the distribution of a normal random variable because the set of
numerical values is normally distributed. Height is also described as a continuous
random variable. You have also noticed that the mean of the distribution can be
found at the center which is 141 cm and the distribution from the center to the right
is a mirror of the distribution to the left as shown in the figure and table below. See
Figure 1.
Normal Probability Distribution is a probability distribution of
continuous random variables. It shows graphical representations of random variables
obtained through measurement like the height and weight of the students, the percentile
ranks of the A&E (Accreditation and Equivalency) Test result of the Alternative Learning
System students, or any data with infinite values. It is used to describe the characteristics of
populations and help us visualize the inferences we make about the population. It also
used to determine the probabilities and percentile of the continuous random variables in
the distribution. For example, your grades in Mathematics is one of the scores in the
distribution, you can predict the location of that score in the distribution and interpret it
with regards to the mean and standard deviation.
Properties of Normal Curve
The graphical representation of the normal distribution is popularly known as
a normal curve. The normal curve is described clearly by the following
characteristics
1. The normal curve is bell-shaped.
2. The curve is symmetrical about its center. This means that, if we draw a segment
from the peak of the curve down to the horizontal axis, the segment divides the
normal curve into two equal parts or areas.
3. The mean, median, and mode coincide at the center. This also means that in a
normal distribution, or a distribution described by a normal curve, the mean,
median, and mode are equal.
4. The width of the curve is determined by the standard deviation of the
distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely
along the horizontal axis. The tails are thus asymptotic to the baseline. A
portion of the graph that is asymptotic to a reference axis or another graph is
called an asymptote, always approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve
represents the probability, or the proportion, or the percentage associated
with specific sets of measurement values.
The shape of a normal curve is based on the two given parameters, the mean
and the standard deviation of the distribution. When comparing two distributions
each described by the normal curve, the following are the three situations based on
the said parameters
a. When the means are not equal, but the standard deviations are equal. (µ1
≠ µ2 ; ơ1 =ơ2 ), the curves have a similar shape but centered at different
points, as shown in figure 4.
c. When the means are different and the standard deviations are also different
(µ1≠ µ2 ; ơ1 ≠ ơ2 ), the curves are centered at different points and vary in
shapes, as shown in figure 6.
Fig. 6: The Means are Different and the Standard Deviations
are also Different
EMPIRICAL
RULE
You can easily understand the functions of normal probability distribution in our
real-life world using the empirical rule. Because this rule is used to roughly test the
distribution’s normality, if many data of a random variable fall outside the lower and
upper limits of the three-standard deviation, this means that the distribution is not
normal.
The empirical rule is better known as 68% - 95% - 99.70% rule. This rule
states that the data in the distribution lies within one (1), two (2), and three (3) of the
standard deviation from the mean are approximately 68%, 95%, and 99.70%,
respectively. Since the area of a normal curve is equal to 1 or 100% as stated on its
characteristics, there are only a few data which is 0.30% falls outside the 3-standard
deviation from the mean. For instance, the distribution of the grades of the Senior
High School students in Statistics and Probability for the Third Quarter is shown
below in Figure 7.
Using the empirical rule, you can say that the distribution of the grades of the
Senior High School in Statistics and Probability are distributed normally because the
shape of the graph is bell-shaped and symmetric about its mean, which is 87. This
also indicates that most of the students’ grades fall close to the mean and fewer of
them are far below and/or above the mean
It can summarize the distribution in the following percentage:
68% of data lies within 1 standard deviation from the mean have a grade
of 83 to 91
95% of data lies within 2 standard deviations from the mean have a grade
of 79 to 95
99.70% of data lies within 3 standard deviations from the mean have a
grade of 83 to 9
Illustrative Example 1:
The scores of the Senior High School students in their Statistics and Probability
quarterly examination are normally distributed with a mean of 35 and a standard
deviation of 5.
Answer the following questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?
Solution:
Draw a standard normal curve and plot the mean at the center. Then, add the
standard deviation to the mean once and mark it to the right of the mean. Add twice
the standard deviation to the mean and put it to the right of the first sum. Then, add
thrice the standard deviation to the mean and mark it to the right of the second sum.
Do the same to the left. This time, instead of adding, subtract the standard deviation
from the mean. An illustration is provided below in figure 7:
Solution: The mean of this problem is 160, it is subtracted by 7 to the left (e.g. 160
-7 = 153; 153-7 = 146; 146 – 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167
+ 7 = 174 + 7 = 181).
ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.
What’s More
Activity 1.
A. Use Empirical rule to complete the following table. Write on the respective column
the range or interval of the scores based on the given parameters.
Additional Activities
Read the following problems carefully. Apply the empirical rule and illustrate your
answer through a diagram.
1. IQ scores of the ALS students in the Division of Quezon are normally
distributed with a mean of 110 and a standard deviation of 10.
a. What percent of the distribution falls within the IQ scores of 100 to 130?
b. What percent of the distribution falls within the IQ scores of 90 to 140?
2. A normal distribution of data has a mean of 78 and a standard deviation of 9.
a. What scores fall within 68% of the distribution?
b. What scores fall within 95% of the distribution?
Lesson 2 Identifying Regions under Normal
Curve Corresponds to Different
Standard Normal Values
In your previous lessons in Mathematics, you learned how to solve for the area of
certain geometric figures or shapes. This time, you are going to identify the areas or
regions of a curve without using a formula or even a density function of standard
normal random variables because the z – table or Table of Areas under the Normal
Curve is already given on the last page of this module.
What is It
In the previous module, we discussed the properties of the normal probability
distribution, and one of the things that you learned is that the total area under the curve
is equal to one. Since the standard normal distribution is defined as normally distributed
random variable having a mean of zero ( µ= 0 ) and standard deviation of one (ơ = 1),
the fifty (50) percent of the region under the curve is below the mean and the other 50%
is above the mean. Figure 1, as shown below illustrate the standard normal curve.
Step 3. Use the table of the area under the normal curve to find the
corresponding area
Step 1:
Step 2:
Step 3.
Step 4. Since the shaded region is to the right of z-value, we will subtract 0.0885
from 1. The difference is 0.9915
Step 5
What’s More
Activity 1.
Identify the regions under the normal curve of the following standard normal values
using the z table (refer to the last page of this module).
1. z = - 3.25 _
2. z = 1.25
3. z = 0.56
4. z = 3 _
5. z = 1.96 _
Additional Activities
Do this additional activity in your paper.
Find the area that corresponds to each of the following z values. Sketch and label
each curve.
1. to the right of z = 1.28
2. below z= -0.20
3. above z= 1.57
4. at least z =-127
5. at most z = 1.39
6. between z = -1.29 and z= - 0.88
Lesson 3 Converting a Normal Random
Variable to a Standard Normal
Variable and Vice-versa
What is It
Was the activity fun and interesting? How did you manage to derive the
formula? Are you successful in completing the table?
The activity allows you to manipulate the given data and the missing value
through derivation. In this case, you convert the raw score (x) into the standard score
(z) using the formula z=
𝜎
x−μ . You substitute the given values and uses properties
σ
of equality and algebraic rules to obtain the needed data
In solving real-life problems involving the normal curve, it is particularly
important to convert a random variable x to a standard normal variable or z-score. This
procedure is known as “standardizing” or “standardization” of a random variable,
where a standardized value is called a z-score. A z-score is a measure of the number of
standard deviations (σ) a particular data value is away from the mean (μ).
Suppose your score on a test in Probability and Statistics is 39 and the scores
are normally distributed with a mean of 33 and standard deviation of 3, then your
score is exactly 2 standard deviations above the mean. If you scored 30, then it is
exactly 1 standard deviation below the mean. All values that are above the mean
have positive z-scores and all values that are below the mean have negative z-scores.
If you obtained a z-score of -3, this means that your score is 3 standard deviations
below the mean, that is, 33 – 3(3) = 33 – 9 = 24.
Given any value x from a normal distribution with mean μ and standard
deviation σ, to convert x to a z-score (standard normal score), you need to;
(a) Subtract the mean μ from x.
(b) Divide this quantity, x – μ , by the standard deviation σ.
The formula used in converting a random variable x to a standard normal
variable z is:
x−μ
z= σ
Therefore; 𝑥 = 𝑧𝜎 + μ
deviation σ, then add the mean μ.
Example 4.
The time it takes for a cell to divide is normally distributed with an average of 60 minutes and
standard deviation of 5 minutes. How long will it take for a given cell to divide if its “mitosis”
has a z-score of -1.35?
𝑥 = 𝑧𝜎 + μ
Solution:
𝒙 = 𝟓𝟑.
z1 =𝟐𝟓
-2 𝒎𝒊𝒏𝒖𝒕𝒆𝒔
x1 = 40 x2 = 70
z2 = 1
x = z1𝜎 + μ x = z2 𝜎 + μ
Use these given and substitute to the formula (12-2), we have
40 = -2𝜎 + μ 70 = 1𝜎 + μ
Note: In this example, we do not have 𝜎 1 and 𝜎 2, and μ 1 and μ 2 since we are
describing a single population. Thus, 𝜎 1 = 𝜎 2 = and μ 1 = μ 2 = μ.
40 = -2𝜎 + μ
Furthermore, we need to review our algebra specifically on systems of linear equation. So
70 = 1𝜎 + μ
equation 1
equation 2
40 = -2𝜎 + μ
Subtract equation 1 from equation 2 , we have
70 = 1𝜎 + μ
equation 1
30= 1 𝜎 – (-2 𝜎)
equation 2
30= 3 𝜎
=
30 3 σ
10= 𝜎
3 3
also be thought of as a probability. That is, if it's possible to select any 𝑧 value at random,
normally distributed variables. The area under the standard normal distribution curve can
the probability of choosing one, say, below 1.45 would be the same as the area under the
randomly selecting a 𝑧 value below of 1.45 is 0.9265 or 92.65%. The problems involving
curve at the left of 1.45. In this case, the area is 0.9265. Therefore, the probability of
probabilities and percentiles are solved in the same manner as finding the areas under a
normal curve.
In finding probabilities, the following notations will be used:
(a) P(Z < z) – probability at the left of z
(b) P(Z > z) = 1 – P(P < z) – probability at the right of z
(c) P(a < z < b) – the probability of z that is in between two other z values, say
a and b
(d) P(z < a ) ∪ P(z > b) – the probability of z is in the opposite direction of two
values, say a and b
(e) P(X < x) – probability at the left of a normal random variable x
(f) P(X > x) – probability at the right of a normal random variable x
(g) P(a < X < b) – the probability of a normal random variable X that is in
between two other normal random variables, say a and b.
(h) P(X < a ) ∪ P(X > b ) – the probability of x is in the opposite direction of
two values, say a and b
Example 1.
Find the probabilities for each of the following.
(a) P (Z < 1.32)
(b) P (Z < −1.05)
(c) P (−0.75 < Z < 1.56)
(d) P (Z > −0.88)
Solution:
(a) The probability P (Z < 1.32) means to find the area under the normal curve
at the left of z = 1.32. From the table, find the intersection on row 1.3 and
column 0.02. Thus, the probability is 0.9066 or 90.66%.
(b) The probability P(Z < −1.05) is the same as the area under the normal curve
at the left of z = −1.05. From the table, the area is 0.1469. Thus, the
probability is 0.1469 or 14.69%.
(c) The probability P(−0.75 < Z < 1.56) is the same as the area under the
normal curve between z values -0.75 and 1.56. To get the area, subtract the
area of
z = −0.75 from the area of z = 1.56. Thus, P (−0.75 < Z < 1.56) =
P(Z < 1.56)– P(Z < −0.75) = 0.9406 – 0.2266 = 0.7140 or 71.40%.
(d) The probability P(Z > −0.88) is the same as the area under the normal curve
at the right of z = −0.88. To get the area, subtract the area of z =
−0.88 from 1. Thus, P(Z > −0.88) = 1 – P(Z < −0.88) = 1 – 0.1894 =
0.8106 or 81. 06%.
Example 2.
following:
(a) P (X < 19)
(b) P (10 < X < 19)
Solution:
(a) To find the probability, convert the normal random variable 19 to a z-score.
Thus, P (X < 19) = x−µ 19−15 4
P (Z < ) = P (Z < ) = P (Z < ) = P (Z <
σ 3 3
From the table, the probability at the left of z = 1.33 is equal to 0.9082 or
90.82%.
(b) To find the probability, convert the normal random variables 10 and 19 to
z values.
10−15 19−15
Thus, (10 < X < 19) =P( <Z< ) = P (-1.67 < Z
3 3
P(Z < 1.33) - P(Z < - 1.67). From the table, we have 0.9082 –
0.0475= 0.8607
86.07 % . These probabilities can also be applied to real-life problems.
Example 4.
Each month, a Filipino household generates an average of 28 pounds of
newspaper for garbage or recycling. Assume that the distribution is normal with a
standard deviation of 2 pounds. If a household is selected at random, find the probability
of its generating
(b) The probability of generating more than 30.2 pounds per month is equivalent
to P(X > 30.2) which is equal to 1 – P(X < 30.2 ) since it is right tailed. 1- P ( X <
30.2) =
30.2−28
1 – P(Z < ) = 1 – P( Z < 1.10) = 1 – 0.8643 – 0.1357. Thus,
2
the probability of generating more than 30.2 pounds of newspaper per
month is 0.1357 or 13.57 %.
Example 5.
The Mauban Automobile Association reports that the average time it takes to
respond to an emergency call is 35 minutes. Assume the variable is approximately
normal and the standard deviation is 5.5 minutes. If 200 calls are randomly selected,
approximately how many were responded in less than 25 minutes?
Solution:
The number of calls responded in less than 25 minutes is equal to the
percentage of 200 calls which were responded in less than 25 minutes. First, find
25−35
P(X < 25). P(X < 25) = P (Z < ) = P(Z < -1.82). From the table , this is
5.5
equal to 0.0344. Thus, there are ( 0.0344) ( 200) = 6.88 or approximately 7 calls
were responded in less than 25 minutes.
Example 6.
To help students improve their reading, Mauban south district decides to
implement a reading program. It is to be administered to the bottom 5% of the
students in the district, based on the scores of a reading achievement test which are
normally distributed. If the average score for the students in the district is 122.6 and
the standard deviation is 18, find the cut-off score that will make a student eligible
for the program.
Solution:
The problem is asking for a
normal random variable that
corresponds to a z value and
occupying an area of 5% or 0.05
from the left under the normal
curve. To find this normal random
variable which is the cut-off score
that will make a student eligible for
the reading program, look for the z
value on the table corresponding to
an area of 0.05
and that is equal to −1.645. Then, convert this z value to a normal random variable
x, that is, x = z σ + µ = (−1.645)(18) + 122.6 = −29.61 + 122.6 = 92.99 ≈ 93.
Therefore, a student will be eligible in the reading program if he scored 93 and
below.
Example 7.
To pass in the accreditation and
equivalency (A&E) test, ALS students must
score in the top 15% in general ability tests.
The test has a mean of 200 and a standard
deviation of 20. Find the lowest possible score
to pass the test assuming the test scores are
normally distributed.
Solution:
The lowest possible score is the normal
random variable corresponding to a
z value occupying an area of 0.15 from the right
of the normal curve. To solve for the required
variable, given that P(Z > z) = 0.15 NOTE:
Remember P(Z < z) + P(Z > z) = 1
P(Z > z) = 0.15
1 – P(Z > z) = 1 – 0.15
1 – P(Z > z) = 1 – 0.15 since it is right tailed
= 0.85
from the table, look for the z value corresponding to this area.
Thus, z = 1.03 then convert this to a normal random variable x
x = z σ + µ = (1.03)(20) + 200 = 220.6 ≈ 221
Therefore, the lowest possible score to pass the test is 221.
What’s More
Assessment
Solve Me !
Solve the following problems by supplying the needed information.
The average number of calories in a 1.8-ounce chocolate bar is 230. Suppose
that the distribution of calories is approximately normal with a standard deviation of 10.
1. Find the probability that a randomly selected chocolate bar will have less
than 200 calories.
2. Find the probability that a randomly selected chocolate bar will have greater
than 195 calories.
3. What is the percentage that a chocolate bar is randomly selected between
200 calories and 250 calories?