STATISTICS Module 1
STATISTICS Module 1
MODULE 1:
Random Variables and
Probability Distribution
Content Course Competencies
At the end of this module, you are expected to:
1. Illustrate a random variable (discrete and continuous).
2. Illustrate a probability distribution for a discrete random variable and its
properties.
3. Compute probabilities corresponding to a given random variable.
4. Illustrate the mean and variance of a discrete random variable.
5. Calculate the mean and the variance of a discrete random variable.
6. Interpret the mean and the variance of a discrete random variable.
7. Solve problems involving mean and variance of probability distributions.
1
STATISTICS AND PROBABILITY
Introduction:
Before we discuss random variables and probability distribution, let us look at the meaning
and applications of inferential statistics.
In life, there is full of uncertainties. For example, no one knows you get a high score in your
examination, or we can never be sure if a sports team will win their next game. However,
we can plan our daily activities by making inferences about the future.
These inferences are usually made based on gathered data about the situation. For
example, we cannot know if it will rain an hour from now, but we can decide to bring an
umbrella if it is cloudy or there is a looming thunderstorm report.
Case Study:
You found out by gathering data that six out of 10 senior high school students in a city
prefer online classes than face to face classes. How do you use this data in determining
future outcomes?
➢ In this case study, we can use inferential statistics to determine future outcomes.
➢ All the situations above are what we deal with in inferential statistics.
Inferential Statistics:
➢ Refers to the techniques of interpreting values resulting from obtained sample data
to draw conclusions or make inferences about the population.
➢ In inferential statistics, we analyze what we know about the object of our study in
determining the likelihood of an event happening in the future.
2
STATISTICS AND PROBABILITY
Definition of Terms
Population
- Refers to the group being studied.
- Refers to the collection of people, objects, individuals, or scores that can be
described as having a unique combination of qualities. More precisely, a population
consists of numerical values associated with objects or individuals.
Parameter
- A summary measure calculated on an entire population data; it quantifies the
characteristic of the population under investigation.
- Data collected out of a population is called parametric data.
Sample
- A part of a population;
- A collection of some elements in a population and it is representative of the entire
population
Statistic
- A summary measure or value calculated on sample data; it quantifies the
characteristic of the sample, which represents the population.
- Data gathered from a sample is called statistical data or statistic.
Variable
- Is any property or characteristic of interest about each individual unit of a
population or of a sample- it is an important aspect of a hypothesis. Some examples
of variables are: age, sex, civil status, work experience and so on.
Data
- Could be referred to as the raw material of statistics. It is a set of values collected for
the variable from each of the elements of the sample. It could also be a set of
obtained measurements from the subjects of an experiment.
3
STATISTICS AND PROBABILITY
Unit 1:
Exploring Random Variables and
Constructing Probability
Distributions
Lesson Objectives: At the end of this unit, you are expected to:
4
STATISTICS AND PROBABILITY
This unit will explain the concept of random variables and probability distribution.
Together, we will learn how to construct the probability mass function of a discrete
probability distribution. Moreover, we will describe its properties and characteristics by
computing its mean and variance.
Random Variable
- is a function that associates a real number to each element in the sample
space. It is a variable whose values are determined by chance.
5
STATISTICS AND PROBABILITY
Exercise 1: Identify which of the following represent continuous variables and which
represent discrete variables:
1. The number of times your mother visited her dentist last year.
2. Reaction time of the subjects in an experiment.
3. Age of the subjects in an experiment.
4. Height of the grade 11 students in your school.
5. Life span of a sample of batteries.
6. Time of the day.
7. Speed of a sports car in a race.
8. Number of words you can encode in 2 minutes.
9. How many kinds of candies have you eaten last week?
10. Number of female students in your statistics class.
Reflect on this:
How do you know whether a random variable is continuous or discrete?
6
STATISTICS AND PROBABILITY
Definition of Terms:
Sample Space:
- the set of all possible outcomes of an experiment and represented by the symbol S
- also refers as the population
Sample Point:
- each of the possible outcomes of the sample space
Variable:
- A characteristic or attribute that can assume different values. We use capital letters
to denote or represent a variable.
Exercise 2:
1. List the sample space of the experiment below:
3. List the elements of the sets corresponding to the following events given the
following sets: A= {0, 2, 4, 6, 8} B= {2, 3, 4, 5}
a. 𝐴 ∩ 𝐵 (∩ means intersection)
b. 𝐴 ∪ 𝐵 (∪ means union)
7
STATISTICS AND PROBABILITY
4. How many sample points are in the sample space when a pair of dice is thrown
once?
Solution: 1st die→𝑛1 = 6 2nd die→𝑛2 = 6 Therefore (𝑛1 )(𝑛2 ) =?
5. List the sample space of the following experiments and the number of elements in
the sample space.
a. Drawing a card greater that 7 from a deck of card
b. Drawing a spade from a deck of cards
c. Rolling a die and tossing a coin simultaneously
d. Tossing three coins
Probability Distribution
Refers to the arrangement of the values of a random variable and the
probabilities assigned to them.
8
STATISTICS AND PROBABILITY
Step 2
There are four possible values of the random variable H representing the number of heads.
These are 0, 1, 2, and 3. Assign probability values P(H) to each value of the random
variable.
❖ As you notice, the probability of 0, 1, 2, and 3 is less than 1 but greater than 0. The sum
of the probabilities is 1. It does satisfy the properties of probability distribution.
❖ Remember, a probability can never be less than zero/ negative and can never be more
than 1.
9
STATISTICS AND PROBABILITY
2. A cargo of five 55” televisions contains two that are slightly defective. If a store
receives three of these televisions at random, list the elements of the sample space S
using the letters D and N for defective and non-defective television, respectively. To
each sample point, assign a value of z of the random variable Z representing the
number of televisions bought by the store which are slightly defective.
10
STATISTICS AND PROBABILITY
Number of Defective TV 0 1 2
1 3 3
Probability P(Z)
7 7 7
Questions:
1. What’s the probability of
getting all non-defective?
1
Answer: 7
2. What is the probability that
the store will get a defective
TV?
6
Answer: 7
3. A sari-sari store that sells bubble gums recorded how many bags of gum it sold per
customer. This record was used to create a probability distribution of the number of
bags of gum sold per customer.
Number of bags
sold per customer, 0 1 2 3 4
X
Probability, P(X) 0.1 0.4 0.2 0.2 0.1
11
STATISTICS AND PROBABILITY
In the example above, the random variable is the number of bags of gum the store
sold per customer. The sample space contains outcomes 0, 1, 2, 3, and 4. This means
that 𝑃(0) = 0.1, 𝑃(1) = 0.4, 𝑃(2) = 0.2, 𝑃(3) = 0.2, 𝑃(4) = 0.1. You can also notice
that the sum of all the probabilities is equal to 1, and none of the probabilities is less
than 0 or greater than 1. Therefore, it satisfies the properties of a probability
distribution.
More examples of a random variable and probability distribution are given below.
1. A raffle with 100 entries is drawn. Of the 100 entries, 20 are assigned to Robert, 28 to
David, and 52 are assigned to Peter. If a numerical value is assigned to each person so
that “0” is Robert, “1” is David, and “2” is Peter, create a probability distribution to
determine the probabilities assigned to 0, 1, and 2.
𝒔
First, solve for the probability assigned to each value by using 𝑷(𝑿) = 𝒏 .
20 𝟏
Solving for 𝑃(0) 𝑃(0) = 100 = 𝟓
28 𝟕
Solving for 𝑃(1) 𝑃(1) = 100 = 𝟐𝟓
52 𝟏𝟑
Solving for 𝑃(2) 𝑃(2) = 100 = 𝟐𝟓
12
STATISTICS AND PROBABILITY
𝑋 0 1 2
𝑃(𝑋) 1 7 13
5 25 25
2. Blood groups are determined for a sample of people, and the results are given in the
accompanying table. If one person from this sample group is randomly chosen, find
the probability that the person had group AB blood.
Blood Group O A B AB Total
Frequency 90 75 35 18 218
𝑠
Solution: 𝑃(𝑋) = 𝑛
18 𝟗
𝑃(𝐴𝐵) = 218 = 𝟏𝟎𝟗
𝟗
So, the probability of choosing a person with AB blood is 𝟏𝟎𝟗.
Exercise 3:
A. Determine whether the distribution represents a probability distribution.
1.
𝑿 1 2 3 4 5
𝑷(𝑿) 1 1 1 1 1
4 4 4 4 4
Answer:
It doesn’t represent a probability distribution because the sum of all the probabilities
is more than one. [Or in symbol, No because ∑ 𝑃(𝑋) > 1)]. And it doesn’t satisfy one
of the properties of probability distribution.
2.
𝑿 0 2 4 6 8
𝑷(𝑿) 1 3 1 1 1
8 8 4 8 8
Answer:
It does represent a probability distribution because the sum of all the probabilities is
equal to one. [Or in symbol, Yes because ∑ 𝑃(𝑋) = 1)]. And it does satisfy all the
properties of probability distribution.
3.
𝑿 1 3 5 7
𝑷(𝑿) 1 3 7 1
5 5 5 5
Answer:
It doesn’t represent a probability distribution because the sum of all the probabilities
is more than one. [Or in symbol, No because ∑ 𝑃(𝑋) > 1)]. In addition, the
probability of 5 is more than 1 (𝑃(5) > 1. And it doesn’t satisfy all the properties of
a probability distribution.
13
STATISTICS AND PROBABILITY
B. Determine whether the given values can serve as the value of probability distribution
of the random variable X that can take on only the values 1, 2, and 3.
C. For each of the following, determine whether it can serve as the probability
distribution of a random variable X.
1
1. 𝑃(𝑋) = 6 for 𝑥 = 1, 2, 3, … ,8
1 1 1 4
Answer: 𝑃(1) = 6 , 𝑃(2) = 6 , …, 𝑃(8) = 6, if we get the sum, then we’ll get 3 which
is more than 1. So, the answer is no because ∑ 𝑃(𝑋) > 1 .
12
2. 𝑃(𝑋) = 25𝑥 for 𝑥 = 1, 2, 3, 4
12
Answer: Plug in the values of x to 𝑃(𝑋) = 25𝑥 and get the sum.
12 12 12 12
➢ + 25(2) + 25(3) + 25(4) = 1, Yes, because it satisfies the properties of the
25(1)
probability of distribution.
D. Now let us see if you can still recall what you have learned from Junior High School.
Find the probability of the following events.
14
STATISTICS AND PROBABILITY
Now it is your turn to practice and apply what you have learned from all the previous
examples. You can check your answers on the given key answers at the end of Module 1.
B. The probability of a college student making two, three, four, five, and six mistakes in
a 500- word essay is 0.15, 0.20, 0.10, 0.30, and 0.25, respectively. Illustrate the
probability distribution and the probability histogram of the random variable.
C. Determine whether the given values can serve as the value of probability distribution
of the random variable X that can take on only the values 1, 2, 3, and 4. Explain your
answer.
1 10 6 7
1. 𝑃(1) = 19 , 𝑃(2) = 19 , 𝑃(3) = 19 , 𝑃(4) = 19
1 3 1 1
2. 𝑃(1) = 8 , 𝑃(2) = 8 , 𝑃(3) = 4 , 𝑃(4) = 4
3. 𝑃(1) = 0.15 , 𝑃(2) = 0.25 , 𝑃(3) = 0.45 , 𝑃(4) = −0.15
15
STATISTICS AND PROBABILITY
3 0.07
4 0.08
5 0.10
6 0.12
7 0.14
8 0.16
9 0.14
10 0.06
1. What is the probability that 4 or more copies will be demanded in a particular day?
2. What is the probability that the demand will be at least 3 but not more than 7?
Reflect:
1. Why should the sum of the probabilities in a probability distribution always
be equal to 1?
2. What is the shape of most probability distributions? Why do you think so?
16
STATISTICS AND PROBABILITY
Unit 2:
Computing the Mean and Variance of
a Discrete Probability Distribution
17
STATISTICS AND PROBABILITY
∑𝑛 𝑥
Population Mean: 𝜇 = 𝑖=1𝑁 1
o Population mean is based on the data taken from an entire population
Example:
Given the table below on the ages of males and females
Pair Males (x) Females (y)
1 18 17
2 18 19
3 20 18
4 19 20
5 17 16
18
STATISTICS AND PROBABILITY
2. ∑5𝑖=1 𝑦𝑖 = 17 + 19 + 18 + 20 + 16 = 𝟗𝟎
3. ∑5𝑖=1(𝑥𝑖 − 𝑦𝑖 )2 = (18 − 17)2 + (18 − 19)2 + (20 − 18)2 + (19 − 20)2 + (17 − 16)2
∑5𝑖=1(𝑥𝑖 − 𝑦𝑖 )2 = 𝟖
4. Average/ Mean age of the males
∑5𝑖=1 𝑥𝑖 92
= = 𝟏𝟖. 𝟒
𝑛 5
- variance is the square of standard deviation or standard deviation is the square root
of variance.
√10
𝑠2 = ( ) = 𝟐. 𝟓
4
19
STATISTICS AND PROBABILITY
Formula: 𝝁 = ∑ 𝑿 ∙ 𝑷(𝑿)
Where:
𝜇 = 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑑𝑖𝑠𝑐𝑟𝑒𝑡𝑒 𝑟𝑎𝑛𝑑𝑜𝑚 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒
𝑃(𝑋) = 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑎𝑠𝑠𝑖𝑔𝑛𝑒𝑑 𝑡𝑜 𝑡ℎ𝑒 𝑟𝑎𝑛𝑑𝑜𝑚 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒
𝑋 = 𝑟𝑎𝑛𝑑𝑜𝑚 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒
20
STATISTICS AND PROBABILITY
3. A sari-sari store that sells bubble gums recorded how many bags of gum it sold per
customer. This record was used to create a probability distribution of the number of
bags of gum sold per customer. Using the data below, determine the mean.
Number of bags
sold per customer, 0 1 2 3 4
X
Probability, P(X) 0.1 0.4 0.2 0.2 0.1
21
STATISTICS AND PROBABILITY
Examples:
1. Suppose three coins are tossed. Let X be the random variable representing the
number of heads that occur. Find the probability of each of the values of the random
variable X and the variance.
𝟕𝟕𝟗 √𝟕𝟕𝟗
𝝈 = √𝟒𝟎𝟎 = 𝒐𝒓 𝟏. 𝟑𝟗𝟓𝟓
𝟐𝟎
22
STATISTICS AND PROBABILITY
3. A sari-sari store that sells bubble gums recorded how many bags of gum it sold per
customer. This record was used to create a probability distribution of the number of
bags of gum sold per customer. Using the data below, determine the mean and the
variance.
Number of bags sold per
0 1 2 3 4
customer, X
Probability, P(X) 0.1 0.4 0.2 0.2 0.1
Solution:
Mean: 𝜇 = ∑ 𝑋 ∙ 𝑃(𝑋) = (0 ∙ 0.1) + (1 ∙ 0.4) + (2 ∙ 0.2) + (3 ∙ 0.2) + (4 ∙ 0.1) = 𝟏. 𝟖
Alternative Formula
1. Suppose three coins are tossed. Let X be the random variable representing the
number of heads that occur. Find the probability of each of the values of the random
variable X and the variance.
Number of Heads Probability
𝑋 ∙ 𝑃(𝑋) 𝑋 2 ∙ 𝑃(𝑋)
X P(X)
1 1
0
8
0 02 ∙ =0
8
3 3 3 3
1
8 12 ∙ =
8 8 8
3 6 3 3
2
8 22 ∙ =
8 8 2
23
STATISTICS AND PROBABILITY
1 3 1 9
3 8 32 ∙=
8 8 8
𝜎 2 = ∑[𝑋 2 ∙ 𝑃(𝑋)] − 𝜇 2
𝟑
𝝁 = 𝟐 𝒐𝒓 𝟏. 𝟓 𝝈𝟐 = 𝟑 − 𝟏. 𝟓𝟐
𝟑
= 𝟒 𝒐𝒓 𝟎. 𝟕𝟓
√𝟑
𝝈 = √𝟎. 𝟕𝟓 = 𝒐𝒓 𝟎. 𝟖𝟔𝟔
𝟐
3. A sari-sari store that sells bubble gums recorded how many bags of gum it sold per
customer. This record was used to create a probability distribution of the number of
bags of gum sold per customer. Using the data below, determine the mean and the
variance.
Number of bags sold per
0 1 2 3 4
customer, X
Probability, P(X) 0.1 0.4 0.2 0.2 0.1
Solution:
Mean: 𝜇 = ∑ 𝑋 ∙ 𝑃(𝑋) = (0 ∙ 0.1) + (1 ∙ 0.4) + (2 ∙ 0.2) + (3 ∙ 0.2) + (4 ∙ 0.1) = 𝟏. 𝟖
24
STATISTICS AND PROBABILITY
2. In a store that sells milk tea, it was found that 30% of customers buy three milk
teas per purchase, 20% buy two milk teas per purchase, and 50% buy one milk
tea per purchase. What is the probability that in the first 20 customers in a day,
six of them buy three milk teas, four buy two milk teas, and 10 buy one milk tea?
Construct the Probability Mass Function, and then solve for the mean, variance,
and standard deviation.
Formative Assessment: Encircle the letter of your answer. Take note that this activity
will NOT be submitted. The answers are found at the end of Module 1.
3. Among certain stocks selected in one day, 442 declined and 458 roses. If one of these
stocks is selected at random, what is the probability that it rose today?
229
A. 0.442 B. 0.458 C. 450
25
STATISTICS AND PROBABILITY
6. What is the probability of selecting a black Jack of hearts in an ordinary deck of cards?
1 13
A. 52 B. 52 C. 0
7. Which of the following explains why the table below is not a probability distribution?
X 1 3 5 7 9
P(X) 0.09 0.15 0.18 0.12 0.13
9. If two coins are tossed, which is not a possible value of the random variable for the
number of tails?
A. 1 B. 2 C. 3
10. Find the mean, variance, and the standard deviation of the probability distribution
below.
X 2 4 6 8 10
P(X) 1 1 2 2 1
9 9 9 9 3
26
STATISTICS AND PROBABILITY
Exercise 1
1. Discrete 5. Continuous 9. Discrete
2. Continuous 6. Continuous 10. Discrete
3. Continuous 7. Continuous
4. Continuous 8. Discrete
Exercise 2
1. S= {BBB, BBG, BGB, BGG, GBB, GBG, GGB, GGG}
2. a. S= {8, 16, 24, 32, 40, 48} b. S= {1, -5} c. 𝑆 = {𝑥|1 < 𝑥 < 2}
3. a. {2, 4} b. {0, 2, 3, 4, 5, 6, 8}
4. 36
5. Number of elements in the sample space and the list of sample points in the sample
space.
a. There are 24 sample points in the sample space S= {6 Heart, 6 Spade, 6 Diamond,
and 6 Club}
b. There are 13 sample points in the sample space S={1 Spade, 2 Spade,…, King
Spade}
c. There are 12 sample points in the sample space
S= {H1, H2, H3, H4, H5, H6, T1, T2, T3, T4, T5, T6}
27
STATISTICS AND PROBABILITY
Exercise 3- D
1
1. The deck of cards has 52 5. 3
cards in it, and there are 4 1
suits; in each suit, there is 1 6. 2
5
Ace. So, the probability is 7.
4 1 36
𝑜𝑟 . 8.
1
52 13
1 4
2. 1
18 9.
1 6
3. 1
8 10.
1 26
4. 26
Elaborate
A. Determine whether the distribution represents a probability distribution.
1. Yes, because ∑ 𝑃(𝑋) = 1, it does satisfy the properties of probability distribution
2. No, because ∑ 𝑃(𝑋) > 1, it does not satisfy one of the properties of probability
distribution
3. No, because ∑ 𝑃(𝑋) < 1, it does not satisfy one of the properties of probability
distribution
28
STATISTICS AND PROBABILITY
C. Determine whether the given values can serve as the value of probability distribution
of the random variable X that can take on only the values 1, 2, 3, and 4. Explain your
answer.
1. No, because ∑ 𝑃(𝑋) > 1, it does not satisfy one of the properties of probability
distribution
2. Yes, because ∑ 𝑃(𝑋) = 1, it does satisfy the properties of probability distribution
3. No, because ∑ 𝑃(4) = −0.15, it does not satisfy one of the properties of
probability distribution
UNIT 2
Elaborate
Evaluate
Formative assessment
1. B
2. C
3. C
4. B
5. A
6. C
7. A & B
8. C
9. C
10. 𝜇 = 7.11 ; 𝜎 = 2.69 ; 𝜎 2 = 7.21
29
STATISTICS AND PROBABILITY
References:
Baccay, E. S., & Belecina, R. R. (2019). Statistics and Probability (2nd ed.). Rex Book Store
Inc.
30
STATISTICS AND PROBABILITY
Tizon, M. B., & Mesa, H. Y. (2016). Stat Speaks: Statistics and Probability for 21st Century
Learners. St. Bernadette Publishing House Corporation.
Abao, Z. R., Guhao, Jr., E. S., Marribay, J. B., Zorilla, R. S., Mendoza, V. C., Jalon, G. E., Santiago,
A. D., Buenaseda, F. P., Conde, B. T., Canlas, R. C., & Senajon, E. T. (2009). Basic
Statistics with Business and Nursing Applications. Books Atbp. Publishing Corp.
Tizon, M. B., & Mesa, H. Y. (2016). Stat Speaks: Statistics and Probability for 21st
Century Learners. St. Bernadette Publishing House Corporation.
Casio fx115ES Plus - Mean, Standard Deviation, Variance. (2019, April 12). [Video].
YouTube. https://www.youtube.com/watch?v=McCoCvogTQ4
31