Random Variable & Probability Distribution
Random Variable & Probability Distribution
Variance is often depicted by this symbol: σ2. The square root of the
variance is the standard deviation (σ), which helps determine the
consistency of an EVENT over a period of time.
Variance = ∑ P(x-x)²
Example: expected value, standard
deviation and CV (HW)
x 10 11 12 13 14
P(x) .4 .2 .2 .1 .1
X 1 2 3 4 5
P(x) .13 .29 .38 .13 .07
Z-Score
Video 2
Videohttps://www.youtube.com/watch?v=fnU42Ue9utk 3
Video 4
Let’s assume that the mean score for a class of 50 students is 60
and the standard deviation is 15 marks. A student named Emily
asked the teacher if by scoring 70 she has performed well or not.
Initially by looking at Emily’s score, it appears that she did well considering that 60 is
the mean class score.
However, this doesn’t reflect the variation among 50 students. Considering the
standard deviation of 15, it is very likely that there is a significant variation among
the scores. To answer the question how well Emily performed in the coursework
compared to other students in the class we can use the Z score.
For finding out the
number of students in the
class that scored higher
or lower than Emily, we
will look at the normal
distribution table. In this
case the Z-value comes
to 0.74857
we can clearly see that Emily performed better than 74.86% (37 students)
of the students in her class.
Also, it means that the probability of a score being higher than 0.67 is 1-
0.74857 = 0.25143 = 25.14% (13 students). It shows that approximately
25% of all the students scored higher than Emily.
A class of 50 students who have written the science test last week. Today
is the result day, and the class teacher told me that John scored 93 in the
test while the average score of the class was 68. Determine the z-score for
John’s test mark if the standard deviation is 13.
POINT INTERVAL
ESTIMATE ESTIMATE
S S
In case of
replacement: 𝜎x = 𝜎/ √N
𝜎2x = 𝜎2/N
In case of no
replacement:
𝜎2x = 𝜎2 (Np –N)
N (Np-1)
MATHEMATICAL PROBLEMS
Sampling Distribution of means
Prob. 1 :
A population consists of the five numbers 2,3,6,8 and 11.
all possibleConsider
samples of size 2 that can be drawn with and without
from this population . Find out
replacement
a)The mean of the population.
b) The standard deviation of the population .
c)The mean of the sampling distribution of
means. deviation of the sampling distribution of means (the standard
d)Standard
error of means ).
# Answer :
2+3+6+8+11
a) Mean of the population = = =
5
30 5 6 2
2 ⅀ 𝑥−𝜇
b)Standard deviation of population ,𝜎 =
𝑁
2
= (2−6)2 +
(3−6) +(6−6)52+(8−6)2+(11
16+9+0+4+25
2
= −6)
54 5 = =
5 10.8
∴ 𝜎 = 3.29
With replacement :
c)There are 5(5)= 25 samples of size 2 that can be drawn with replacement. These
are :
(2,2) (2,3) (2,6) (2,8) (2,11)
(3,2) (3,3) (3,6) (3,8) (3,11)
(6,2) (6,3) (6,6) (6,8) (6,11)
(8,2) (8,3) (8,6) (8,8) (8,11)
(11,2) (11,3) (11,6) (11,8) (11,11)
The corresponding sample means are :
5.5
𝑠𝑢𝑚 𝑜 𝑓 𝑎𝑙𝑙 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛𝑠
7.0distribution of
And the mean of sampling
mean is , 25
8.0
1
= 5
= 6.0
9.5 25
0
6.5
Hence proved the fact that 𝜇𝑥 =
µ 7.0
8.5
9.5
11.0
d) Here, standard deviation of the sampling distribution of mean is,
2 2 2
𝜎2x = 2−6 ++(2.5−6) + ………+ (11−6) ( substracting the mean 6 from each numbers,
squaring the
result, adding all1325 numbers25thus obtained and dividing by 25 )
=5 =
2
5.40
σx = 55.40 =
2.32 𝜎2
This illustrates the fact that for finite populations involving sampling with replacement 𝜎 2x = -
𝑁
since
, the right hand side is 10.8/2 = 5.40 ; agreeing with the above value
.
Without Replacement:
c) There are 10 samples of size 2 that can be drawn without replacement from
the population NCn = 5!/[(5-2)! X 2! ]
(2,3) (2,6) (2,8) (2,11) (3,6) (3,8) (3,11) (6,8) (6,11) (8,11)
The corresponding sample means are :
2.5, 4.0 , 5 , 0 , 6.5 , 4.5 , 5.5 , 7.0 , 7.0 , 8.5 ,
9.5 .
The mean of sampling distribution of means is ,
2.5+4.0+
𝜇 𝑥 = …….…+9.5 =
6.0
10 ∴ 𝜇𝑥 = µ
(d) The variance of sampling distribution of mean is ,
(2.5−
6)2+ 4.0−6 2+ ……….+
= 4.05
𝜎2x = 10
(9.5−6) 2
And, 𝜎𝑥 = 2.01
𝜎 2 𝑁−
𝑝 𝑁
this illustrates, 𝜎2x = )
𝑁 𝑁𝑝− 1
(
10.8
= (
2
5−2
5−1)
= 4.05
As obtained above .
Q 2. The population is the weight of six pumpkins (in pounds) displayed in a carnival
"guess the weight" game booth. You are asked to guess the average weight of the six
pumpkins by taking a random sample without replacement from the population.
Pumpkin Weight (in pounds)
A 19
B 14
C 15
D 9
E 10
F 17
To demonstrate the sampling distribution, let’s start with obtaining all of the possible
samples of size n = 2 from the populations, sampling without replacement. Sample size
= 6!/ (4! x 2!) = 15.
To demonstrate the sampling distribution, let’s start with obtaining all of the possible
samples of size n = 5 from the populations, sampling without replacement. Sample
size = 6!/ (1! X 5!) = 6.
MATHEMATICAL PROBLEMS
Sampling Distribution of means (HW)
Prob. 3 :
A population consists of the THREE numbers 2,3, and 4.
all possibleConsider
samples of size 2 that can be drawn with and without
from this population .
replacement
a)The mean of the population.
b) The standard deviation of the population .
c)The mean of the sampling distribution of
means. deviation of the sampling distribution of means (the standard
d)Standard
error of means ).
Central limit theorem is a statistical theory which states that when the large
sample size is having a finite variance, the samples will be normally distributed
and the mean of samples will be approximately equal to the mean of the whole
population.
In other words, the central limit theorem states that for any population with mean
and standard deviation, the distribution of the sample mean for sample size N
has mean μ and standard deviation σ / √n .
As the sample size gets bigger and bigger, the mean of the sample will get closer
to the actual population mean. If the sample size is small, the actual distribution
of the data may or may not be normal, but as the sample size gets bigger, it can
be approximated by a normal distribution. This statistical theory is useful in
simplifying analysis while dealing with stock index and many more.
The record of weights of the male population follows the normal distribution. Its
mean and standard deviations are 70 kg and 15 kg respectively. If a researcher
considers the records of 50 males, then what would be the mean and standard
deviation of the chosen sample?
= 15/√50
= 15/7.071
= 2.122= 2.1 kg
Example:
The record of weights of female population follows normal distribution. Its
mean and standard deviation are 65 kg and 14 kg respectively. If a researcher
considers the records of 50 females, then what would be the standard deviation
of the chosen sample?
Solution:
Mean of the population μ = 65 kg
Standard deviation of the population = 14
kg
sample size n = 50
Standard deviation is given by = σ / √n
= 14/√50
= 14/7.071
= 1.97
The average weight of a water bottle is 30 kg with a standard deviation of 1.5 kg. If
a sample of 45 water bottles is selected at random from a consignment and their
weights are measured, find the probability that the mean weight of the sample and
standard deviation.
= 1.5/√45
= 1.5/6.7082
= 0.2236
Hypothesis Testing