Chapter 2B QM (PC)
Chapter 2B QM (PC)
Measures of Dispersion
➢ Measures of dispersion help us to understand the spread or
variability of a set of data. It gives additional information to judge
the reliability of the measure of central tendency and helps in
comparing dispersion that is present in various samples.
➢ Two data sets can have the same mean, the same median, or the
mode and yet they are very different in other respects.
➢ Example: consider the heights (cm) of five employees from each
of the sales and production departments as shown:
Sales department: 183 185 193 193 198
Production department: 170 183 193 193 213
The two groups have the same mean heights, 190.4cm, the same median heights,
193cm, and the same modal heights, 193cm.
Nonetheless, it is clear that the two data sets differ. To describe this difference
quantitatively, we use a measure of dispersion.
1
Eg 1: Find the range for the following data.
10 15 17 20 25 29 30 35 38 40 45
Solution: Range = 45 – 10 = 35
2
Quartile Deviation (semi-interquartile range – ½ IQR)
𝑄3 −𝑄1
QD = where 𝑄1 = lower quartile or first quartile
2
53 58 68 73 75 76 79 80 85 88 91 99
12+1
a) Q1 = ( ) 𝑡ℎ = 3.25th = 68 + 0.25(73 – 68) = 69.25
4
3(12+1)
Q3 = ( ) 𝑡ℎ = 9.75th = 85 + 0.75(88 – 85) = 87.25
4
3
Eg 4: The following are the ages of nine employees of an insurance
company.
47 28 39 51 33 37 59 24 33
Find the quartile deviation.
24 28 33 33 37 39 47 51 59
9+1
Q1 = ( ) 𝑡ℎ = 2.5th =28 + 0.5(33 – 28) = 30.5
4
3(9+1)
Q3 = ( ) 𝑡ℎ =7.5th = 47 + 0.5(51 – 47) = 49
4
49−30.5
QD = = 9.25
2
Value
5
62
(a) Q1 = value of the th = value of the 15.5 th item
4
LQ1=17.5
Q1 class: 17.5 – 22.5
CQ=
5
Q1 = 17.5 + 22 [15.5 − 2] = 20.57 fQ1=22
3(62) fQ1-1=2
Q3 = value of the th = value of the 46.5 th item
4
5 CQ=
Q3 = 27.5 + 14 [46.5 − 34] = 31.96
fQ3=14
31.96−20.57 fQ3-1=34
Quartile deviation = = 5.695
2
(b)
Q1 = 20.5
25% of the days are having production less than or equal to 20.5
units and the other 75% of the days are more than or equal to 20.5
units.
6
Q3 = 32
75% of the days are having production less than or equal to 32
units and the other 25% of the days are more than or equal to 32
units.
32−20.5
Quartile deviation = = 5.75
2
Percentiles are the summary measures that divide a ranked data set
into 100 equal parts. There are 99 percentiles in a ranked data set.
Consider n items arranged in ascending order. Then,
𝑘
The kth percentile, Pk = 100 (𝑛 + 1)th value
P25 = Q1; P50 = Q2; P75 = Q3
7
a) Raw data
Example: Find the P10 and P90 for the following raw data.
63, 105, 30, 43, 53, 73, 65, 77,89, 70, 68, 47, 38, 34, 41, 80, 60, 54, 59
30, 34, 38, 41, 43, 47, 53, 54, 59, 60, 63, 65, 68, 70, 73, 77, 80, 89, 105
10
P10 = 100 (19 + 1)𝑡ℎ = 2th = 34
90
P90 = 100 (19 + 1)𝑡ℎ = 18th = 89
Example: Find the P10 and P90 for the following distribution.
Marks 10 20 30 40 50 60
Number of 3 9 20 8 5 4
students
Cumulative 3 12 32 40 45 49
frequency
Soln:
10
P10 = 100 (49)𝑡ℎ = 4.9th = 20
90
P90 = 100 (49)𝑡ℎ = 44.1th = 50
8
c) Grouped frequency distribution
𝐶𝑃𝑘 𝑛𝑘
𝑃𝑘 = 𝐿𝑃𝑘 + 𝑓 ( − ∑ 𝑓𝑃𝑘−1 )
𝑃𝑘 100
9
Soln:
10(62)
n = 62 P10 = value of th item = value of 6.2th item
100
90(62)
P90 = value of th item = value of 55.8th item
100
L10=17.5
10
From the ogive, P10 = 18.5 units and P90 = 43.5 units
(∑ 𝑥)2
∑ 𝑥2 −
Sample standard deviation : 𝑠 = √𝑠 2 =√ 𝑛
𝑛−1
11
(∑ 𝑥)2 (24)2
∑ 𝑥2 − 138−
𝑛 5
s2 = = = 5.7
𝑛−1 5−1
s = √5.7 = 2.39
∑ 𝑥2 ∑𝑥 2 5920.465 176.8 2
=
2
−(𝑁) = −( ) = 118.460
𝑁 6 6
= √118.460 = 10.88
2) For grouped data
2
(∑ 𝑓𝑥)
∑ 𝑓𝑥 2 ∑ 𝑓𝑥 2 ∑ 𝑓𝑥 2 −
𝜎2 = ∑𝑓
− (∑ ) and 𝑠2 = 𝑛
𝑓 𝑛−1
12
2
(∑ 𝑓𝑥)
∑ 𝑓𝑥 2 −
Sample standard deviation : 𝑠 = √𝑠 2 = √ 𝑛
𝑛−1
Solution:
x f fx fx2
1 3 3 3
2 4 8 16
3 8 24 72
4 2 8 32
5 3 15 75
f= 20 fx= 58 fx2= 198
13
∑ 𝑓𝑥 2 ∑ 𝑓𝑥 2 198 58 2
St. dev = √ ∑𝑓
− ( ∑ 𝑓 ) = √ 20 − (20) = 1.22
14
∑ 𝑓𝑥 2 ∑ 𝑓𝑥 2 14825 535 2
Variance = ∑𝑓
− ( ∑𝑓 ) = − ( 25 ) = 135.04
25
No. of orders f
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
x f fx fx2
11 4 44 484
14 12 168 2352
17 20 340 5780
20 14 280 5600
f = 50 fx = 832 fx2 = 14216
15
2 2
(∑ 𝑓𝑥) (832)
∑ 𝑓𝑥 2 − 14216−
𝑛 50
Variance = = = 7.582
𝑛−1 50−1
Coefficient of Variation
➢ Useful for relative comparison.
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
CV = × 100%
𝑚𝑒𝑎𝑛
23.0
CV of Machine B = 281.3 × 100% = 8.18%
16
Coefficient of Skewness
➢ The term skewness is used to describe the shape of a frequency
distribution.
➢ If the histogram of a frequency distribution is drawn, the
distribution is said to be skewed if the peak of the histogram lies
to either side of the centre of the distribution. The terms positive
and negative skewness are used to describe the direction of the
skewness.
➢ If the mean = mode = median, the distribution of data is said to
be symmetrical else asymmetrical or skewed.
➢ 2 types of asymmetrical frequency distribution:
i) Positive skewed distribution
Mean > Median > Mode
Tail stretches out to the right
17
➢ Degree of skewness, Pearson’s coefficient of skewness
3(𝑀𝑒𝑎𝑛−𝑀𝑒𝑑𝑖𝑎𝑛)
𝑆𝑘 = 𝑆𝐷
Range of Sk is [-3,3]
For symmetrical distribution, Sk = 0
Soln:
3(𝑥̅ −𝑥̅̃) 3(30.9−28.8)
𝑆𝑘 = = = 0.476
𝑠 13.23
18
To get some basic statistics from the data, follow the following
procedure:
19
20