Lecture 4 Measures of Dispersion
Lecture 4 Measures of Dispersion
DISPERSIONS
MEASURES OF DISPERSIONS
• A quantity that measures the variability
among the data, or how the data one
dispersed about the average, known as
Measures of dispersion, scatter, or
variations.
2. Common Measures of
Dispersion
• The main measures of dispersion
1. Range
2. Mean deviation or the average deviation
3. The variance & the standard deviation
1. RANGE
• It is the difference between the largest and the smallest
observation in a set of data.
• Range = xm – xo
• Its relative measure known as coefficient of dispersion.
x m xo
• Coefficient of dispersion =
x m xo
• It is used in daily temperature recording stick prices rate
• It ignores all the information available in middle of data.
• It might give a misleading picture of the spread of data.
1. RANGE
• Example:
1. Find the range in the following data.
31,26,15,43,19,10,12,37
Range = xm – xo 33 = 43 – 10
2. Find the range in the following F.D. (Ungrouped)
X 3 4 5 6 7 8
f5 5 = 8 –8 3 12 10 4 2
Range 5=8–3
3. Find the range in the following data.
X 10 - 20 20 - 30 30- 40 40 – 50 50 - 60
f 5 8 12 10 4
Range = 60 – 10 = 50
MEAN (OR AVERAGE) DEVIATION
• x x
or x mean
for
M .D. N
n
ungroup.
f x x
f x mean
f f
• or for grouped.
MEAN (OR AVERAGE) DEVIATION
• Example:
1. Calculate mean deviation
ẋ=92/14=6.57
MEAN (OR AVERAGE) DEVIATION
• It is an absolute measure.
• Its relative measure is coefficient of M.D.
M .D. M .D.
• Coefficient of M.D. = or
mean median
( x x ) 2
•
2
= n for ungrouped
THE VARIANCE AND
STANDARD DEVIATION
• 2
= f ( x x)
for grouped
2
f
• It is an absolute measure;
• It is relative measure is coefficient of
variation.
S .D.
• C .V
100 C .
V .
x
100
• Shortcut method
2
2
x2 x
N N
2
2
f .x 2
fx
f f
VARIANCE AND STANDARD
• Example:
DEVIATION
1. Calculate Variance and SD from the FD (Ungrouped Data).
2
f .x 2
fx
N N
VARIANCE AND STANDARD
DEVIATION
• Exp: Calculate Variance and Standard deviation from the FD (Grouped Data).
06:48 PM 14
Relative Measures of Variation
X Largest X Smallest
Coefficient of Range
X Largest X Smallest
Q3 Q1
Coefficient of Quartile Deviation
Q3 Q1
MD
Coefficient of Mean Deviation
Mean
06:48 PM 15
Coefficient of Variation (CV)
S
CV 100%
X
Stock B:
Average price last year = $100
Standard deviation = $5 but stock B is
less variable
relative to its
S $5 price
CVB 100% 100% 5%
X $100
06:48 PM 17
Five Number Summary
The five number summary of a data set consists of the
minimum value, the first quartile, the second quartile, the
third quartile and the maximum value written in that order:
Min, Q1, Q2, Q3, Max.
06:48 PM 18
Five Number Summary
The weekly TV viewing times (in hours).
25 41 27 32 43 66 35 31 15 5
34 26 32 38 16 30 38 30 20 21
5 15 16 20 21 25 26 27 30 30
31 32 32 34 35 37 38 41 43 66
06:48 PM 19
Five Number Summary
1(20 1)
LOCATION of Q1 ; th obs. in the data 5.25th obs.
4
VALUE of Q1 ; 5th obs. 0.25{6th obs. - 5th obs.} 21 0.25{25 - 21} 22.0 Hrs
2(20 1)
LOCATION of Q 2 ; th obs. in the data 10.50th obs.
4
VALUE of Q2 ;10th obs. 0.50{11th obs. - 10th obs.} 30 0.50{31- 30} 30.5 Hrs
3(20 1)
LOCATION of Q 3 ; th obs. in the data 15.75th obs.
4
VALUE of Q 3 ; 15th obs 0.75 {16th obs - 15th obs} 35 0.75{37 - 35} 36.5 Hrs
06:48 PM 21
Construction of Box-Plot
Max
Value
Q3
1. Start the box from Q1 and
end at Q3
Q2
2. Within the box draw a line
to represent Q2
3. Draw lower whisker to Min.
Value up to Q1 Q1
06:48 PM 22
70
Construction of Box-Plot 60
50
1. Q1=22.0 Q3=36.5
40
2. Q2=30.5
3. Minimum Value=5.0
4. Maximum Value=66.0 30
20
10
0
06:48 PM 23
70
Interpretation of Box-Plot 60
06:48 PM 25
Inner and Outer Fences
If Q1=22.0 Q2=30.5 Q3=36.5
Lower Inner Fence Q1 1.5IQR 0.25
Inner Fences :
Upper Inner Fence Q 3 1.5IQR 58.25
06:48 PM 26
80
Identification of the Outliers
70
outliers outlier
3. The values that lie outside outer 30
Female
06:48 PM Male 28
Measures of Skewness
A distribution in which the values equidistant from
the centre have equal frequencies is defined to be
symmetrical and any departure from symmetry is
called skewness.
06:48 PM 29
Measures of Skewness
A distribution is positively skewed, if the observations
tend to concentrate more at the lower end of the
possible values of the variable than the upper end. A
positively skewed frequency curve has a longer tail on
the right hand side
06:48 PM 32
Measures of Kurtosis
06:48 PM 33
Measures of Kurtosis
n X-X
4
Coefficient of Kurtosis=
2 2
X-X
06:48 PM 34