Chapter 4
Chapter 4
Measures of Dispersion
Measures of Dispersion
By comparing two different data sets (for now, measured
in the same units, e.g. kg, km etc). By chance, it may
happened that the two data sets have the same means,
medians or modes.
Does it mean that the two data sets are the same or they
have the same features.
No.
here we need some extra insight into the data; as a first step,
we need to measure their respective dispersions or
variabilities about the center and then compare them.
What are Absolute Dispersion and Relative
Dispersion
1. Range,
2. Mid-range
3. Inter-quartile Range (also called the fourth-spread),
4. Semi-inter-quartile Range( or Quartile deviation)
5. Mean Deviation
6. Variance
7. Standard Deviation
1: Range
max−𝑣𝑎𝑙𝑢𝑒 + min−𝑣𝑎𝑙𝑢𝑒
mid-range =
2
𝑋𝑚 + 𝑋0
mid-range =
2
Inter-quartile Range
It is also called Semi-Inter-quartile range (SIQR) because it is just the half of IQR.
Co-efficient of Quartile Deviation
= 59
𝑥𝑖 −𝑥
M.D. (median)= 𝑛
= 16.8
Example
Find the MD and MedD for the following grouped data.
x : 14 35 45 55 56 65 87 89 92
f: 4 7 11 13 18 13 8 6 3
Solution: Again, first we need the mean and the median to calculate the necessary
columns. The mean is
Solution
𝑥 =58.7
𝑥 =56
𝑓𝑖 𝑥𝑖 − 𝑥 1182
M.D.= = = 14.2
𝑛 83
𝑓𝑖 𝑥𝑖 − 𝑥 1120
M.D. (median)= = = 13.5
𝑛 83
Variance
Variance is defined as:
The mean of the squared deviations of all the observations from the mean.
𝑥𝑖 −𝑥 2
S2 = 𝑛−1
for ungrouped data
𝑓𝑖 𝑥𝑖 −𝑥 2
S2 = 𝑛−1
for grouped data
𝑥𝑖 −𝑥 2
S2 = for ungrouped data
𝑛
𝑓𝑖 𝑥𝑖 −𝑥 2
S2 = for grouped data
𝑛
Standard deviation (SD)
𝑓𝑖 𝑥𝑖 −𝑥 2
S= for grouped data
𝑛−1
𝑥𝑖 −𝑥 2
S= 𝑛
for ungrouped data
𝑓𝑖 𝑥𝑖 −𝑥 2
S=
𝑛
Formulas for SD
xi 2 𝑥𝑖 2
S= 𝑛
-- 𝑛
for ungrouped data
fixi 2 𝑓𝑖𝑥𝑖 2
S= 𝑛
- 𝑛
for grouped data
A few important properties of
1. summation Σ
2. , mean ¯ x
3. variance S2.
Properties of summation
- sum of the deviation from mean will be equal to zero of the same data size
Properties of mean
for which deviation was taken.
- sum of square of deviation from mean is always minimum. (for variance
check also if divided by divisor 'n' or 'n-1'
-combine mean calculation. denoted 'Xc'
Properties of variance
Box plot
Stem-and-leaf displays and histograms convey rather general impressions
about a data set,
whereas a single summary such as the mean or standard deviation
focuses on just one aspect of the data.
In recent years, a pictorial summary called a boxplot has been used
successfully to describe several of a data set’s most prominent following
features,
1. center,
2. spread,
3. the extent and nature of any departure from symmetry
4. identification of “outliers,”
outliers are observations that lie unusually far from the main body of the data.
A Box Plot is the visual representation of the statistical five number summary of a
given data
set.
A Five Number Summary includes:
1. Minimum value
2. First Quartile
3. Median (Second Quartile)
4. Third Quartile
5. Maximum value
Box plot
Example
Co-efficient of variation(cv)
noise to ratio. where is it
applied and used in aerospace
Moments
QUESTION
Find the first four central moments for the following data.
14 35 45 55 55 56 56 65 87 89 92
Raw Moments
In some texts moments about arbitrary values are also calculated, and the
central moments are then calculated using some relationships. You can
also calculate them by first calculating moments about zero, called raw
moments, and then calculate central moments using those equations.
Those equations are presented as.
Skewness graph of skewness
positive skewness
negative skewness
lapto kurtic
meso kurtic
platy kurtic
Practice exercises:
Then solve questions 41-46, 48, 49 in exercise 1.4. Also, calculate the first for raw and
central moments for the above questions.