Lecture_2 Measures of Central Tendency and Variation
Lecture_2 Measures of Central Tendency and Variation
Dr Emmanuel
Measures of Dispersion
The median is suitable for data with extreme values and it can
used to give the general overview for a huge mass of data
whereby the computation of the mean might be tedious.
Given a set of n observations X1 , X2 , . . . , Xn , then
n
1X
X̄ = Xi
n
i=1
The median is suitable for data with extreme values and it can
used to give the general overview for a huge mass of data
whereby the computation of the mean might be tedious.
where
L is the Lower class of the model class,
fm is the frequency of the model class,
fb is the frequency of the class before the model class,
fa is the frequency of the class after the model class and
h is the class size.
(c) Mode: The computation of mode for a grouped data is done
using the formula
fm − fb
Mode = L + h
2fm − fb − fa
where
L is the Lower class of the model class,
fm is the frequency of the model class,
fb is the frequency of the class before the model class,
fa is the frequency of the class after the model class and
h is the class size.
Example:
The height (in inches) of 100 students at ARU were recorded as
follows:
Height 60 - 62 63 - 65 66 - 68 69 - 71 72 - 74
Frequency 5 18 42 27 8
Ungrouped Data
Example
Grouped Data
Example
Z-scores
The mean and standard deviation of a data set can be used to
calculate a z-score, which measures the distance between a
particular observation x and the mean x̄, measured in units of
standard deviation (s).
x − x̄
Z-score =
s
Skewness and Kurtosis
That is, data sets with high kurtosis tend to have heavy tails, or
outliers. Data sets with low kurtosis tend to have light tails, or
lack of outliers.