Measures of Central Tendency
Measures of Central Tendency
2
Mean (Average)
Value about which observation tends to cluster
3
Example
Let us compute the mean (or average) of this
sample:
4
Example (cont.)
110 118 110 122 110 150 120
= n 6
5
Example (cont.)
110 118 110 122 110 150 120
= n 6
6
Example (cont.)
110 118 110 122 110 150 120
= n 6
7
Example 2.1 (cont.)
110 118 110 122 110 150 120
= n 6
8
Median
Median is the middle value of the measurement when they
are arranged in ascending or descending order.
Eg. Age of 5 students.
Unarranged – 1. 5. 3. 2. 4
Ascending order – 1, 2, 3, 4, 5
Median in this case is 3
if the middle observations are 2, then median is average of
central 2 values.
E.g..1, 2, 3, 4, 5, 6
Median = 3+4 / 2 = 3.5 years
9
Mode
observation occurring most frequently.
30, 32, 35, 37, 41, 45, 41
mode = 41 years
a data may have more then one mode
30, 32, 35, 37, 41, 45, 41, 32
mode = 32 & 41
10
Advantages and Disadvantages of each
measurement
Advantages Disadvantages
Mode 1. not affected by extreme 1. there may be more then one mode
values 2. does not represent all data
2. histograms can be obtained 3. not used in further statistical work.
3. easy to find & understand
11
Is arithmetic mean alone enough to interpret the
data??
NO, mean alone is misleading
What about the following 2 data sets
12
Measurement of Dispersion
Range
Difference in value b/w maximum & minimum value.
Range = X max – X min
Quick to compute
Not very useful. Does not take into consideration bulk
of values, only extreme values
13
Measurement of Variation
1. The Standard Deviation is a measure, which
describes how much individual measurements differ,
on the average, from the mean.
A large standard deviation shows that there is a wide
scatter of measured values around the mean
Small standard deviation shows that the individual
values are concentrated around the mean with little
variation among them.
Standard deviation: it is the square root of the variance
14
STANDARD DEVIATION (calculation)
( Xi X )
2
SD
n 1
Data: 1,2,3,4,5
15
Frequency table for calculating S.D.
5 5-3 (2) 2 =4
15 15/5=3 0 10 S2 = 10/4=2.5
Important:
All sample variances are computed this way!
We always take the mean;
subtract each score from the mean;
square the result;
sum the squares;
and divide by the (sample size-1)
17
Mean and S.D. from group data
Formula for Mean
=S f Xi / n
18
Example 1
1. Calculate the mean and standard deviation of the following
data set by making frequency table. (number of children in
different families)
2,2,5,3,0,1,3,2,3,4,0,3,4,5,7,3,2,4,1,0,5,8,6,5,4,2,4,4,7,6.
1. Also calculate X – S and X + S
2. Also calculate X – 2 S and X + 2 S
3. Also calculate X – 3 S and X + 3 S
What percentage of measurements fall within each interval?
19
0,0,0,1,1,2,2,2,2,2,3,3,3,3,3,4,4,4,4,4,4,5,5,5,5,6,6,7,7,8
X = # of f = frequency (f * x) (X – )2 f (X – )2
children
- 0.72
0 3 0 (0 – 3.5)2= 12.25 12.25 * 3 = 36.75
1 2 2 (1 – 3.5)2 = 6.25 6.25 * 2 = 12.5
2 1.39 5 10 (2 – 3.5)2 = 2.25 2.25 * 5 = 11.25
6
5.61 2 12 (6 – 3.5)2 = 6.25 6.25 * 2 = 12.5
7 2 14 (7 – 3.5)2 = 12.25 12.25 * 2 = 24.5
8 7.72 1 8 (8 – 3.5)2 = 20.25 20.25 * 1 = 20.25
Total = 30 105 129.5
20
Mean = 105/30 = 3.5
Mean and S.D. from group data (discreet data)
22
To calculate X – 2 S and X + 2 S
23
Outliers
In a normal distribution any measurement with a
Z-score of greater than 3 or less than -3 is called
an outlier.
24
Tutorial
Find the mean, median, mode, range, variance, and
standard deviation for the data? 8, 5, 3, 5, 2, 1.
Calculate the mean and standard deviation of the
following data set by making frequency table.
2,2,5,3,0,1,3,2,3,4,1,3,4,5,3,3,2,4,1,0,5,4,5,4,2,4,4,3,6.
Also calculate X – S and X + S
Also calculate X – 2 S and X + 2 S
25
Home work 3 Number of babies
born per day
(days)
30 days 73
MEAN S f X / n
Mean = 73/30 = 2.43 89.32 28
Mean and S.D. from group data
Formula for Mean
=S f Xi / n
= 73 / 30 = 2.43
Formula for variance
S2 = S f (Xi - )2 / n-1
= 89.32 / 30 – 1
= 89.32 / 29 = 3.08
Take square root of 3.08 to get S.D.
S.D. = 1.75
29