Measures of Central Tendency and Spread: Chapter 1, Section 2
Measures of Central Tendency and Spread: Chapter 1, Section 2
and Spread
Chapter 1, Section 2
Measures of Central Tendency
90
80
70
60
50
40
30
20
10
0
3 1 3 4 9 6 4 1 2 9 7 4
071 875 037 199 384 763 142 521 290 266 047 426
7 1 3 4 6 4 3 1 4 6 1 9
-2
.6 .96 .25 .54 164 873 582 291 987 . 69 405 113
-1 -1 -0 0. 0. 1. 2. 2. 3 4. 5.
or blue. 50
thousands
• Yellow is the 40
mode. 30
20
• The mode is the 10
120
100
80
60
40
20
0
very bad bad neutral good very good
No Mode
Category Frequency
1 51 70
2 51 60
3 66 50
4 62 40
5 65 30
6 57 20
10
7 47
0
8 43 1 2 3 4 5 6 7 8 9
9 64
• Although the third category is the
largest, it is not sufficiently
different to be called the mode.
Midrange
• The midrange is the average of the lowest
and highest value in the data set.
• This measure is not often used since it is
based strictly on the two extreme values in
the data.
Midrange Example
X
min 14
17
28
31
42 14 + 78
midrange = = 46
43 2
47
51
51
62
66
67
70
70
max 78
0
20
40
60
80
100
120
140
160
180
200
-6.33939635
-5.447617432
-4.555838513
-3.664059595
-2.772280676
-1.880501757
-0.988722839
-0.09694392
0.794834998
1.686613917
2.578392835
3.470171754
4.361950672
5.253729591
Same mean, but y varies more than x.
6.145508509
Measures of Variation
7.037287428
y
x
Three Measures of Variation
• While there are other measures, we will look at
only three:
– Variance
– Standard deviation
– Coefficient of variation
• Population mean and sample mean use an
identical formula for calculation.
• There is a minor difference in the formulas for
variation.
Population Variance
• The population variance, σ2, is
found using either of the
formulas to the right.
• The differences are squared to 2
(x ) 2
2
μ is the population mean. N
• Note that variance is always
positive if x can take on more
than one value.
Population Standard Deviation
• The standard deviation can be thought of as
the average amount we could expect the x’s
in the population to differ from the mean
value of the population.
• To get the standard deviation, simply take
the square root of the variance.
Sample Variance
• The sample variance, s2, is
found using either of the
formulas to the right.
• The differences are squared to s 2 ( x x ) 2
100s 100
CV (sample) or CV (population)
x
Coefficient of Variation Cont.
• Looking at the two
examples. We see that in A B
both cases the standard
deviation for B is twice CV 75% 120%
that of A. Example 1
• In the first example we
have almost twice the
relative variation in B.
CV 75% 4.6%
• In the second example, we Example 2
have a little over 16 times
as much variation in A.
Measures of Position
The dot on the left is at about -1, the dot on the right is at
approximately 0.8. But where are they relative to the rest
of the values in this distribution.
Quartiles, Percentiles and Other
Fractiles
• We will only consider the quartile, but the same
concept is often extended to percentages or other
fractions.
• The median is a good starting point for finding the
quartiles.
• Recall that to find the median, we wanted to locate
a point so that half of the data was smaller, and the
other half larger than that point.
Quartile
• For quartiles, we want to divide our data
into 4 equal pieces.
2 3 7 8 8 8 9 13 17 20 21 21
The red lines are 1.5 times the IQR. Starting from Q1 going
left, and starting from Q3 going right 1.5(IQR) we establish
limits. All numbers smaller on the left, and larger on the right
are outliers.
Example
Linear Transformations
• When changing units, e.g., feet to meters,
degrees F to degrees C, we employ a linear
transformation.
– New = a + b Old
• Measures of both center and spread will be
multiplied by “b”.
• Only measures of location are affected by
“a”.