Variance and Standard Deviation
Variance and Standard Deviation
Deviation
Variance: a measure of how data
points differ from the mean
• Data Set 1: 3, 5, 7, 10, 10
Data Set 2: 7, 7, 7, 7, 7
But we know that the two data sets are not identical! The
variance shows how they are different.
( x X )
N
• Although this might seem reasonable, this expression
always equals 0, because the negative deviations about the
mean always cancel out the positive deviations about the
mean.
• We could just drop the negative signs, which is the same
mathematically as taking the absolute value, which is known
as the mean deviations.
• The concept of absolute value does not lend itself to the kind
of advanced mathematical manipulation necessary for the
development of inferential statistical formulas.
• The average of the squared deviations about the mean is
called the variance.
x X
2
x X
2
For sample variance
s
2
n 1
Score (
X X)
2
X X
X
1
3
2
5
3
7
4
10
5
10
Totals
35
1
3 3-7=-4
2
5 5-7=-2
3
7 7-7=0
4
10 10-7=3
5
10 10-7=3
Totals
35
Score (
X X)
2
X X
X
1
3 3-7=-4 16
2
5 5-7=-2 4
3
7 7-7=0 0
4
10 10-7=3 9
5
10 10-7=3 9
Totals
35 38
Score (
X X)
2
X X
X
1
3 3-7=-4 16
2
5 5-7=-2 4
3
7 7-7=0 0
4
10 10-7=3 9
5
10 10-7=3 9
Totals
35 38
Example 2
mean 23 23
median 22 27
range 10 22
1 28 5 25
2 22 -1 1
3 21 -2 4
4 26 3 9
5 18 -5 25
Totals 115 0 64
x X
2
N
Another formula
• Definitional formula for variance for data in a
frequency distribution
S 2
(X X ) 2
f
f
• Definitional formula for standard deviation for
data in a frequency distribution
S
( X X ) 2
f
f
The mean is 23
Myrna’s Score X f
X X ( X X)2 ( X X )2 x f
28 1
27 3
6 1
115 5
Myrna’s Score X f
X X ( X X)2 ( X X )2 x f
28 1 5
27 3 4
6 1 -17
115 5
Myrna’s Score X f
X X ( X X)2 ( X X )2 x f
28 1 5 25
27 3 4 16
6 1 -17 289
115 5
round-off rule – carry
one more decimal
Myrna’s Score X f ( X X)2 ( X X )2 x f
place than was
X X
present in the
original data
28 1 5 25 25
27 3 4 16 48
115 5 362