0% found this document useful (0 votes)

24 views

Measures of Central Tendency and Spread: Chapter 1, Section 2

Uploaded by

Khaleel Ur Rehman M

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Measures of Central Tendency and Spread: Chapter 1, Section 2

Uploaded by

Khaleel Ur Rehman M

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 36

Measures of Central Tendency

and Spread
Chapter 1, Section 2
Measures of Central Tendency
90

0
3 1 3 4 9 6 4 1 2 9 7 4
071 875 037 199 384 763 142 521 290 266 047 426
7 1 3 4 6 4 3 1 4 6 1 9
-2
.6 .96 .25 .54 164 873 582 291 987 . 69 405 113
-1 -1 -0 0. 0. 1. 2. 2. 3 4. 5.

Identical shapes, the black is centered to the right of the red.

The Motivation
• Measure of central tendency are used to
describe the typical member of a
population.
• Depending on the type of data, typical
could have a variety of “best” meanings.
• We will discuss four of these possible
choices.
4 Measures of Central Tendency
• Mean – the arithmetic average. This is used for continuous
data.
• Median – a value that splits the data into two halves, that
is, one half of the data is smaller than that number, the
other half larger. May be used for continuous or ordinal
data.
• Mode – this is the category that has the most data. As the
description implies it is used for categorical data.
• Midrange – not used as often as the other three, it is found
by taking the average of the lowest and highest number in
the data set. Also primarily used for continuous data.
Mean
• To find the mean, add all
of the values, then divide
by the number of values. 

x
Population
• The lower case, Greek N
letter mu is used for
population mean. 
x
x
Sample
• An “x” with a bar over n
it, read x-bar, is used for
sample mean.
Mean Example
listing X
1 14
2 17
3 31 x-bar
4 28 737/15 = 49.13333
5 42
6 43
7 51
8 51
9 66
10 70
11 67
12 70
13 78
14 62
n = 15 47
total 737
Median
• The median is a number chosen so that half of the
values in the data set are smaller than that number,
and the other half are larger.
• To find the median
– List the numbers in ascending order
– If there is a number in the middle (odd number of
values) that is the median
– If there is not a middle number (even number of values)
take the two in the middle, their average is the median
Median Example
listing X listing X
1 14 1 14
2 17 2 17
3 28 3 28
4 31 4 31
5 42 5 42
6 43 6 43
7 47 7 47
8 51 8 51 51+53
= 52
9 51 9 53 2
10 62 10 57
11 66 11 62
12 67 12 66
13 70 13 67
14 70 14 70
15 78 15 70
16 78
Mode
• The mode is simply the category or value which
occurs the most in a data set.
• If a category has radically more than the others, it
is a mode.
• Generally speaking we do not consider more than
two modes in a data set.
• No clear guideline exists for deciding how many
more entries a category must have than the others
to constitute a mode.
Obvious Example
Beach Ball Production
• There is
80
obviously more 70

yellow than red 60

or blue. 50

thousands
• Yellow is the 40

mode. 30

20
• The mode is the 10

class, not the 0

frequency. blue red yellow

Bimodal
Geometry Scores For TASP

120

100

0
very bad bad neutral good very good
No Mode
Category Frequency
1 51 70
2 51 60

3 66 50

4 62 40

5 65 30

6 57 20

10
7 47
0
8 43 1 2 3 4 5 6 7 8 9

9 64
• Although the third category is the
largest, it is not sufficiently
different to be called the mode.
Midrange
• The midrange is the average of the lowest
and highest value in the data set.
• This measure is not often used since it is
based strictly on the two extreme values in
the data.
Midrange Example
X
min 14
17
28
31
42 14 + 78
midrange = = 46
43 2
47
51
51
62
66
67
70
70
max 78
0
20
40
60
80
100
120
140
160
180
200
-6.33939635
-5.447617432
-4.555838513
-3.664059595
-2.772280676
-1.880501757
-0.988722839
-0.09694392
0.794834998
1.686613917
2.578392835
3.470171754
4.361950672
5.253729591
Same mean, but y varies more than x.
6.145508509
Measures of Variation

7.037287428
y
x
Three Measures of Variation
• While there are other measures, we will look at
only three:
– Variance
– Standard deviation
– Coefficient of variation
• Population mean and sample mean use an
identical formula for calculation.
• There is a minor difference in the formulas for
variation.
Population Variance
• The population variance, σ2, is
found using either of the
formulas to the right.
• The differences are squared to  2

 (x  ) 2

prevent the sum from being N

zero for all cases.
• N is the size of the population, 2   x 2

 2
μ is the population mean. N
• Note that variance is always
positive if x can take on more
than one value.
Population Standard Deviation
• The standard deviation can be thought of as
the average amount we could expect the x’s
in the population to differ from the mean
value of the population.
• To get the standard deviation, simply take
the square root of the variance.
Sample Variance
• The sample variance, s2, is
found using either of the
formulas to the right.
• The differences are squared to s  2  ( x  x ) 2

prevent the sum from being n 1

x   x
zero for all cases. 2
• The sample size is n, x-bar is
s 2

 
2

the sample mean. n 1 n(n  1)

• Note that n-1 is used rather than
n. This adjustment prevents bias
in the estimate.
Sample Standard Deviation
• Just like the standard deviation of a
population, to find the standard deviation of
a sample, take the square root of the sample
variance.
Coefficient of Variation
• The measures discussed so far are primarily
useful when comparing members from the
same population, or comparing similar
populations.
• When looking at two or more dissimilar
populations, it doesn’t make any more sense
to compare standard deviations than it does
to compare means.
Coefficient of Variation Cont.
• Example 1: Weight loss
programs A and B. A B
• Two different programs Mean 20 25
with the same goal and
(weight
target population.
loss per
• While program B averages
more weight loss, it also
month)
has less consistent results. Standard 15 30
deviation
Coefficient of Variation Cont.
• Example 2: Weight loss
program A and tax refund B. A B
• Two different programs with Mean 20 650
different goals and different
target populations.
• We know that average Standard 15 30
weight loss and average tax deviation
refund are not comparable.
Are the standard deviations
comparable?
Coefficient of Variation Cont.
• In the last example we can see an argument that
standard deviation does not give the complete picture.
• The coefficient of variation addresses this issue by
establishing a ratio of the standard deviation to the
mean. This ratio is expressed as a percentage.

100s 100
CV  (sample) or CV  (population)
x 
Coefficient of Variation Cont.
• Looking at the two
examples. We see that in A B
both cases the standard
deviation for B is twice CV 75% 120%
that of A. Example 1
• In the first example we
have almost twice the
relative variation in B.
CV 75% 4.6%
• In the second example, we Example 2
have a little over 16 times
as much variation in A.
Measures of Position

The dot on the left is at about -1, the dot on the right is at
approximately 0.8. But where are they relative to the rest
of the values in this distribution.
Quartiles, Percentiles and Other
Fractiles
• We will only consider the quartile, but the same
concept is often extended to percentages or other
fractions.
• The median is a good starting point for finding the
quartiles.
• Recall that to find the median, we wanted to locate
a point so that half of the data was smaller, and the
other half larger than that point.
Quartile
• For quartiles, we want to divide our data
into 4 equal pieces.

Suppose we had the following data set (already in order)

2 3 7 8 8 8 9 13 17 20 21 21

Choosing the numbers 7.5, 8.5, and 18.5 as markers would

Divide the data into 4 groups, each with three elements.
These numbers would be the three quartiles for this data set.
Quartiles Continued
• Conceptually, this is easy, simply find the median, then
treat the left hand side as if it were a data set, and find its
median; then do the same to the right hand side.
• This is not always simple. Consider the following data set.
• 3333356888889
• The first difficulty is that the data set does not divide
nicely.
• Using the rules for finding a median, we would get
quartiles of 3, 6 and 8.
• The second difficulty is how many of the 3’s are in the
first quartile, and how many in the second?
Quartiles Continued
• For this course, let’s pretend that this is not
an issue.
• I will give you the quartiles.
• I will not ask how many are in a quartile.
5 Number Summary
• The five number summary is the minimum value, the three quartiles and
the maximum value.
• This may be represented graphically with a box and whisker plot.
Outliers
• Outliers are values in the data set which are either
suspiciously large or small.
• Such values may be the result of an error, the
researcher measures incorrectly or maybe the
results are typed incorrectly.
• Outliers may be good data. There is always the
chance that you have one basketball player in a set
of ordinary people.
• The seven foot height is not an error, but it is still
unusually large.
Interquartile Range
• One method for identifying these outliers,
involves the use of quartiles.
• The interquartile range (IQR) is Q3 – Q1.
• All numbers less than Q1 – 1.5(IQR) are
probably too small.
• All numbers greater than Q3 + 1.5(IQR) are
probably too large.
Using IQR to Find Outliers

The red lines are 1.5 times the IQR. Starting from Q1 going
left, and starting from Q3 going right 1.5(IQR) we establish
limits. All numbers smaller on the left, and larger on the right
are outliers.
Example
Linear Transformations
• When changing units, e.g., feet to meters,
degrees F to degrees C, we employ a linear
transformation.
– New = a + b Old
• Measures of both center and spread will be
multiplied by “b”.
• Only measures of location are affected by
“a”.

Measures of Location and VARIATION For 1 Variable
No ratings yet
Measures of Location and VARIATION For 1 Variable
44 pages
Ch 2 Lecture Notes
No ratings yet
Ch 2 Lecture Notes
12 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
2) SummarizationOfData Mean Median Mod SD CV
No ratings yet
2) SummarizationOfData Mean Median Mod SD CV
24 pages
2.data Description
No ratings yet
2.data Description
57 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Statistical Data
No ratings yet
Statistical Data
41 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Chapt3 Overheads
No ratings yet
Chapt3 Overheads
8 pages
UKP6053 L3 Descriptive Statsitcs
100% (1)
UKP6053 L3 Descriptive Statsitcs
92 pages
Lecture 1, BAS115
No ratings yet
Lecture 1, BAS115
57 pages
Numerical Descriptive Measures 1
No ratings yet
Numerical Descriptive Measures 1
39 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
Analysis of Statistcal Data
No ratings yet
Analysis of Statistcal Data
46 pages
الفصل الثالث مقدمة في الاحصاء.pdf
No ratings yet
الفصل الثالث مقدمة في الاحصاء.pdf
69 pages
Lecture 3 Summarizing Data Measures of Central Location and Sampling
No ratings yet
Lecture 3 Summarizing Data Measures of Central Location and Sampling
53 pages
Unit - 2 Biostatistics
No ratings yet
Unit - 2 Biostatistics
9 pages
Lesson 1
No ratings yet
Lesson 1
37 pages
Lecture_04
No ratings yet
Lecture_04
88 pages
Lecture 2-3 Data Analysis Location & Dispression
No ratings yet
Lecture 2-3 Data Analysis Location & Dispression
43 pages
Week 6+7+8
No ratings yet
Week 6+7+8
37 pages
Lec006 - Measures of Dispersion
No ratings yet
Lec006 - Measures of Dispersion
42 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
FDSA unit 2
No ratings yet
FDSA unit 2
44 pages
Week 11 Measure of Center and Variability
No ratings yet
Week 11 Measure of Center and Variability
35 pages
MATH& 146 Lesson 8: Averages and Variation
No ratings yet
MATH& 146 Lesson 8: Averages and Variation
30 pages
Averages and Variation Eda
No ratings yet
Averages and Variation Eda
29 pages
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
No ratings yet
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
62 pages
2Descriptives
No ratings yet
2Descriptives
43 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Lec1 Statistics
No ratings yet
Lec1 Statistics
30 pages
CH 3 - Luc
No ratings yet
CH 3 - Luc
76 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
3jane - Data Description Finala4
No ratings yet
3jane - Data Description Finala4
14 pages
Chapter 3 - Data Presentation
No ratings yet
Chapter 3 - Data Presentation
40 pages
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
No ratings yet
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
25 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
Descriptive Statistics II
No ratings yet
Descriptive Statistics II
24 pages
Introduction To Statistics PDF
No ratings yet
Introduction To Statistics PDF
32 pages
Lecture 3 - Stat HO
No ratings yet
Lecture 3 - Stat HO
21 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
Topic: Measures of Central Tendency and Measures of Dispersion
No ratings yet
Topic: Measures of Central Tendency and Measures of Dispersion
45 pages
Measure of Central Tendency and Variability
No ratings yet
Measure of Central Tendency and Variability
73 pages
Statistics Unit1 Notes.docx
No ratings yet
Statistics Unit1 Notes.docx
11 pages
chapter 4
No ratings yet
chapter 4
11 pages
Topic 1 Describing Data II
No ratings yet
Topic 1 Describing Data II
68 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
Chapter 3
No ratings yet
Chapter 3
28 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Ch3 Numerically Summarizing Data
No ratings yet
Ch3 Numerically Summarizing Data
35 pages
Math in The Modern World Stat Lecture
No ratings yet
Math in The Modern World Stat Lecture
3 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Lecture Slides - Capítulo 02
No ratings yet
Lecture Slides - Capítulo 02
21 pages
2a. Describing Variables with Numbers
No ratings yet
2a. Describing Variables with Numbers
30 pages
Chapter Four: Numerical Descriptive Techniques
No ratings yet
Chapter Four: Numerical Descriptive Techniques
65 pages
More Minute Math Drills, Grades 3 - 6: Multiplication and Division
From Everand
More Minute Math Drills, Grades 3 - 6: Multiplication and Division
Carson Dellosa Education
5/5 (1)
Summer Bridge Math, Grades 1 - 2
From Everand
Summer Bridge Math, Grades 1 - 2
Summer Bridge Activities
No ratings yet
Mathematical Physics Unit - 6 Laplace Transform and Application
No ratings yet
Mathematical Physics Unit - 6 Laplace Transform and Application
55 pages
Chee 331 Notes On Fluid Ization
No ratings yet
Chee 331 Notes On Fluid Ization
21 pages
Intro to FEA Notes Zurich
No ratings yet
Intro to FEA Notes Zurich
208 pages
Discrete Mathematics - Recurrence Relation
No ratings yet
Discrete Mathematics - Recurrence Relation
10 pages
CNC & Machining Centers: Modul 12 MK. CAD/CAM
No ratings yet
CNC & Machining Centers: Modul 12 MK. CAD/CAM
103 pages
Sample Exam Questions Stats1a
No ratings yet
Sample Exam Questions Stats1a
14 pages
Grade 7 Science Review Quiz
No ratings yet
Grade 7 Science Review Quiz
3 pages
Computer Class VII- Ch.1
No ratings yet
Computer Class VII- Ch.1
6 pages
Class 9 B
No ratings yet
Class 9 B
2 pages
CB Timing TEST: Test Close Trip Coil - 1 Trip Coil - 2 Close-Open-1 (C-O1) Close-Open-2 (C-O2)
No ratings yet
CB Timing TEST: Test Close Trip Coil - 1 Trip Coil - 2 Close-Open-1 (C-O1) Close-Open-2 (C-O2)
2 pages
Or Simplex
No ratings yet
Or Simplex
13 pages
MC1648 DataSheet
100% (1)
MC1648 DataSheet
11 pages
FloppyDriveInfo 1349101164
No ratings yet
FloppyDriveInfo 1349101164
27 pages
2 Comparative Study of DFIG Power Control Using Stator
No ratings yet
2 Comparative Study of DFIG Power Control Using Stator
8 pages
Material Data:: Foundation For Pipe Support
100% (1)
Material Data:: Foundation For Pipe Support
8 pages
Crows Foot Notation
100% (1)
Crows Foot Notation
6 pages
Web Server COnfiguration
No ratings yet
Web Server COnfiguration
3 pages
Hyper Terminal
No ratings yet
Hyper Terminal
22 pages
PDF Multimedia environmental models the fugacity approach Third Edition Mackay download
100% (3)
PDF Multimedia environmental models the fugacity approach Third Edition Mackay download
55 pages
Xna Multi Threading
No ratings yet
Xna Multi Threading
36 pages
Rheem Ra2060ajvcb Article 1405358021530 en Ss
No ratings yet
Rheem Ra2060ajvcb Article 1405358021530 en Ss
28 pages
Superabsorbent Polymers and Superabsorbent Polymer Composites
No ratings yet
Superabsorbent Polymers and Superabsorbent Polymer Composites
5 pages
Catia Multicax Installation Guide
No ratings yet
Catia Multicax Installation Guide
42 pages
Moloboco RSH 632 Infographic 2
No ratings yet
Moloboco RSH 632 Infographic 2
1 page
076bct041 AI Lab5
No ratings yet
076bct041 AI Lab5
5 pages
Chapter 2-DATABASE SYSTEM Architecture
No ratings yet
Chapter 2-DATABASE SYSTEM Architecture
52 pages
DUO-6046K Fisa Tehnica EN
No ratings yet
DUO-6046K Fisa Tehnica EN
1 page
Arnaud Du Chene: Mastercard Paypass Vendor Product - Letter of Approval
No ratings yet
Arnaud Du Chene: Mastercard Paypass Vendor Product - Letter of Approval
2 pages

Measures of Central Tendency and Spread: Chapter 1, Section 2

Uploaded by

Measures of Central Tendency and Spread: Chapter 1, Section 2

Uploaded by

Measures of Central Tendency

Identical shapes, the black is centered to the right of the red.

yellow than red 60

class, not the 0

frequency. blue red yellow

prevent the sum from being N

prevent the sum from being n 1

the sample mean. n 1 n(n  1)

Suppose we had the following data set (already in order)

Choosing the numbers 7.5, 8.5, and 18.5 as markers would

You might also like