0% found this document useful (0 votes)

12 views

Chapter 4

1. This document discusses various measures of dispersion used to quantify how spread out or clustered data values are around a central point. 2. Absolute measures of dispersion like range, interquartile range, and standard deviation have units, while relative measures like coefficient of variation are unit-less. 3. Common measures include range, interquartile range, mean deviation, variance, and standard deviation. Boxplots provide a visual representation of a data set's spread and outliers.

Uploaded by

Javeria Naseem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Chapter 4

Uploaded by

Javeria Naseem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Chapter 4

Measures of Dispersion
Measures of Dispersion
By comparing two diﬀerent data sets (for now, measured
in the same units, e.g. kg, km etc). By chance, it may
happened that the two data sets have the same means,
medians or modes.
Does it mean that the two data sets are the same or they
have the same features.
No.
here we need some extra insight into the data; as a ﬁrst step,
we need to measure their respective dispersions or
variabilities about the center and then compare them.
What are Absolute Dispersion and Relative
Dispersion

 Absolute & relative dispersion are two different ways to measure

the spread of a data set. They are used extensively in biological
statistics, as biological phenomena almost always show some
variation and spread.
 The easiest way to differentiate relative dispersion/absolute
dispersion is to check whether your statistic involves units. Absolute
measures always have units, while relative measures do not.
Most commonly used absolute
measures of dispersions

1. Range,
2. Mid-range
3. Inter-quartile Range (also called the fourth-spread),
4. Semi-inter-quartile Range( or Quartile deviation)
5. Mean Deviation
6. Variance
7. Standard Deviation
1: Range

 The range R, is defined as:

The difference between the largest and smallest observations in
a set of data.
Symbolically, it is given by relation:
R = 𝑋𝑚 - 𝑋0
Where,
𝑋𝑚 stands for largest observation
𝑋0 stands for smallest observation
Mid-range

 It is just the average of two extreme values, i.e.

max−𝑣𝑎𝑙𝑢𝑒 + min−𝑣𝑎𝑙𝑢𝑒
 mid-range =
2

𝑋𝑚 + 𝑋0
 mid-range =
2
Inter-quartile Range

 The interquartile range is a measure of spread and is defined as :

the difference between the third and first quartile .

 it is denoted by IQR and symbolically;

IQR = Q3 −Q1
Quartile Deviation

 The interquartile range is a measure of spread and it is denoted by Q.D.

and symbolically;
Q3 −Q1
Q.D. = 2

It is also called Semi-Inter-quartile range (SIQR) because it is just the half of IQR.
Co-eﬃcient of Quartile Deviation

 The pure measure (free of units of measurements) is the co-

efficient of quartile deviation .
 It is defined as
Q3 −Q1
 Co-efficient of Quartile Deviation =
Q3 + Q1

 This measure is free of measurements units and can be used

to compare two or more data with diﬀerent units of
measurement.
Mean Deviation:

 Mean (or median) deviation (MD) or mean absolute deviation

(MAD) is also a measure of dispersion deﬁned as

 the average of the absolute diﬀerences/deviations between the

data values and the data center (usually, mean or median).
 Mathematically, Using the mean as the data center,
Mean deviation from mean

For ungrouped data:

𝑥𝑖 − 𝑥
M.D.=
𝑛

For grouped data:

𝑓𝑖 𝑥𝑖 − 𝑥
M.D.=
𝑛
Mean deviation from median:
 For ungrouped data:
𝑥𝑖 −𝑥
M.D. (median)=
𝑛

For grouped data:

𝑓𝑖 𝑥𝑖 − 𝑥
M.D. (median)=
𝑛
Example
 Find the MD and MedD for the following simple data.
 65 55 89 56 35 14 56 55 87 45 92
Solution:

 Lets denote the data by X.

 What we need ﬁrst, are the mean and median. The mean is
𝑥𝑖
 𝑥= = 65 + 55 + ... + 92 / 11
𝑛

 = 59

Since n is odd, the median is just the middle observation of the

ordered data,
14 35 45 55 55 56 56 65 87 89 92
hence median is 56.
𝑥𝑖 − 𝑥
 M.D. = 𝑛
= 17.6

𝑥𝑖 −𝑥
 M.D. (median)= 𝑛
= 16.8
Example
 Find the MD and MedD for the following grouped data.

x : 14 35 45 55 56 65 87 89 92
f: 4 7 11 13 18 13 8 6 3
 Solution: Again, ﬁrst we need the mean and the median to calculate the necessary
columns. The mean is
Solution
 𝑥 =58.7

 𝑥 =56

𝑓𝑖 𝑥𝑖 − 𝑥 1182
 M.D.= = = 14.2
𝑛 83

𝑓𝑖 𝑥𝑖 − 𝑥 1120
 M.D. (median)= = = 13.5
𝑛 83
Variance
 Variance is deﬁned as:

The mean of the squared deviations of all the observations from the mean.

Population variance is denoted by 𝜎2

the sample variance is denoted by S2 or 𝜎2

Mathematically,
n-1 means unbiased estimation of size.
all of S^2 calculated and their mean will be equal to
sigma^2
E(S^2)=sigma^2
 For small samples(n <= 30)

𝑥𝑖 −𝑥 2
 S2 = 𝑛−1
for ungrouped data

𝑓𝑖 𝑥𝑖 −𝑥 2
 S2 = 𝑛−1
for grouped data

 F or large samples(n > 30)

𝑥𝑖 −𝑥 2
 S2 = for ungrouped data
𝑛
𝑓𝑖 𝑥𝑖 −𝑥 2
 S2 = for grouped data
𝑛
Standard deviation (SD)

 It is a widely used measure of variability or diversity, used in statistics

and probability theory.
 It shows how much variation or “dispersion” exists from the average
(mean, or expected value).
 A low standard deviation indicates that the data points tend to be
very close to the mean,
 whereas high standard deviation indicates that the data points are
spread out over a large range of values.
Formulas for SD
if observations in cm and variance is calculated the
results will be in cm^2 because ot the square we put
in formula of variance and to bring back to same
𝑥𝑖 −𝑥 2 scale and to generalize data we use S.D. e.g taking
 S= for ungrouped data square root will bring back cm^2 to cm giving a
𝑛−1 more generalized data

𝑓𝑖 𝑥𝑖 −𝑥 2
 S= for grouped data
𝑛−1

 F or large samples(n > 30)

𝑥𝑖 −𝑥 2
 S= 𝑛
for ungrouped data
𝑓𝑖 𝑥𝑖 −𝑥 2
 S=
𝑛
Formulas for SD

xi 2 𝑥𝑖 2
S= 𝑛
-- 𝑛
for ungrouped data

fixi 2 𝑓𝑖𝑥𝑖 2
S= 𝑛
- 𝑛
for grouped data
A few important properties of

1. summation Σ
2. , mean ¯ x
3. variance S2.
Properties of summation
- sum of the deviation from mean will be equal to zero of the same data size

Properties of mean
for which deviation was taken.
- sum of square of deviation from mean is always minimum. (for variance
check also if divided by divisor 'n' or 'n-1'
-combine mean calculation. denoted 'Xc'
Properties of variance
Box plot
 Stem-and-leaf displays and histograms convey rather general impressions
about a data set,
 whereas a single summary such as the mean or standard deviation
focuses on just one aspect of the data.
In recent years, a pictorial summary called a boxplot has been used
successfully to describe several of a data set’s most prominent following
features,
1. center,
2. spread,
3. the extent and nature of any departure from symmetry
4. identification of “outliers,”
outliers are observations that lie unusually far from the main body of the data.
A Box Plot is the visual representation of the statistical five number summary of a
given data
set.
A Five Number Summary includes:
1. Minimum value
2. First Quartile
3. Median (Second Quartile)
4. Third Quartile
5. Maximum value
Box plot
Example

 Ultrasound was used to gather the accompanying corrosion data on

the thickness of the floor plate of an aboveground tank used to store
crude oil, each observation is the largest pit depth in the plate,
expressed in milli-in.
 40 52 55 60 70 75 85 85 90 90 92 94 94 95 98 100 115 125 125
 The five-number summary is as follows:
 smallest xi =40
 lower fourth =72.5 Median= 90
 upper fourth 96.5
 largest xi =125
Boxplots That Show Outliers
DEFINITION
 Any observation farther than 1.5fs from the closest fourth is an outlier.An
outlier is extreme if it is more than 3fs from the nearest fourth, and it is mild
otherwise.
 Even a single extreme outlier in the sample warns the investigator that such
procedures may be unreliable, and the presence of several mild outliers
conveys the same message
Box plot
Comparative Boxplots

A comparative or side-by-side boxplot is a very

effective way of revealing similarities and
differences between two or more data sets
consisting of observations on the same.
CV= SD/ mean

Co-efficient of variation(cv)
noise to ratio. where is it
applied and used in aerospace
Moments
QUESTION

 Find the ﬁrst four central moments for the following data.

14 35 45 55 55 56 56 65 87 89 92
Raw Moments

 In some texts moments about arbitrary values are also calculated, and the
central moments are then calculated using some relationships. You can
also calculate them by ﬁrst calculating moments about zero, called raw
moments, and then calculate central moments using those equations.
Those equations are presented as.
Skewness graph of skewness
positive skewness
negative skewness

if near -3 then is highly

negatively skewed, if 3 then positifely
skewed
Skewness and kurtosis

lapto kurtic

meso kurtic

platy kurtic
Practice exercises:

 Section 1.4 in Chapter one of J L Devore’s Modern Mathematical Statistics with

Applications.

Then solve questions 41-46, 48, 49 in exercise 1.4. Also, calculate the ﬁrst for raw and
central moments for the above questions.

1.03 Statistical Measures of Asset Returns - Answers
No ratings yet
1.03 Statistical Measures of Asset Returns - Answers
44 pages
Chapter 4-1
No ratings yet
Chapter 4-1
46 pages
Lecture 4 Copy 1
No ratings yet
Lecture 4 Copy 1
13 pages
Chapter 4
No ratings yet
Chapter 4
27 pages
Lecture 5&6
No ratings yet
Lecture 5&6
15 pages
Chapter#4 Measure of Dispersion
No ratings yet
Chapter#4 Measure of Dispersion
14 pages
MEFall2023_4
No ratings yet
MEFall2023_4
28 pages
TDA1
No ratings yet
TDA1
57 pages
UNIT FIVE (1)
No ratings yet
UNIT FIVE (1)
23 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
Statistics Chapter-IV
No ratings yet
Statistics Chapter-IV
59 pages
Descriptive Statistics PDF
100% (1)
Descriptive Statistics PDF
40 pages
Ch 2 Lecture Notes
No ratings yet
Ch 2 Lecture Notes
12 pages
Lecture 4 Measures of Dispersion
No ratings yet
Lecture 4 Measures of Dispersion
34 pages
Math in The Modern World Stat Lecture
No ratings yet
Math in The Modern World Stat Lecture
3 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Measures of Dispersion
100% (1)
Measures of Dispersion
13 pages
EXP-1- Statistics and Plotting
No ratings yet
EXP-1- Statistics and Plotting
23 pages
chapter 4 finalized
No ratings yet
chapter 4 finalized
23 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
Business Statistics: Session 2
No ratings yet
Business Statistics: Session 2
60 pages
Chapter 3 - Data Presentation
100% (1)
Chapter 3 - Data Presentation
40 pages
Health Statistics III 2.1 Cert
No ratings yet
Health Statistics III 2.1 Cert
53 pages
Measures-of-Grouped-and-Ungrouped-Data-se-201
No ratings yet
Measures-of-Grouped-and-Ungrouped-Data-se-201
8 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
40 pages
lecture_4
No ratings yet
lecture_4
56 pages
Chapter 3 Data Presentation
No ratings yet
Chapter 3 Data Presentation
40 pages
Statistics Part 1 and 2
No ratings yet
Statistics Part 1 and 2
53 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Dispersion
No ratings yet
Dispersion
31 pages
5-MEASURES of DISPERSION-02-Aug-2019Material I 02-Aug-2019 Exp. No. 1 - Measures of Central Tendency Dispersion Skewness and Kurtosi
No ratings yet
5-MEASURES of DISPERSION-02-Aug-2019Material I 02-Aug-2019 Exp. No. 1 - Measures of Central Tendency Dispersion Skewness and Kurtosi
10 pages
Lecture 2.2 - Statistics - Desc Stat and Distrib
No ratings yet
Lecture 2.2 - Statistics - Desc Stat and Distrib
48 pages
Lec006 - Measures of Dispersion
No ratings yet
Lec006 - Measures of Dispersion
42 pages
04 Dispersion Measures
No ratings yet
04 Dispersion Measures
17 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
Lecture 3 - Numerical Statistics
No ratings yet
Lecture 3 - Numerical Statistics
7 pages
Business Statistics: by Dr. Anugamini Srivastava
No ratings yet
Business Statistics: by Dr. Anugamini Srivastava
51 pages
Describing Data_Numerical Measure
No ratings yet
Describing Data_Numerical Measure
33 pages
Imp - MEASURES OF DISPERSION
No ratings yet
Imp - MEASURES OF DISPERSION
5 pages
MCT Biost Fuo
No ratings yet
MCT Biost Fuo
10 pages
STA112 Note week 5
No ratings yet
STA112 Note week 5
23 pages
Chapter-3ni Kamote Chua
No ratings yet
Chapter-3ni Kamote Chua
29 pages
STASTIC
No ratings yet
STASTIC
12 pages
measures of dispersion updated
No ratings yet
measures of dispersion updated
38 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
04 - Measures of Variation
No ratings yet
04 - Measures of Variation
24 pages
Chapter 04
No ratings yet
Chapter 04
18 pages
Quiz_2
No ratings yet
Quiz_2
7 pages
measures of variability
No ratings yet
measures of variability
13 pages
SLIDES - Statistics-Descriptive Statistics
No ratings yet
SLIDES - Statistics-Descriptive Statistics
25 pages
1 - Chapter (1) Analysis of Data and Its Types Exercise
No ratings yet
1 - Chapter (1) Analysis of Data and Its Types Exercise
10 pages
Ids Unit 2 Notes Ckm-1
No ratings yet
Ids Unit 2 Notes Ckm-1
30 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
UNIT IV Dispersion and Skewness
No ratings yet
UNIT IV Dispersion and Skewness
12 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
46 pages
3.dispersion and Skewness-Students Notes-MAR
No ratings yet
3.dispersion and Skewness-Students Notes-MAR
29 pages
Measures of Dispersion
80% (5)
Measures of Dispersion
23 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
CH-6 Data Loading, Storage, and File Formats
No ratings yet
CH-6 Data Loading, Storage, and File Formats
163 pages
Cambridge International AS & A Level: Mathematics 9709/52
No ratings yet
Cambridge International AS & A Level: Mathematics 9709/52
16 pages
Statistics For Data Science - 1
100% (2)
Statistics For Data Science - 1
38 pages
Mas 101 3
No ratings yet
Mas 101 3
63 pages
A Level Maths - Statistics Revision Notes
No ratings yet
A Level Maths - Statistics Revision Notes
9 pages
Y10 - HT1 - Revision Questions - Part 1
No ratings yet
Y10 - HT1 - Revision Questions - Part 1
16 pages
GDC Revision
No ratings yet
GDC Revision
10 pages
Sections 2.1 - 2.3: Mind On Statistics
No ratings yet
Sections 2.1 - 2.3: Mind On Statistics
22 pages
Stat 100 _ Statistics 1
No ratings yet
Stat 100 _ Statistics 1
95 pages
03 Descriptive Statistics
No ratings yet
03 Descriptive Statistics
77 pages
DWDM - (UNIT-1) : SVIT College of Engineering, ATP
No ratings yet
DWDM - (UNIT-1) : SVIT College of Engineering, ATP
40 pages
Statistics 110, Lecture Notes - Cedar Crest College
No ratings yet
Statistics 110, Lecture Notes - Cedar Crest College
111 pages
Mathematical Literacy-DATA HANDLING
100% (1)
Mathematical Literacy-DATA HANDLING
48 pages
2023 Cambridge Units 3 4 Solutions
No ratings yet
2023 Cambridge Units 3 4 Solutions
402 pages
Stat Descr
No ratings yet
Stat Descr
68 pages
R Studio Assignments
No ratings yet
R Studio Assignments
95 pages
Sta301 Misbha
No ratings yet
Sta301 Misbha
4 pages
Drawing and Interpreting Cummulative Frequency Diagrams
No ratings yet
Drawing and Interpreting Cummulative Frequency Diagrams
27 pages
Box and Whisker Interpretation Worksheet ANSWERS AND BLANK
No ratings yet
Box and Whisker Interpretation Worksheet ANSWERS AND BLANK
6 pages
Cambridge International Advanced Subsidiary and Advanced Level
No ratings yet
Cambridge International Advanced Subsidiary and Advanced Level
4 pages
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
100% (2)
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
8 pages
Session 9 and 10 Data Visualization
No ratings yet
Session 9 and 10 Data Visualization
34 pages
Spatial Analyses of Homicide With Areal Data
No ratings yet
Spatial Analyses of Homicide With Areal Data
37 pages
Measures of Dispersion Student
100% (1)
Measures of Dispersion Student
12 pages
Lecture 6 - Measures of Variability
No ratings yet
Lecture 6 - Measures of Variability
3 pages
aerofit_case_study1
No ratings yet
aerofit_case_study1
56 pages
LP For Quartile in Ungrouped Data
100% (1)
LP For Quartile in Ungrouped Data
5 pages
Golden Ratio
No ratings yet
Golden Ratio
12 pages
Module 1-ASL 2
No ratings yet
Module 1-ASL 2
6 pages

Chapter 4

Uploaded by

Chapter 4

Uploaded by

Chapter 4

 Absolute & relative dispersion are two different ways to measure

 The range R, is defined as:

 It is just the average of two extreme values, i.e.

 The interquartile range is a measure of spread and is defined as :

 it is denoted by IQR and symbolically;

 The interquartile range is a measure of spread and it is denoted by Q.D.

 The pure measure (free of units of measurements) is the co-

 This measure is free of measurements units and can be used

 Mean (or median) deviation (MD) or mean absolute deviation

 the average of the absolute diﬀerences/deviations between the

For ungrouped data:

For grouped data:

For grouped data:

 Lets denote the data by X.

Since n is odd, the median is just the middle observation of the

Population variance is denoted by 𝜎2

the sample variance is denoted by S2 or 𝜎2

 F or large samples(n > 30)

 It is a widely used measure of variability or diversity, used in statistics

 F or large samples(n > 30)

 Ultrasound was used to gather the accompanying corrosion data on

A comparative or side-by-side boxplot is a very

if near -3 then is highly

 Section 1.4 in Chapter one of J L Devore’s Modern Mathematical Statistics with

You might also like