0% found this document useful (0 votes)

52 views

Summarizing Data-Measures of Dispersion

The document discusses various measures of dispersion used to numerically summarize data, including range, interquartile range, variance, standard deviation, and coefficient of variation. It also covers measures of distribution shape such as skewness, which indicates the symmetry of a distribution, and z-scores for determining outliers. The measures of dispersion and distribution shape help analyze the variability and properties of a data set beyond just measures of central tendency.

Uploaded by

Rakesh Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Summarizing Data-Measures of Dispersion

Uploaded by

Rakesh Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Summarizing Data: Measures of

Dispersion

Prepared by:
Dr Vijendra Singh
Department of Informatics
School of Computer Science, UPES
Numerically Summarizing Data

Dispersion
Measures of Variability

 It is often desirable to consider measures of variability

(dispersion), as well as measures of location.
 For example, in choosing supplier A or supplier B we
might consider not only the average delivery time for
each, but also the variability in delivery time for each.
Measures of Variability
 Range
 Interquartile Range
 Variance
 Standard Deviation
 Coefficient of Variation
Range

 The range of a data set is the difference between the

largest and smallest data values.
 It is the simplest measure of variability.
 It is very sensitive to the smallest and largest data
values.
Range

Range = largest value - smallest value

Range = 615 - 425 = 190

 The interquartile range of a data set is the difference

between the third quartile and the first quartile.
 It is the range for the middle 50% of the data.
 It overcomes the sensitivity to extreme data values.
Interquartile Range
3rd Quartile (Q3) = 525
1st Quartile (Q1) = 445
Interquartile Range = Q3 - Q1 = 525 - 445 = 80

The variance is a measure of variability that utilizes

all the data.

It is based on the difference between the value of

each observation (xi) and the mean ( x for a sample,
m for a population).
Variance

The variance is the average of the squared

differences between each data value and the mean.

The variance is computed as follows:

2
2  ( xi  x ) 2  ( xi  m ) 2
s   
n 1 N
for a for a
sample population
Standard Deviation

The standard deviation of a data set is the positive

square root of the variance.

It is measured in the same units as the data, making

it more easily interpreted than the variance.
Standard Deviation

The standard deviation is computed as follows:

s  s2   2

for a for a
sample population
Coefficient of Variation

The coefficient of variation indicates how large the

standard deviation is in relation to the mean.

The coefficient of variation is computed as follows:

s   
  100 %   100  %
x  m 
for a for a
sample population
Descriptive Statistics:
Numerical Measures
 Measures of Distribution Shape, Relative Location,
and Detecting Outliers
Measures of Distribution Shape, and
Relative Location

 Distribution Shape
 z-Scores
 Detecting Outliers
Distribution Shape: Skewness
 An important measure of the shape of a distribution
is called skewness.
 The formula for computing skewness for a data set is
somewhat complex.
 Skewness can be easily computed using statistical
software.
Skewness a bit of history
___________________________________________________________________________________

Relationship between location measures:

mean – mode = 3(mean – median)

Coefficient of skewness: xx M

sk 
independent of measurment units

Combining both:
3 x  m 
sk 
 We will be using it
Karl Pearson (1857-1938)
xM – mode, a value that occurs most frequently in the sample or population
Skewness
formulas
____________________________________________________________________________________________

Skweness:
T

 xi  x  sum of deviation from

mean value devided by

sk  i 1
the cubed standard
 3
deviation

where x¯ is the mean, is the standard deviation, and

T is the number of data points
Skewness
formulas
____________________________________________________________________________________________

x  x 
3 adjusted Fisher-Pearson
i
T standardised moment
sk  i 1
T 1T  2  3 coefficient

where x¯ is the mean, is the standard deviation, and

T is the number of data points
Galton skewness (also known as Bowley's skewness) is defined as

Galton skewness=(Q1 + Q3 −2Q2 )/(Q3 − Q1)

where Q1 is the lower quartile, Q3 is the upper quartile, and Q2 is the median.
Skewness Example
3rd Quartile (Q3) = 525
1st Quartile (Q1) = 445
2rd Quartile (Q2) = 475

425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Skewness Example
3rd Quartile (Q3) = 525
1st Quartile (Q1) = 445
2rd Quartile (Q2) = 475
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615

Galton skewness=(Q1 + Q3 −2Q2 )/(Q3 − Q1)

=(445+525-2*475)/(525-445)
=20/80
=0.25
Distribution Shape: Skewness

 Symmetric (not skewed)

• Skewness is zero.
• Mean and median are equal.
Skewness = 0
.35
.30
Relative Frequency

.25
.20
.15
.10
.05
0
Distribution Shape: Skewness

 Moderately Skewed Left

Skewness is negative.
Mean will usually be less than the median.

Skewness =  .31
.35
.30
Relative Frequency

.25
.20
.15
.10
.05
0
Distribution Shape: Skewness
Moderately Skewed Right
Skewness is positive.
Mean will usually be more than the median.
Skewness = .31
.35
.30
Relative Frequency

.25
.20
.15
.10
.05
0
Distribution Shape: Skewness

 Highly Skewed Right

• Skewness is positive (often above 1.0).
• Mean will usually be more than the median.

.35
Skewness = 1.25
.30
Relative Frequency

.25
.20
.15
.10
.05
0
Distribution Shape: Skewness

 Example: Apartment Rents

Seventy efficiency apartments
were randomly sampled in
a small college town. The
monthly rent prices for
these apartments are listed
in ascending order on the next slide.
Distribution Shape: Skewness

.35 Skewness = .92

.30
Relative Frequency

.25

.20
.15

.10
.05
0
59
z-Scores

The z-score is often called the standardized value.

It denotes the number of standard deviations a data

value xi is from the mean.

xi  x
zi 
s
Population Z - score

Sample Z - score
z-Scores

 An observation’s z-score is a measure of the relative

location of the observation in a data set.
 A data value less than the sample mean will have a
z-score less than zero.
 A data value greater than the sample mean will have
a z-score greater than zero.
 A data value equal to the sample mean will have a
z-score of zero.
z-Scores
 z-Score of Smallest Value (425)

xi  x 425  490.80
z    1.20
s 54.74

Standardized Values for Apartment Rents

-1.20 -1.11 -1.11 -1.02 -1.02 -1.02 -1.02 -1.02 -0.93 -0.93
-0.93 -0.93 -0.93 -0.84 -0.84 -0.84 -0.84 -0.84 -0.75 -0.75
-0.75 -0.75 -0.75 -0.75 -0.75 -0.56 -0.56 -0.56 -0.47 -0.47
-0.47 -0.38 -0.38 -0.34 -0.29 -0.29 -0.29 -0.20 -0.20 -0.20
-0.20 -0.11 -0.01 -0.01 -0.01 0.17 0.17 0.17 0.17 0.35
0.35 0.44 0.62 0.62 0.62 0.81 1.06 1.08 1.45 1.45
1.54 1.54 1.63 1.81 1.99 1.99 1.99 1.99 2.27 2.27
EXAMPLE Using Z-Scores

The mean height of males 20 years or older is

69.1 inches with a standard deviation of 2.8
inches. The mean height of females 20 years or
older is 63.7 inches with a standard deviation of
2.7 inches. Data based on information obtained
from National Health and Examination Survey.
Who is relatively taller:
Shaquille O’Neal whose height is 85 inches
or
Lisa Leslie whose height is 77 inches.
Answer:

 Shaquille O’Neal Z-Score:

(85-69.1)/2.8 =5.67857143

 Lisa Leslie
(77-63.7)/2.7 =4.92592593
Because O’Neal Z-Score > Lisa ‘s Z-Score,
 We say O’Neal is in a higher position than Lisa in their Goups.
Empirical Rule
For data having a bell-shaped distribution:

68.26% of the values of a normal random variable

are within +/- 1 standard deviation of its mean.

95.44% of the values of a normal random variable

are within +/- 2 standard deviations of its mean.

99.72% of the values of a normal random variable

are within +/- 3 standard deviations of its mean.
70
EXAMPLE Using the Empirical Rule

The following data represent the serum HDL

cholesterol of the 54 female patients of a family
doctor.
41 48 43 38 35 37 44 44 44
62 75 77 58 82 39 85 55 54
67 69 69 70 65 72 74 74 74
60 60 60 61 62 63 64 64 64
54 54 55 56 56 56 57 58 59
45 47 47 48 48 50 52 52 53

71
(a) Compute the population mean and standard
deviation.
(b) Draw a histogram to verify the data is bell-
shaped.
(c) Determine the percentage of patients that have
serum HDL within 3 standard deviations of the
mean according to the Empirical Rule.
(d) Determine the percentage of patients that have
serum HDL between 34 and 80.8 according to the
Empirical Rule.
(e) Determine the actual percentage of patients
that have serum HDL between 34 and 80.8.
72
(a) Using a TI83 plus graphing calculator, we find

m  57.4 and   11.7

(b)

73
m  57.4 and   11.7

(c) According to the Empirical Rule, approximately

99.7% of the patients will have serum HDL
cholesterol levels within 3 standard deviations of the
mean. That is, approximately 99.7% of the patients
will have serum HDL cholesterol levels greater than
or equal to 57.4 - 3(11.7) = 22.3 and less than or
equal to 57.4 + 3(11.7) = 92.5.

74
m  57.4 and   11.7
(d) Because 33.8 is 2 standard deviations below the
mean (57.4 - 2(11.7) = 34) and 81 is 2 standard
deviations above the mean (57.4 + 2(11.7) = 80.8),
the Empirical Rule states that approximately 95% of
the data will lie between 34 and 80.8.
(e) There are no observations below 34. There are
2 observations greater than 80.8. Therefore, 52/54
= 96.3% of the data lie between 34 and 80.8.
75
Detecting Outliers

 An outlier is an unusually small or unusually large

value in a data set.
 A data value with a z-score less than -3 or greater
than +3 might be considered an outlier.
 It might be:
• an incorrectly recorded data value
• a data value that was incorrectly included in the
data set
• a correctly recorded data value that belongs in
the data set
Detecting Outliers

 The most extreme z-scores are -1.20 and 2.27

 Using |z| > 3 as the criterion for an outlier, there are
no outliers in this data set.
Standardized Values for Apartment Rents
-1.20 -1.11 -1.11 -1.02 -1.02 -1.02 -1.02 -1.02 -0.93 -0.93
-0.93 -0.93 -0.93 -0.84 -0.84 -0.84 -0.84 -0.84 -0.75 -0.75
-0.75 -0.75 -0.75 -0.75 -0.75 -0.56 -0.56 -0.56 -0.47 -0.47
-0.47 -0.38 -0.38 -0.34 -0.29 -0.29 -0.29 -0.20 -0.20 -0.20
-0.20 -0.11 -0.01 -0.01 -0.01 0.17 0.17 0.17 0.17 0.35
0.35 0.44 0.62 0.62 0.62 0.81 1.06 1.08 1.45 1.45
1.54 1.54 1.63 1.81 1.99 1.99 1.99 1.99 2.27 2.27

Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Action Plan in Basic Calculus (Specialized)
100% (1)
Action Plan in Basic Calculus (Specialized)
2 pages
BM850HL7 Protocol
No ratings yet
BM850HL7 Protocol
7 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
24 pages
Unit 3. Measures of Dispersion Revised
No ratings yet
Unit 3. Measures of Dispersion Revised
41 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Chapter 3 Review
100% (1)
Chapter 3 Review
12 pages
M-1 CH-3 Descriptive Statistcs
No ratings yet
M-1 CH-3 Descriptive Statistcs
27 pages
3 Dispersion Skewness Kurtosis PDF
No ratings yet
3 Dispersion Skewness Kurtosis PDF
42 pages
Measures of Dispersion
50% (2)
Measures of Dispersion
52 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
EECM3724_Unit_1_Ch3_slides_2022
No ratings yet
EECM3724_Unit_1_Ch3_slides_2022
48 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
79 pages
Chapter 4 Data Managmnt Lesson 3 Measures of Dispersion
No ratings yet
Chapter 4 Data Managmnt Lesson 3 Measures of Dispersion
9 pages
Topic II Part II
No ratings yet
Topic II Part II
22 pages
Biostat Ch-5
No ratings yet
Biostat Ch-5
58 pages
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
Part 2-Chapter 3 - Describing Data - Edit
No ratings yet
Part 2-Chapter 3 - Describing Data - Edit
46 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Numerical Summary Statistics
No ratings yet
Numerical Summary Statistics
19 pages
Why Study Dispersion?: Spread of The Data
No ratings yet
Why Study Dispersion?: Spread of The Data
31 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Lecture 9 101
No ratings yet
Lecture 9 101
41 pages
Chapter 4 Measures of Dispersion (Variation)
No ratings yet
Chapter 4 Measures of Dispersion (Variation)
34 pages
Chapter Two Mba Summary Class Notes 24
No ratings yet
Chapter Two Mba Summary Class Notes 24
31 pages
Chapter 5 - Measures of Variability
No ratings yet
Chapter 5 - Measures of Variability
35 pages
Lec006 - Measures of Dispersion
No ratings yet
Lec006 - Measures of Dispersion
42 pages
ProSta Module 6
No ratings yet
ProSta Module 6
36 pages
Probability and Statistics: Lums Undergraduate SS-4-6
No ratings yet
Probability and Statistics: Lums Undergraduate SS-4-6
17 pages
Statistics - Dispersion - Week 4
No ratings yet
Statistics - Dispersion - Week 4
4 pages
Lecture 3 - Numerical Statistics
No ratings yet
Lecture 3 - Numerical Statistics
7 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Unit 4 Descriptive Statistics
No ratings yet
Unit 4 Descriptive Statistics
8 pages
Chapter 3, Part A Descriptive Statistics: Numerical Measures
No ratings yet
Chapter 3, Part A Descriptive Statistics: Numerical Measures
7 pages
Lecture V Probability and Statistics
No ratings yet
Lecture V Probability and Statistics
6 pages
Topic 4 Descriptive Statistics
No ratings yet
Topic 4 Descriptive Statistics
49 pages
04 - Measures of Variation
No ratings yet
04 - Measures of Variation
24 pages
4 Variation 22022024 113733am
No ratings yet
4 Variation 22022024 113733am
14 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
Chapter 3, Numerical Descriptive Measures: - Data Analysis Is
No ratings yet
Chapter 3, Numerical Descriptive Measures: - Data Analysis Is
21 pages
Dispersion
No ratings yet
Dispersion
26 pages
SLIDES - Statistics-Descriptive Statistics
No ratings yet
SLIDES - Statistics-Descriptive Statistics
25 pages
QTT201 Ca-2
No ratings yet
QTT201 Ca-2
14 pages
1. Measures of Dispersion, Skewness & Kurtosis
No ratings yet
1. Measures of Dispersion, Skewness & Kurtosis
6 pages
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
No ratings yet
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
33 pages
2Descriptives
No ratings yet
2Descriptives
43 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Chapter-3ni Kamote Chua
No ratings yet
Chapter-3ni Kamote Chua
29 pages
Measures of Dispersion
100% (6)
Measures of Dispersion
18 pages
Numerical Descriptive Measures
No ratings yet
Numerical Descriptive Measures
52 pages
Topic 1 Numerical Measure
No ratings yet
Topic 1 Numerical Measure
11 pages
Stat 102 Module 3
No ratings yet
Stat 102 Module 3
8 pages
CH IV Stat I (1)
No ratings yet
CH IV Stat I (1)
41 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
14 pages
6 Descriptive Statistics 2
No ratings yet
6 Descriptive Statistics 2
20 pages
3 Descriptive Statistics - Numerical(1)
No ratings yet
3 Descriptive Statistics - Numerical(1)
82 pages
Class Test 1 Revision Notes
No ratings yet
Class Test 1 Revision Notes
10 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Parameter and Statistic
No ratings yet
Parameter and Statistic
15 pages
Advanced Functional Thinking Lab 4: Q1.) Finding Character at 8th Position
No ratings yet
Advanced Functional Thinking Lab 4: Q1.) Finding Character at 8th Position
2 pages
Advanced Functional Thinking Lab 4: Q1.) Finding Character at 8th Position
No ratings yet
Advanced Functional Thinking Lab 4: Q1.) Finding Character at 8th Position
3 pages
Advanced Functional Thinking Lab Lab Experiment: 8
No ratings yet
Advanced Functional Thinking Lab Lab Experiment: 8
3 pages
Disk Based Processing Lab Experiment-6: Aim: Exploring YARN and HUE Procedure
No ratings yet
Disk Based Processing Lab Experiment-6: Aim: Exploring YARN and HUE Procedure
10 pages
Stats For Data Science Assignment-2: NAME: Rakesh Choudhary ROLL NO.-167 BATCH-Big Data B3
No ratings yet
Stats For Data Science Assignment-2: NAME: Rakesh Choudhary ROLL NO.-167 BATCH-Big Data B3
9 pages
Name - Rakesh Choudhary Sap-ID-500071544 Enrollment-no-R172218167 CSE-Big Data Batch-B3
No ratings yet
Name - Rakesh Choudhary Sap-ID-500071544 Enrollment-no-R172218167 CSE-Big Data Batch-B3
5 pages
Understanding The Bottom-Up SLR Parser: ACM SIGCSE Bulletin March 1994
No ratings yet
Understanding The Bottom-Up SLR Parser: ACM SIGCSE Bulletin March 1994
6 pages
Grade Card: Course Code Course Name Credits Grade Result
No ratings yet
Grade Card: Course Code Course Name Credits Grade Result
1 page
Maxwell-Ampere Law: Free Net Free
No ratings yet
Maxwell-Ampere Law: Free Net Free
9 pages
Essay On Higher Education in India
No ratings yet
Essay On Higher Education in India
1 page
Sapta Bhumika GP2020
No ratings yet
Sapta Bhumika GP2020
15 pages
Python Programming For Beginners
No ratings yet
Python Programming For Beginners
3 pages
Conducting Materials
100% (2)
Conducting Materials
7 pages
Fact Sheet
No ratings yet
Fact Sheet
4 pages
2580886-Centerfire Quick Reference v140
100% (1)
2580886-Centerfire Quick Reference v140
35 pages
NCERT Solutions For Class 5 Maths 9 May Chapter 2 Shapes and Angles
No ratings yet
NCERT Solutions For Class 5 Maths 9 May Chapter 2 Shapes and Angles
16 pages
Implementation of Space Vector Pulse Width Modulation (SVPWM) For Three Phase Voltage Source Inverter Using Matlab Simulink - 24 Pages
100% (1)
Implementation of Space Vector Pulse Width Modulation (SVPWM) For Three Phase Voltage Source Inverter Using Matlab Simulink - 24 Pages
24 pages
JJ JJ: WWW - Manaresults.co - in
No ratings yet
JJ JJ: WWW - Manaresults.co - in
2 pages
Ashwagandha Book chapter
No ratings yet
Ashwagandha Book chapter
296 pages
The Distribution of Share Price Changes (The Journal of Business, Vol. 45, Issue 1) (1972)
No ratings yet
The Distribution of Share Price Changes (The Journal of Business, Vol. 45, Issue 1) (1972)
8 pages
White Paper Projection Mapping 2020 03092020 PDF
100% (1)
White Paper Projection Mapping 2020 03092020 PDF
14 pages
BHS INGGRIS CHAPTER 5 PART 2 (SISWA)
No ratings yet
BHS INGGRIS CHAPTER 5 PART 2 (SISWA)
3 pages
Lecture 4
No ratings yet
Lecture 4
58 pages
ME8791 Mechatronics Course File
100% (1)
ME8791 Mechatronics Course File
22 pages
Conservation@Sankhu: Preliminary Presentations
No ratings yet
Conservation@Sankhu: Preliminary Presentations
39 pages
Exercise 1
No ratings yet
Exercise 1
4 pages
Wireless Networking Developing World
No ratings yet
Wireless Networking Developing World
254 pages
Laparoscopic Abdominoperineal Resection
100% (2)
Laparoscopic Abdominoperineal Resection
19 pages
Flashcodes and Troubleshooting Carrier Luv Inverte - 240427 - 164155
No ratings yet
Flashcodes and Troubleshooting Carrier Luv Inverte - 240427 - 164155
26 pages
PDMS 12.1 Electrical & Instrumentation: AVEVA Solutions Limited
0% (1)
PDMS 12.1 Electrical & Instrumentation: AVEVA Solutions Limited
7 pages
Dr. Josef Schächter Auth. Prolegomena To A Critical Grammar
No ratings yet
Dr. Josef Schächter Auth. Prolegomena To A Critical Grammar
180 pages
PPT SAMPLING PROCEDURE
No ratings yet
PPT SAMPLING PROCEDURE
36 pages
CREI
100% (1)
CREI
25 pages
Social Media Trends and Issues
No ratings yet
Social Media Trends and Issues
51 pages
A1 Present Tense PDF
No ratings yet
A1 Present Tense PDF
11 pages
CASE ANALYSIS For Module 6-VALUES FORMATION and MORAL RECOVERY
No ratings yet
CASE ANALYSIS For Module 6-VALUES FORMATION and MORAL RECOVERY
1 page
Summary Sheet SR SAP Descripition Unit Qty. Unit No Code Rate
No ratings yet
Summary Sheet SR SAP Descripition Unit Qty. Unit No Code Rate
22 pages
Syllabus: Fundamentals of Construction: Mr. Mossholder
No ratings yet
Syllabus: Fundamentals of Construction: Mr. Mossholder
7 pages

Summarizing Data-Measures of Dispersion

Uploaded by

Summarizing Data-Measures of Dispersion

Uploaded by

Summarizing Data: Measures of

 It is often desirable to consider measures of variability

 The range of a data set is the difference between the

Range = largest value - smallest value

 The interquartile range of a data set is the difference

The variance is a measure of variability that utilizes

It is based on the difference between the value of

The variance is the average of the squared

The variance is computed as follows:

The standard deviation of a data set is the positive

It is measured in the same units as the data, making

The standard deviation is computed as follows:

The coefficient of variation indicates how large the

The coefficient of variation is computed as follows:

Relationship between location measures:

mean – mode = 3(mean – median)

Coefficient of skewness: xx M

 xi  x  sum of deviation from

mean value devided by

where x¯ is the mean, is the standard deviation, and

where x¯ is the mean, is the standard deviation, and

Galton skewness=(Q1 + Q3 −2Q2 )/(Q3 − Q1)

Galton skewness=(Q1 + Q3 −2Q2 )/(Q3 − Q1)

 Symmetric (not skewed)

 Moderately Skewed Left

 Highly Skewed Right

 Example: Apartment Rents

.35 Skewness = .92

The z-score is often called the standardized value.

It denotes the number of standard deviations a data

 An observation’s z-score is a measure of the relative

Standardized Values for Apartment Rents

The mean height of males 20 years or older is

 Shaquille O’Neal Z-Score:

68.26% of the values of a normal random variable

95.44% of the values of a normal random variable

99.72% of the values of a normal random variable

The following data represent the serum HDL

m  57.4 and   11.7

(c) According to the Empirical Rule, approximately

 An outlier is an unusually small or unusually large

 The most extreme z-scores are -1.20 and 2.27

You might also like