0% found this document useful (0 votes)
39 views

1 - Descriptive Statistics Part 3

The document provides information about measures of variability and position in descriptive statistics. It defines concepts like range, interquartile range, standard deviation, coefficient of variation, z-scores, percentiles, quartiles, and deciles. Examples are given to demonstrate how to calculate and interpret these measures using data sets. Procedures are outlined for finding values corresponding to specific percentiles and quartiles. A video link is provided for additional examples.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views

1 - Descriptive Statistics Part 3

The document provides information about measures of variability and position in descriptive statistics. It defines concepts like range, interquartile range, standard deviation, coefficient of variation, z-scores, percentiles, quartiles, and deciles. Examples are given to demonstrate how to calculate and interpret these measures using data sets. Procedures are outlined for finding values corresponding to specific percentiles and quartiles. A video link is provided for additional examples.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 41

EM 1203

UNIT 1 – Descriptive Statistics


Part 3
Teacher: Engr. Stephanie Cañete
MEASURES OF
VARIABILITY
Measures of Variability

1. Range and Interquartile Range:


R = largest value – smallest value
IQR : measures the spread of the middle 50% of an
ordered data set.
Example

The following data points represent the number of animal crackers in


each kid’s lunch box.
4 4 10 11 15 7 14 12 6

Sort the data from the least to greatest.


Find the interquartile range (IQR) of the data set.
Solution:
4 4 6 7 10 11 12 14 15
(Please see the rest of the solution in the given video)
https://youtu.be/qLYYHWYr8xI
Example

Find the interquartile range (IQR) of the data in the dot plot
below.

(Please see the solution in the same given video)


https://youtu.be/qLYYHWYr8xI
DESCRIPTIVE
STATISTICS

• A statistic that
allows you to compare
standard deviations
when the units are
different is called the
coefficient of
variation.
Descriptive Statistics

The coefficient of variation, denoted by CVar, is the standard


deviation divided by the mean. The result is expressed as a
percentage.

sample standard deviation s


For samples, CVar = =
mean x

population standard deviation s


For populations, CVar = =
mean µ
Descriptive Statistics
Sample 1
The mean for the number of pages of a sample of a magazines is
132, with a variance of 23
The mean for the number of advertisements of a magazine is 182,
with a variance of 62. Compare the variations.
Note: Please see the example also given in the video:
https://youtu.be/4BCVcIyEY5k
CHEBYSHEV’S THEOREM
EMPIRICAL (NORMAL) RULE
Significance of Standard Deviation

Chebyshev’s Theorem

Use Chebyshev’s Theorem for any data set, regardless of the distribution shape.
FOR EXAMPLE:

• This theorem says that within two standard deviations of


the mean, you will find out at least

1 1 3 1
1- 2 = 1- 2 = 1- =
k 2 4 4
= 0.75 or at least 75%
SPECIFIC EXAMPLE:

• Suppose that the average score on a math test is an 84 with


a standard deviation of 4 points.
• According to Chebyshev’s theorem, at least what percent of
the tests have a grade of at least 72 and at most 96?
Descriptive Statistics
k 1 - (1/k^2) k 1 - (1/k^2)
1 0.0 16 99.6
2 75.0 17 99.7
3 88.9 18 99.7
4 93.8 19 99.7
5 96.0 20 99.8
6 97.2 21 99.8
7 98.0 22 99.8
8 98.4 23 99.8
9 98.8 24 99.8
10 99.0 25 99.8 x ± ks
11 99.2 26 99.9
where,
12 99.3 27 99.9
x = mean/average
13 99.4 28 99.9
k = constant
14 99.5 29 99.9
s = standard deviation
15 99.6 30 99.9
Descriptive Statistics

Step 1: Find the mean of the


data set

Step 2: Find the standard


deviation of the data set

Step 3: Solve for “k”

Step 4: Find the range of


values
Descriptive Statistics

Example:
The mean price of houses in a certain neighbourhood
is $50,000, and the standard deviation is $10,000. Find
the price range for which at least 75% of the houses
will sell.
The Empirical (Normal) Rule

Approximately 68% , 95% , and “almost all” or 99.7% of the


measurements are within one , two, and three standard deviations of
the mean respectively.

The Empirical Rule can be used only for relatively mound-shaped data sets.
EXAMPLE:
• Assume that the mean weight of 1 year-old girls in the US is
normally distributed with a mean of about 9.5 kg with a standard
deviation of approximately 1.1 kg. Without using a calculator,
estimate the percentage of 1 year-old girls in the US that meet
the following conditions. Draw a sketch and shade the the proper
region for each problem.
(a) Less than 8.4 kg
(b) Between 7.3 kg and 11.7 kg
(c) More than 12.8 kg
MEASURES OF
POSITION
Descriptive Statistics

A standard score or z score tells how many standard deviations a data


value is above or below the mean for a specific distribution of values.

If a standard score is zero, then the


data value is the same as the mean.

The z score represents the number


of standard deviations that a data
value falls above or below the
mean.
Descriptive Statistics

A z score or standard score for a value is obtained by subtracting the


mean from the value and dividing the result by the standard deviation.
The symbol for a standard score is z.

value - mean
z=
standard deviation

For sample data, For population data,

x-x x-µ
z= z=
s s
Ø z-scores between -2 and +2 are highly likely.

Ø z-scores exceeding 3 in absolute value are very unlikely


and can be considered outliers (unusually large or small
observations).

PTan ChE dept, USC


Topic 2 – Data description
Example
A student scored 65 on a
calculus test that had a mean
of 50 and a standard
deviation of 10; he scored 30
on a history test with a mean
of 25 and a standard
deviation of 5. Compare his
relative positions on the two
tests.
Descriptive Statistics

Percentiles are position


measures used in
educational and health-
related fields to indicate
the position of an
individual in a group.

Percentiles divide the data


set into 100 equal groups.
Example:

Suppose in one major exam, you obtained a score of


150 (200 being the perfect score) and this placed you at the
60th percentile in the distribution of scores among the
examinees. Where does your score of 150 stand in relation
to the scores of others who took the examination?
Solution
Scoring at the 60th percentile means that 60% of all examination
scores were lower than yours and 40% were higher.
Descriptive Statistics

Percentile Corresponding to a Given Data Value

The percentile corresponding to a given value X is computed by


using the following formula:

(number of values below X) + 0.5


Percentile = (100%)
total number of values
Descriptive Statistics

Example
A teacher gives a 20-point test to 10 students. The scores are
the following
18, 15, 12, 6, 8, 2, 3, 5, 20, 10

a.) Find the percentile rank of a score of 12.

b.) Find the value corresponding to the 25th percentile

c.) Find the value corresponding to the 60th percentile


The steps for finding a value corresponding to a given percentile are summarized in
this Procedure Table.

Procedure
PROCEDURE Table
TABLE:

Finding a Data Value Corresponding to a Given Percentile


Step 1 Arrange the data in order from lowest to highest.
Step 2 Substitute into the formula
n•p
c"
100
where
n " total number of values
p " percentile
Step 3A If c is not a whole number, round up to the next whole number. Starting at the
lowest value, count over to the number that corresponds to the rounded-up value.
Step 3B If c is a whole number, use the value halfway between the cth and (c ! 1)st values
when counting up from the lowest value.

Quartiles and Deciles


Descriptive Statistics
Finding a Data Value Corresponding to a Given Percentile
Step 1
Arrange the data in order from lowest to highest.
Step 2
Substitute into the formula

c=
( n )( p )
where 100
c = rank of every data (1st , 2nd, 3rd ……)
n = total number of values
p = percentile
Descriptive Statistics

Step 3A
If c is not a whole number, round up to the next whole number.
Starting at the lowest value, count over to the number that
corresponds to the rounded-up value.
Step 3B
If c is a whole number, use the value halfway between the cth and
(c + 1)st values when counting up from the lowest value.
Descriptive Statistics

Quartiles divide the distribution into four groups, separated by Q1,


Q2, Q3

Note that Q1 is the same as the 25th percentile; Q2 is the same as the
50th percentile, or the median; Q3 corresponds to the 75th percentile,
as shown:

The interquartile range (IQR) is defined as the difference between Q1


and Q3 and is the range of the middle 50% of the data.
Descriptive Statistics
Finding Data Values Corresponding to Q1, Q2, and Q3
Step 1
Arrange the data in order from lowest to highest.
Step 2
Find the median of the data values. This is the value for
Q2.
Step 3
Find the median of the data values that fall below Q2.
This is the value for Q1.
Step 4
Find the median of the data values that fall above Q2.
This is the value for Q3.
Topic 2 – Data description

Example

Find Q1, Q2, and Q3 for the data set


15, 13, 6, 5, 12, 50, 22, 18.
Descriptive Statistics

Deciles divide the distribution into 10 groups, as shown.


They are denoted by D1, D2, etc.

You might also like