Statistics For Business and Economics (13e) : John Loucks
Statistics For Business and Economics (13e) : John Loucks
Statistics for
Slides by
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
1
Statistics for Business and Economics (13e)
Chapter 3, Part B
Descriptive Statistics: Numerical Measures
• Measures of Distribution Shape, Relative Location, and Detecting Outliers
• Five-Number Summaries and Box Plots
• Measures of Association Between Two Variables
• Data Dashboards: Adding Numerical Measures to Improve Effectiveness
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
2
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
3
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
4
Statistics for Business and Economics (13e)
.35
Skewness = 0
Relative Frequency
.30
.25
.20
.15
.10
.05
0
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
5
Statistics for Business and Economics (13e)
.25
.20
.15
.10
.05
0
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
6
Statistics for Business and Economics (13e)
.25
.20
.15
.10
.05
0
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
7
Statistics for Business and Economics (13e)
.25
.20
.15
.10
.05
0
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
8
Statistics for Business and Economics (13e)
525 530 530 535 535 535 535 535 540 540
540 540 540 545 545 545 545 545 550 550
550 550 550 550 550 560 560 560 565 565
565 570 570 572 575 575 575 580 580 580
580 585 590 590 590 600 600 600 600 610
610 615 625 625 625 635 649 650 670 670
675 675 680 690 700 700 700 700 715 715
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
9
Statistics for Business and Economics (13e)
.25
.20
.15
.10
.05
0
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
10
Statistics for Business and Economics (13e)
z-Scores
• The z-score is often called the standardized value.
• It denotes the number of standard deviations a data value xi is from the mean.
=
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
11
Statistics for Business and Economics (13e)
z-Scores
• An observation’s z-score is a measure of the relative location of the observation
in a data set.
• A data value less than the sample mean will have a z-score less than zero.
• A data value greater than the sample mean will have a z-score greater than
zero.
• A data value equal to the sample mean will have a z-score of zero.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
12
Statistics for Business and Economics (13e)
z-Scores
• Example: Apartment Rents
• z-Score of Smallest Value (525)
= = -1.20
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
13
Statistics for Business and Economics (13e)
Chebyshev’s Theorem
• At least (1 - 1/z2) of the data values must be within z standard deviations of
the mean, where z is any value greater than 1.
• Chebyshev’s theorem requires z > 1; but z need not be an integer.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
14
Statistics for Business and Economics (13e)
Chebyshev’s Theorem
• At least 75% of the data values must be within z = 2 standard
deviations of the mean.
• At least 89% of the data values must be within z = 3 standard
deviations of the mean.
• At least 94% of the data values must be within z = 4 standard
deviations of the mean.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
15
Statistics for Business and Economics (13e)
Chebyshev’s Theorem
• Example: Apartment Rents
Let z = 1.5 with = 590.80 and s = 54.74
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
16
Statistics for Business and Economics (13e)
Empirical Rule
• When the data are believed to approximate a bell-shaped distribution:
• The empirical rule can be used to determine the percentage of data
values that must be within a specified number of standard deviations
of the mean.
• The empirical rule is based on the normal distribution, which is
covered in Chapter 6.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
17
Statistics for Business and Economics (13e)
Empirical Rule
For data having a bell-shaped distribution:
• Approximately 68% of the data values will be within one standard
deviation of the mean.
• Approximately 95% of the data values will be within two standard
deviations of the mean.
• Almost all of the data values will be within three standard deviations of
the mean.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
18
Statistics for Business and Economics (13e)
Empirical Rule
99.72%
95.44%
68.26%
z
m
m – 3s m – 1s m + 1s m + 3s
m – 2s m + 2s
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
19
Statistics for Business and Economics (13e)
Detecting Outliers
• An outlier is an unusually small or unusually large value in a data set.
• A data value with a z-score less than -3 or greater than +3 might be considered
an outlier.
• It might be:
• an incorrectly recorded data value
• a data value that was incorrectly included in the data set
• a correctly recorded data value that belongs in the data set
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
20
Statistics for Business and Economics (13e)
Empirical Rule
• Example: Apartment Rents
• The most extreme z-scores are -1.20 and 2.27.
• Using |z| > 3 as the criterion for an outlier, there are no outliers in this data
set.
Standardized Values for Apartment Rents
-1.20 -1.11 -1.11 -1.02 -1.02 -1.02 -1.02 -1.02 -0.93 -0.93
-0.93 -0.93 -0.93 -0.84 -0.84 -0.84 -0.84 -0.84 -0.75 -0.75
-0.75 -0.75 -0.75 -0.75 -0.75 -0.56 -0.56 -0.56 -0.47 -0.47
-0.47 -0.38 -0.38 -0.34 -0.29 -0.29 -0.29 -0.20 -0.20 -0.20
-0.20 -0.11 -0.01 -0.01 -0.01 0.17 0.17 0.17 0.17 0.35
0.35 0.44 0.62 0.62 0.62 0.81 1.06 1.08 1.45 1.45
1.54 1.54 1.63 1.81 1.99 1.99 1.99 1.99 2.27 2.27
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
21
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
22
Statistics for Business and Economics (13e)
Five-Number Summary
1. Smallest Value
2. First Quartile
3. Median
4. Third Quartile
5. Largest Value
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
23
Statistics for Business and Economics (13e)
Five-Number Summary
• Example: Apartment Rents
Lowest Value = 525 First Quartile = 545
Median = 575
Third Quartile = 625 Largest Value = 715
525 530 530 535 535 535 535 535 540 540
540 540 540 545 545 545 545 545 550 550
550 550 550 550 550 560 560 560 565 565
565 570 570 572 575 575 575 580 580 580
580 585 590 590 590 600 600 600 600 610
610 615 625 625 625 635 649 650 670 670
675 675 680 690 700 700 700 700 715 715
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
24
Statistics for Business and Economics (13e)
Box Plot
• A box plot is a graphical display of data that is based on a five-number
summary.
• A key to the development of a box plot is the computation of the median and
the quartiles Q1 and Q3.
• Box plots provide another way to identify outliers.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
25
Statistics for Business and Economics (13e)
Box Plot
• Example: Apartment Rents
• A box is drawn with its ends located at the first and third quartiles.
• A vertical line is drawn in the box at the location of the median (second
quartile).
500 525 550 575 600 625 650 675 700 725
Q1 = 545 Q3 = 625
Q2 = 575
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
26
Statistics for Business and Economics (13e)
Box Plot
• Limits are located (not drawn) using the interquartile range (IQR).
• Data outside these limits are considered outliers.
• The location of each outlier is shown with the symbol * .
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
27
Statistics for Business and Economics (13e)
Box Plot
• Example: Apartment Rents
• The lower limit is located 1.5(IQR) below Q1.
Lower Limit: Q1 - 1.5(IQR) = 545 - 1.5(80) = 425
• The upper limit is located 1.5(IQR) above Q3.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
28
Statistics for Business and Economics (13e)
Box Plot
• Example: Apartment Rents
• Whiskers (dashed lines) are drawn from the ends of the box to the smallest
and largest data values inside the limits.
500 525 550 575 600 625 650 675 700 725
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
29
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
30
Statistics for Business and Economics (13e)
Covariance
• The covariance is a measure of the linear association between two variables.
• Positive values indicate a positive relationship.
• Negative values indicate a negative relationship.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
31
Statistics for Business and Economics (13e)
Covariance
• The covariance is computed as follows:
For samples: =
For populations:
=
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
32
Statistics for Business and Economics (13e)
Correlation Coefficient
• Correlation is a measure of linear association and not necessarily causation.
• Just because two variables are highly correlated, it does not mean that one
variable is the cause of the other.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
33
Statistics for Business and Economics (13e)
Correlation Coefficient
• The correlation coefficient is computed as follows:
For samples: =
For populations: =
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
34
Statistics for Business and Economics (13e)
Correlation Coefficient
• The coefficient can take on values between -1 and +1.
• Values near -1 indicate a strong negative linear relationship.
• Values near +1 indicate a strong positive linear relationship.
• The closer the correlation is to zero, the weaker the relationship.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
35
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
36
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
37
Statistics for Business and Economics (13e)
= = = -7.08
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
38
Statistics for Business and Economics (13e)
Data Dashboards:
Adding Numerical Measures to Improve Effectiveness
• Data dashboards are not limited to graphical displays.
• The addition of numerical measures, such as the mean and standard deviation
of KPIs, to a data dashboard is often critical.
• Dashboards are often interactive.
• Drilling down refers to functionality in interactive dashboards that allows the
user to access information and analyses at an increasingly detailed level.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
39
Statistics for Business and Economics (13e)
Data Dashboards:
Adding Numerical Measures to Improve Effectiveness
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
40
Statistics for Business and Economics (13e)
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
41