DataViz - 1e - Ch01 - PowerPoint 2
DataViz - 1e - Ch01 - PowerPoint 2
1e
Chapter 1: Introduction
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Chapter Objectives
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Data Visualization
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.1 Analytics
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.1 Developments in Analytics
Three developments have spurred the explosive growth in the use of analytics
for decision making:
• Technological advances – incredible amounts of data from scanner technology, e-
commerce, social networks, sensors, and personal electronic devices such as cell
phones.
• Methodological developments – faster algorithms can handle and explore
massive amounts of data for data visualization, machine learning, optimization, and
simulation.
• Explosion in computing power and storage capability – better computational
hardware, parallel computing, and cloud computing, enable businesses to solve
larger problems faster and with greater accuracy.
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.1 A Categorization of Analytical Methods
[Author Name], [Book Title], [#] Edition. © [Insert Year] Cengage. All Rights Reserved. May not be scanned,
copied or duplicated, or posted to a publicly accessible website, in whole or in part.
1.2 Why Visualize Data?
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.2 Data Visualization for Exploration: Patterns
25,000
Column Chart of Zoo Attendance by Month
20,000
15,000
10,000
5,000
0
Jan Feb Mar Apr May Jun July Aug Sept Oct Nov Dec
Month
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.2 Data Visualization for Exploration: Relationships
Data visualization is a powerful tool to better understand the relationship
between variables.
Y Linear Regression of Anscombe's Data Set Y Linear Regression of Anscombe's Data Set
12 1 10 2
f(x) = 0.5 x + 3.00090909090909
9 R² = 0.666242033727484
10
f(x) = 0.500090909090909 x + 3.00009090909091 8
R² = 0.666542459508775 7
8
6
6 5
4
4
3
2
2
1
0 0
2 4 6 8 10 12 14 16 2 4 6 8 10 12 14 16
X X
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.2 Data Visualization for Explanation
Data visualization is also important for explaining relationships found in
data and for explaining the results of predictive and prescriptive models.
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.3 Quantitative vs Categorical Data
Quantitative data
Data for which numerical values indicate
magnitude. Arithmetic operations, such as
addition, subtraction, multiplication, and division,
can be performed on quantitative data.
Examples: Share Price ($), and Volume.
Categorical data
Data for which labels or names identify
categories of like items. Arithmetic operations
cannot be performed on categorical data.
Examples: Company, Symbol, and Industry.
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.3 Cross-Sectional vs Time Series Data
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.3 Big Data
[Author Name], [Book Title], [#] Edition. © [Insert Year] Cengage. All Rights Reserved. May not be scanned,
copied or duplicated, or posted to a publicly accessible website, in whole or in part.
1.3 Word Cloud
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Accounting
A clustered column chart showing Benford’s Law versus Tucker Software’s Accounts
Payable Entries
Benford’s Law, (the First-Digit Law),
states that the proportion of
observations in which the first digit is 1
through 9, respectively, follows given
probabilities.
Benford's Law may help detect fraud. If
the first digits of numbers in a data set
do not conform to Benford's Law, fraud
investigation may be warranted.
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Finance
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Human Resource
Management
A stacked column chart of employee turnover by month
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Marketing
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Operations
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Engineering
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Science
Source: clickorlando.com
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
1.4 Data Visualization for Sports
Source: basketball-reference.com
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Discussion Activity 1
• Consider the two scatter charts and related trendline and regression
statistics shown for Anscombe’s data sets in slide 9. The estimated
regression equations and related R-squares for both data sets are identical.
• Does fitting a line to the data appear to be a wise choice for both data
sets? Explain your answer.
• What would be a more appropriate regression equation to fit Anscombe’s
Data Set 2?
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Discussion Activity 2
• Consider the first digit of Tucker software’s accounts payable entries in the
clustered column chart in slide 15.
• Does it appear that the data follow Benford’s Law? Explain your answer.
• Which first digits from Tucker's accounts payable entries stand out as
underrepresented in terms of absolute and relative proportional
difference for the corresponding expected probabilities as dictated by
Benford's Law?
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Check Your Knowledge
1. Map visualizations are often used in the natural sciences industry because the data are
often _________.
a. geographic
b. quantitative
c. ordinal
d. weather-related
2. Which chart displays a variable of interest plotted over time relative to lower and upper
control limits?
a. High-low-close stock chart
b. Funnel chart
c. Clustered bar chart
d. Control chart
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Summary
Camm, Cochran, Fry & Ohlmann, Data Visualization - Exploring and Explaining with Data, 1st Edition. © 2021
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible
website, in whole or in part.