Topic 3_ 2023_Data Visualization_std_1
Topic 3_ 2023_Data Visualization_std_1
( n + 1) = (12 + 1) = 11.05.
p 85
2. Compute 2. Compute L85 =
100 100
3. The interpretation
The difference between the third and first quartiles is often referred to as
the interquartile range, or IQR.
6. Analyzing Distributions Calculating z-Scores for the Home Sales Data in Excel
z-Scores:
The z-score measures the relative location of a value in the data set.
Helps to determine how far a particular value is from the mean relative to
the data set’s standard deviation.
Often called the standardized value.
` x1 , x2 , , xn is a sample of n observations:
◦ Outliers: Extreme values in a data set. A box plot is a graphical summary of the distribution of data.
Developed from the quartiles for a data set.
◦ They can be identified using standardized values (z-scores).
◦ Any data value with a z-score less than –3 or greater than +3 is an outlier. Box Plot for the Home Sales Data
◦ Such data values can then be reviewed to determine their accuracy and
whether they belong in the data set.
Box Plot Created in Excel for Home Sales Data Box Plots for Multiple Variables Created in Excel
Combined Line Chart and Table for Monthly Costs and Revenues at
2. Tables Gossamer Industries
2. Tables 2. Tables
◦ Table Design Principles: ◦ Table Design Principles:
◦ Avoid using vertical lines in a table unless they are necessary
for clarity.
◦ Horizontal lines are generally necessary only for separating
column titles from data values or when indicating that a
calculation has taken place.
Tables
Crosstabulation of Quality Rating and Meal Price for
2. Tables 300 Los Angeles Restaurants
◦ Crosstabulation: A useful type of table for describing data of Meal Price
two variables. Quality Rating $10–19 $20–29 $30–39 $40–49 Total
Good 42 40 2 0 84
◦ PivotTable: A crosstabulation in Microsoft Excel. Very Good 34 64 46 6 150
Excellent 2 14 28 22 66
Total 78 118 76 28 300
The greatest number of restaurants in the sample (64) have a very good
rating and a meal price in the $20–29 range.
Only two restaurants have an excellent rating and a meal price in the $10–19
range.
The right and bottom margins of the crosstabulation give the frequencies of
quality rating and meal price separately.
3. Charts 3. Charts
◦ Scatter chart: ◦ Scatter chart:
Sample Data for the San Francisco Electronics Store
No. of • Copy the data in the file Electronics to a new excel worksheet in columns A through
Commercials Sales ($100s) C and rows 1 through 11.
Week x y 1: Select cells B2:C11
1 2 50 2: Click the Insert tab in the Ribbon
2 5 57 3: Click the Insert Scatter (X,Y) or Bubble Chart button in the Charts group
3 1 41
4: When the list of scatter chart subtypes appears, click the Scatter button
4 3 54
5: Click the Design tab under the Chart Tools Ribbon
5 4 54
6 1 38 6: Click Add Chart Element in the Chart Layouts group
7 5 63 Select Chart Title, and click Above Chart
8 3 48 Click on the text box above the chart, and replace the text with Scatter Chart
9 4 59 for the San Francisco Electronics Store
10 2 46
3. Charts 3. Charts
◦ Scatter chart: ◦ Scatter chart:
Scatter Chart for the San Francisco Electronics Store
7: Click Add Chart Element in the Chart Layouts group
Select Axis Title, and click Primary Vertical
Click on the text box under the horizontal axis, and replace “Axis Title” with
Number of Commercials Trendline
8: Click Add Chart Element in the Chart Layouts group provides an
Select Axis Title, and click Primary Horizontal approximation of
the relationship
Click on the text box next to the vertical axis, and replace “Axis Title” with Sales
between the
($100s)
variables
9: Right-click on the one of the horizontal grid lines in the body of the chart, and click
Delete
10: Right-click on the one of the vertical grid lines in the body of the chart, and click
Delete
3. Charts 3. Charts
◦ Line chart:
Scatter Chart and Line Chart for Monthly Sales Data ◦ Sparkline: Special type of line chart:
- Minimalist type of line chart that can be placed directly into a cell in
Excel.
- Contains no axes; they display only the line for the data.
- Takes up very little space and can be effectively used to provide
information on overall trends for time series data.
3. Charts 3. Charts
Bar Charts: Use horizontal bars to display the magnitude of the
quantitative variable.
Column Charts: Use vertical bars to display the magnitude of the
quantitative variable.
Bar and column charts are very helpful in making comparisons
between categorical variables.
3. Charts 3. Charts
3. Charts 3. Charts
3. Charts 3. Charts
PivotCharts in Excel:
PivotChart: To summarize and analyze data with both a
crosstabulation and charting, Excel pairs PivotCharts with
PivotTables.