Business Analytics Chapter02

The document discusses descriptive statistics and provides examples of key concepts in data analysis including raw data, proper data sets, variables, populations and samples, categorical and quantitative data, and cross-sectional and time series data. It explains how data should be organized and structured to allow for effective analysis using features in Excel like sorting, filtering, and pivot tables. Various statistical terms are also defined to lay the groundwork for descriptive analysis techniques.

Uploaded by

ann camile maupay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Business Analytics Chapter02

Uploaded by

ann camile maupay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Basic Business Analytics

using Excel
Chapter 02
Descriptive Statistics

1
Raw Data: Data stored in its smallest size
No: Yes:

Addresses Address City State Zip

313 173rd Blvd, Kent, WA 981215 313 173rd Blvd Kent WA 981215
316 66th Blvd, Kent, WA 981244 316 66th Blvd Kent WA 981244
4358 23rd St, Kent, WA 981225 4358 23rd St Kent WA 981225
965 151st St, Kent, WA 981162 965 151st St Kent WA 981162
7900 173rd Lane, Kent, WA 981266 7900 173rd Lane Kent WA 981266
4047 15th Ave, Kent, WA 981228 4047 15th Ave Kent WA 981228
4907 13th Ave, Kent, WA 981232 4907 13th Ave Kent WA 981232
3789 4th Blvd, Seattle, WA 981152 3789 4th Blvd Seattle WA 981152
2977 66th Lane, Seattle, WA 981171 2977 66th Lane Seattle WA 981171
3392 23rd St, Seattle, WA 981131 3392 23rd St Seattle WA 981131

Why?
Because it is easier to analyze data when it is stored in its smallest parts 2
Data:
• Textbook: Facts or figures collected, analyzed and summarized
for presentation and interpretation
• Data = all the unorganized raw data in a Proper Data Set

Transaction
Number Date Sales SalesRep
12568 12/1/2014 $19,161 Jo
12569 12/1/2014 $15,027 Gigi
12570 12/2/2014 $12,953 Chin
12571 12/2/2014 $12,670 Jo
12572 12/2/2014 $8,893 Gigi
12573 12/3/2014 $4,667 Chin
12574 12/3/2014 $20,272 Jo
12575 12/3/2014 $20,204 Gigi
12576 12/3/2014 $17,223 Chin
3
Data Types & Default Alignment in Excel

• Empty Cells  Not really a Data Type, but it is a "thing" in Excel that can sometimes cause
problems.
• **Refer to Empty Cells as "Empty Cells", not blanks.
• Why Default Alignment? Because Left means Excel thinks it is Text and Right means Excel thinks it is
a Number. This is important when dealing with data because some systems will mistakenly import
numbers as text. Numbers as text do not always behave like you expect (like not being added by
4
the SUM function. The Default Alignment is a visual cue that informs us about how Excel “sees” the
data.
Proper Data Set: Proper Table of Data
• A structure for your data set
necessary so that Excel Data
Transaction
Analysis features like Sort, Filter Number Date Sales SalesRep
and PivotTables will work 12568 12/1/2014 $19,161 Jo
correctly: 12569 12/1/2014 $15,027 Gigi
12570 12/2/2014 $12,953 Chin
1. Fields in first row (no empty 12571 12/2/2014 $12,670 Jo
cells) 12572 12/2/2014 $8,893 Gigi
12573 12/3/2014 $4,667 Chin
2. Records or Observations in rows 12574 12/3/2014 $20,272 Jo
3. Empty cells or Excel 12575 12/3/2014 $20,204 Gigi
12576 12/3/2014 $17,223 Chin
Row/Column Headers all the
way around Data Set
4. Try not to have empty cells in
data set 5
Terms for Proper Data Set
Primary Key / Variables
List of Unique Elements

Element = Entities on
which data are collected.
We are collecting data for
each Transaction Number.
Transaction Number is the
Element.

Each row is
a Record /
Observation 6

All 4 are called Fields (Column Headers)

Variable, Element, Observation
• Variable
• A characteristic or quantity of interest that can take on different values
• A Variable is also known as a “Field” or “Column Header” in Database terminology
• Example: Street address, City, State, Zip for a customer
• Element
• Entities on which data are collected
• Like collecting data for an Employee or Invoice Number
• Primary Key
• When the first column in a Proper Data Set contains a “Unique List” of Elements, it is called a
“Primary Key”.
• “Primary Key”, “Unique List of Elements”, “List of Unique Identifiers”, “Distinct List” are all synonyms
• The “Primary Key” assure that data collected for a give element is stored in one and only one
place.
7
• Observation or Record
• A set of values corresponding to a set of Variables (Fields) for a set of Elements
Proper Data Set with a Primary Key / List
of Unique Elements:
Proper Data Set:

8
Proper Data Set with NO Primary Key /
List of Unique Elements:
Proper Data Set: Using the PivotTable feature we can create a
Proper Data Set with a Primary Key (Unique
List of Products or Elements):

9
Variables
• Variable (from previous slide)
• A characteristic or quantity of interest that can take on different values
• Decision Variables
• Variables under the direct control of decision makers
• Example
• The “Quantity” Variable for a manufacturer. Managers can decide how many to make
each day.
• Random (uncertain variables) Variables:
• In general, variables that are outside of the decision makers control
• A quantity whose value is not known with certainty
• Example:
• Stock Price of Yahoo 10
• Number of units sold of a particular product
Variables and Variation If you own Yahoo Stock, you would be
interested in the Variation in the Variable
• Variation “Price (Adj Close)”.

• The difference in a variable measured over

observations
• Differences over time
• Differences between customers or products
• **We will have a numerical measure for
variation later…
• Roll of Descriptive Statistics:
• Collect “Past Observed Values for Variables”
or “Realizations of Variables” or “Raw Data”
or “Data”
• Analyze Data to gain a better understanding
of the variation and its impact on the
11
business setting/situation
Population and Sample
• Population
• All elements of interest
• Sample
• Subset of the population
• Random sampling
• A sampling method to gather a representative sample of the
population data.
• Each element comes from the same population (Target Population)
• Each element is selected independently (without bias)
12
Categorical and Quantitative Data
• Quantitative Data
• “Number Data” on which numeric and arithmetic operations, such as
addition, subtraction, multiplication, and division, can be performed.
• Discrete Quantitative Data: There are gaps between numbers, like
counting: 1, 2, 3…
• Continuous Quantitative Data: There are no gaps between numbers,
like weight, time, money. The number depends on the measurement
instrument.
• Categorical Data
• “Not Number Data”, like Product Names or “Yes” “No” Data on which arithmetic 13
operations cannot be performed.
Data Terminology
Cross-sectional Data Time Series Data
• Cross-sectional Data • Data collected over several time periods
• Data collected from several (Year, Month, Day, Hour…).
elements/entities at the same, or • Charts of time series data are common
approximately the same, point in time. in business and economics.
• Help analysts understand what
Sep 22, 2015 happened in the past, identify trends
Market Cap:
GOOG
426.88B
YHOO
28.62B
FB Industry
261.91B 277.63M
over time, and project future levels for
Employees: 57148 12500 10955 355 the time series.
Qtrly Rev Growth (yoy): 0.11 0.15 0.39 0.15
Revenue (ttm): 69.61B 4.87B 14.64B 132.20M
Gross Margin (ttm): 0.62 0.67 0.83 0.58
EBITDA (ttm): 22.62B 541.75M 6.38B 3.47M
Operating Margin (ttm): 0.26 0.02 0.32 0.01
Net Income (ttm): 14.39B 6.94B 2.72B N/A
EPS (ttm): 21.22 7.2 0.98 0
P/E (ttm): 29.34 4.22 94.47 33.33
PEG (5 yr expected): 1.22 -2.38 1.59 1.07
14
P/S (ttm): 6.26 6.02 18.39 3.74
Sources of Data
• Experimental study
• A variable of interest is first identified.
• Then one or more other variables are identified and controlled or manipulated so that data can
be obtained about how they influence the variable of interest.
• Nonexperimental study or observational study - Make no attempt to control the variables
of interest.
• A survey is perhaps the most common type of observational study.
• Existing Data Sets:
• Customer Lists
• Sales or Expense Lists
• Census Data
• Weather Data
• Government sources (data.gov) 15
• Purchase data from companies such as: Bloomberg, Dow Jones
Sort & Filter to Organize Data
Sort Filter
• Organize the Raw Data by sorting • Must have a Proper Data Set
• Example: Sort Sales biggest to • Filter Button in Data Ribbon
smallest • Great for querying a data set
• Sort Buttons in Data Ribbon (Extracting Observations / Records
• Sort columns one by one, with the from a Proper Data Set) to get a
“Major Sort” last. sub-set of data based on a set of
• Sort Dialog Box conditions or criteria
• Make sure that “Major Sort” on
top.
• Keyboard for Sort: Alt, D, S 16
Conditional Formatting to Visualizing Data
• Each cell in the highlighted range must get a logical test
that comes out TRUE (apply formatting) or FALSE (do NOT
apply formatting)
• Logical test can be created with built-in features or Logical
Formulas
• Great for visualizing data based on a set of conditions or
criteria
17
Frequency Distributions and
Column/Bar Charts for Categorical Data
• Frequency Distribution for Categorical Data is a tabular summary which:
1. Shows the number of observations (count or frequency) in each of a set
categories (unique list from data set)
2. Categories must be Collectively Exhaustive Categories (enough categories so
nothing is left out) and Mutually Exclusive Categories (no item can fit into more
than one category)
3. Goal is to is to provide information about frequencies (count)
• Relative Frequency Distribution
• Shows decimal value that represents "parts compared to the whole" (used in
chapter 4 for assigning probabilities)
• Percent Frequency Distribution
18
• Formats Relative Frequencies with Percent Number Format
Frequency Distributions and
Column/Bar Charts for Categorical Data
• Column/Bar Chart:
• Used to show Frequency Distribution or Relative/Percent Frequency
Distribution for Categorical Data
• Counts across categories. Height of columns convey count. Order of
categories conveys no info
• There are "gaps" between columns to indicate that the data is
categorical or a discrete quantitative variable (not a continuous
quantitative variable). Columns do not touch
19
Frequency Distributions and
Column/Bar Charts for Categorical Data
PivotTable: COUNTIFS function:

Web Site Frequency % Frequency Web Site Frequency % Frequency

amazon.com 11436 43.12% amazon.com 11436 43.12%
coloradoboomerangs.com 6380 24.05% coloradoboomerangs.com 6380 24.05%
ebay.com 5810 21.90% ebay.com 5810 21.90%
gel-boomerang.com 2898 10.93% gel-boomerang.com 2898 10.93%
Grand Total 26524 100.00% Total 26524 100.00%

Car Chart (Column on its side):

Boomerang Inc. 2015 Sales Frequency by Web Site

gel-boomerang.com 2898

ebay.com 5810

coloradoboomerangs.com 6380

20
amazon.com 11436
Histograms for Quantitative Data
• Histograms
• Used to show frequency distribution of continuous quantitative data
over a set of class intervals (lower and upper limit for each category)
• Column or Bar Charts where columns are touching to indicate that the
variable is continuous
• Columns touch to indicate that no numbers can fit between classes.
"No numbers can fit between columns - no gaps"
• Height of columns convey count
• Order of classes is important to help reveal shape of data, or
distribution of data. 21
Mean, Median, Mode
• Mean
• Arithmetic Mean: Add them up and divide by the count
• Good for quantitative data when there are not extreme values - extreme values can make the mean look too
big or too small (Median more representative of a typical value in that case)
• Use AVERAGE function
• Median
• Sort, then take the one in the middle. If count odd, take one in middle, if even, average middle two.
• Marks the point in the sorted list (an actual number) where 50% of the numbers are above and 50% of the
numbers are below
• Good for quantitative data when there are extreme values (like house prices and salaries)
• Use MEDIAN function
• Mode
• One that occurs most frequently (can be bimodal, multimodal)
• Good for Categorical Data (Nominal and Ordinal)
• Use MODE.SNGL for quantitative data and COUNTIF or PivotTable for Categorical or quantitative data. 22
MODE.SNGL will only show 1 mode if the data set is bi-modal or multi-modal. MODE.MULT can be used for
multiple modes.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6431)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (640)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1173)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (992)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1853)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1016)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5143)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (460)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2126)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2010)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2787)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4087)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (918)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Networking Basics
100% (1)
Networking Basics
53 pages
How Much Is The Principal If Monthly Payment Is $ 1,200.00
No ratings yet
How Much Is The Principal If Monthly Payment Is $ 1,200.00
5 pages
Excel Skills - Exercises - Loan Repayment Calculations & What-If Tools
No ratings yet
Excel Skills - Exercises - Loan Repayment Calculations & What-If Tools
2 pages
College of Computer Studies: Vision: Mission: Program Objectives
No ratings yet
College of Computer Studies: Vision: Mission: Program Objectives
6 pages
Basic Business Analytics Using Excel, Chapter 01
No ratings yet
Basic Business Analytics Using Excel, Chapter 01
21 pages
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
No ratings yet
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
7 pages
Big Data: Introduction To Terms, Concepts and Tools
No ratings yet
Big Data: Introduction To Terms, Concepts and Tools
23 pages
Highline Class, BI 348: Basic Business Analytics Using Excel Chapter 11: Monte Carlo Simulation
No ratings yet
Highline Class, BI 348: Basic Business Analytics Using Excel Chapter 11: Monte Carlo Simulation
35 pages
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
No ratings yet
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
9 pages
College of Computer Studies: 1.1 What Is HTML?
No ratings yet
College of Computer Studies: 1.1 What Is HTML?
4 pages
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
No ratings yet
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
5 pages
Understand THE Internet: Lesson 1
No ratings yet
Understand THE Internet: Lesson 1
8 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Developing Trading Systems in RightEdge
No ratings yet
Developing Trading Systems in RightEdge
17 pages
Iste Stds Self Assessment-Talvy
No ratings yet
Iste Stds Self Assessment-Talvy
5 pages
Schneider LV5 16303
No ratings yet
Schneider LV5 16303
2 pages
FortiSwitch Compatible Transceivers
No ratings yet
FortiSwitch Compatible Transceivers
6 pages
FP4035
No ratings yet
FP4035
7 pages
KVS Librarian 2021 Exam Pattern and Syllabus
No ratings yet
KVS Librarian 2021 Exam Pattern and Syllabus
2 pages
DSA Assignment 4
No ratings yet
DSA Assignment 4
6 pages
Quality Control of Nuclear Medicine Instrumentation and Protocol
No ratings yet
Quality Control of Nuclear Medicine Instrumentation and Protocol
168 pages
leopard-datasheet(11)
No ratings yet
leopard-datasheet(11)
5 pages
SVM 1
No ratings yet
SVM 1
17 pages
Electrical Design & Implementation Engineer CV Word
No ratings yet
Electrical Design & Implementation Engineer CV Word
6 pages
Employee Information Form (Updated 5.2.2018
No ratings yet
Employee Information Form (Updated 5.2.2018
4 pages
Internet and Emails
No ratings yet
Internet and Emails
67 pages
ABR PPT Presentation
No ratings yet
ABR PPT Presentation
12 pages
Formulir Tanpa Judul (Jawaban)
50% (2)
Formulir Tanpa Judul (Jawaban)
6 pages
Basic Principles of Programming Languages
No ratings yet
Basic Principles of Programming Languages
40 pages
Community Detection in Social Media: Performance and Application Considerations
No ratings yet
Community Detection in Social Media: Performance and Application Considerations
40 pages
Babu M J - Resume
No ratings yet
Babu M J - Resume
1 page
Informatica Performance Optimization Techniques
No ratings yet
Informatica Performance Optimization Techniques
21 pages
100_Computer_Full_Forms
No ratings yet
100_Computer_Full_Forms
4 pages
Materi Informatika Dan Keterampilan Generik Kelas X
No ratings yet
Materi Informatika Dan Keterampilan Generik Kelas X
12 pages
Abb Ag: Application
No ratings yet
Abb Ag: Application
4 pages
Arena® 15.1: Arena Version 15.1 Provides Enhanced Capabilities For Your Business
No ratings yet
Arena® 15.1: Arena Version 15.1 Provides Enhanced Capabilities For Your Business
8 pages
Installing Jenkins On Windows
No ratings yet
Installing Jenkins On Windows
8 pages
Vs 2019 Community Workload
No ratings yet
Vs 2019 Community Workload
4 pages
ZQ200 USER MANUAL V2.2
No ratings yet
ZQ200 USER MANUAL V2.2
20 pages
Assignment No 3 Part B
No ratings yet
Assignment No 3 Part B
2 pages
UD36994B - Hik-Partner Pro OpenAPI - Developer Guide - V2.0 - 20240315
No ratings yet
UD36994B - Hik-Partner Pro OpenAPI - Developer Guide - V2.0 - 20240315
235 pages
Unit Test Paper of Digital Electronics
No ratings yet
Unit Test Paper of Digital Electronics
1 page

Business Analytics Chapter02

Uploaded by

Business Analytics Chapter02

Uploaded by

Basic Business Analytics

Addresses Address City State Zip

All 4 are called Fields (Column Headers)

• The difference in a variable measured over

Web Site Frequency % Frequency Web Site Frequency % Frequency

Car Chart (Column on its side):

Boomerang Inc. 2015 Sales Frequency by Web Site

You might also like