Unit 3 Descriptive Statistics

Uploaded by

Sridevi R Batch 3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views25 pages

Unit 3 Descriptive Statistics

Uploaded by

Sridevi R Batch 3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

WHAT IS DESCRIPTIVE ANALYTICS?

Descriptive analytics is the process of using

current and historical data to iden

Descriptive analytics is relatively accessible and likely something your

organization uses daily. Basic statistical software, such as Microsoft
Excel or data visualization tools, such as Google Charts and Tableau, can
help parse data, identify trends and relationships between variables, and
visually display information, trends and relationships.
Data analytics can be broken into four key types:
Descriptive, which answers the question, “What happened?”
Diagnostic, which answers the question, “Why did this
happen?”
Predictive, which answers the question, “What might happen
in the future?”
Prescriptive, which answers the question, “What should we do
next?”
What Are the Advantages of
Descriptive Analytics
● Now, let’s look at the stand-out benefits of descriptive analytics.
● It’s easy to do: Descriptive analysis doesn’t require great expertise or
experience in statistical methods or analytics.
● There are a lot of tools available: There is a cornucopia of analytics tools
available to? choose from, products that do most of the heavy lifting. Come to
think of it, that helps explain why it’s easy to perform descriptive analytics!
● It answers the most common business performance questions: Most
stakeholders and salespeople want to know things like "How are we doing?" or
"What should we be doing differently?" Descriptive analytics provides the data
needed to answer those questions efficiently, no matter when or how often
they're asked.
● But, like any other tool, descriptive analysis isn’t perfect. Here are the two
chief drawbacks:
● It’s limited to simple analysis: Descriptive analysis examines the relationship
between a handful of variables, and that’s all.
● It tells you what, but not why: Descriptive analysis reports events as they
happened, not why they happened or what could possibly happen next.
Descriptive vs. Predictive vs. Prescriptive Analytics

Prescriptive
Predictive Analysis
Descriptive Analysis Analysis
What’s What
Summary What happened? going to should
happen? happen?
It takes the
conclusion
s gleaned
It looks at
from
historical
descriptive
data and
and
It uses data mining and data analyzes
predictive
Function aggregation to discover past data
analysis
historical data. trends to
and
predict
recommen
what could
ds the best
happen.
future
course of
action.
It offers
critical
It’s a
It’s easy to employ in insights into
valuable
Pros daily operations. Little making the
forecasting
experience is needed. best, most
tool.
informed
decisions.

It needs lots It requires a

of historical lot of past
It offers a limited view, data to data and
Cons and doesn't go beyond work. It will often cannot
the data’s surface. never be account for
100% all possible
accurate. variables.
Shape – Center - Spread

• When we gather data, we want to uncover the

“information” in it. One easy way to do that is to
think of: “Shape –Center- Spread”
• Shape – What is the shape of the histogram?
• Center – What is the mean or median?
• Spread – What is the range or standard deviation?
Chapter 3 - Key Terms
• Measures of • Mean
Central – µ, population; , sample
Tendency, • Weighted Mean
• Median
The Center
• Mode
(Note comparison of mean,
median, and mode)
Chapter 3 - Key Terms
• Measures of • Range
Dispersion, • Variance
(Note the computational difference
The Spread between σ2 and s2.)

• Standard deviation
• Interquartile range
Chapter 3 - Key Terms
• Measures of • Coefficient of correlation, r
Association – Direction of the relationship:
direct (r > 0) or inverse (r < 0)
– Strength of the relationship:
When r is close to 1 or –1, the linear
relationship between x and y is strong.
When r is close to 0, the linear
relationship between x and y is weak.
When r = 0, there is no linear
relationship between x and y.
• Coefficient of determination, r2
– The percent of total variation in y that is
explained by variation in x.
The Center: Mean
• Mean
– Arithmetic average = (sum all values)/# of values
» Population: µ = (Σxi)/N
» Sample: = (Σxi)/n
x
Be sure you know how to get the value easily from
your calculator and computer softwares.
Problem: Calculate the average number of truck shipments from the
United States to five Canadian cities for the following data given in
thousands of bags:
Montreal, 64.0; Ottawa, 15.0; Toronto, 285.0;
Vancouver, 228.0; Winnipeg, 45.0 (Ans: 127.4)
The Center: Weighted Mean
• When what you have is grouped data, compute
the mean using µ = (Σwixi)/Σwi
Problem: Calculate the average profit from truck shipments, United
States to Canada, for the following data given in thousands of bags
and profits per thousand bags:
Montreal 64.0 Ottawa 15.0 Toronto 285.0
$15.00 $13.50 $15.50
Vancouver 228.0 Winnipeg 45.0
$12.00 $14.00

(Ans: $14.04 per thous. bags)

The Center: Median
• To find the median:
1. Put the data in an array.
2A. If the data set has an ODD number of numbers, the median is the
middle value.
2B. If the data set has an EVEN number of numbers, the median is
the AVERAGE of the middle two values.
(Note that the median of an even set of data values is not
necessarily a member of the set of values.)

• The median is particularly useful if there are outliers in

the data set, which otherwise tend to sway the value of an
arithmetic mean.
The Center: Mode

• The mode is the most frequent value.

• While there is just one value for the mean
and one value for the median, there may be
more than one value for the mode of a data
set.
• The mode tends to be less frequently used
than the mean or the median.
Shape: The “shape” of the data is
called its “distribution”?
• If mean = median = mode, the shape of the distribution is
symmetric.
• If mode < median < mean, the shape of the distribution
trails to the right, is positively skewed.
• If mean < median < mode, the shape of the distribution
trails to the left, is negatively skewed.
• Distributions of various “shapes” have different
properties and names such as the “normal” distribution,
which is also known as the “bell curve” (among
mathematicians it is called the Gaussian Distribution).
Normal Distribution
So, if: Therefore,
Average = 3500
Raw score = 4500 SD = 2000
Z = +0.5
Platykurtic

68.26%
Non-Normal Distribution
Mode
Negative Skew
Median
Mean
Non-Normal Distribution
Mode
Positive Skew

Median
Mean
The Spread: Range
• The range is the distance between the smallest
and the largest data value in the set.
• Range = largest value – smallest value
• Sometimes range is reported as an interval,
anchored between the smallest and largest data
value, rather than the actual width of that
interval.
The Spread: Variance
• Variance is one of the most frequently used
measures of spread,
– for population,

– for sample,

• The right side of each equation is often used as a

computational shortcut.
The Spread: Standard Deviation
• Since variance is given in squared units, we
often find uses for the standard deviation,
which is the square root of variance:
– for a population,

– for a sample,
Be sure you know how to get the values easily from
your calculator and computer softwares.
Relative Position - Quartiles
• One of the most frequently used quantiles is the quartile.
• Quartiles divide the values of a data set into four subsets
of equal size, each comprising 25% of the observations.
• To find the first, second, and third quartiles:
– 1. Arrange the N data values into an array.
– 2. First quartile, Q1 = data value at position (N + 1)/4
– 3. Second quartile, Q2 = data value at position 2(N + 1)/4
– 4. Third quartile, Q3 = data value at position 3(N + 1)/4

Malunggay Leaves As An Alternative Pen Ink
92% (24)
Malunggay Leaves As An Alternative Pen Ink
38 pages
9783035621006
No ratings yet
9783035621006
273 pages
Educ 201
No ratings yet
Educ 201
2 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Mmw Reviewer
No ratings yet
Mmw Reviewer
9 pages
E-Book On Essentials of Business Analytics: Group 7
No ratings yet
E-Book On Essentials of Business Analytics: Group 7
6 pages
Descriptive Analytics Notes
No ratings yet
Descriptive Analytics Notes
6 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Statistical Analysis_ Descriptive Stat (2)
No ratings yet
Statistical Analysis_ Descriptive Stat (2)
6 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
21 pages
DA Major Notes
No ratings yet
DA Major Notes
46 pages
Presentation On Data Analysis: Submitted by
No ratings yet
Presentation On Data Analysis: Submitted by
38 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Business Statistics - Session Descriptive Statistics
No ratings yet
Business Statistics - Session Descriptive Statistics
28 pages
Descriptive Analytics
No ratings yet
Descriptive Analytics
42 pages
Descriptive_Statistics
No ratings yet
Descriptive_Statistics
73 pages
Assignment
No ratings yet
Assignment
30 pages
Assignment
No ratings yet
Assignment
23 pages
Topic 8 Data Processing and Analysis PDF
No ratings yet
Topic 8 Data Processing and Analysis PDF
157 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
biostatics course
No ratings yet
biostatics course
29 pages
Analytics compendium (incl stats)
No ratings yet
Analytics compendium (incl stats)
31 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Business Statistics
100% (1)
Business Statistics
52 pages
PUPSPC BUMA30063 - Chapter 2 Instructional Material
No ratings yet
PUPSPC BUMA30063 - Chapter 2 Instructional Material
10 pages
Assignment No 3
No ratings yet
Assignment No 3
16 pages
ai- ssmda
No ratings yet
ai- ssmda
142 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
DSA-REPORT
No ratings yet
DSA-REPORT
11 pages
Research Methodology: Result and Analysis (Part 1)
No ratings yet
Research Methodology: Result and Analysis (Part 1)
65 pages
Topic 3 - Data Presentation, Summarization, Measure of Central Tendency&Spread.
No ratings yet
Topic 3 - Data Presentation, Summarization, Measure of Central Tendency&Spread.
48 pages
Statistics Intro 1
No ratings yet
Statistics Intro 1
41 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
13 pages
Fin Math
100% (1)
Fin Math
151 pages
Descriptive Statistics (1)
No ratings yet
Descriptive Statistics (1)
63 pages
Statistics 25102022
No ratings yet
Statistics 25102022
37 pages
Statistics
No ratings yet
Statistics
25 pages
Basic Statistics
100% (9)
Basic Statistics
73 pages
Slides for IT SKill
No ratings yet
Slides for IT SKill
63 pages
SSM & Da All Unit Notes
No ratings yet
SSM & Da All Unit Notes
152 pages
Data Analysis
No ratings yet
Data Analysis
40 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
Ch 2 Lecture Notes
No ratings yet
Ch 2 Lecture Notes
12 pages
Lecture 5 (Descriptive Statistics)
No ratings yet
Lecture 5 (Descriptive Statistics)
39 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
Decriptive Statistics in Data Science
No ratings yet
Decriptive Statistics in Data Science
9 pages
Statistics
No ratings yet
Statistics
30 pages
5. Descriptive Statistics
No ratings yet
5. Descriptive Statistics
15 pages
Unit IV
No ratings yet
Unit IV
80 pages
Bocalig Act5 MMW
No ratings yet
Bocalig Act5 MMW
6 pages
Further Bound Reference
No ratings yet
Further Bound Reference
42 pages
ge8 statistics
No ratings yet
ge8 statistics
2 pages
3.3.1 Data Summarization
No ratings yet
3.3.1 Data Summarization
56 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
From Everand
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
Pasquale De Marco
No ratings yet
Measures of Success: React Less, Lead Better, Improve More
From Everand
Measures of Success: React Less, Lead Better, Improve More
Mark Graban
5/5 (1)
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
The Little Book of Managing Uncertainty
From Everand
The Little Book of Managing Uncertainty
Harry Katzan Jr.
No ratings yet
MB0038 Management Process and Organizational Behavior Assignment Feb 11
No ratings yet
MB0038 Management Process and Organizational Behavior Assignment Feb 11
24 pages
STEM Graduate's Evaluation On The Benefits of Taking STEM Strand in Their
No ratings yet
STEM Graduate's Evaluation On The Benefits of Taking STEM Strand in Their
9 pages
BUSINESS ANALYTICS QP
No ratings yet
BUSINESS ANALYTICS QP
9 pages
Analytical Techniques in Forensic Science 2021
100% (1)
Analytical Techniques in Forensic Science 2021
447 pages
New Approach in Evaluating Tourism Attractiveness in The Region of Moldavia (Romania)
No ratings yet
New Approach in Evaluating Tourism Attractiveness in The Region of Moldavia (Romania)
11 pages
Mohd B. Makmor Bakry, PH.D., R.PH
No ratings yet
Mohd B. Makmor Bakry, PH.D., R.PH
12 pages
Parts of A Research
No ratings yet
Parts of A Research
26 pages
Major Stages in Legal Research
100% (2)
Major Stages in Legal Research
3 pages
Performance Management and Its Challenges
No ratings yet
Performance Management and Its Challenges
5 pages
AI CaseProcessing
No ratings yet
AI CaseProcessing
8 pages
SAMPLE NUMERACY ANALYSIS
No ratings yet
SAMPLE NUMERACY ANALYSIS
4 pages
Competency Mapping Process
No ratings yet
Competency Mapping Process
5 pages
Intro To Machine Learning Nanodegree Program Syllabus
No ratings yet
Intro To Machine Learning Nanodegree Program Syllabus
14 pages
ToR Encuestador PDF
No ratings yet
ToR Encuestador PDF
3 pages
Publication Manual of the American Psychological Association 7th Edition (eBook PDF) download
100% (1)
Publication Manual of the American Psychological Association 7th Edition (eBook PDF) download
44 pages
Techniques For Sentiment Analysis of Twitter Data: A Comprehensive Survey
No ratings yet
Techniques For Sentiment Analysis of Twitter Data: A Comprehensive Survey
7 pages
Environment Damage Compansation CPCB
No ratings yet
Environment Damage Compansation CPCB
38 pages
DR Kabetu's and DR Mwiti's Input On Dissertation July 2023 DR L G M
No ratings yet
DR Kabetu's and DR Mwiti's Input On Dissertation July 2023 DR L G M
73 pages
Glossary of Human Resource Management: March 2016
No ratings yet
Glossary of Human Resource Management: March 2016
19 pages
Moha Proposal
No ratings yet
Moha Proposal
29 pages
Jclinpath 2020 206873.full
No ratings yet
Jclinpath 2020 206873.full
4 pages
Powerpoint Defense
No ratings yet
Powerpoint Defense
15 pages
Pur - Brochure - 1706 Esponjas BWP Catver
No ratings yet
Pur - Brochure - 1706 Esponjas BWP Catver
2 pages
B-7039-Article Text-21030-1-2-20220317-1
No ratings yet
B-7039-Article Text-21030-1-2-20220317-1
10 pages
Effectiveness of Muscle Energy Technique On Hamstring Extensibility in Healthy, Asymptomatic Adults With Hamstring Tightness - Pang
No ratings yet
Effectiveness of Muscle Energy Technique On Hamstring Extensibility in Healthy, Asymptomatic Adults With Hamstring Tightness - Pang
38 pages
Group 3 PPT
No ratings yet
Group 3 PPT
9 pages
Hypocrisy: What Counts?
No ratings yet
Hypocrisy: What Counts?
49 pages
PDD Paper 12.01.2023
No ratings yet
PDD Paper 12.01.2023
32 pages

Unit 3 Descriptive Statistics

Uploaded by

Unit 3 Descriptive Statistics

Uploaded by

WHAT IS DESCRIPTIVE ANALYTICS?

Descriptive analytics is the process of using

Descriptive analytics is relatively accessible and likely something your

It needs lots It requires a

• When we gather data, we want to uncover the

(Ans: $14.04 per thous. bags)

• The median is particularly useful if there are outliers in

• The mode is the most frequent value.

• The right side of each equation is often used as a

You might also like