0% found this document useful (0 votes)

4 views8 pages

dataanexp-2

The document outlines the principles and stages of descriptive statistics using Python, emphasizing its importance in analyzing historical data to identify business performance and inefficiencies. It details various types of descriptive analysis, including measurements of frequency, central tendency, and dispersion, along with steps for conducting such analyses. Additionally, it provides definitions and calculations for mean, median, mode, and measures of dispersion, highlighting their significance in data interpretation.

Uploaded by

aniketagrawal810

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views8 pages

dataanexp-2

Uploaded by

aniketagrawal810

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

EXPERIMENT - 2

Aim: - To understand the basis of descriptive statistics using Python.

Theory: -
1. Descriptive Analysis
Descriptive analysis refers to the interpretation of historical data to better understand changes
that occur in a business. It describes the use of a range of historic data to draw comparisons
with other reporting periods for the same company (i.e. quarterly or annually) or with others
within the same industry. Most commonly reported financial metrics are a product of
descriptive analytics, such as year-over-year (YOY) pricing changes, month-over-month sales
growth, the number of users, or the total revenue per subscriber. These measures all describe
what has occurred in a business during a set period.
Descriptive analysis is a very important tool that can be used in different parts of any business.
That's because it allows companies to understand how well it is performing and where there
may be inefficiencies. As such, corporate management can identify areas for improvement and
use it to motivate different teams to implement changes for continued success.
It is the technique of identifying patterns and links by utilizing recent and historical data.
Because it identifies patterns and associations without going any further, it is frequently
referred to as the most basic data analysis.

Stages in Descriptive Analysis

There are a few stages that companies follow in order to successfully implement descriptive
analysis into their business strategy. The following list highlights these stages along with a
description of each.

1. Identifying which metrics to analyse - Before beginning, it's important to decide which
metrics companies want to produce and the time frame for each, such as quarterly
revenue or annual operating profit.
2. Identifying and locating the data - This step requires locating all of the data required to
produce the result. This means going through all internal and external sources,
including databases.
3. Compiling the data - Once all the data is identified and located, the next step is to
prepare and compile it together. Part of the process here is to ensure that it's accurate
and to format everything into a single format.
4. Data analysis - Analysing datasets and figures means using different tools

Once all these steps are completed, it's important to present all the data to the appropriate
stakeholders. Using appropriate visual aids, such as charts, graphics, videos, and other tools
can be a great way to provide analysts, investors, management, and others with the insight they
need about the direction of the company.

1
2. Types of Descriptive Analysis
I. Measurements of Frequency:
Understanding how often a specific event or reaction occurs is essential for descriptive
analysis, providing quantitative insights through counts or percentages to reveal
patterns within the dataset.

II. Measures of Central Tendency:

In descriptive analysis, determining the central tendency is crucial, employing mean,
median, and mode to quantify the typical value and gain insights into the overall trend
or behaviour of observed variables.

III. Measures of Dispersion:

In certain scenarios, understanding how data is spread across a range is vital; measures
like range or standard deviation in descriptive analysis offer valuable information about
distribution patterns and variability within the dataset.

IV. Measures of Position:

An integral part of descriptive analysis involves determining a value's position relative
to others, employing metrics like quartiles and percentiles to offer nuanced insights into
the dataset's structure and identify trends or outliers.

2
3. Steps to conduct Descriptive Analysis
Descriptive analysis is an important phase in data exploration that involves summarizing and
describing the primary properties of a dataset. It provides vital insights into the data’s frequency
distribution, central tendency, dispersion, and identifying position. It assists researchers and
analysts in better understanding their data.
Conducting a descriptive analysis entails several critical phases, which include:
a) Data Collection
Before conducting any analysis, you must first collect relevant data. This process involves
identifying data sources, selecting appropriate data-collecting methods, and verifying that the
data acquired accurately represents the population or topic of interest. We can collect data
through surveys, experiments, observations, existing databases, or other methods.
b) Data Preparation
Data preparation is crucial for ensuring the dataset is clean, consistent, and ready for analysis.
This step covers the following tasks:
a) Data Cleaning: Handle missing values, exceptions, and errors in the dataset. Input
missing values or develop appropriate statistical techniques for dealing with them.
b) Data Transformation: Convert data into an appropriate format. Examples of this are
changing data types, encoding categorical variables, or scaling numerical variables.
c) Data Reduction: For large datasets, try reducing their size by sampling or aggregation
to make the analysis more manageable.
c) Apply Methods
In this step, you will analyse and describe the data using a variety of methodologies and
procedures. The following are some common descriptive analysis methods:
i. Frequency Distribution Analysis: Create frequency tables or bar charts to show the
number or proportion of occurrences for each category for categorical variables.
ii. Measures of Central Tendency: Calculate numerical variables’ mean, median, and
mode to determine the centre or usual value.
iii. Measures of Dispersion: Calculate the range, variance, and standard deviation to
examine the dispersion or variability of the data.
iv. Measures of Position: Identify the position of a single value or its response to others.
d) Summary Statistics and Visualization
Descriptive statistics refers to a set of methods for summarizing and describing the main
characteristics of a dataset. Summarize the data through statistics and visualization. This step
involves the following tasks:
i. Summary Statistics: Summarize your findings clearly and concisely.
ii. Data Visualization: Use various charts and plots to visualize the data. Create
histograms, box plots, scatter plots, or line charts for numerical data. Use bar charts, pie
charts, or stacked bar charts for categorical data.

3
4. Central Tendency
Central Tendencies in Statistics are the numerical values that are used to represent mid-value
or central value a large collection of numerical data. These obtained numerical values are called
central or average values in Statistics.
Measures of Central Tendency: -
Mean:
Mean in general terms is used for the arithmetic mean of the data, but other than the arithmetic
mean there are geometric mean and harmonic mean as well that are calculated using different
formulas.
i. The most common measure of central tendency is the mean.
ii. Mean is also known as the simple average.
iii. It is denoted by greek letter μ for population and by ¯x for sample.
iv. We can find mean of a number of elements by adding all the elements in a dataset and
then dividing by the number of elements in the dataset.
v. It is the most common measure of central tendency but it has a drawback.
vi. The mean is affected by the presence of outliers.
vii. So, mean alone is not enough for making business decisions.

Types of Mean:
Mean can be classified into three different class groups which are
 Arithmetic Mean
 Geometric Mean
 Harmonic Mean

4
Median
The Median of any distribution is that value that divides the distribution into two equal parts
such that the number of observations above it is equal to the number of observations below it.
Thus, the median is called the central value of any given data either grouped or ungrouped.
 Median is the number which divides the dataset into two equal halves.
 To calculate the median, we have to arrange our dataset of n numbers in ascending
order.
 Median is robust to outliers.
 So, for skewed distribution or when there is concern about outliers, the median may be
preferred.

Mode
The Mode is the value of that observation which has a maximum frequency corresponding to
it. In other, that observation of the data occurs the maximum number of times in a dataset.
i. Mode of a dataset is the value that occurs most often in the dataset.
ii. Mode is the value that has the highest frequency of occurrence in the dataset.

5
5. Dispersion in Statistics
Dispersion is the state of getting dispersed or spread. Statistical dispersion means the extent to
which numerical data is likely to vary about an average value. In other words, dispersion helps
to understand the distribution of the data.

Measures of Dispersion
In statistics, the measures of dispersion help to interpret the variability of data i.e. to know how
much homogenous or heterogeneous the data is. In simple terms, it shows how squeezed or
scattered the variable is.

Types of Measures of Dispersion

There are two main types of dispersion methods in statistics which are:

 Absolute Measure of Dispersion

 Relative Measure of Dispersion

Absolute Measure of Dispersion

An absolute measure of dispersion contains the same unit as the original data set. The absolute
dispersion method expresses the variations in terms of the average of deviations of observations
like standard or means deviations. It includes range, standard deviation, quartile deviation, etc.

The types of absolute measures of dispersion are:

i. Range: It is simply the difference between the maximum value and the minimum value
given in a data set. Example: 1, 3,5, 6, 7 => Range = 7 -1= 6
ii. Variance: Deduct the mean from each data in the set, square each of them and add each
square and finally divide them by the total no of values in the data set to get the variance.
Variance (σ2) = ∑(X−μ)2/N
iii. Standard Deviation: The square root of the variance is known as the standard
deviation i.e. S.D. = √σ.
iv. Quartiles and Quartile Deviation: The quartiles are values that divide a list of
numbers into quarters. The quartile deviation is half of the distance between the third
and the first quartile.
v. Mean and Mean Deviation: The average of numbers is known as the mean and the
arithmetic mean of the absolute deviations of the observations from a measure of central
tendency is known as the mean deviation (also called mean absolute deviation).
vi. Co-efficient of Dispersion: The coefficients of dispersion are calculated (along with
the measure of dispersion) when two series are compared, that differ widely in their
averages. The dispersion coefficient is also used when two series with different
measurement units are compared. It is denoted as C.D.

6
Relative Measure of Dispersion
The relative measures of dispersion are used to compare the distribution of two or more data
sets. This measure compares values without units. Common relative dispersion methods
include:

i. Co-efficient of Range
ii. Co-efficient of Variation
iii. Co-efficient of Standard Deviation
iv. Co-efficient of Quartile Deviation
v. Co-efficient of Mean Deviation
The common coefficients of dispersion are:

C.D. in terms of Coefficient of dispersion

Range C.D. = (Xmax – Xmin) ⁄ (Xmax +

Xmin)

Quartile Deviation C.D. = (Q3 – Q1) ⁄ (Q3 + Q1)

Standard Deviation (S.D.) C.D. = S.D. ⁄ Mean

Mean Deviation C.D. = Mean deviation/Average

7
6. Code Snippet:

Output:

Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
Statistics
No ratings yet
Statistics
21 pages
Descriptive Analytics Notes
No ratings yet
Descriptive Analytics Notes
6 pages
BRM - Data Analysis, Interpretation and Reporting Part II
No ratings yet
BRM - Data Analysis, Interpretation and Reporting Part II
102 pages
SSM & Da All Unit Notes
No ratings yet
SSM & Da All Unit Notes
152 pages
Chapter 12
No ratings yet
Chapter 12
46 pages
RMPS M3
No ratings yet
RMPS M3
38 pages
What Is Descriptive Analysis - Types and Advantages - Analytics Steps
No ratings yet
What Is Descriptive Analysis - Types and Advantages - Analytics Steps
46 pages
Statistical-Analysis-1
No ratings yet
Statistical-Analysis-1
35 pages
Conservation Biology - Andrew S.pullin
No ratings yet
Conservation Biology - Andrew S.pullin
9,764 pages
Marketing Research: Ninth Edition
No ratings yet
Marketing Research: Ninth Edition
44 pages
BMS Group 4
No ratings yet
BMS Group 4
26 pages
Module 3 Data Analysis Techniques
No ratings yet
Module 3 Data Analysis Techniques
55 pages
Data-Analysis-Tawan-Pee-Cream
No ratings yet
Data-Analysis-Tawan-Pee-Cream
53 pages
Module 2c - Exploratory Data Analysis
No ratings yet
Module 2c - Exploratory Data Analysis
18 pages
Descriptive Analysis
No ratings yet
Descriptive Analysis
20 pages
Lecture 2 Descriptive Statistics
No ratings yet
Lecture 2 Descriptive Statistics
46 pages
Lecture No - 3
No ratings yet
Lecture No - 3
19 pages
Lesson 3 Notes
No ratings yet
Lesson 3 Notes
53 pages
Research Report
No ratings yet
Research Report
47 pages
Quantitative Data Analysis Thru Descriptive Statistics
No ratings yet
Quantitative Data Analysis Thru Descriptive Statistics
6 pages
EXPERIMENT-2
No ratings yet
EXPERIMENT-2
8 pages
Community Engagement DLL
100% (2)
Community Engagement DLL
111 pages
02 Exploratory Data Analytics
No ratings yet
02 Exploratory Data Analytics
41 pages
Unit 2 Describing Data
No ratings yet
Unit 2 Describing Data
21 pages
MPC 006 2024-25 for ssc and all educational needs
No ratings yet
MPC 006 2024-25 for ssc and all educational needs
27 pages
14 - Chapter 7 PDF
No ratings yet
14 - Chapter 7 PDF
39 pages
Decriptive Statistics in Data Science
No ratings yet
Decriptive Statistics in Data Science
9 pages
Contents UNIT 42
No ratings yet
Contents UNIT 42
21 pages
PUPSPC BUMA30063 - Chapter 2 Instructional Material
No ratings yet
PUPSPC BUMA30063 - Chapter 2 Instructional Material
10 pages
Data Analysis
No ratings yet
Data Analysis
30 pages
Unit 3
No ratings yet
Unit 3
19 pages
chap2b
No ratings yet
chap2b
15 pages
UNIT 5 - Data Analysis Methods
No ratings yet
UNIT 5 - Data Analysis Methods
31 pages
5. Descriptive Statistics
No ratings yet
5. Descriptive Statistics
15 pages
Advance Statistics for Data Science and Data Analysis (2)
No ratings yet
Advance Statistics for Data Science and Data Analysis (2)
47 pages
Research method lecture notes
No ratings yet
Research method lecture notes
32 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
Green Aesthetic Thesis Defense Presentation
No ratings yet
Green Aesthetic Thesis Defense Presentation
5 pages
Descriptive analysis in tableau
No ratings yet
Descriptive analysis in tableau
4 pages
ds1 Iat Ans
No ratings yet
ds1 Iat Ans
18 pages
DSA-REPORT
No ratings yet
DSA-REPORT
11 pages
Step 6 Data Analysis
No ratings yet
Step 6 Data Analysis
23 pages
Module 5 Research Methodology (3)
No ratings yet
Module 5 Research Methodology (3)
9 pages
Research Presentation
No ratings yet
Research Presentation
29 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Descreptive Analysis
No ratings yet
Descreptive Analysis
2 pages
MS102
No ratings yet
MS102
9 pages
Gene Keys - Magical Contemplations
100% (8)
Gene Keys - Magical Contemplations
5 pages
Best Practices For
No ratings yet
Best Practices For
8 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
4 pages
Creative and Minimal Portfolio Presentation
No ratings yet
Creative and Minimal Portfolio Presentation
5 pages
Statistics and Its Types(v1.0)
No ratings yet
Statistics and Its Types(v1.0)
6 pages
(Thesis) Neide Simões 2013 PDF
No ratings yet
(Thesis) Neide Simões 2013 PDF
164 pages
Descriptive Analytics
No ratings yet
Descriptive Analytics
4 pages
DSBDL Asg 3 Write Up
No ratings yet
DSBDL Asg 3 Write Up
6 pages
7 Types of Statistical Analysis Techniques
No ratings yet
7 Types of Statistical Analysis Techniques
7 pages
Presentation On Data Analysis: Submitted by
No ratings yet
Presentation On Data Analysis: Submitted by
38 pages
History of Aerospace
No ratings yet
History of Aerospace
81 pages
seeker_diagnostics
No ratings yet
seeker_diagnostics
15 pages
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
No ratings yet
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
5 pages
RF-amp AN8112 Technical Manual
No ratings yet
RF-amp AN8112 Technical Manual
37 pages
Experiment-1 2
No ratings yet
Experiment-1 2
6 pages
The Real You by DR - Sudipta Rath
100% (2)
The Real You by DR - Sudipta Rath
84 pages
Process and Summarize Data
No ratings yet
Process and Summarize Data
2 pages
BB Bank Account Conditions
No ratings yet
BB Bank Account Conditions
38 pages
Finite Element Analysis of A Crankshaft Using ANSYS
100% (1)
Finite Element Analysis of A Crankshaft Using ANSYS
208 pages
Statistical Analysis_ Descriptive Stat (2)
No ratings yet
Statistical Analysis_ Descriptive Stat (2)
6 pages
Combinepdf (4) Removed Removed
No ratings yet
Combinepdf (4) Removed Removed
97 pages
Applications of SM Processes
No ratings yet
Applications of SM Processes
110 pages
E-Book On Essentials of Business Analytics: Group 7
No ratings yet
E-Book On Essentials of Business Analytics: Group 7
6 pages
Blockchain Paper Corrected
No ratings yet
Blockchain Paper Corrected
5 pages
Budget sheet format
No ratings yet
Budget sheet format
8 pages
1Z0-184 (Final_Last_One) 2 2
No ratings yet
1Z0-184 (Final_Last_One) 2 2
10 pages
Practical Research Week 1
No ratings yet
Practical Research Week 1
1 page
Pre - DT Report ZBGR - 4331 - TDD
No ratings yet
Pre - DT Report ZBGR - 4331 - TDD
4 pages
Jio Bill Augest
No ratings yet
Jio Bill Augest
2 pages
BH Series Table
No ratings yet
BH Series Table
1 page
Bongolan, Stephanie N - FS 2 Activity 2 1
No ratings yet
Bongolan, Stephanie N - FS 2 Activity 2 1
7 pages
K03b-Ing-R2 - Ub0-Atr PDF
No ratings yet
K03b-Ing-R2 - Ub0-Atr PDF
3 pages
DEXT 085 DIN 928 Porca Solda Quadrada
100% (2)
DEXT 085 DIN 928 Porca Solda Quadrada
10 pages
Vocational Electives - Group-A (2016-2018) Subject Code Subject Name (Class Xi)
No ratings yet
Vocational Electives - Group-A (2016-2018) Subject Code Subject Name (Class Xi)
8 pages
Aroma User Manual English
No ratings yet
Aroma User Manual English
38 pages
2014 NCEES 8hr Exam Standards
No ratings yet
2014 NCEES 8hr Exam Standards
6 pages
Alyssa Mcmahon Evidence Chart 2023
No ratings yet
Alyssa Mcmahon Evidence Chart 2023
8 pages
No Et Moi - Study Scheme
No ratings yet
No Et Moi - Study Scheme
3 pages
Role Name Terms of Reference (Duties and Responsibilities)
No ratings yet
Role Name Terms of Reference (Duties and Responsibilities)
5 pages
Ship of Theseus Was Rebuilt Over The Centuries F
No ratings yet
Ship of Theseus Was Rebuilt Over The Centuries F
5 pages
Operation & Maintenance Competency Certificate.: The Electrical Licensing Board
No ratings yet
Operation & Maintenance Competency Certificate.: The Electrical Licensing Board
2 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet

dataanexp-2

Uploaded by

dataanexp-2

Uploaded by

EXPERIMENT - 2

Aim: - To understand the basis of descriptive statistics using Python.

Stages in Descriptive Analysis

II. Measures of Central Tendency:

III. Measures of Dispersion:

IV. Measures of Position:

Types of Measures of Dispersion

 Absolute Measure of Dispersion

Absolute Measure of Dispersion

The types of absolute measures of dispersion are:

C.D. in terms of Coefficient of dispersion

Range C.D. = (Xmax – Xmin) ⁄ (Xmax +

Quartile Deviation C.D. = (Q3 – Q1) ⁄ (Q3 + Q1)

Standard Deviation (S.D.) C.D. = S.D. ⁄ Mean

Mean Deviation C.D. = Mean deviation/Average

You might also like