0% found this document useful (0 votes)

5 views34 pages

Data Analytics

The document introduces data analytics, explaining its importance in decision-making through various types of analysis: descriptive, diagnostic, predictive, and prescriptive. It illustrates these concepts with a case study of BeanBrew Café, demonstrating how data-driven insights can improve sales and customer satisfaction. Additionally, it highlights Python's role in data analytics, showcasing libraries like NumPy, Pandas, Matplotlib, Seaborn, and SciPy for data manipulation and visualization.

Uploaded by

meejanani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views34 pages

Data Analytics

Uploaded by

meejanani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Data

Analytic
s
From Data to
Decisions
Introduction
to Data
Analytics
• What is data ? Data refers to raw facts and
figures collected from various sources.
• What is data analytics ? Data analytics is
the process of looking at data to find useful
information, patterns, or trends that help us
make better decisions.
• It involves:
1. Collecting the right data
2. Processing and organizing it
3. Analyzing it using statistical or
computational techniques
4. Visualizing results using charts, graphs,
dashboards, etc.
Types of Data
Analytics
The Tale of
BeanBrew
Café
How data saved the coffee
empire
Once upon a time in a bustling city, a cozy little coffee shop called
BeanBrew Café was famous for its rich espresso and warm
pastries. For years, it was the go-to spot for students, professionals,
and tourists alike. But in early 2024, the owner, Maya, noticed
something troubling—sales were slipping.

Instead of panicking, Maya turned to data.

Descriptive Analysis:
“What’s happening?”
Maya’s first step was to look at the numbers. She
pulled sales data from the past year and used
descriptive analysis:
• Sales had dropped 20% over 3 months.
• Afternoon sales were down the most.
• The number of returning customers was
decreasing.
Maya visualized this with charts showing declining
foot traffic and monthly revenue dips. Something
was definitely off.
Diagnostic Analysis:
“Why is it happening?”
Now that she knew what was happening, Maya
asked: Why ? She conducted a diagnostic analysis:
• She compared weather patterns (rainy
afternoons!)
• She looked at online reviews — people
complained about long wait times in the
afternoon.
• She found a correlation: wait times ⬆️=
customer satisfaction ⬇️.
She confirmed a strong link between staff shortage
and lost sales.
Predictive Analysis:
“What could happen
next?”
Worried the trend might continue, Maya used
predictive analysis .
She fed her sales and staffing data into a machine
learning model, which forecasted:
• If no changes were made, she'd lose 30% more
sales over the next quarter.
• However, adding one extra barista in the
afternoon could reverse the trend.
She also predicted hot drink sales would spike in the
upcoming rainy season
Prescriptive Analysis:
“What should I do
about it?”
Maya wanted a strategy, not just insights . Using
prescriptive analysis, she:
• Simulated different staffing schedules.
• Modeled pricing strategies (e.g., happy hour
deals at 3–5 p.m.).
• Found the optimal plan: hire a part-time barista,
offer a rainy-day discount, and promote mobile
orders.
The model showed this would increase profits by
15% and reduce customer complaints.
70-
80%
of people use Python for Data
Analysis
Python is the top choice for data analytics because it is popular, easy to learn,
and supported by a large community. Its rich ecosystem of powerful libraries
like pandas, numpy, matplotlib, seaborn and scikit-learn makes data handling,
visualization, and machine learning simple. Python is versatile, free, and open-
source, used across many industries. Plus, Python skills are highly in demand in
the job market, making it a valuable tool for data professionals.
Libraries used in
Python for Data
Analytics
NumPy
• It stands for Numerical Python.
• NumPy is a powerful library in Python used
for:
1. Working with arrays (especially multi-
dimensional arrays)
2. Performing mathematical and logical
operations on data efficiently
3. Serving as the base for libraries like
Pandas, Scikit-learn, etc.
Creating arrays
import numpy as np

# 1D array
array_1d = np.array([10, 20, 30])
print("1D Array:", array_1d)

# 2D array
array_2d = np.array([[1, 2, 3], [4, 5, 6]])
print("2D Array:", array_2d)

print("Dimensions:", array_2d.ndim)
print("Shape:", array_2d.shape)
print("Data Type:", array_2d.dtype)
Indexing and Slicing arrays
# Create a 1D array
arr = np.array([10, 20, 30, 40, 50])
print(arr[0])
print(arr[0:3])

# Create a 2D array
arr2d = np.array([
[1, 2, 3],
[4, 5, 6],
[7, 8, 9]
])
print(arr2d[0, 1])
print(arr2d[0])
print(arr2d[:, 0])
print(arr2d[1:, 1:])
Flatten, Reshape and
Transpose
arr = np.array([
[1, 2, 3],
[4, 5, 6]
])

print("Original array: ", arr)

print("Flattened:", arr.flatten())

print("Reshaped :", arr.reshape(3, 2))

print("Transposed:", arr.transpose())
Pandas
• Pandas is an open-source Python library that
provides powerful and easy-to-use data
structures for data analysis and
manipulation.
• At the core of Pandas are two primary data
structures:
1. Series: A one-dimensional labeled array.
2. DataFrame: A two-dimensional, tabular
data structure with labeled rows and
columns
Creating Series and
DataFrames
import pandas as pd
#Series
data = [10, 20, 30, 40, 50]

series = pd.Series(data)
print(series)

#DataFrame
data = {
'Name': ['A', 'B’, C'],
'Age': [25, 30, 35],
'City': ['New York', 'Los Angeles', 'Chicago']
}

df = pd.DataFrame(data)
print(df)
Creating Series and
DataFrames
import pandas as pd
#Series
data = [10, 20, 30, 40, 50]

series = pd.Series(data)
print(series)

#DataFrame
data = {
'Name': ['A', 'B’, C'],
'Age': [25, 30, 35],
'City': ['New York', 'Los Angeles', 'Chicago']
}

df = pd.DataFrame(data)
print(df)
Summary methods
data = {
'Name': ['A', 'B’, C', 'D’, E’, F'],
'Age': [25, 30, 35, 40, 22, 28],
'Salary': [50000, 60000, 70000, 80000, 45000, 52000]
}

df = pd.DataFrame(data)

print(df.head())

print(df.tail())

print(df.info())

print(df.describe())
loc[ ] and iloc[ ]
data = {
'Name': ['John', 'Emma', 'Liam'],
'Age': [28, 24, 31]
}
df = pd.DataFrame(data)

print("Using loc: ")

print(df.loc[0, 'Name’])
print(df.loc[:, 'Age’])

print("Using iloc: ")

print(df.iloc[0, 0])
print(df.iloc[0:2, 0])
Handling missing data
data = {
'Name': ['John', 'Emma', None, 'Liam'],
'Age': [28, None, 22, 31],
'City': ['New York', 'Los Angeles', 'Chicago', None]
}
df = pd.DataFrame(data)

print(df.isnull())
print(df.dropna())

df['Name'] = df['Name'].fillna('Unknown')
mean_age = df['Age'].mean()

df['Age'] = df['Age'].fillna(mean_age)
Matplotlib
• Matplotlib is an open-source Python library
used for creating a variety of charts and
graphs.
• At the core of Matplotlib is:
1. Figure: The overall window or page that
holds the plot(s).
2. Axes: The individual plot or graph within
the figure where data is visualized.
• Matplotlib makes it easy to generate common
plots such as line graphs, scatter plots,
Creating a plot
import numpy as np
import matplotlib.pyplot as plt

x = np.array([1, 2, 3, 4])
y = np.array([10, 20, 25, 30])

plt.plot(x, y)

plt.title("Simple Line Plot with NumPy")

plt.xlabel("X-axis")
plt.ylabel("Y-axis")

plt.show()
Subplots
x = np.array([1, 2, 3, 4])
y1 = np.array([10, 20, 30, 40])
y2 = np.array([40, 30, 20, 10])
y3 = np.array([5, 15, 10, 25])

# Create 3 subplots (3 rows, 1 column)

fig, axs = plt.subplots(3, 1, figsize=(6, 9))

# First subplot
axs[0].plot(x, y1)

# Second subplot
axs[1].plot(x, y2)

plt.show()
Seaborn
• Seaborn is an open-source Python library
built on top of Matplotlib.
• It is designed for creating attractive and
informative statistical graphics with ease.
• It provides a high-level interface for drawing
visually appealing and complex plots using
fewer lines of code.
• Seaborn makes it easy to visualize
relationships in data, explore patterns, and
enhance plots created with Matplotlib.
Creating a simple plot
import seaborn as sns
import matplotlib.pyplot as plt

tips = sns.load_dataset("tips")

sns.scatterplot(data=tips, x="total_bill", y="tip", hue="time")

plt.title("Total Bill vs Tip")

plt.xlabel("Total Bill ($)")
plt.ylabel("Tip ($)")

plt.show()
SciPy
• SciPy stands for Scientific Python. It's an
open-source Python library used for scientific
and technical computing.
• It's built on top of NumPy and provides
additional functionality.
• Think of SciPy as a powerful extension of
NumPy — NumPy gives you arrays and
basic math, SciPy gives you advanced tools
to analyze and solve complex problems.
Simple hypothesis testing
from scipy.stats import ttest_rel

before = [2.9, 3.0, 2.5, 2.6, 3.2]

after = [3.1, 3.2, 2.7, 2.8, 3.4]

t_stat, p_value = ttest_rel(before, after)

print("t-statistic:", t_stat)
print("p-value:", p_value)
Conclusion
• Python transforms data analytics from a
complex challenge into a powerful
opportunity.
• With its intuitive libraries and dynamic
tools, it not only simplifies data
processing and visualization but also
unlocks deeper insights.
• This fuels smarter decisions and spark
innovation—making it the ultimate
catalyst for success across
industries.
Thank
you very
much!
Presented by Srivarshan

Data Analysis With Python
No ratings yet
Data Analysis With Python
29 pages
Datascience With Answers
100% (1)
Datascience With Answers
36 pages
data science
No ratings yet
data science
42 pages
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
No ratings yet
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
8 pages
datascience
No ratings yet
datascience
26 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
Business Analytics
No ratings yet
Business Analytics
33 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Chapter 2. Data Analysis and Processing - Full
No ratings yet
Chapter 2. Data Analysis and Processing - Full
49 pages
unit-3(FODS)
No ratings yet
unit-3(FODS)
34 pages
De&v Lab Manual
No ratings yet
De&v Lab Manual
91 pages
prac2
No ratings yet
prac2
11 pages
NumPy and Pandas (1)
No ratings yet
NumPy and Pandas (1)
12 pages
Usage of NumPy for Numerical Data in Detail
No ratings yet
Usage of NumPy for Numerical Data in Detail
52 pages
Python Ca22
No ratings yet
Python Ca22
14 pages
DOC-20250315-WA0005.
No ratings yet
DOC-20250315-WA0005.
29 pages
Data Analyst Course
No ratings yet
Data Analyst Course
8 pages
AA MDM MST
No ratings yet
AA MDM MST
8 pages
PythonDASE_2025 Version1 (1)
No ratings yet
PythonDASE_2025 Version1 (1)
44 pages
Report
No ratings yet
Report
18 pages
Python Data Analyst Handbook Guide_byom_cybertechie
No ratings yet
Python Data Analyst Handbook Guide_byom_cybertechie
57 pages
prac2
No ratings yet
prac2
11 pages
Python for Data Analysis
No ratings yet
Python for Data Analysis
84 pages
NumPy, Pandas, MatplotLib,Seaborn, ScikitLearn (SkLearn)
No ratings yet
NumPy, Pandas, MatplotLib,Seaborn, ScikitLearn (SkLearn)
14 pages
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
Exploratory Data Analysis-1
No ratings yet
Exploratory Data Analysis-1
10 pages
final dev record
No ratings yet
final dev record
49 pages
EDA Document
No ratings yet
EDA Document
13 pages
Data Analytics
No ratings yet
Data Analytics
36 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
DAP_3_module
No ratings yet
DAP_3_module
62 pages
Data Minds - Data Science Curriculum 2023 V2
No ratings yet
Data Minds - Data Science Curriculum 2023 V2
15 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
Data Analytics With Python Lecture 1
No ratings yet
Data Analytics With Python Lecture 1
23 pages
L6 and 7-Data Preprocessing-coding
No ratings yet
L6 and 7-Data Preprocessing-coding
34 pages
Universal Data Analytics Algorithm
No ratings yet
Universal Data Analytics Algorithm
51 pages
Python Ds
No ratings yet
Python Ds
22 pages
Learneverythingai
No ratings yet
Learneverythingai
9 pages
Pandas: A Foundational Python Library For Data Analysis and Statistics
100% (3)
Pandas: A Foundational Python Library For Data Analysis and Statistics
9 pages
Part A
No ratings yet
Part A
24 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
dav 2 unit
No ratings yet
dav 2 unit
55 pages
Data Analytics and Reporting - Notes Unit 1 and 2
No ratings yet
Data Analytics and Reporting - Notes Unit 1 and 2
11 pages
Data Science I: Charles C.N. Wang
No ratings yet
Data Science I: Charles C.N. Wang
68 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
Enache 1
No ratings yet
Enache 1
6 pages
FDS RECORD-1-4
No ratings yet
FDS RECORD-1-4
18 pages
Numpy Basics Introduction To
No ratings yet
Numpy Basics Introduction To
35 pages
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
100% (1)
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
12 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
DEV Lab Record
No ratings yet
DEV Lab Record
46 pages
Vibhin Pro
No ratings yet
Vibhin Pro
36 pages
Practical_1
No ratings yet
Practical_1
5 pages
Data Analysis Using Python Day_1 to Day_4
No ratings yet
Data Analysis Using Python Day_1 to Day_4
30 pages
3 - Pandas
No ratings yet
3 - Pandas
87 pages
2.1 - Introduction To Data Analytics
No ratings yet
2.1 - Introduction To Data Analytics
32 pages
Python CA2
No ratings yet
Python CA2
11 pages
NAC.pdf (1)
No ratings yet
NAC.pdf (1)
23 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
LINUX
No ratings yet
LINUX
12 pages
misbelief
No ratings yet
misbelief
31 pages
Chapter 29 The Stream API _ Java_ The Complete Reference, Eleventh Edition, 11th Edition
No ratings yet
Chapter 29 The Stream API _ Java_ The Complete Reference, Eleventh Edition, 11th Edition
37 pages
important_question[1]
No ratings yet
important_question[1]
2 pages
Flask Cheatsheet _ CodeWithHarry
No ratings yet
Flask Cheatsheet _ CodeWithHarry
4 pages
Contact Book Project
No ratings yet
Contact Book Project
19 pages
1308061019061511
No ratings yet
1308061019061511
28 pages
DSP Ict - 1
No ratings yet
DSP Ict - 1
1 page
Top 24 C Interview Questions PDF
No ratings yet
Top 24 C Interview Questions PDF
6 pages
Lecture 09
100% (1)
Lecture 09
35 pages
MATH1048 Linear Algebra 1 Exam 2014
No ratings yet
MATH1048 Linear Algebra 1 Exam 2014
9 pages
Prefunctional Checklists
100% (1)
Prefunctional Checklists
37 pages
ZINSER CNC5010 Brochure English
No ratings yet
ZINSER CNC5010 Brochure English
2 pages
M150 - Installation Confirmation
No ratings yet
M150 - Installation Confirmation
7 pages
Source IP Continus ICMP
No ratings yet
Source IP Continus ICMP
3 pages
Windows Event Log Cheat Sheet
No ratings yet
Windows Event Log Cheat Sheet
6 pages
HPXN200-1-Jul-Dec2024-FA1-CZ-V2-15042024 (2)
No ratings yet
HPXN200-1-Jul-Dec2024-FA1-CZ-V2-15042024 (2)
4 pages
MPPSC Prelims: Micro Syllabus Important Topics
No ratings yet
MPPSC Prelims: Micro Syllabus Important Topics
8 pages
En
No ratings yet
En
87 pages
Requirment Enginering
No ratings yet
Requirment Enginering
22 pages
Algorithms Worksheet 1 Algorithms and flowcharts (4)
No ratings yet
Algorithms Worksheet 1 Algorithms and flowcharts (4)
3 pages
Machine Learning Application Image Processing
No ratings yet
Machine Learning Application Image Processing
3 pages
You Might Have Heard Thread by Aartthetrader Jul 4, 23 From Rattibha
No ratings yet
You Might Have Heard Thread by Aartthetrader Jul 4, 23 From Rattibha
13 pages
Bluetooth® Audio System: Operating Instructions Manual de Instrucciones
No ratings yet
Bluetooth® Audio System: Operating Instructions Manual de Instrucciones
72 pages
DI02011011
No ratings yet
DI02011011
6 pages
Tyler Schwantes: Bachelor of Management Information Systems, December, 2012
No ratings yet
Tyler Schwantes: Bachelor of Management Information Systems, December, 2012
4 pages
The Effect of Using Smart Elearning App On The Academic Achievement of Eighthgrade Students - 2023 - Modestum LTD
No ratings yet
The Effect of Using Smart Elearning App On The Academic Achievement of Eighthgrade Students - 2023 - Modestum LTD
11 pages
Lecture 4
No ratings yet
Lecture 4
16 pages
Jobs (2013) : Plot Summary
No ratings yet
Jobs (2013) : Plot Summary
4 pages
Flatpack2 Wallbox
No ratings yet
Flatpack2 Wallbox
4 pages
SDS5032E (V) : User Manual
No ratings yet
SDS5032E (V) : User Manual
90 pages
Difference Between Switchings and TDM-FDM
No ratings yet
Difference Between Switchings and TDM-FDM
4 pages
Website Design of Job Description Based On Isco-08 and Calculation of Employee Total Needs Based On Work Load
No ratings yet
Website Design of Job Description Based On Isco-08 and Calculation of Employee Total Needs Based On Work Load
10 pages
Seiwa Explorer3 User Manual English
No ratings yet
Seiwa Explorer3 User Manual English
118 pages

Data Analytics

Uploaded by

Data Analytics

Uploaded by

Data

Instead of panicking, Maya turned to data.

print("Original array: ", arr)

print("Reshaped :", arr.reshape(3, 2))

print("Using loc: ")

print("Using iloc: ")

plt.title("Simple Line Plot with NumPy")

# Create 3 subplots (3 rows, 1 column)

sns.scatterplot(data=tips, x="total_bill", y="tip", hue="time")

plt.title("Total Bill vs Tip")

before = [2.9, 3.0, 2.5, 2.6, 3.2]

t_stat, p_value = ttest_rel(before, after)

You might also like