Case Study-1-Pattern Discovery in Supermarket Sales Transactions Using EDA

DataSets

Uploaded by

Harshitha Bandhakavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views3 pages

Case Study-1-Pattern Discovery in Supermarket Sales Transactions Using EDA

DataSets

Uploaded by

Harshitha Bandhakavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Project Title: Pattern Discovery in Supermarket Sales Transaction using EDA

Technology Platform: ML with Azure Studio

Team Size: 4
Technical Domain: Exploratory Data Analysis
Business Domain: Retail Industry

Project Overview
Supermarkets are big business and they use data on a big scale. Originating in the US in the
1930s, supermarkets have since gradually taken over a bigger and bigger share of the retail
and grocery market. Giants like Wal-Mart, Aldi and Carrefour are among the largest retailers
in the world with revenues approaching the hundreds of billions. As such many have invested
heavily in big data, with analytics and data science forming a core part of their decision
making.
The growth of supermarkets in most populated cities are increasing and market competitions
are also high. Every product purchased, along with its price, is recorded in gargantuan
databases, with tables exceeding hundreds of billions of rows. Loyalty schemes, where
customers accumulate points by scanning their loyalty card at each purchase, allow the
company to stitch together a customer’s entire history of transactions, gaining more valuable
insights.

Dataset
The dataset is one of the historical sales of Supermarket Company which has recorded in 3
different branches for 3 months data. Predictive data analytics methods are easy to apply with
this dataset. The following table details the attribute information:

Page | 1
The dataset consists of data from 3 cities or 3 branches in Myanmar as given below-

a) Branch A (Yangoon)
b) Branch B (Mandalay)
c) Branch C (Naypyitaw)

Objective
Project teams need to explore & visualize the data to generate insights about supermarket
sales transactions of customers and also obtain inference about customer ratings.

Methodology
The methodology should include the following operations of Exploratory Data Analysis &
Clustering Analysis:
a. Import the dataset
b. Perform Univariate analysis to address the following queries:
 Question 1: What does the customer rating look like and is it skewed?
(Use normal distribution plot)
 Question 2: Is there any difference in aggregate sales across
branches?(Use bar graph)
 Question 3: Which is the most popular payment method used by
customers?(Use bar graph)

c. Perform Bi-variate analysis to address the following queries:

 Question 4: Does gross income affect the ratings that the customers
provide?(Use scatterplot)
 Question 5: Which branch is the most profitable?(Use Boxplot)
 Question 6: Is there any relationship between Gender and Gross
income?(Use Boxplot)
 Question 7: Is there any time trend in gross income? (Use line graph)
 Question 8: Which product line generates most income?(Use Bar plot)

d. Prepare pairwise plot (scatterplot matrix) to visualize all the bi-variate relationships
in the data.
e. Perform correlation analysis using heatmap.
f. Perform additional analysis to address the following queries:
 Question 9: What is the spending pattern of females and males and in
which category do they spend a lot?(Use countplot in Seaborn Python
package)
 Question 10: How many products are bought by customers?(Use
distribution plot)
 Question 11: Which day of the week has maximum sales?(Use countplot)
 Question 12: Which hour of the day is the busiest?(Use line plot)
 Question 13: Which product line should the supermarket focus on?(Use
bar plot)
 Question 14: Which city should be chosen for expansion and which
products should it focus on?(Use bar plot)

Use the dataset supermarket_sales.csv available under ‘Files’ section for the Project.

Page | 2
Project Outcome(s)
Project Teams need to explore the dataset and visualize the hidden data patterns and produce
valuable insights to highlight the key findings using Exploratory Data Analysis(EDA).

Page | 3

Nicholson Solution For Linear Algebra 7th Edition.
60% (5)
Nicholson Solution For Linear Algebra 7th Edition.
194 pages
Data-Driven Marketing: The 15 Metrics Everyone in Marketing Should Know
From Everand
Data-Driven Marketing: The 15 Metrics Everyone in Marketing Should Know
Mark Jeffery
3.5/5 (19)
Business Report Project SMDM Sonali Pradhan
100% (1)
Business Report Project SMDM Sonali Pradhan
56 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
ULC College Algebra and Problem Solving Course Syllabus Course Overview
No ratings yet
ULC College Algebra and Problem Solving Course Syllabus Course Overview
5 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
86 pages
i Ct 762 Group Report
No ratings yet
i Ct 762 Group Report
19 pages
Data Analysis
No ratings yet
Data Analysis
10 pages
497-Article Text-2287-1-10-20210802
No ratings yet
497-Article Text-2287-1-10-20210802
17 pages
Gaurav Upadhyay ML Project
No ratings yet
Gaurav Upadhyay ML Project
8 pages
National Institute of Technology Durgapur
No ratings yet
National Institute of Technology Durgapur
11 pages
Imbuido James MA5821 Ax2
No ratings yet
Imbuido James MA5821 Ax2
20 pages
Analysis of Superstore Database
No ratings yet
Analysis of Superstore Database
23 pages
Data Analytics Project Sem4
No ratings yet
Data Analytics Project Sem4
6 pages
Piyush Kumar Singh - Project Submission - Data Analytics
No ratings yet
Piyush Kumar Singh - Project Submission - Data Analytics
23 pages
SMDM Project Report Dipti
No ratings yet
SMDM Project Report Dipti
14 pages
Amazon Data Analysis with SQL (1)
No ratings yet
Amazon Data Analysis with SQL (1)
4 pages
Research Paper On Retail Data Analytics
No ratings yet
Research Paper On Retail Data Analytics
6 pages
Retail Sales Analytics Project
No ratings yet
Retail Sales Analytics Project
3 pages
Aaabgh Project
No ratings yet
Aaabgh Project
28 pages
Supermarket Sales Analysis 1
No ratings yet
Supermarket Sales Analysis 1
13 pages
sql capstone project
No ratings yet
sql capstone project
4 pages
FILE_2620
No ratings yet
FILE_2620
24 pages
DSML - Project Report - Group 3
No ratings yet
DSML - Project Report - Group 3
17 pages
A Product Network Analysis Using A Priori Algorithm For Extending The Market Basket in Retail
No ratings yet
A Product Network Analysis Using A Priori Algorithm For Extending The Market Basket in Retail
12 pages
Solution
No ratings yet
Solution
4 pages
Intro To BA
No ratings yet
Intro To BA
7 pages
Pranita Dane - IBM - Internship Project Submission - Data Analytics
No ratings yet
Pranita Dane - IBM - Internship Project Submission - Data Analytics
28 pages
Chapter 1: Introduction: 1.1 Background Theory
No ratings yet
Chapter 1: Introduction: 1.1 Background Theory
36 pages
final project ppt
No ratings yet
final project ppt
15 pages
REPORT
No ratings yet
REPORT
33 pages
ISE302 - IT Project Management
No ratings yet
ISE302 - IT Project Management
25 pages
CUSTOMER ANALYSIS_Report
No ratings yet
CUSTOMER ANALYSIS_Report
10 pages
Economic Data Analysis (Finance Analyst)
No ratings yet
Economic Data Analysis (Finance Analyst)
38 pages
Market And Retail Analysis Presentation-compressed-compressed
No ratings yet
Market And Retail Analysis Presentation-compressed-compressed
23 pages
rithika.ppt
No ratings yet
rithika.ppt
16 pages
Case Study Module 1
No ratings yet
Case Study Module 1
4 pages
AMAZON SALES ANALYSIS
No ratings yet
AMAZON SALES ANALYSIS
51 pages
Analysis Report-5 - 105839
No ratings yet
Analysis Report-5 - 105839
20 pages
Lo1 - 3
No ratings yet
Lo1 - 3
9 pages
Unit 5
No ratings yet
Unit 5
16 pages
Day 1
No ratings yet
Day 1
3 pages
Olist Kasyapa
No ratings yet
Olist Kasyapa
22 pages
DABI - Final Assignment - Arif - Shayekh
No ratings yet
DABI - Final Assignment - Arif - Shayekh
12 pages
Ali Shafi BSBA 2-A 6522 Sales Market Data
No ratings yet
Ali Shafi BSBA 2-A 6522 Sales Market Data
40 pages
IP Project Final
No ratings yet
IP Project Final
9 pages
Pankaj Soni Gamma 199 BA Assignment
No ratings yet
Pankaj Soni Gamma 199 BA Assignment
20 pages
Market Basket Analysis: Interim Progress Report (IPR)
No ratings yet
Market Basket Analysis: Interim Progress Report (IPR)
12 pages
br17 Final Project Report
No ratings yet
br17 Final Project Report
7 pages
Intern 23
No ratings yet
Intern 23
21 pages
Market Basket Analysis for a Supermarket
No ratings yet
Market Basket Analysis for a Supermarket
9 pages
Business Report Project - SMDM Group 10 16-March-2020
No ratings yet
Business Report Project - SMDM Group 10 16-March-2020
12 pages
SMDM-Project Report (Madhur Dhananiwala)
100% (2)
SMDM-Project Report (Madhur Dhananiwala)
43 pages
Integrating Data Mining and Predictive M
No ratings yet
Integrating Data Mining and Predictive M
5 pages
Market Basket Analysis For A Supermarket
No ratings yet
Market Basket Analysis For A Supermarket
9 pages
excel project
No ratings yet
excel project
2 pages
Analytics Roadmap
No ratings yet
Analytics Roadmap
30 pages
Enterprise Final Demo
No ratings yet
Enterprise Final Demo
8 pages
Projects PDF
No ratings yet
Projects PDF
12 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
12 pages
Group 9 Paper Presentation
No ratings yet
Group 9 Paper Presentation
24 pages
Data Science Project Ideas for Thesis, Term Paper, and Portfolio
From Everand
Data Science Project Ideas for Thesis, Term Paper, and Portfolio
Zemelak Goraga
No ratings yet
Atomic and Nuclear Physics Review Material
No ratings yet
Atomic and Nuclear Physics Review Material
172 pages
Chapter 2 - Manual Transmission P1
No ratings yet
Chapter 2 - Manual Transmission P1
29 pages
PGIS Practical
No ratings yet
PGIS Practical
19 pages
CLG 200-3 Page 16-29
No ratings yet
CLG 200-3 Page 16-29
15 pages
Voltage Dips Test System According To IEC 61000-4-11: Article
No ratings yet
Voltage Dips Test System According To IEC 61000-4-11: Article
6 pages
Akhil Hacked Files Bitcoin
No ratings yet
Akhil Hacked Files Bitcoin
2 pages
Computational & Functional Analysis of Special Functions with Arbitrary Parameters
No ratings yet
Computational & Functional Analysis of Special Functions with Arbitrary Parameters
22 pages
MS6001FA
No ratings yet
MS6001FA
14 pages
Earth's Rotation and Revolution
No ratings yet
Earth's Rotation and Revolution
48 pages
Vectors Vedantu
No ratings yet
Vectors Vedantu
11 pages
Flare System
No ratings yet
Flare System
68 pages
(123doc) Ophthalmic Microsurgical Suturing Techniques Part 2
No ratings yet
(123doc) Ophthalmic Microsurgical Suturing Techniques Part 2
15 pages
Lecture 01
No ratings yet
Lecture 01
35 pages
5 Light T
0% (1)
5 Light T
44 pages
Llambiasetal 2003 JSAES
No ratings yet
Llambiasetal 2003 JSAES
16 pages
Principles of Chemical Kinetics: James E. House
No ratings yet
Principles of Chemical Kinetics: James E. House
5 pages
Humidity
No ratings yet
Humidity
18 pages
Differential Pressure Calibrator With Master Gauge New
No ratings yet
Differential Pressure Calibrator With Master Gauge New
1 page
IEEE Paper Review On Wireless Security and Loopholes
No ratings yet
IEEE Paper Review On Wireless Security and Loopholes
4 pages
Course 2:programming With Perl: By: Jayesh H. Munjani (159250)
No ratings yet
Course 2:programming With Perl: By: Jayesh H. Munjani (159250)
64 pages
Share DC Circuit Lecture
No ratings yet
Share DC Circuit Lecture
7 pages
Core 321/4541: EN 1.4541, ASTM TYPE 321 / UNS S32100
No ratings yet
Core 321/4541: EN 1.4541, ASTM TYPE 321 / UNS S32100
8 pages
Industrial Training Institute List of Lesson Semester - 2: SR. No. Weekn O Lesso N No. Description Time Remark S
No ratings yet
Industrial Training Institute List of Lesson Semester - 2: SR. No. Weekn O Lesso N No. Description Time Remark S
4 pages
Jurnal Kombis
No ratings yet
Jurnal Kombis
14 pages
Indifference Curves
No ratings yet
Indifference Curves
18 pages
EE210 Hoja Datos Sensovant
No ratings yet
EE210 Hoja Datos Sensovant
5 pages
M Stage 5 p840 02 Afp - Jason
No ratings yet
M Stage 5 p840 02 Afp - Jason
16 pages
Applied Thermodynamics
No ratings yet
Applied Thermodynamics
10 pages