Case Study 1 Data Mart
Case Study 1 Data Mart
INTRODUCTION:
Data Dart is my latest venture and I want your help to analyze the sales and
performance of my venture. In June 2020 - large scale supply changes were made
at Data Mart. All Data Mart products now use sustainable packaging methods in
every single step from the farm all the way to the customer.
I need your help to quantify the impact of this change on the sales performance for
Data Mart and its separate business areas.
SCHEMA USED: WEEKLY_SALES TABLE
segment age_band
1 Young Adults
2 Middle Aged
3 or 4 Retirees
5. Add a new demographic column using the following mapping for the first
letter in the segment values:
segment | demographic |
C | Couples |
F | Families |
6. Ensure all null string values with an "unknown" string value in the
original segment column as well as the
new age_band and demographic columns
7. Generate a new avg_transaction column as the sales value divided
by transactions rounded to 2 decimal places for each record
B. Data Exploration
1. Which week numbers are missing from the dataset?
2. How many total transactions were there for each year in the dataset?
3. What are the total sales for each region for each month?
4. What is the total count of transactions for each platform
5. What is the percentage of sales for Retail vs Shopify for each month?
6. What is the percentage of sales by demographic for each year in the dataset?
7. Which age_band and demographic values contribute the most to Retail
sales?