TSK 1
TSK 1
The insights from your analysis will feed into the supermarket’s
strategic plan for the chip category in the next half year.
You have received the following email from your manager, Zilinka.
'Hi,
Welcome again to the team, we love having new graduates join us!
I just wanted to send a quick follow up from our conversation earlier with a few pointers around the key ar
of this task to make sure we set you up for success.
Below I have outlined your main tasks along with what we should be looking for in the data for each.
Examine transaction data – look for inconsistencies, missing data across the data set, outliers, correctly
identified category items, numeric data across all tables. If you determine any anomalies make the necessa
changes in the dataset and save it. Having clean data will help when it comes to your analysis.
Examine customer data – check for similar issues in the customer data, look for nulls and when you are ha
merge the transaction and customer data together so it’s ready for the analysis ensuring you save your file
along the way.
Data analysis and customer segments – in your analysis make sure you define the metrics – look at total sa
drivers of sales, where the highest sales are coming from etc. Explore the data, create charts and graphs a
well as noting any interesting trends and/or insights you find. These will all form part of our report to Juli
Deep dive into customer segments – define your recommendation from your insights, determine which
segments we should be targeting, if packet sizes are relative and form an overall conclusion based on your
analysis.
Make sure you save your analysis in the CSV files and your visualisations – we will need them for our repo
you could work on this analysis and send me your initial findings by end of next week that would be great.
Thanks,
Zilinka'
Here is your task
We need to present a strategic recommendation to Julia that is
supported by data which she can then use for the upcoming category
review. However, to do so, we need to analyse the data to understand
the current purchasing trends and behaviours. The client is
particularly interested in customer segments and their chip
purchasing behaviour. Consider what metrics would help describe the
customers’ purchasing behaviour.
We have chosen to complete this task in R, however you will also find
Python to be a useful tool in this piece of analytics. If you aren’t
familiar with R or Python we would recommend searching a few online
courses to help get you started. We have also provided an R solution
template if you want some assistance in getting through this
Task. Whilst its possible to complete the task in Excel you may find
the size of the data and the nature of the tasks is such that it is more
difficult to complete in Excel.
You will also want to derive extra features such as pack size and
brand name from the data and define metrics of interest to enable
you to draw insights on who spends on chips and what drives spends
for each customer segment. Remember, our end goal is to form a
strategy based on the findings to provide a clear recommendation
to Julia the Category Manager so make sure your insights can have
a commercial application.
Pro analytics Tip: While the data set would not normally be
considered large some operations may still take some time to run.