Marketing and Retail Analysis
Marketing and Retail Analysis
Yaresh Vijayasundaram
Post Graduate Program in Data Science and
Business Analytics
BATCH: PGPDSBA.O.SEP22.A
CONTENT
Objective: The objective of this analysis was to identify the underlying buying
patterns of customers, provide insightful information about customer behaviour, and
recommend customized marketing strategies for different customer segments.
Inferences :
Inferences :
Inferences :
Inferences :
Inference:
Inference:
Inference:
Inference:
Inference:
Inference:
• Deal Size Analysis: The majority of the deals fall into the small and medium categories, with counts of
1,246 and 1,349, respectively. This indicates that the company primarily engages in transactions of
moderate to smaller deal sizes.
• Order Status Analysis: The significant count of 2,541 successfully shipped orders reflects the company's
outstanding order fulfilment capabilities and commitment to delivering orders promptly.
• Quality Ordered Analysis: The bins with quality orders of 21, 28, 35, and 42 have the highest counts,
indicating their popularity among customers. With counts ranging from 576 to 631, these quantities meet
the demands of a significant portion of customers.
• Sales Analysis: Sales showed a positive trend and grew significantly from 2018 (3,353,014) to 2019
(4,669,925), suggesting that the business performed well and experienced an increase in customer
demand. However, since we only have sales data until May for 2020 (1,737,283), it is difficult to make
conclusive statements about the entire year.
03. Customer Segmentation using RFM analysis
(4 segments)
• What is RFM?
• What all parameters used, and assumptions made?
• Showcase the KNIME workflow image.
• What results are there in the output table head?
RFM
RFM analysis is a customer
segmentation technique commonly
used in marketing and retail to analyse
and categorize customers based on
their purchasing behaviour. RFM
stands for Recency, Frequency, and
Monetary Value, which are three key
metrics used to evaluate customer
engagement and profitability.
PARAMETERS
• Monetary: Calculated by multiplying the price each with the quantity ordered. It represents the monetary
value or revenue generated by each customer.
• Recency: Determined by subtracting the order date from a fixed reference date, such as 01-06-2020. It
represents how recently a customer made a purchase.
• Frequency: Calculated as the count of customer names. It represents how often a customer has made
purchases.
• Auto-binning: Customers are segmented into four categories based on their RFM scores - High, Moderate,
Low, and Very Low. This segmentation helps in categorizing customers based on their value and behaviour.
These parameters are utilized in RFM analysis to evaluate customer behaviour, identify customer segments,
and make data-driven marketing and sales decisions.
ASSUMPTION:
• Higher Monetary value indicates a higher spending customer: The assumption is that customers who
generate higher monetary value through their purchases are likely to be more valuable and potentially more
profitable for the business.
• Recent purchases are more indicative of customer engagement: The assumption is that customers who have
made purchases more recently are likely to be more engaged with the company and its offerings. They may
have higher potential for repeat purchases or upselling/cross-selling opportunities.
• Higher Frequency of purchases reflects customer loyalty: The assumption is that customers who make
purchases more frequently are more loyal to the company. They may have a stronger connection to the
brand, higher customer satisfaction, and a higher likelihood of recommending the company to others.
KNIME
WORKFLOW
Tool Used
Output table head
04. Inferences from RFM Analysis and identified segments:
The top 5 best customers are determined by calculating the RFM score, which is the sum of
scores for Monetary, Recency, and Frequency. Higher RFM scores indicate more valuable
customers.
Customers at Risk of Churning: Identifying Potential Churners
The customers with very low to low recency, high to moderate monetary value, and high to
moderate purchase frequency are at an increased risk of churning.
Top 5 Lost customer
Loyal customer refers to customers who are consistent and repeat purchases over
time. They may not necessarily have the highest monetary value, but they
demonstrate a strong commitment to the brand by repeatedly choosing to do
business with the company.
05. RECOMMENDATIONS:
Customer Retention Strategies: Engage and retain customers who are at risk of
churning. Offer personalized incentives, loyalty programs, or exclusive promotions
to encourage their continued loyalty