Major Project Report
Major Project Report
Abstract i
List of Tables ii
1 Introduction 1
1.1 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Existing System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Proposed System . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Review of Literature 5
3 Methodology Planning 7
3.1 Hardware Software requirement . . . . . . . . . . . . . . . . . . . . 7
3.2 Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.3 Code Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.4 Work Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4 Conclusion 18
5 Future scope 19
References 20
i
List of Tables
ii
List of Figures
iii
Chapter 1
Introduction
1
amount. Promo codes also provide customers with free shipping or gift-wrapping.
This marketing strategy essentially gives customers yet another reason to buy your
products.
Hence along with a successfull marketing strategy of generating coupons for
customer this system generates specialised custom coupons for customers. It anal-
yses the database containing customer’s purchase record and ans according to the
database after performing RFM analyses special coupons are generated for specific
group of customer.
2
customer engagement is lost re-engagement must be initiated. Further more the
current strategy of sending cold coupons, promo-code and offers to the customer
is not strategic.
3
database after performing RFM analyses special coupons are generated for specific
group of customer.
4
Chapter 2
Review of Literature
According to Blattberg, R.C., Kim, BD., Neslin, S.A. (2008). RFM Analysis.
In: Database Marketing. International Series in Quantitative Marketing, vol 18.
Springer, New York, NY. https://doi.org/10.1007/978-0-387-72579-61 2
Abstract Recency (R), Frequency (F), and Monetary Value (M) are the most
popular database marketing metrics used to quantify customer trans- action his-
tory. Recency is how recently the customer has purchases; frequency is how often
the customer purchases, and monetary value is the dollar value of the purchases.
RFM analysis classifies customers into groups according to their RFM measures,
and relates these classifications to behaviors such as the likelihood of responding to
a catalog or other offer. RFM analysis was prob- ably the first “predictive model”
used in database marketing.
How do you select customers for target mailing? Or whom should you send
your catalogs or direct mail offers to? The need to mail smarter is always among
the top concerns of direct marketers. The direct mail promotion that results in
sales to 2 Percent of the mailed universe is considered a success. Identifying and
targeting the customers who are most likely to respond are therefore of prime
concern.
Because of the nature of their businesses, direct marketers including cat- alogers
have been collecting customer data, analyzing them, and developing models for
several decades to improve their business performance. One pop- ular approach
used to improve mailing efficiency is the RFM – Recency, Fre- quency, Monetary
5
amount – model. The primitive form of the RFM model was used about 50 years
ago by catalogers of general merchandise. For exam- ple, as early as 1961, George
Cullinan promoted the use and understanding of RFM customer data analysis.
Recognizing his contribution in advancing the direct marketing industry, the DMA
(Direct Marketing Association) inducted him into the DMA Hall of Fame in 1989.
The core concept of the RFM model is based on the empirical evidence. Direct
marketers have found that the response to a mailing offer is hetero- geneous across
customers. And they also found that customers who haveresponded well in the
past are likely to respond in the future. More specif- ically, direct marketers have
found that customers’ purchase response can be predicted using their previous
purchase histories. The three most impor- tant variables to summarize customers’
purchase histories are recency (R), frequency (F), and monetary amount (M). That
is, using RFM measures for each customer, one can predict his or her propensity
to respond. Once identifying who is going to respond, the direct marketer sends
catalogs to customers with high propensity.
It encounters several scoring challenges in the measure of frequency and is rela-
tively sensitive, which leads to pulling apart customers who have identical behavior
at the lower quintiles, but group customers together whose buying behaviors have
significant differences (Alam and Khalifa, 2009).
There is another scoring method (behavior quintile scoring method) developed
by John Wirth (The founder of Woodworker’s Supply of New Mexico), which
sorts customers based on their behavior and thus may have different number of
customers in each quintile. The scoring scheme of frequency covers five intervals,
including 0 to 3 months, 4 to 6 months, 7 to 12 months, 13 to 24 months and
25+ months, which are coded as 5, 4, 3, 2 and 1, respectively. This method is
known as hard coding (McCarty and Hastak, 2007). For frequency, although this
scoring method appears to solve the sensitivity problems, it still encounters similar
problems as customer quintile method in the frequency measure.
6
Chapter 3
Methodology Planning
• RAM: 2 GB or higher
Software
• Python
7
3.2 Algorithm
Step 1
We will start with a dataset containing information about individual orders
performed on a platform. The data is represented in a transaction level i.e. each
row of the dataset contains features associated with a single transaction, such as
date, time, payment method, client id of the user that made the purchase, etc.
Step 2
The next step in building an RFM model is to assign Recency, Frequency and
Monetary values to each customer
• Recency is simply the amount of time since the customer’s most recent trans-
action (most businesses use days, though for others it might make sense to
use months, weeks or even hours instead).
• Monetary is the total amount that the customer has spent across all trans-
actions (during a defined period).
Step 3
The third step is to divide the customer list into tiered groups for each of the
three dimensions (R, F and M).
This results in 64 distinct customer a (4x4x4), into which customers will be
segmented. Three tiers can also be used (resulting in 27 segments); using more
8
Recency Frequency Monetary
R-Tier-1 (most recent) F-Tier-1 (most frequent) M-Tier-1 (highest spend)
R-Tier-2 F-Tier-2 M-Tier-2
R-Tier-3 F-Tier-3 M-Tier-3
R-Tier-4 (least recent) F-Tier-4 (only one transaction) M-Tier-4 (lowest spend)
than four, however, is not recommended (because the difficulty in use outweighs
the small benefit gain from the extra granularity).
Step 4
The fourth step is to select groups of customers to whom specific types of
communications will be sent, based on the RFM segments in which they appear.
It is helpful to assign names to segments of interest. Here are just a few
examples to illustrate:
• Best Customers – This group consists of those customers who are found in
R-Tier-1, F-Tier-1 and M-Tier-1, meaning that they transacted recently, do
so often and spend more than other customers. A shortened notation for
this segment is 1-1-1; we’ll use this notation going forward.
Marketers should assemble groups of customers most relevant for their partic-
ular business objectives and retention goals.
9
Step 5
The fifth step actually goes beyond the RFM segmentation itself: crafting
specific messaging that is tailored for each customer group. By focusing on the
behavioral patterns of particular groups, RFM marketing allows marketers to com-
municate with customers in a much more effective manner.
Again, here are just some examples for illustration, using the groups we named
above:
10
Marketing Strategies: These customers have demonstrated a high willingness
to pay. Consider premium offers, subscription tiers, luxury products, or value add
cross/up-sells to increase AOV. Don’t waste margin on discounts.
11
3.3 Code Implementation
12
13
Mail System
14
Email Received
Coupon on website
15
3.4 Work Flow
16
Figure 3.1: Project Roadmap
17
Chapter 4
Conclusion
The primary goal of this project is to develop a system that categorises the cus-
tomers on the basis of the recency - that is how recent did they purchase an item,
frequency - that is how frequent do they purchase and monetary - that is how much
money are they willing to spend. This categorisation allows the business owners
to develop unique and personalized marketing scheme for each of the cluster of
customer segments. This project makes use of RFM analyses as well as k means
clustering.
Furthermore this categorisation is used for generating coupons. These coupons
will be generated specifically with respect to customers purchase history and RFM
results leading to custom coupons for every group of customers.
18
Chapter 5
Future scope
19
References
20
Acknowledgement
I have a great pleasure to express my gratitude to all those who have I would
like to express my deepest appreciation to all those who provided me the pos-
sibility to complete this report. A special gratitude I give to our project guid,
Ms Ankita Nagmote, whose contribution in stimulating suggestions and encour-
agement, helped to coordinate project especially in writing this report and has
invested her full effort in guiding the team in achieving the goal..
A special thanks goes to the team, Pranjal Ambulkar, Naziya Inamdar, Khush-
numa Shaikh , who helped to assemble the project and gave suggestions to improve
the project. Last but not least, I have to appreciate the guidance given by other
supervisors as well as the panels especially in our project presentation that has
improved our presentation skills thanks to their comment and advice.
Finally, we are very thankful to Usha Mittal institute of technology, for giving
us a chance to complete our thesis project within the scheduled time.
Date:
Pranjal Ambulkar - 01
Naziya Inamdar - 24
Khushnuma Shaikh - 56
21