Michael Lewis - Influence of Loyalty Programs 2009
Michael Lewis - Influence of Loyalty Programs 2009
Loyalty programs have long been an important element Loyalty programs that base rewards on cumulative pur-
of customer relationship management for firms in travel- chasing are an explicit attempt to enhance retention. Such
related industries such as airlines, hotels, and rental cars. programs encourage repeat buying and thereby improve
Information technology that enables firms to practice retention rates by providing incentives for customers to pur-
individual-level marketing has facilitated the spread of loy- chase more frequently and in larger volumes. However,
alty programs into such diverse industries as gaming, finan- dynamically oriented promotions, such as loyalty programs,
cial services, and retailing (Deighton 2000). Accordingly, represent just one possible technique for increasing cus-
academic researchers have begun to study loyalty programs. tomer retention. Repeat buying may also be encouraged
Behaviorally oriented researchers, such as Soman (1998) through various means such as short-term discounts on mer-
and Kivetz and Simonson (2002), have studied the effect of chandise or reduced shipping charges. Therefore, it is
delayed incentives on consumer decisions. Zhang, Krishna, important to develop models that can simultaneously esti-
and Dhar (1999), Kim, Shi, and Srinivasan (2001), and mate the influence of dynamic and current factors on long-
Kopalle and Neslin (2003) have proposed analytical models term customer behavior. In this article, I report on a
to study the impact of loyalty programs in categories with methodology for assessing the relative impact of loyalty-
different structures. This study contributes to the literature based promotions, short-term promotions, and individual-
that is focused on empirically measuring response to loyalty level factors on customer purchasing over time.
programs (Drèze and Hoch 1998; Sharp and Sharp 1997). There is only limited and contradictory published empir-
ical work on the value of loyalty programs. A relevant study
by Sharp and Sharp (1997) analyzes individual-level data
*Michael Lewis is Assistant Professor of Marketing, Warrington College by using a one-period switching model to measure the abil-
of Business, University of Florida (e-mail: [email protected]). The ity of a loyalty program to alter normal repeat-purchase
author thanks the three anonymous JMR reviewers for their helpful rates. Unfortunately, the study’s results are inconclusive. In
comments.
contrast, Drèze and Hoch (1998) report on a category-
specific loyalty program that results in increases for both ment strategy in two respects. First, it is a means for assess-
the specific category and total store traffic. The contrasting ing the influence of dynamic promotions such as loyalty
findings are consistent with the lack of consensus on the programs that operate over an extended period. Second, by
ability of loyalty programs to increase customer retention controlling for the dynamic elements of a customer’s deci-
(see Dowling and Uncles 1997). It should be emphasized sion problem, the model removes a possible source of bias
that studies that question the value of loyalty programs and may provide better estimates of the effects of short-
(e.g., Dowling and Uncles 1997; Sharp and Sharp 1997) are term marketing tactics.
largely based on research that uses single-period switching The analysis employs a research methodology known as
models. Additional research with models that more fully discrete-choice dynamic programming (Eckstein and
replicate the dynamics of consumer response is needed to Wolpin 1989). The approach is based on the idea that a cus-
judge the effectiveness of dynamically oriented loyalty tomer’s observed sequence of decisions may be interpreted
programs. as the solution to a dynamic optimization problem. These
For a frequency program to be effective in increasing loy- methods are established in economics (Eckstein and Wolpin
alty, it must have a structure that motivates customers to 1989; Rust 1994) and are becoming more common in mar-
view purchases as a sequence of related decisions rather keting (Erdem, Imai, and Keane 2003; Erdem and Keane
than as independent transactions. That is, the structure must 1996; Gönül and Shi 1998; Gönül and Srinivasan 1996;
give customers an incentive to adopt a dynamic perspective. Sun, Neslin, and Srinivasan 2004). Dynamic programming
O’Brien and Jones (1995) suggest that the major factors methods are ideal for analyzing individual choices that are
that customers consider when evaluating programs are the based on both current and future expected benefits. A loy-
relative value of awards and the likelihood of achieving a alty program that bases awards on the level of purchasing
reward. Furthermore, the likelihood of achieving a reward is over a specified period is a prime example of such a deci-
a function of cumulative buying thresholds and time con- sion problem. A further benefit of dynamic programming
straints. These design elements (e.g., thresholds, rewards, methods is that the estimated coefficients can be used to
time constraints) combine with individual-level require- conduct simulations that replicate the consumer’s dynamic
ments and preferences to determine the customer’s expected decision process.
benefits of participating in a loyalty program. The primary contribution of this research is a framework
A special characteristic of loyalty programs is that their for modeling the influence of a reward program and other
attractiveness may change dynamically with a customer’s marketing instruments on customer retention. Firms have
decisions. As purchases are made, both the customer’s multiple options for their promotional budgets, so models
investment in the program and the customer’s likelihood of that can quantify the long-term effects of loyalty programs
earning a reward increase. Conversely, when a customer and other options (e.g., pricing, coupons, shipping fees) can
decides not to purchase in a given period, the likelihood of help the firm justify its choices. Although most database
earning a reward decreases, because the customer moves no marketing applications focus on tasks such as customer
closer to the reward threshold, and the time left to earn scoring that are designed to maximize the profitability of
rewards shrinks. The assessment of a program’s attractive- single-period mailing efforts (Bult and Wansbeek 1995), the
ness is further complicated because customers usually have current research focuses on customers’ response to a range
imperfect knowledge of their future requirements and of the of marketing instruments over an extended period. The
marketing policies of the firm. These dynamic factors are a model provides the means to support multicampaign direct
challenge in the modeling of customer response to loyalty marketing in environments in which customers have a
programs. dynamic orientation.
This study empirically estimates the impact of a reward In terms of substantive findings, the results suggest that
program and other elements of the marketing mix on cus- the loyalty program under examination is successful in
tomer buying behavior over time by developing a model changing customer behavior and in motivating customers to
that replicates dynamic consumer response to a loyalty pro- increase purchasing. In addition to a statistically significant
gram. In contrast to previous models, the current model estimate for the loyalty reward parameters, formulations
considers the impact of previous purchasing activity and that assume that customers are dynamically oriented fit bet-
customer expectations. The underlying behavioral assump- ter than do models that do not include a dynamic structure.
tion is that a reward program can motivate customers to The simulation experiments also provide a means for esti-
base their purchasing decisions both on the current environ- mating the magnitude of response to the program.
ment and on a long-term goal of achieving a frequent buyer The remainder of the article is organized as follows: In
reward. In other words, an effective reward program can the next section, I develop a dynamic programming model
encourage customers to make decisions that maximize of customer response to a loyalty program. Next, I discuss
expected utility over an extended time horizon rather than at model estimation issues and present empirical results. I then
each purchase occasion. This assumption is consistent with report the results of policy experiments that are designed to
previous findings in the literature that expectations of the assess the impact of the loyalty program and other promo-
future can affect consumers’ current-period decisions (e.g., tions on customer retention. I subsequently address limita-
Boulding et al. 1993; Lemon, White, and Winer 2002). tions and future research opportunities and conclude with a
The empirical section of this article uses individual-level discussion of key findings and managerial applicability.
customer data from an Internet grocer to develop a dynamic
model of customer retention. The model identifies the key MODELING THE DYNAMICALLY ORIENTED
factors that influence customers to make repeat purchases CUSTOMER
over time. A model based on an appropriate dynamic behav- The model is intended to replicate the decision-making
ioral specification can assist an overall customer manage- process of a customer when dynamic incentives exist. As
Loyalty Programs and Customer Retention 283
such, the model needs to include the factors that affect the written in terms of alternative specific expected value
attractiveness of a customer’s options at a given time and a functions as V[S(t)] = maxj ∈ JVj[S(t)], where the alterna-
structure that captures dynamic considerations. I begin with tive specific value function Vj[S(t)] is the expected value of
a customer who makes repeated choices from among J the customer selecting option j at time t and then selecting
mutually exclusive alternatives in each of T discrete peri- optimal actions subsequently. Furthermore, the alternative
ods, where the choice set is defined in terms of purchase specific value functions satisfy the Bellman (1957)
quantity (I dropped customer-specific indexes for clarity). equation:
The choice set also includes a no-purchase option. This
(3) Vj[S(t)] = E[Rj(t)|S(t)] + αE{V[S(t + 1)]|S(t), dj(t) = 1},
model allows a customer to choose the extent to which he
or she participates in a program over a sequence of when t = 1, ..., T – 1,
occasions.
The quantity decision is likely to be based on marketing and
factors such as prices and on individual-level factors such Vj[S(T)] = E[Rj(T)|S(t)], at t = T.
as inventories. A myopic decision maker would select from
the options 1, ..., J and consider the relative benefits asso- The form of the alternative specific value functions
ciated with each option, denoted as Rj. The standard implies that decisions are based on both the immediate
assumption is that consumers select the option that yields utility provided by an option and the expected total future
the optimal benefits for the purchase occasion. However, a utility from subsequent purchases and loyalty rewards. A
loyalty program may provide an incentive for customers to key detail is that the expected future benefits from period
view weekly transactions as a sequence of related deci- t + 1 forward may depend on the option selected. In many
sions. If rewards are earned over time, a multiple-period loyalty programs, an important element of the state space,
objective function is appropriate for the customer’s deci- which evolves according to customer actions, is the level
sion problem. The objective function of a dynamically ori- of cumulative spending. In general, the evolution of the
ented customer1 is state space is a function of both the current environment,
S(t), and the choices made by the customer, dj(t), over
T J time.
(1) max E
∑ αt −1 ∑ R (t)d (t)|S(t) ,
j j The discount factor, α, defines the degree to which con-
t = 1 j = 1 sumers are forward-looking or dynamically oriented. A pos-
itive α (0 < α < 1) implies that customers consider future
where Rj(t) is the single-period reward associated with benefits when making current decisions. If α is equal to
option j at time t, dj(t) is an indicator variable that is set zero, customers completely discount future benefits. In the
equal to 1 if option j is chosen at time t and is set equal to context of a loyalty program, a positive α means that cus-
zero otherwise, and α is a one-period discount factor. An tomers partially base current purchasing decisions on the
important element of this formulation is that the reward expectation of earning a reward. When α is equal to zero,
functions Rj(t) are conditional on the state of the environ- customers do not consider future benefits.
ment, S(t). The state space, S(t), is a vector of information Customer Choices and State Description
about the environment that is relevant to the customer’s
forward-looking optimization problem. The state space may As I previously stated, the setting for the empirical analy-
consist of marketing-mix elements, such as the pricing envi- sis is an online retailer that specializes in nonperishable
ronment in a given week, and customer-specific informa- grocery and drugstore items. I model customer choices in
tion, such as cumulative purchases. terms of four purchase levels during weekly periods: no
The customer’s decision problem involves selecting the order, small order, medium order, and large order. Small
level of buying in each period to maximize the expected orders are ones in which merchandise costs less than $50.
discounted utility for the remainder of the time horizon.2 Medium orders are ones in which merchandise costs at least
The value function, V, is the maximum value of discounted $50 but less than $75. Large orders cost at least $75. The
expected utility over the horizon. options available in each week are indexed by j, where
T J
j = no: no purchase in a given week,
(2) V[S( t )] = max E
d j (t)
∑ ατ − t ∑ R j (τ)d j (τ)|S(τ) .
j = sm: purchase of a small basket of goods,
j = med: purchase of a medium basket of goods, and
τ = t j = 1 j = lrg: purchase of a large basket of goods.
The value function is the utility that can be achieved over The order options are defined to be mutually exclusive so
the time horizon if the customer selects the optimal that Σdj(t) = 1. This discretization is not arbitrary but is
sequence of quantities. The value function may also be based on the shipping fees charged by the firm.
I use weekly purchasing decisions to update a state vari-
able that tracks cumulative expenditures Cumit for individ-
1The model is developed for a representative customer. Given that ual i at time t. This variable is maintained by rounding the
response to this type of program is likely to vary across the population, I actual cumulative expenditures to the nearest $33 incre-
estimate a finite mixture model, which is detailed in the “Model Estima-
tion” section.
ment.3 For purposes of the customer’s expectations of the
2The loyalty program under study includes a time constraint. Specifi- future state, the variable is updated according to Equation
cally, the program is based on customers’ annual purchasing, and points
are not carried over into future annual cycles. This is an important element
of the program’s structure because it dictates the formulation of the 3These increments roughly reflect the sizes of the average orders
dynamic program used to model the customer’s optimization problem. observed in each category.
284 JOURNAL OF MARKETING RESEARCH, AUGUST 2004
4.4 For example, if a customer has $500 in cumulative within the previous two weeks, k = 2 indicates between
expenditures at a given time t and makes a small purchase, three and five weeks since the previous purchase, k = 3 indi-
the customer is assumed to behave as if the future value of cates between six and eight weeks since the previous pur-
Cumit + 1 is $533. chase, and k = 4 indicates no purchase in the previous nine
weeks. The level of duration for a representative consumer
4
at time t is denoted by a series of indicator variables,
(4) Cum it + 1 = Cum it ∑ (100/3)( j − 1)d (t).
j Rcn(k)i,t, that are defined as mutually exclusive, such that
j = 1 Σ4k = 1Rcn(k)i,t = 1.5
The Rcn and FM variables are consistent with the
I use the cumulative buying variables to determine when recency, frequency, and monetary (RFM) value variables
a loyalty-based reward is earned. A reward variable, Lit, is that are popular in direct-marketing response models. The
set to one if a purchase causes a reward threshold to be RFM variables reflect important individual-level differ-
crossed and is set to zero otherwise. This occurs when a ences and have been used in previous dynamic program-
customer whose previous level of cumulative purchases is ming models of consumer behavior (Gönül 1999; Gönül
less than a reward threshold makes a purchase that brings and Shi 1998).
cumulative purchasing over the next reward threshold. For An important marketing-mix element for direct retailers
the program under study, the reward thresholds are set at is shipping and handling fees. A unique element of the data
$1,000, $1,500, and $2,000 worth of annual purchases. If a is that the online retailer in this study experimented with
customer with cumulative purchases of $925 makes a pur- various shipping fee structures during data collection. The
chase worth $75 or more, Lit equals one, but Lit equals zero shipping fee structure in place at a given time is important,
if only a $50 purchase is made. Rewards are paid in blocks because it provides incentives and penalties for various
of 500 frequent-flyer miles for the $1,000 and $1,500 order sizes and thus may affect both order incidence and
thresholds and 1000 miles for the $2,000 level. Therefore, distribution of order sizes. Specifically, graduated shipping
when the $2,000 threshold is crossed, the Lit variable equals fee structures impose an element of nonlinear pricing on
two, to reflect that the reward earned is double the value of buying decisions. For example, a shipping fee structure used
the previous rewards. in the data is a schedule that charges $6.97 in fees for orders
I do not treat the cumulative expenditure variables as a less than $75 and provides free shipping for orders greater
direct component of customers’ reward functions. Rather, I than $75. This type of policy provides an incentive for cus-
define variables that account for the potential benefits of tomers to increase order sizes. Customers may find them-
being an experienced customer. My strategy is to use indi- selves in situations in which the total order cost (i.e., the
cator variables to define broad categories of previous activ- sum of merchandise and shipping fees) is less for a basket
ity. I define two indicator variables (FM1 and FM2) that with a higher merchandise total, because the marginal cost
define classes of previous purchasing activity for a given of an incremental item (that causes the merchandise order to
customer as follows: FM1 = 1 if Cumit ≥ $300, and FM1 = 0 cross a threshold) can be negative. Similarly, a schedule that
otherwise, and FM2 = 1 if Cumit ≥ $500, and FM2 = 0 involves fees that increase as order size increases provides
otherwise. The variables are intended to capture dynamic an incentive for customers to purchase smaller baskets of
effects beyond the loyalty program. For example, as cumu- goods. The cost of shipping each order size under each ship-
lative purchasing increases in terms of number and size of ping fee schedule is provided in Table 1.
transactions, it is likely that customers gain a measure of Shipping variables, SH(h)t, indicate the shipping fee
expertise that effectively reduces transaction costs. structure used at a given time, where h = 1 indicates a
Another element of the state space is the amount of time mixed/decreasing fee schedule, h = 2 indicates increasing
since a customer’s previous order. This measure is intended fee schedule (high), h = 3 indicates increasing fee schedule
to control for possible inventory or customer attrition (low), and h = 4 indicates free shipping for all order sizes.
effects and is adjusted each period as follows: Durit = weeks For example, SH(4)t = 1 indicates that the firm used a free
since a purchase was made; if j ∈ {small, medium, large} shipping policy at time t. The shipping and handling fee
(i.e., a purchase is made), then Durit + 1 = 1; and if j = no, structures are mutually exclusive, such that ΣSH(h)t = 1.
then Durit + 1 = Durit + 1. I subsequently define the actual The online retailer under study carries in excess of
implementation of the duration variable, which is capped at 10,000 items. Therefore, rather than use specific prices in
nine weeks through a series of categorical variables. the model, I use a summary measure that describes the pric-
Before the empirical analysis, the role of the duration ing environment. This price variable is defined as the aver-
variable is not well defined. It may capture inventory- age price of the 50 top-selling items during the data collec-
related decision making if the natural buying cycle is tion period in a given week and is calculated as Pt =
greater than the weekly cycle used in the model. However, (Σ50
item = 1Pitem,t)/50. The price variable accounts for the rela-
if the probability of observing a purchase decreases as the tive attractiveness or competitiveness of the firm’s pricing
time since the previous purchase increases, the duration in a given week. An additional price-related factor is e-mail
variable will capture customer attrition. The duration cate-
gories are indexed by k, where k = 1 indicates a purchase
5Recency also could be constructed as a continuous variable. The use of
a series of categorical variables is a simplifying assumption that helps
4For purposes of the loyalty rewards, this variable is updated exactly on maintain computational tractability. In addition, I based the selection of the
the basis of previous levels of buying and the quantity purchased. I use the break points on patterns of incidence observed in the data. For example,
discretization to approximate the customer’s expectations of the next purchase incidence drops from 9.5% when duration is eight weeks to 5%
period state. when it is nine weeks; after nine weeks, the rate stabilizes near 3%.
Loyalty Programs and Customer Retention 285
Table 1
SHIPPING AND HANDLING FEE STRUCTURES
Table 2
STATE AND CONTROL VARIABLES
Control Variable
dj(t) Purchase quantity decision 4 levels (no buy, small, medium, and large)
286 JOURNAL OF MARKETING RESEARCH, AUGUST 2004
2
In this expression, vj represents the deterministic portion of
+ ∑β f , med FM(f ) it + β L L it + ε med , it , the alternative specific value function. Therefore, model
f = 1 estimation requires the repeated solution of a dynamic pro-
and gramming problem to calculate the value functions. Under
the assumption of a homogeneous population, the log-
H
∑β
likelihood function to be maximized consists of the sum of
(11) R lrg, i ( t ) = β lrg + β P , lrg Pit + β C, lrg Cit + h , lrg SH (h) it the logarithms of the choice probabilities defined in Equa-
h = 1 tion 12.
K 2 The estimation procedure involves nesting an algorithm
+ ∑ β k Rcn( k )it + ∑β f , lrg FM(f ) it + β L L it to solve the dynamic programming problem within a
maximum-likelihood routine. At each iteration, the dynamic
k = 1 f = 1
programming problem is solved by the use of the current
+ ε lrg, it .
estimate of the parameter vector associated with the reward
functions. The alternative specific value functions are used
Equation 8 represents the reward associated with the no- to compute the log-likelihood for the current iteration. The
purchase option, and Equations 9–11 represent the rewards maximum-likelihood algorithm then updates the parameter
associated with small, medium, and large orders. For exam- vector, and the process is repeated until convergence.
ple, in Equation 9, the utility of a small order is a function Because the loyalty program under study has a finite
of βsm, which is the constant associated with a small order, horizon, I solve the dynamic programming model used to
the price index, and whether an e-mail coupon was avail- generate the choice probabilities with backward recursion.
able at time t. The shipping charges enter through the sum- Given the annual time horizon of the program and the
mation of βh,sm and SH(h)it. For example, β1,sm is the coef- assumption of weekly decisions, the decision horizon
ficient for a small order (option j = 2) when the shipping involves 52 periods. The recursion procedure is executed as
schedule is of Type 1 (h = 1). The second summation term, follows: For a set of parameter values, I compute the alter-
which involves βk and the duration indicator variables native specific value functions for the final period (T = 52)
Rcn(k)it, captures the effects of time since the previous pur- for all combinations of the state space S:
chase. The third summation term involves the cumulative
purchasing variables, FM(f)it. In the term for the loyalty (13) Vj[S(52)] = E[Rj(52)|S(52)].
indicator, βL is the coefficient that describes the benefit of a
loyalty reward. The εjit terms correspond to the random I then use the values for Vj(52) to calculate the value func-
component of utility. In addition, for identification pur- tions for t = 51. Specifically, the alternate specific value
poses, the constant of the no-purchase option is normalized function for an arbitrary state at t = 51 is the expected
to zero (βno = 0). reward during that period plus the discounted expected
reward of an optimal action in the next period:
MODEL ESTIMATION Spot
is that the loyalty program alters the behavior of only some Table 3
fraction of the customer population. To extend the model to SUMMARY OF MODEL FITS
account for variability in customer preferences, I use a
latent class approach (Kamakura and Russell 1989). This Model Number of Parameters BIC
approach assumes that the population consists of M types,
Single Segment
where πm is the proportion of type m in the population. For Myopic 28 35,419.5
the finite-mixture approach, the sample likelihood is Dynamic 31 34,996.8
N M Two Segments
(15) ∏∑
i = 1m = 1
Ti |type = m )π m ,
Pr(d1mi , d 2mi , ..., d m Two myopic
Two dynamic
60
66
34,388.0
33,996.7
One dynamic and one myopic 63 34,230.5
where Three Segments
Three dynamic 95 34,003.9
T
Pr[d m
i (1), dm
i (2), ..., dm
n (T)] = ∏ Pr[d(t)|S(t), type = m].
t = 1 models. The addition of a third segment marginally
improves fit but yields several nonintuitive parameters.
The use of a finite-mixture model to account for customer Estimation results provided in Table 4 indicate two fairly
heterogeneity significantly increases the computational bur- different segments. In broad terms, Segment 1 purchases
den, because the dynamic optimization problem must be less frequently and favors larger order sizes, whereas Seg-
repeatedly solved for each type in the population. ment 2 purchases more frequently and favors smaller
Because the direct estimation of the discount factor α orders. To generate descriptions of the two segments, Table
tends to be difficult, this parameter is fixed before the esti- 5 reports results of simulation studies based on each seg-
mation of the other model parameters. The value of α used ment’s parameter values and a marketing policy similar to
in the estimation procedure plays a key role in the analysis. that employed by the firm. Segment 1 places an average of
By setting α equal to zero, the model is estimated under an 8.7 orders per year and spends more than $670 annually.
assumption that customers are myopic and do not consider Segment 2 spends an average of $586 dollars per year and
the value of future rewards in current decisions. If α is set to places approximately 14 orders. In terms of the loyalty
a high value (i.e., close to one), the model assumes that cus- rewards, approximately 15% of the Segment 1 population
tomers value potential future benefits. The empirical strat- earns at least one reward, whereas only approximately 1%
egy is to estimate both types of models. A comparison of of Segment 2 earns a loyalty reward. Table 5 also shows
model fits then provides evidence of whether customers how the two segments combine to an overall population and
have a dynamic response to the loyalty-based rewards. how this simulated population compares with the actual
sample.7
EMPIRICAL IMPLEMENTATION
An important difference in the segments is the levels of
The complete data set from the Internet retailer includes dispersion. Although the two segments have fairly similar
transaction histories for more than 30,000 customers and mean levels of annual spending, there is a significant differ-
spans a 13-month period from 1997 to 1998. For estimation ence in terms of variability. Segment 1 tends to be more
purposes, I selected a small subsample of the population. volatile, which is why it contains the majority of award
Because the primary focus is the effectiveness of the reward recipients. Figure 1 shows the distribution of annual pur-
program, the empirical analysis focuses on customers with chasing for both segments and illustrates the differences in
meaningful transaction histories. The selection criterion is the dispersion of the two groups. Although the BIC meas-
that customers make at least two purchases during their first ures help justify the incorporation of dynamic factors and
5 weeks in the database and that there are at least 30 weeks heterogeneity controls, measures that show how these
of observations. The final sample consists of 1058 cus- model enhancements lead to improved managerial forecasts
tomers who made an average of 10.45 purchases per year. are also important. Because I am interested in predicting
I estimated models for various single- and multiple- long-term behavior, I define measures that describe how
segment formulations and for different assumptions regard- well each specification can replicate the actual distribution
ing the discount rate. Fit statistics and assumptions regard- of cumulative purchasing. To construct the measures, I use
ing segment discount rates for single-, two-, and each model to simulate the distribution of annual spending
three-segment models are provided in Table 3. Table 4 pres- for 100,000 customers. I then compare the simulated distri-
ents the estimated coefficients for a two-segment mixture butions with the actual percentages of the population in
model in which both segments possess a dynamic orienta- terms of absolute deviations (Table 6). This provides an
tion.6 The two-segment dynamic model is the best fitting of indication of how well the specifications capture the vari-
all the two-segment models, according to the Bayesian ability in the population. To show how well the specifica-
information criterion (BIC) fit measure, and is the basis for tions capture the influence of the loyalty program, I also
subsequent analysis. In addition to providing a better fit include the percentage of customers who are predicted to
than the competing two-segment models, the two-segment earn a loyalty reward. In general, the dynamic specifications
dynamic model is superior to the homogeneous population help better account for the spikes caused by the reward pro-
6To improve readability, Table 4 does not report coefficients that are not 7I use simulation studies rather than posterior probabilities to describe
significant. However, when relevant, I mention such coefficients in the the segments in order to be consistent with the simulation studies described
text. Full results are available on request. in the “Policy Experiments” section.
288 JOURNAL OF MARKETING RESEARCH, AUGUST 2004
Table 4
TWO-SEGMENT MODEL RESULTS
Segment 1 Segment 2
Coefficient Standard Error Coefficient Standard Error
Small-Order Parameters
Intercept: βsm N.S. N.S.
Shipping policy 1: βh = 1,sm –1.02** .15 –.32** .07
Shipping policy 2: βh = 2,sm –.61* .26 –.28* .12
Shipping policy 4: βh = 4,sm N.S. N.S.
Price: βP,sm –1.90* .89 –1.38* .74
E-coupon: βC,sm N.S. .36** .11
FM1: βf = 1,sm N.S. .31* .13
FM2: βf = 2,sm N.S. .48** .08
Medium-Order Parameters
Intercept: βmed N.S. N.S.
Shipping policy 1: βh = 1,med –.95** .17 –.63** .24
Shipping policy 2: βh = 2,med –.62* .33 –.45* .24
Shipping policy 4: βh = 4,med N.S. N.S.
Price: βP,med –2.13* 1.14 –2.20* 1.18
E-coupon: βC,med N.S. .19* .11
FM1: βf = 1,med .94** .18 1.10** .34
FM2: βf = 2,med .91** .13 1.12** .18
Large-Order Parameters
Intercept: βlrg N.S. N.S.
Shipping policy 1: βh = 1,lrg .40** .05 1.02** .26
Shipping policy 2: βh = 2,lrg N.S. N.S.
Shipping policy 4: βh = 4,lrg .19* .09 N.S.
Price: βP,lrg –2.49** .76 –2.88** 1.15
E-coupon: βC,lrg .33** .12 .35* .17
FM1: βf = 1,lrg 1.59** .23 1.29* .55
FM2: βf = 2,lrg 1.90** .15 1.42** .29
Common Parameters
Reward: βL .75** .27 N.S.
Recency level 2: βk = 2 N.S. –.46** .07
Recency level 3: βk = 3 –.73** .08 –1.01** .11
Recency level 4: βk = 4 –1.73** .19 –1.84** .15
Price Expectation Model
Intercept 1: θLow N.S. N.S.
Intercept 2: θHigh N.S. N.S.
Price effect: θP –.65* .35 N.S.
Segment size: θseg1 .89** .11
*p < .10.
**p < .01.
Notes: I estimated the segment size parameter using a logit formulation such that π1 = exp(θseg1)/[1 + exp(θseg1)]. N.S. = not significant.
Table 5
SEGMENT DESCRIPTIONS
gram, and the heterogeneity controls help better reflect the and myopic homogeneous models are also notable.8 In the
variability in the population. dynamic model, the reward coefficient is positive and sig-
The segments are substantially different in terms of the nificant (.77), whereas the static model yields a smaller pos-
loyalty reward effects. Segment 1 yields a parameter of .75, itive estimate that is only marginally significant (.56). The
whereas the parameter for Segment 2 is just .22. In addition, static model yields a lower estimate for the reward parame-
the t-statistic for Segment 2’s response to the loyalty pro- ter because the specification does not consider the ability of
gram is just 1.3. To some extent, this is not surprising,
because the rewards are rarely relevant to members of Seg-
ment 2. The estimated reward parameters from the dynamic 8Full estimation results for the models are available on request.
Loyalty Programs and Customer Retention 289
Figure 1 category, –2.12 for the medium category, and –2.48 for the
COMPARISON OF SIMULATED POPULATIONS large category. In contrast, Segment 2’s coefficients are
–1.38, –2.20, and –2.88, respectively. These patterns are as I
expected, because higher prices have a larger absolute
25% impact on larger orders. In terms of segment differences,
the results suggest that higher prices tend to shift Segment 2
Percentage of Population
specification herein, the shipping fee schedules are repre- Drèze, Xavier and Stephen J. Hoch (1998), “Exploiting the
sented as four distinct states, and the use of dollar values Installed Base Using Cross-Merchandising and Category Desti-
would require three distinct variables with three levels. nation Programs,” International Journal of Research in Market-
Another area for further research pertains to the effective- ing, 15 (5), 459–71.
Eckstein, Zvi and Kenneth Wolpin (1989), “The Specification and
ness of alternative reward types. A limitation of the current
Estimation of Dynamic Stochastic Discrete Choice Models,”
data set is that rewards are limited to a single type. With Journal of Human Resources, 24 (4), 562–98.
appropriate data, it may be possible to measure the relative Erdem, Tülin, Susumu Imai, and Michael Keane (2003), “Brand
effectiveness of different rewards or the benefits of offering and Quantity Choice Dynamics Under Price Uncertainty,”
reward choices. Quantitative Marketing and Economics, 1 (1), 5–64.
——— and Michael Keane (1996), “Decision-Making Under
CONCLUSION Uncertainty: Capturing Dynamic Brand Choice Processes in
This article presents a dynamic programming model of Turbulent Consumer Markets,” Marketing Science, 15 (1),
1–20.
customer response to a loyalty program and other marketing Gönül, Füsun (1999), “Estimating Price Expectations in the OTC
tactics. The model measures the influence of rewards by Medicine Market: An Application of Dynamic Stochastic Dis-
considering customers’ sequences of purchases as a solution crete Choice Models to Scanner Panel Data,” Journal of Econo-
to a dynamic optimization problem. A primary strength of metrics, 89 (1–2), 41–56.
the approach is that the dynamic programming framework ——— and Mengze Shi (1998), “Optimal Mailing of Catalogs: A
provides a rich structure for modeling customer behavior. New Methodology Using Estimable Structural Dynamic Pro-
The model includes both the influence of previous behavior gramming,” Management Science, 44 (9), 1249–62.
in terms of cumulative purchases and forward-looking fac- ——— and Kannan Srinivasan (1996), “Estimating the Impact of
tors, such as expectations of future prices and loyalty Consumer Expectations of Coupons on Purchase Behavior: A
rewards. The inclusion of forward-looking behaviors is Dynamic Structural Model,” Marketing Science, 15 (3),
262–79.
computationally expensive but intuitively appealing for Kalwani, Manohar U., Chi Kin Yim, Heikki J. Rinne, and Yoshi
modeling customer response to a loyalty program. Given Sugita (1990), “A Price Expectations Model of Consumer
the prevalence of loyalty programs, this is a salient topic for Brand Choice,” Journal of Marketing Research, 27 (August),
researchers and practitioners. 251–62.
The model estimates the effects of e-mail coupons, pric- Kamakura, Wagner and Gary J. Russell (1989), “A Probalistic
ing changes, shipping fees, and the loyalty program. The Choice Model for Market Segmentation and Elasticity Struc-
sign and statistical significance of the parameter associated ture,” Journal of Marketing Research, 26 (November),
with the reward in the dynamic model is evidence that the 379–90.
loyalty program effectively increases repeat-purchase rates. Kim, Byung-Do, Mengze Shi, and Kannan Srinivasan (2001),
Further evidence of the program’s effectiveness is the rela- “Reward Programs and Tacit Collusion,” Marketing Science, 20
(2), 99–120.
tive fit of the dynamic model compared with a static formu- Kivetz, Ran and Itamar Simonson (2002), “Earning the Right to
lation in which customers do not consider future benefits. Indulge: Effort as a Determinant of Customer Preferences
The results should provide solace to advocates of loyalty Toward Frequency Program Rewards,” Journal of Marketing
programs. Research, 39 (May), 155–70.
The model also provides a platform for conducting what- Kopalle, Praveen and Scott A. Neslin (2003), “The Economic Via-
if analyses of customer retention. Simulation studies can be bility of Frequency Reward Programs in a Strategic Competitive
used to study the relative power of each marketing instru- Environment,” Review of Marketing Science, (accessed April
ment to increase long- and short-term purchasing. Such 16, 2004), [available at http://www.bepress.com/romsjournal/
studies are useful because the estimation results indicate vol1/iss1/art1].
that multiple instruments can stimulate repeat buying. Fur- Lemon, Katherine, Tiffany Barnett White, and Russell Winer
(2002), “Dynamic Customer Relationship Management,” Jour-
thermore, given appropriate cost information, it is a simple nal of Marketing, 66 (January), 1–14.
extension to use these experiments to estimate the prof- Narasimhan, C. (1989), “Incorporating Consumer Price Expecta-
itability of alternative policies for customer relationship tions into Diffusion Models,” Marketing Science, 8 (4), 343–57.
management. O’Brien, Louise and Charles Jones (1995), “Do Rewards Really
Create Loyalty?” Harvard Business Review, 73 (May–June),
REFERENCES 75–82.
Bellman, Richard (1957), Dynamic Programming. Princeton, NJ: Rust, John (1994), “Structural Estimation of Markov Decision
Princeton University Press. Processes,” in Handbook of Econometrics, Vol. 4, D. McFadden
Bertsekas, Dimitri (1996), Dynamic Programming and Optimal and R. Engle, eds. Amsterdam: North-Holland, 3081–3143.
Control. Belmont, MA: Athena Scientific. Sharp, Byron and Anne Sharp (1997), “Loyalty Programs and
Boulding, William, Ajay Kalra, Richard Staelin, and Valarie Zei- Their Impact on Repeat-Purchase Loyalty Patterns,” Interna-
thaml (1993), “A Dynamic Process Model of Service Quality: tional Journal of Research in Marketing, 14 (5), 473–86.
From Expectations to Behavioral Intentions,” Journal of Mar- Soman, Dilip (1998), “The Illusion of Delayed Incentives: Evalu-
keting Research, 30 (February), 7–27. ating Future Effort-Money Transactions,” Journal of Marketing
Bult, Jan Roelf and Tom Wansbeek (1995), “Optimal Selection for Research, 35 (November), 427–37.
Direct Mail,” Marketing Science, 14 (4), 378–95. Sun, Baohong, Scott A. Neslin, and Kannan Srinivasan (2004),
Deighton, John (2000), “Frequency Programs in Service Industries,” “Measuring the Impact of Promotions on Brand Switching
in Handbook of Services Marketing and Management, T. Swartz When Consumers Are Forward Looking,” Journal of Marketing
and D. Iacobucci, eds. London: Sage Publications, 401–408. Research, 40 (November), 389–405.
Dowling, Grahame R. and Mark Uncles (1997), “Do Customer Zhang, Z. John, Aradhna Krishna, and Sanjay K. Dhar (2000),
Loyalty Programs Really Work?” Sloan Management Review, “The Optimal Choice of Promotional Vehicles,” Management
38 (Summer), 71–82. Science, 46 (3), 348–62.