Chrun of Telecom Subscribers (1)

Uploaded by

Rydham Goyal

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Chrun of Telecom Subscribers (1)

Uploaded by

Rydham Goyal

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Term Project: Prediction of Churn of Telecommunications Customers

As you have learnt by now, predictive modeling – the process of “scoring” and targeting customers for a
marketing campaign – is a significant database marketing tool and an important component of a firm’s
customer relationship management (CRM) effort. The promise of predictive modeling is the ability to
predict what actions customers will take, thereby allowing firms to target their marketing efforts more
effectively. One area of particular importance is customer “churn,” in this case, customer voluntary churn,
when current customers decide to take their business elsewhere or voluntarily terminate their service.
Annual churn rates have been reported to be in the 20% - 40% range for telecommunication and other
technology industries. This puts a premium on developing models that accurately predict which
customers are most likely to churn, so proactive steps (e.g. appropriate communication and treatment
programs) can be taken to prevent customers from churning. The purpose of this assignment is to figure
out which method(s) works best for predicting churn, thereby enhancing our overall understanding of
predictive modeling.
The Teradata Center for Customer Relationship Management at Duke University (the Center) has shared
a dataset regarding the churn of telecommunications customers. The data consist of calibration and
validation samples of customers from a major wireless telecommunications company. The calibration
sample includes observed churn and a set of potential predictor variables. The two validation samples
include the same predictor variables, but no churn variable. Your group, which has been hired as a
consultant to the telecommunications company, are required to submit the predictions of likelihood to
churn.
The Wireless Industry: Over the years, the wireless sector has been one of the fastest-growing businesses
in the economy. With a unique value proposition – freedom and connectivity – the number of subscribers
doubled every two years during the 90’s. Wireless stocks grew as fast as those of many dot-coms, start-
ups emerged everywhere, and IPO’s raised record amounts of money.
These events shaped the new telecommunications landscape as we know it today.
Industry Turmoil: Despite the vertiginous levels of growth and promise, serious charges to industry
profitability have recently emerged: (a) Consolidation: From the nearly 60 cellular companies in US,
virtually all of them are now bankrupt, bought out, or struggling with heavy debts. Only six big players
now account for 80% of the wireless pie. We have seen such consolidation in India too, where there are
four major players (b) Growth: With 1.2 billion subscribers, India is currently the second-largest telecom
market in the world and has seen rapid expansion over the past years. The industry has increased
primarily due to favourable regulatory conditions, low prices, increased accessibility, and the
introduction of Mobile Number Portability (MNP). The telecom sector is set to grow at a Compound
Annual Growth Rate (CAGR) of 9.4% from 2020 to 2025. However, with a CAGR of 15.9% throughout the
forecast period, the smartphone industry in India will have the fastest growth. However, as the growth
increases, so have the competition; (c) Competition: As an obvious result (and to the consumer’s
delight), firms engaged in a devastating price war that not only eroded revenue growth but also
endangered their ability to meet their titanic debts. (d) Customer Strategy: The industry paradigm has
arguably changed from one of “make big networks, get customers” to “make new services, please
customers.” In short, the industry has moved from an acquisition orientation to a retention orientation.
The Elusive Customer: Until now, firms have been able to acquire customers without much effort.
Demand for wireless services has been such that if a customer decided to drop his service and switch to
another carrier, another new customer was right behind him. The priority was to maintain the customer
acquisition rate high, often at the expense of customer retention. But this situation has changed. As the
well of wireless subscribers has begun to run dry, churn – the customer’s decision to end the relationship
and switch to another company – has become a major concern. Last year the industry average churn rate
was 20% - 25% annually, which translates to approximately 2% churn per month. This means that
companies lose 2% of their customers every month. Third quarter, 2001 (the data is a bit old, but the
pattern still is similar), statistics show annual churn rates in an even higher range, 28% - 46% annual churn.
Churn rates for major carriers - Q3 2001
2.10 %
Nextel 28 %

5%
VoiceStream 46 %

3%
Sprint PCS 31 %

3.10 %
AT&T wireless 37 %

3.20 %
Cingular 34 %

2.20 % Month
Verizon Wireless 31 % Year

0 % 10 % 20 % 30 % 40 % 50 %

Source: Telephony Online, 2002

The reasons for the high level of churn are: (a) number of companies, (b) the similarity of their offerings,
and (c) the cheap prices/perceived quality of service. In fact, the biggest current barrier to churn – the
lack of phone number portability – has been removed and people can churn without changing their phone
numbers. Companies are now beginning to realize just how important customer retention is. In fact, one
study finds that “the top six US wireless carriers would have saved $207 million if they had retained an
additional 5% of customers open to incentives but who switched plans in the past year” (Reuters 2002).
Over the next years, the industry’s biggest marketing challenge will be to control churn rates by identifying
those customers who are most likely to leave and taking appropriate steps to retain them. The first step
therefore is predicting churn likelihood at the customer level.

Data Description

The data provided have generously been provided to the Center by a major wireless carrier. The data are
organized into three data files: Calibration, Current Score Data, and Future Score Data.
Calibration Current Score Data Future Score Data

Sample Size 100,000 51,306 100,462

# of Predictor 171 171 171
Variables
Churn Indicator Yes No No
Customer ID 1,000,001 – 2,000,001 – 3,000,001 –
1,100,000 2,051,306 3,100,462
The Calibration Data contain the “dependent variable” – churn – as well as several potential predictors.
The Current and Future Score Data contain the predictors but not churn. You are expected to develop
your models on the calibration data and use these models to predict for the Current and Future Score
Data, once you have evaluated their performance parameters.
The “Data Documentation” spreadsheet provides detailed descriptions of all the variables. The
predictors include three types of variables: behavioral data such as minutes of use, revenue, handset
equipment; company interaction data such as customer calls into the customer service center, and
customer household demographics.
Customers were selected as follows: mature customers, customers who were with the company for at
least six months, were sampled during July, September, November, and December of 2001. For each
customer, predictor variables were calculated based on the previous four months. Churn was then
calculated based on whether the customer left the company during the period 31-60 days after the
customer was originally sampled. The one-month treatment lag between sampling and observed
churn was for the practical concern that in any application, a few weeks would be needed to score the
customer and implement any proactive actions.
The actual percentage of customers who churn in a given month is approximately 1.8%. However,
churners were over sampled when creating the Calibration sample to create a roughly 50-50 split
between churners and non-churners (the exact number is 49,562 churners and 50,438 non-churners).
Over sampling was not undertaken in creating the Current Score and Future Score validation samples.
This is to provide a more realistic predictive test. The Current Score data contain a different set of
customers from the Calibration data, but selected at the same point in time. The Future Score data
contain a different set of customers selected at a future point in time.
In addition to the regular measures, you will have
to calculate two additional measures of
predictive accuracy for each submitted data file
– Top Decile Lift and Gini Coefficient. Top Decile
Lift measures whether the 10% of customers
predicted most likely to churn actually churn.
The Gini Coefficient measures predictive
accuracy across the entire set of customers, not
just the top 10%. The Gini Coefficient is used in
economics to measure phenomena such as
income inequality (Wolff 2002; Sydsaeter and
Hammond 1995). In database marketing, the
Gini Coefficient works off the “Cumulative Lift
Curves,” shown in the figure to the right. A
Cumulative Lift Curve plots the top x% predicted
customers versus the percentage of churners
accounted for by these customers. For example, in the figure, the 10% of customers predicted most
likely to churn by Method B account for 31.3% of all churners. The top 20% predicted customers account
for 62.5% of all churners. That is better than random prediction (shown by the Cum Random line), where
the top 10% would account for 10% of churners, and the top 20% would account for 20% of churners.
The Gini Coefficient is the area between a method’s cumulative lift curve and the random lift curve.
Technically, it should be calculated as an integral (Sydsaeter and Hammond 1995) but we will
approximate it by a numerical measure (Alker 1965; Statistics.Com, 2002) since we have a finite number
of customers and no closed form formula for the cumulative lift curve for a given method.
The formula we use is:
𝑛
2
𝐺𝑖𝑛𝑖 = ( ) ∑(𝑣𝑖 − 𝑣̂𝑖 )
𝑛
𝑖=1

where:
n = number of customers,
𝑣𝑖 = % of churners who have predicted probability of churn equal to or higher than customer i,
𝑣̂𝑖 = % of customers who have predicted probability of churn equal to or higher than customer i
𝑣𝑖 is the height of the method’s cumulative lift curve at the ith most likely predicted-to-churn customer,
and 𝑣̂𝑖 is the height of the random cumulative lift curve. The difference provides the “length” for
calculating the area between the random and method prediction curves. The term 1/n approximates the
“width” on the x-axis. The Gini Coefficient sums these lengths-times-widths across customers, providing
an approximation to the area between the method’s lift curve and the random lift curve. The calculation
is multiplied by “2” to ensure that the maximum possible Gini Coefficient is 12.
The Gini Coefficient for Method A in the above figure is 0.84; the Gini for Method B is 0.69. Random
prediction will achieve a Gini of 0 (as seen in the formula above since for random prediction, 𝑣̂𝑖 =𝑣𝑖 , and
higher Gini will correspond to more separation between the method’s lift curve and random, which
means better prediction.
Deliverables:
You, as a group, are required to work on the assignment and prepare a report detailing out the your
articulation of the problem including an outline of how you plan to address the issue, the specific tasks
carried out for the purpose (data wrangling, iterative model building, etc), and the usefulness of the
models developed explained with the help of relevant outputs, and the insights gained from those to
help you arrive at the final recommendation. This report needs to be uploaded in Word/pdf format. Note
that the report should include the results that you got, the plots that you have developed, with due
interpretation. In addition, you need to upload a R script file which will run all the codes that you would
have used to develop the model. If you use any other tools like Tableau or Excel, you are required to
upload those files too. In addition, you are required to upload a video presentation, detailing out the
process that you had followed, and the outcome thereof, including the interpretation of various results
that you would have got, and your final recommendation of the model. This video should not be of more
than 15 minutes duration.
The evaluation will be based on the detailed explanation and interpretation incorporated in the report,
and the video presentation. In addition, you will be given due credit for concepts beyond what was
discussed in the classes as incorporated by you in the task. Though the data is a bit old, the learnings
would surely be as relevant as ever.
Your submissions will be checked for plagiarism, and if found to be plagiarized, will be heavily penalized.

All the best

Fintech Insights: 2023 Update
From Everand
Fintech Insights: 2023 Update
Rupert Nicolay
No ratings yet
Predicting The Churn in Telecom Industry
No ratings yet
Predicting The Churn in Telecom Industry
9 pages
Telco Customer Churn
100% (2)
Telco Customer Churn
11 pages
Starcraft - Terran Theme 3
100% (1)
Starcraft - Terran Theme 3
8 pages
Churn Analysis
No ratings yet
Churn Analysis
7 pages
Telecom_Customer_Churn
No ratings yet
Telecom_Customer_Churn
5 pages
Churn Management
100% (1)
Churn Management
15 pages
CHURN Analysis
100% (1)
CHURN Analysis
6 pages
Classification of Customer Churn Prediction Model For Telecommunication Industry Using Analysis of Variance
No ratings yet
Classification of Customer Churn Prediction Model For Telecommunication Industry Using Analysis of Variance
7 pages
Business Intelligence & Data Mining-15
No ratings yet
Business Intelligence & Data Mining-15
17 pages
Churn Analysis
100% (3)
Churn Analysis
12 pages
Churn Rate DPV
No ratings yet
Churn Rate DPV
15 pages
FinalCapstone Shivam
No ratings yet
FinalCapstone Shivam
68 pages
Analysis_of_Telecom_Churn_using_Machine_Learning_Techniques
No ratings yet
Analysis_of_Telecom_Churn_using_Machine_Learning_Techniques
6 pages
Abstract: Churn Rate, Customers, CRISP-DM Strategy
No ratings yet
Abstract: Churn Rate, Customers, CRISP-DM Strategy
6 pages
Churn Prediction in Telecom Industry Using R: Manpreet Kaur, Dr. Prerna Mahajan
No ratings yet
Churn Prediction in Telecom Industry Using R: Manpreet Kaur, Dr. Prerna Mahajan
8 pages
Customer Churn Prediction in Telecommunication
No ratings yet
Customer Churn Prediction in Telecommunication
13 pages
Churn Prediction in Mobile Telecom Syste PDF
No ratings yet
Churn Prediction in Mobile Telecom Syste PDF
5 pages
Paper 4-Churn Prediction in Telecommunication PDF
No ratings yet
Paper 4-Churn Prediction in Telecommunication PDF
3 pages
Churn Analysis in Wireless Industry
No ratings yet
Churn Analysis in Wireless Industry
20 pages
DM Assg 041
No ratings yet
DM Assg 041
9 pages
Predicting Near-Future Churners and Win-Backs in The Telecommunications Industry
No ratings yet
Predicting Near-Future Churners and Win-Backs in The Telecommunications Industry
4 pages
Customer Churn Prediction Using Machine Learning: D. Deepika, Nihal Chandra
100% (1)
Customer Churn Prediction Using Machine Learning: D. Deepika, Nihal Chandra
14 pages
2017 CustomerChurn
No ratings yet
2017 CustomerChurn
6 pages
Ali Tamaddoni Jahromi, Mehrad Moeini, Issar Akbari, Aram Akbarzadeh
No ratings yet
Ali Tamaddoni Jahromi, Mehrad Moeini, Issar Akbari, Aram Akbarzadeh
11 pages
Telecom Churn Prediction Case Studies: Kunal Das Yogesh Kumar Pati
No ratings yet
Telecom Churn Prediction Case Studies: Kunal Das Yogesh Kumar Pati
12 pages
Applying_data_mining_to_telecom_churn_ma
No ratings yet
Applying_data_mining_to_telecom_churn_ma
10 pages
Churn Analysis in Telecommunication Using Logistic Regression
No ratings yet
Churn Analysis in Telecommunication Using Logistic Regression
6 pages
CustomerChurnPrediction_ProjectReport_2555425555
No ratings yet
CustomerChurnPrediction_ProjectReport_2555425555
19 pages
Customer Churn Telecom
No ratings yet
Customer Churn Telecom
35 pages
The Architecture of A Churn Prediction System Based On Stream Mining
No ratings yet
The Architecture of A Churn Prediction System Based On Stream Mining
13 pages
Churn of Customers
No ratings yet
Churn of Customers
3 pages
Ref 1
No ratings yet
Ref 1
10 pages
Reading 3
No ratings yet
Reading 3
19 pages
Major Project Report PDF
No ratings yet
Major Project Report PDF
35 pages
Churn Prediction and Social Network
No ratings yet
Churn Prediction and Social Network
10 pages
Predicting Churn Customer in Telecom Using Peergrading Regression Learning Technique
No ratings yet
Predicting Churn Customer in Telecom Using Peergrading Regression Learning Technique
13 pages
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
No ratings yet
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
6 pages
Kisioglu 2011
No ratings yet
Kisioglu 2011
7 pages
How Does Churn Prediction Contribute To Customer Retention?
No ratings yet
How Does Churn Prediction Contribute To Customer Retention?
3 pages
Telecom Churn Solution
100% (5)
Telecom Churn Solution
28 pages
Reviewoffactorsaffectingcustomerchurnintelecom
No ratings yet
Reviewoffactorsaffectingcustomerchurnintelecom
23 pages
Research paper_Tushar Agrawal
No ratings yet
Research paper_Tushar Agrawal
3 pages
Final Case Study PDF
No ratings yet
Final Case Study PDF
4 pages
Coventry (1,2,3,4)
No ratings yet
Coventry (1,2,3,4)
49 pages
Churn Prediction Product Idea
No ratings yet
Churn Prediction Product Idea
7 pages
Airtel'S Customer Relationship Model
No ratings yet
Airtel'S Customer Relationship Model
19 pages
Customer Churn Analysis in Telecom Industry
No ratings yet
Customer Churn Analysis in Telecom Industry
6 pages
Project Report
No ratings yet
Project Report
83 pages
Churn PredictionITNACC
No ratings yet
Churn PredictionITNACC
7 pages
Applying Data Mining To Customer Churn Prediction in An Internet Service Provider
No ratings yet
Applying Data Mining To Customer Churn Prediction in An Internet Service Provider
7 pages
Sivas Ankar 2019
No ratings yet
Sivas Ankar 2019
26 pages
Customer Churn
No ratings yet
Customer Churn
17 pages
Data Mining 101: Core Concepts and Algorithms
From Everand
Data Mining 101: Core Concepts and Algorithms
Swarnalata Verma
No ratings yet
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
From Everand
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
Charles W. Chase
No ratings yet
Free Antivirus and its Market Implimentation: a Case Study of Qihoo 360 And Baidu
From Everand
Free Antivirus and its Market Implimentation: a Case Study of Qihoo 360 And Baidu
Yang Yiming
No ratings yet
Social Network Analysis in Telecommunications
From Everand
Social Network Analysis in Telecommunications
Carlos Andre Reis Pinheiro
1/5 (1)
Service Breakthroughs
From Everand
Service Breakthroughs
James L. Heskett
5/5 (1)
Digital Twins: How Engineers Can Adopt Them To Enhance Performances
From Everand
Digital Twins: How Engineers Can Adopt Them To Enhance Performances
Isrin Ismail
No ratings yet
Cellular & Wireless Telecommunication Lines World Summary: Market Values & Financials by Country
From Everand
Cellular & Wireless Telecommunication Lines World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
Unlocking the Potential of Digital Services Trade in Asia and the Pacific
From Everand
Unlocking the Potential of Digital Services Trade in Asia and the Pacific
Asian Development Bank
No ratings yet
NBL Datasheet
No ratings yet
NBL Datasheet
2 pages
RXN RKN09 12KEVJU Installation Manual
No ratings yet
RXN RKN09 12KEVJU Installation Manual
12 pages
On Optimal Partially Replicated Rotatable and Slope Rotatable Central Composite Designs
No ratings yet
On Optimal Partially Replicated Rotatable and Slope Rotatable Central Composite Designs
27 pages
Anti Sleep Project
No ratings yet
Anti Sleep Project
4 pages
Hyd 10
No ratings yet
Hyd 10
3 pages
Chem Taster Sheet 2
No ratings yet
Chem Taster Sheet 2
1 page
Fake News in The Philippines
No ratings yet
Fake News in The Philippines
1 page
VLSI - Design (Module II) Final
No ratings yet
VLSI - Design (Module II) Final
126 pages
Time Table For Summer 2024 Theory Examination 6i
No ratings yet
Time Table For Summer 2024 Theory Examination 6i
1 page
Himalayan College of Management Kamalpokhari, Kathmandu
No ratings yet
Himalayan College of Management Kamalpokhari, Kathmandu
7 pages
Thermovit Brochure EN - 0 PDF
No ratings yet
Thermovit Brochure EN - 0 PDF
9 pages
Spss Paper
No ratings yet
Spss Paper
1 page
Lec05 ch14 Sec1 6
No ratings yet
Lec05 ch14 Sec1 6
18 pages
CST8288 Lab1 5
No ratings yet
CST8288 Lab1 5
2 pages
Spiritual Care in The Intensive Care Unit: A Narrative Review
No ratings yet
Spiritual Care in The Intensive Care Unit: A Narrative Review
9 pages
SIMS 2022 Submission 052
No ratings yet
SIMS 2022 Submission 052
6 pages
Bresse 1860. Diámetro Nominal Optimo..
No ratings yet
Bresse 1860. Diámetro Nominal Optimo..
8 pages
Grade 7 Maths
No ratings yet
Grade 7 Maths
226 pages
Opinion Checklist Teacher
No ratings yet
Opinion Checklist Teacher
2 pages
Contemporary
No ratings yet
Contemporary
8 pages
Online Student Test Series - Brochure
No ratings yet
Online Student Test Series - Brochure
8 pages
Dokumen Induk All Dept
No ratings yet
Dokumen Induk All Dept
251 pages
M1472 SubMonitor Manual BODY Tri 03 14 R12 - WEB
No ratings yet
M1472 SubMonitor Manual BODY Tri 03 14 R12 - WEB
64 pages
9. Marketing Plan _ Entrepreneurship_ Creating and Leading an Entrepreneurial Organization
No ratings yet
9. Marketing Plan _ Entrepreneurship_ Creating and Leading an Entrepreneurial Organization
91 pages
Che222 Term Project
No ratings yet
Che222 Term Project
1 page
Iris Module
0% (2)
Iris Module
3 pages
GC 08-18 B3 FaithFamily
No ratings yet
GC 08-18 B3 FaithFamily
1 page
ob_7aa0cf_nupamudabiwobuvupi
No ratings yet
ob_7aa0cf_nupamudabiwobuvupi
2 pages
4th Grade Revised Syllabus
No ratings yet
4th Grade Revised Syllabus
3 pages

Chrun of Telecom Subscribers (1)

Uploaded by

Chrun of Telecom Subscribers (1)

Uploaded by

Term Project: Prediction of Churn of Telecommunications Customers

Source: Telephony Online, 2002

Sample Size 100,000 51,306 100,462

All the best

You might also like