SlideShare a Scribd company logo
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
DOI: 10.5121/ijdms.2011.3401 1
CUSTOMER DATA CLUSTERING USING DATA
MINING TECHNIQUE
Dr. Sankar Rajagopal
Enterprise DW/BI Consultant
Tata Consultancy Services, Newark, DE, USA
e-mail:rajagopal.sankar@tcs.com
ABSTRACT:
Classification and patterns extraction from customer data is very important for business support and
decision making. Timely identification of newly emerging trends is very important in business process.
Large companies are having huge volume of data but starving for knowledge. To overcome the
organization current issue, the new breed of technique is required that has intelligence and capability to
solve the knowledge scarcity and the technique is called Data mining. The objectives of this paper are to
identify the high-profit, high-value and low-risk customers by one of the data mining technique - customer
clustering. In the first phase, cleansing the data and developed the patterns via demographic clustering
algorithm using IBM I-Miner. In the second phase, profiling the data, develop the clusters and identify the
high-value low-risk customers. This cluster typically represents the 10-20 percent of customers which
yields 80% of the revenue.
KEYWORDS:
Data mining, customer clustering and I-Miner
1. INTRODUCTION
For a successful business, identification of high-profit, low-risk customers, retaining those
customers and bring the next level customers to above cluster is a key tasks for business owners
and marketers. Traditionally, marketers must first identify customer cluster using a mathematical
mode and then implement an efficient campaign plan to target profitable customers (1-4). This
process confronts considerable problems. Most previous studies used various mathematical
models to segment customers without considering the correlation between customer cluster and a
campaign/loyalty programs. Moreover, due to advances in computing and information storage
areas large companies are piling up vast volume of data and the traditional mathematical models
are difficult predicts the segmentations and patterns. As a result, useful information is often
overlooked, and the potential benefits of increased computational and data gathering capabilities
are only partially realized. Only through data mining techniques, it is possible to extract useful
pattern and association from the customer data (5). Data mining techniques like clustering and
associations can be used to find meaningful patterns for future predictions (6,7).Customer
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
2
clustering and segmentation are two of the most important techniques used in marketing and
customer-relationship management. They use customer-purchase transaction data to track buying
behavior and create strategic business initiatives. Businesses can use this data to divide customers
into segments based on such "shareholder value" variables as current customer profitability, some
measure of risk, a measure of the lifetime value of a customer, and retention probability. Creating
customer segments based on such variables highlights obvious marketing opportunities.
The paper is organized as follows. Section 2 discuss about Background research. Section 3 briefly
reviews data mining and different clustering techniques. The proposed architecture, experiments
and results are discussed in the section 4. Section 5 concludes the paper and gives suggestions for
future work.
2. RESEARCH BAGROUND
In traditional markets, customer clustering / segmentation is one of the most significant methods
used in studies of marketing. This study classifies existing customer cluster/segmentation
methods into methodology-oriented and application-oriented approaches. Most methodology-
driven studies used mathematical methodologies; e.g statistics, neural net, generic algorithm (GA)
and Fuzzy set to identify the optimized segmented homogenous group (8-15).
In recent years, it has been recognized that the partitioned clustering technique is well suited for
clustering a large dataset due to their relatively low computational requirements. Behavioral
clustering and segmentation help derive strategic marketing initiatives by using the variables that
determine customer shareholder value. By conducting demographic clustering and segmentation
within the behavioral segments, we can define tactical marketing campaigns and select the
appropriate marketing channel and advertising for the tactical campaign. It is then possible to
target those customers most likely to exhibit the desired behavior by creating predictive models.
In this work demographic clustering algorithm is used to identify the customer clustering. In
phase 1, the customer data is cleansed and developed patterns using various parameters and
subsequently, in phase 2 profiled the data, developed the clusters and identified the high-value
low risk customers. From the experimental results it showed that the proposed approach would
generate more useful pattern from large data.
3. DATA MINING AND CLUSTERING METHODS
Data mining - also known as knowledge-discovery in databases (KDD) is process of extracting
potentially useful information from raw data. A software engine can scan large amounts of data
and automatically report interesting patterns without requiring human intervention. Other
knowledge discovery technologies are Statistical Analysis, OLAP, Data Visualization, and Ad
hoc queries. Unlike these technologies, data mining does not require a human to ask specific
questions.
In general, Data mining has four major relationships. They are:
(i) Classes
(ii) Clusters
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
3
(iii)Associations
(iv) Sequential patterns.
(i) Classes: Stored data is used to locate data in predetermined groups. For example, a
restaurant chain could mine customer purchase data to determine when customers
visit and what they typically order. This information could be used to increase traffic
by having daily specials.
(ii) Clusters: Data items are grouped according to logical relationships or consumer
preferences. For example, data can be mined to identify market segments or
consumer affinities.
(iii) Associations: Data can be mined to identify associations. The beer-diaper example is
an example of associative mining.
(iv) Sequential patterns: Data is mined to anticipate behavior patterns and trends. For
example, an outdoor equipment retailer could predict the likelihood of a backpack
being purchased based on a consumer's purchase of sleeping bags and hiking shoes.
3.1 Clustering Methods:
Clustering is a typical unsupervised learning technique for grouping similar data points. A
clustering algorithm assigns a large number of data points to a smaller number of groups such that
data points in the same group share the same properties while, in different groups, they are
dissimilar. Clustering has many applications, including part family formation for group
technology, image segmentation, information retrieval, web pages grouping, market
segmentation, and scientific and engineering analysis (16).
Many clustering methods have been proposed and they can be broadly classified into four
categories (17-24): partitioning methods, hierarchical methods, density-based methods and grid-
based methods. Other clustering techniques that do not fir in these categories have been
developed. They are fuzzy clustering, artificial neural networks and generic algorithms.
The following section deals about detailed study of the customer clustering. The data is the
production information of our organization smart retail store.
3.2 Customer Clustering:
Customer clustering is the most important data mining methodologies used in marketing and
customer relationship management (CRM). Customer clustering would use customer-purchase
transaction data to track buying behavior and create strategic business initiatives.
Companies want to keep high-profit, high-value, and low-risk customers. This cluster typically
represents the 10 to 20 percent of customers who create 50 to 80 percent of a company's profits.
A company would not want to lose these customers, and the strategic initiative for the segment is
obviously retention. A low-profit, high-value, and low-risk customer segment is also an attractive
one, and the obvious goal here would be to increase profitability for this segment. Cross-selling
(selling new products) and up-selling (selling more of what customers currently buy) to this
segment are the marketing initiatives of choice.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
4
3.3 Proposed Architecture:
The proposed approach is a two phased model. In first phase, collect the data from our
organization retail smart store and then do the data cleansing. It involves removing the noise first,
so the incomplete, missing and irrelevant data are removed and formatted according to the
required format. In second phase, generate the clusters and profile the clusters to identify by best
clusters. Fig.1 illustrates the whole process.
Fig: 1 Clustering Process
4. EXPERIMENTS AND RESULTS:
For this study, the transaction of data of our organization retail smart store has been taken. Using
these data, customers have been clustered using IBM Intelligent Miner tool. The first steps in the
clustering process involve selecting the data set and the algorithm. There are two types of
algorithms available in I-Miner process.
i. Demographic clustering process
ii. Neural clustering process.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
5
In this exercise, the Demographic clustering process has been chosen, since it works best for the
continuous data type. The data set has all the data types are continuous.
The next step in the process is to choose the basic run parameters for the process. The basic
parameters available for demographic clustering include are:
• Maximum number of clusters
• Maximum number of passes through the data
• Accuracy
• Similarity threshold
For this assignment, the maximum number of clusters is 4, maximum number of passes through
the data is 3 and the accuracy is 0.5 have been chosen.
The input parameters for the customer’s clustering are:
1. Recency
2. Total customer profit
3. Total customer revenue
4. Top revenue Department
The top revenue department variable has been chosen as supplementary variable and rest of the
variables have been chosen as active variables. The supplementary variables are used to profile
the clusters and not to define them. The ability to add supplementary variables at the outset of
clustering is a very useful feature of Intelligent Miner that allows quick and easy interpretation of
clusters using data other than the input variables. The input and output field’s width are defined
and The input data used in mining is the production data of our organization retail smart store.
(Synthesized based on the experience).
The data is first extracted from the oracle databases and flat files and converted into flat files.
Subsequently, the IMiner process picks up the file and processed. The following figures show the
snapshot of some of the IMiner process output screens. The entire output data set would have
customer information appended to the end of the each record.
The below figures 2 show the results of the clustering run in Intelligent Miner's cluster visualizer,
which was used by both demographic and neural clustering (clustering mode). Each of the
horizontal strips represents a cluster (with its ID number on the left). The clusters are ordered
from top to bottom in order of size, with the number to the penultimate left indicate ng the size of
a cluster as a percentage of the universe.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
6
Fig. 2: I-Miner Results
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
7
Variables are ordered from left to right in order of importance to the cluster, based on a chi-
square test between variable and cluster ID. This is the default metric. Other metrics include
entropy, Condorcet criterion, and database order.
The following figure shows the characteristics of the clusters along with cluster id (Name) and
size of the each cluster. The variables used to define clusters are without brackets, while the
supplementary variables appear within brackets. Numeric (integer), discrete numeric (small
integer), binary, and continuous variables have their frequency distribution or histogram shown as
a bar graph. The red bars in the foreground indicate the distribution of the variable within the
current cluster. The gray solid bars in the background indicate the distribution of the variable in
the whole universe. The more different the cluster distribution is from the average, the more
interesting or distinct the cluster.
The output of the I-miner clustering would be generated as a fixed length flat file which contains
the output fields along with cluster ids, condorcet values and the confidence values.
Subsequently, the flat file would be loaded into ORACLE table for profiling using SQL loader.
4.1 Cluster Profiling
The next step in the clustering process is to profile the clusters by executing SQL queries. The
purpose of profiling is to assess the potential business value of each cluster quantitatively by
profiling the aggregate values of the shareholder value variables by cluster.
Below table 1 provides values of a profile of revenue, profit and total revenue in percentage. The
table shows that cluster 1 is the most profitable cluster, representing about 35 percent of the
revenue yet only 6 percent of the customers.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
8
Table 1: Clustering results with total revenue, customer count, total profit etc.
CLUSTER
RANK
CLUSTER ID CUSTOMER
COUNT
TOTAL
PROFIT
TOTAL
REVENUE
CUSTOMER
COUNT(%)
TOTAL
REV.(%)
1 3 486 117256.59 563643.71 6.16 35.47
2 1 5977 85812.08 382009.43 75.72 24.04
3 2 529 82169.06 379789.03 6.70 23.90
4 0 902 51561.46 263656 11.43 16.59
From this result, it is possible to derive some high-level business strategies. It is obvious that the
best customers (considering only the data contained in the table) are falls in cluster 3. These
customers have higher revenue per person than the customers of other clusters, as indicated by the
total revenue column. The cluster 3 has high value, low cost and is classified as High value
cluster. The cluster 1 has high revenue, high cost and is classified as Medium value. The cluster 2
has low revenue, low cost and is classified as Low value. The cluster 0 has low revenue, high cost
and is classified as Negative value.
Some possible strategies include:
• A retention strategy for best customers (those in cluster 3)
• Cross sell strategy can be implanted between cluster 3 & 2 because they are close in
value.
• The strategy for cluster 1 would be to wait and see. It appears to be a group of new
customers for which we have not yet collected sufficient data to determine what
behaviors they may exhibit.
• Cluster 0 appears to be the worst cluster, with a very low revenue percentage.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
9
5. CONCLUSION AND FUTURE SCOPE
Identification of high-profit, high-value and low-risk customers via the data mining technique -
customer clustering has been studied using IBM Intelligent Miner. Our organization retail smart
store data is used for this study. This study uses demographic clustering technique for customer
clustering. The final results demonstrate that the proposed approach revealed the high-value
customers. From the results, cluster 3 has high revenue customers and represents about 35 percent
of the revenue yet only by 6 percent of the customers. The cluster 3 has high value, low cost and
is classified as High value cluster. The cluster 1 has high revenue, high cost and is classified as
Medium value. The cluster 2 has low revenue, low cost and is classified as low value. Also,
suggested possible strategies to retain and move the customers from lower band to upper band.
5.1 Future Scope:
Based on the results our organization retail smart store decided to undertake lot of loyalty
programs for smart store customers. The further work on segmentation (clustering) using more
detailed behavioral data and opportunity identification using association algorithms within the
segments discovered. Other possible future works are association of products and customer
segmentations for cross-selling (selling new products) and up-selling (selling more of what
customers currently buy).
ACKNOWLEDGEMENTS
Author would like to thanks to his colleague Mr. Nagarajan Karuppiah, Tata Consultancy
Services for his valuable support and help.
REFERENCES
[1] Lefait, G. and Kechadi, T, (2010) “Customer Segmentation Architecture Based on Clustering
Techniques” Digital Society, ICDS’10, Fourth International Conference, 10-02-2010.
[2] Fraley, Andrew, and Thearting, Kurt (1999). Increasing customer value by integrating data mining
and campaign management software. DataManagement, 49–53.
[3] P. Bhargavi and S.Jyothi, (2009) “Applying Naïve Bayes Data Mining Technique for Classification of
Agricultural land Soils” IJCSNS International Journal of computer Science and Network Security,
VOL. 9 No.8, August 117-122.
[4] I.Krishna Murthy, “Data Mining- Statistics Applications: A Key to Managerial Decision Making”,
Article/Report indiastat.com, April-May 2010.
[5] Association Analysis of Customer Services from the Enterprise Customer Management System-
ICDM-2006.
[6] Terry Harris, (2008)“Optimization creates lean green supply chains”, Data Mining Book
[7] Matt Hartely (2005)“Using Data Mining to predict inventory levels” Data Mining Book
[8] Hu, Tung-Lai, & Sheub, Jiuh-Biing (2003). “A fuzzy-based customer classification method for
demand-responsive logistical distribution operations”, Fuzzy Sets and Systems, 139, 431–450.
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
10
[9] Hwang, Hyunseok, Jung, Taesoo, & Suh, Euiho (2004). “An LTV model and customer segmentation
based on customer value: A case study on the wireless telecommunication industry”, Expert Systems
with Applications, 26, 181–188.
[10] Jiao, Jianxin, & Zhang, Yiyang (2005). “Product portfolio identification based on association rule
mining” Computer-Aided Design, 37, 149–172.
[11] Jonker, Jedid-Jah, Piersma, Nanda, & Poel, Dirk Van den (2004). “Joint optimization of customer
segmentation and marketing policy to maximize long-term profitability”. Expert Systems with
Applications, 27, 159–168.
[12] Kim, Su-Yeon, Jung, Tae-Soo, Suh, Eui-Ho, & Hwang, Hyun-Seok (2006). “Customer segmentation
and strategy development based on customer life time value”: A case study. Expert Systems with
Applications, 31, 101–107.
[13] Kim, Yong Seog, & Street, W. Nick (2004). “An intelligent system for customer targeting: A data
mining approach”. Decision Support Systems, 37, 215–228.
[14] Kim, Yong Seog, Street, W. Nick, Russell, Gary J., & Menczer, Filippo (2005). “Customer Targeting:
A Neural Network Approach Guided by Genetic Algorithms.” Management Science, 51(2), 264– 276.
[15] Kuo, R. J., An, Y. L., Wang, H. S., & Chung, W. J. (2006). “Integration of self-organizing feature
maps neural net work and genetic K-means algorithm for market segmentation”. Expert Systems with
Applications, 30, 313–324.[
[16] Pham, D.T. and Afify, A.A. (2006) “Clustering techniques and their applications in engineering”.
Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering
Science,
[17] A.K. Jain, M.N. Murty, and P. J. Flynn,(1999) “Data clustering: a review”, ACM Computing Surveys
(CSUR), Vol.31, Issue 3,, 1999.
[18] Grabmeier, J. and Rudolph, A. (2002) “Techniques of cluster algorithms in data mining” Data Mining
and Knowledge Discovery, 6, 303-360.
[19] Han, J. and Kamber, M. (2001) “Data Mining: Concepts and Techniques”, (Academic Press, San
Diego, California, USA).
[20] Jiyuan An , Jeffrey Xu Yu , Chotirat Ann Ratanamahatana , Yi-Ping Phoebe Chen,(2007) “A
dimensionality reduction algorithm and its application for interactive visualization”, Journal of Visual
Languages and Computing, v.18 n.1, , February p.48-70.
[21] Nargess Memarsadeghi , Dianne P. O'Leary,(2003) “Classified Information: The Data Clustering
Problem”, Computing in Science and Engineering, v.5 n.5, September p.54-60,
[22] Yifan Li , Jiawei Han , Jiong Yang, (2004) “Clustering moving objects”, Proceedings of the tenth
ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25,
Seattle, WA, USA
[23] Sherin M. Youssef , Mohamed Rizk , Mohamed El-Sherif, (2008) “Enhanced swarm-like agents for
dynamically adaptive data clustering”, Proceedings of the 2nd WSEAS International Conference on
Computer Engineering and Applications, p.213-219, January 25-27, Acapulco, Mexico
International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011
11
[24] Marcel Brun , Chao Sima , Jianping Hua , James Lowey , Brent Carroll , Edward Suh , Edward R.
Dougherty,(2007) Model-based evaluation of clustering validation measures, Pattern Recognition,
v.40 n.3, March p.807-824.
Author:
Sankar Rajagopal received M.Sc., degree in Electronics, M.E degree in Materials
Science and Ph.D in Metallurgical Engineering from Bharathidasan University in
1990, 1992 and Indian Institute of Technology-Madras, Chennai 1999 respectively.
After completion of his doctoral degree, he joined in TATA Consultancy Services as a
Software Consultant. Then he has elevated to position Enterprise DW/BI Architect.
His areas of research interests include Metal Matrix Composites, Mechanical alloying,
Material Informatics, Mechanical Behaviors of Materials, Nanotechnology and Data
Mining and Knowledge Discovery. He has published about 15 contributed peer
reviewed papers at National / International Journals and Conferences. He received
best oral presentation awards in conferences.
Ad

More Related Content

What's hot (20)

BI-Full Document
BI-Full DocumentBI-Full Document
BI-Full Document
Ekemini Anthony Essiet
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance Business
Ankur Khanna
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 
Data mining
Data miningData mining
Data mining
RajThakuri
 
Using Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer SegmentationUsing Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer Segmentation
IJERA Editor
 
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MININGINTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
Rajesh Math
 
A Study On Red Box Data Mining Approach
A Study On Red Box Data Mining ApproachA Study On Red Box Data Mining Approach
A Study On Red Box Data Mining Approach
IJCSIS Research Publications
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
IJTET Journal
 
U25107111
U25107111U25107111
U25107111
IJERA Editor
 
Data mining in retail industry
Data mining in retail industryData mining in retail industry
Data mining in retail industry
MonicaRaveshanker
 
Data mining PPT
Data mining PPTData mining PPT
Data mining PPT
Kapil Rode
 
Big data overview
Big data overviewBig data overview
Big data overview
Shyam Sunder Budhwar
 
Role of business intelligence in knowledge management
Role of business intelligence in knowledge managementRole of business intelligence in knowledge management
Role of business intelligence in knowledge management
Shakthi Fernando
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
ijtsrd
 
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKINGTHE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
csijjournal
 
km ppt neew one
km ppt neew onekm ppt neew one
km ppt neew one
Sahil Jain
 
Discovering diamonds under coal piles: Revealing exclusive business intellige...
Discovering diamonds under coal piles: Revealing exclusive business intellige...Discovering diamonds under coal piles: Revealing exclusive business intellige...
Discovering diamonds under coal piles: Revealing exclusive business intellige...
IJERA Editor
 
Data mining
Data miningData mining
Data mining
jadhav_priti
 
Datamining
DataminingDatamining
Datamining
Yaman Çakmaklar
 
Sai kiran goud sem.ppt
Sai kiran goud sem.pptSai kiran goud sem.ppt
Sai kiran goud sem.ppt
Bathini Saikirangoud
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance Business
Ankur Khanna
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 
Using Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer SegmentationUsing Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer Segmentation
IJERA Editor
 
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MININGINTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
INTRODUCTION TO BUSINESS INTELLIGENCE and DATA MINING
Rajesh Math
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
IJTET Journal
 
Data mining in retail industry
Data mining in retail industryData mining in retail industry
Data mining in retail industry
MonicaRaveshanker
 
Data mining PPT
Data mining PPTData mining PPT
Data mining PPT
Kapil Rode
 
Role of business intelligence in knowledge management
Role of business intelligence in knowledge managementRole of business intelligence in knowledge management
Role of business intelligence in knowledge management
Shakthi Fernando
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
ijtsrd
 
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKINGTHE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
csijjournal
 
km ppt neew one
km ppt neew onekm ppt neew one
km ppt neew one
Sahil Jain
 
Discovering diamonds under coal piles: Revealing exclusive business intellige...
Discovering diamonds under coal piles: Revealing exclusive business intellige...Discovering diamonds under coal piles: Revealing exclusive business intellige...
Discovering diamonds under coal piles: Revealing exclusive business intellige...
IJERA Editor
 

Viewers also liked (20)

2010-11-08 NSA Technical Symposium
2010-11-08 NSA Technical Symposium2010-11-08 NSA Technical Symposium
2010-11-08 NSA Technical Symposium
Shawn Wells
 
Lacydes moreno blanco
Lacydes moreno blancoLacydes moreno blanco
Lacydes moreno blanco
mateoarevalo123
 
Kempenfelt Gallery of Garage Doors 2
Kempenfelt Gallery of Garage Doors 2Kempenfelt Gallery of Garage Doors 2
Kempenfelt Gallery of Garage Doors 2
Robert Maslen
 
English project
English projectEnglish project
English project
mateoarevalo123
 
Duvi honig (1)
Duvi honig (1)Duvi honig (1)
Duvi honig (1)
Duvi Honig
 
Ministerio de Salud como autoridad Sanitaria
Ministerio de Salud como autoridad SanitariaMinisterio de Salud como autoridad Sanitaria
Ministerio de Salud como autoridad Sanitaria
Universidad del Istmo Chiriqui Panamá
 
Intoxicaciones generacion 28 2010
Intoxicaciones generacion 28 2010Intoxicaciones generacion 28 2010
Intoxicaciones generacion 28 2010
Rocio del Carmen Ruiz Ceceña
 
El reinado de Alfonso XIII. 4º ESO
El reinado de Alfonso XIII. 4º ESOEl reinado de Alfonso XIII. 4º ESO
El reinado de Alfonso XIII. 4º ESO
Mencar Car
 
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
Yoshitaka Kawashima
 
Seasar conference 2015 sa-compojure
Seasar conference 2015 sa-compojureSeasar conference 2015 sa-compojure
Seasar conference 2015 sa-compojure
Yoshitaka Kawashima
 
Semiologia Geral
Semiologia GeralSemiologia Geral
Semiologia Geral
Ricardo Duarte
 
โภชนาการสุภาพร
โภชนาการสุภาพรโภชนาการสุภาพร
โภชนาการสุภาพร
tassanee chaicharoen
 
Portfolio_and_Risk_Analytics_
Portfolio_and_Risk_Analytics_Portfolio_and_Risk_Analytics_
Portfolio_and_Risk_Analytics_
Ignacio Reyes-Miranda
 
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
arizabaletavale
 
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOSPRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
Laura Mejia
 
CoOpPRPlan
CoOpPRPlanCoOpPRPlan
CoOpPRPlan
Amy Rauch
 
信益餅店口述歷史
信益餅店口述歷史信益餅店口述歷史
信益餅店口述歷史
令惠 王
 
Proposiciones
ProposicionesProposiciones
Proposiciones
Luis Plaez
 
Clase 1 egb
Clase 1  egbClase 1  egb
Clase 1 egb
Yazmín Barrera
 
Dreamweaver
DreamweaverDreamweaver
Dreamweaver
Aldair De la hoz
 
2010-11-08 NSA Technical Symposium
2010-11-08 NSA Technical Symposium2010-11-08 NSA Technical Symposium
2010-11-08 NSA Technical Symposium
Shawn Wells
 
Kempenfelt Gallery of Garage Doors 2
Kempenfelt Gallery of Garage Doors 2Kempenfelt Gallery of Garage Doors 2
Kempenfelt Gallery of Garage Doors 2
Robert Maslen
 
Duvi honig (1)
Duvi honig (1)Duvi honig (1)
Duvi honig (1)
Duvi Honig
 
El reinado de Alfonso XIII. 4º ESO
El reinado de Alfonso XIII. 4º ESOEl reinado de Alfonso XIII. 4º ESO
El reinado de Alfonso XIII. 4º ESO
Mencar Car
 
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
マイクロフレームワークEnkan(とKotowari)ではじめるREPL駆動開発
Yoshitaka Kawashima
 
Seasar conference 2015 sa-compojure
Seasar conference 2015 sa-compojureSeasar conference 2015 sa-compojure
Seasar conference 2015 sa-compojure
Yoshitaka Kawashima
 
โภชนาการสุภาพร
โภชนาการสุภาพรโภชนาการสุภาพร
โภชนาการสุภาพร
tassanee chaicharoen
 
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
1e4dfe 8b41b82e4c2f36ac1b9e5834ef2cd6d6 (1)
arizabaletavale
 
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOSPRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
PRIMERA PARTE ENTREGABLE PRODUCCIÓN DE DOCUMENTOS
Laura Mejia
 
信益餅店口述歷史
信益餅店口述歷史信益餅店口述歷史
信益餅店口述歷史
令惠 王
 
Ad

Similar to Clustering customer data dr sankar rajagopal (20)

AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime ValueAHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
IIRindia
 
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUESDEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
IRJET Journal
 
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
IJDKP
 
Market Segmentation Customer Maximum Profit
Market Segmentation Customer Maximum ProfitMarket Segmentation Customer Maximum Profit
Market Segmentation Customer Maximum Profit
vivatechijri
 
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold  Digging Beneath The Surface Of Data MiningAll That Glitters Is Not Gold  Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
Jim Webb
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
IJERA Editor
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET Journal
 
Data mining-basic
Data mining-basicData mining-basic
Data mining-basic
gufranresearcher
 
Cis 500 assignment 4
Cis 500 assignment 4Cis 500 assignment 4
Cis 500 assignment 4
Brownsugar7578
 
69.pdf
69.pdf69.pdf
69.pdf
tarannumtasneem
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer Behaviour
IRJET Journal
 
Data mining
Data miningData mining
Data mining
hardavishah56
 
Pcc mktg 6 chapter 3
Pcc mktg 6 chapter 3Pcc mktg 6 chapter 3
Pcc mktg 6 chapter 3
Mark Jason Garingarao
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence Data
IJERA Editor
 
Data Mining
Data MiningData Mining
Data Mining
AbinayaJawahar
 
What is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts ExplainedWhat is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts Explained
Julie Bowie
 
Data Mining
Data MiningData Mining
Data Mining
SOMASUNDARAM T
 
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
IJCSEIT Journal
 
Customer Churn Prediction using Association Rule Mining
Customer Churn Prediction using Association Rule MiningCustomer Churn Prediction using Association Rule Mining
Customer Churn Prediction using Association Rule Mining
ijtsrd
 
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime ValueAHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
IIRindia
 
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUESDEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
DEMOGRAPHIC DIVISION OF A MART BY APPLYING CLUSTERING TECHNIQUES
IRJET Journal
 
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
AI - Powered Customer Segmentation and Targeting: Predicting Customer Behavio...
IJDKP
 
Market Segmentation Customer Maximum Profit
Market Segmentation Customer Maximum ProfitMarket Segmentation Customer Maximum Profit
Market Segmentation Customer Maximum Profit
vivatechijri
 
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold  Digging Beneath The Surface Of Data MiningAll That Glitters Is Not Gold  Digging Beneath The Surface Of Data Mining
All That Glitters Is Not Gold Digging Beneath The Surface Of Data Mining
Jim Webb
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
IJERA Editor
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET Journal
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer Behaviour
IRJET Journal
 
Customer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence DataCustomer Clustering Based on Customer Purchasing Sequence Data
Customer Clustering Based on Customer Purchasing Sequence Data
IJERA Editor
 
What is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts ExplainedWhat is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts Explained
Julie Bowie
 
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
CSHURI – Modified HURI algorithm for Customer Segmentation and Transaction Pr...
IJCSEIT Journal
 
Customer Churn Prediction using Association Rule Mining
Customer Churn Prediction using Association Rule MiningCustomer Churn Prediction using Association Rule Mining
Customer Churn Prediction using Association Rule Mining
ijtsrd
 
Ad

Recently uploaded (20)

How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 

Clustering customer data dr sankar rajagopal

  • 1. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 DOI: 10.5121/ijdms.2011.3401 1 CUSTOMER DATA CLUSTERING USING DATA MINING TECHNIQUE Dr. Sankar Rajagopal Enterprise DW/BI Consultant Tata Consultancy Services, Newark, DE, USA e-mail:[email protected] ABSTRACT: Classification and patterns extraction from customer data is very important for business support and decision making. Timely identification of newly emerging trends is very important in business process. Large companies are having huge volume of data but starving for knowledge. To overcome the organization current issue, the new breed of technique is required that has intelligence and capability to solve the knowledge scarcity and the technique is called Data mining. The objectives of this paper are to identify the high-profit, high-value and low-risk customers by one of the data mining technique - customer clustering. In the first phase, cleansing the data and developed the patterns via demographic clustering algorithm using IBM I-Miner. In the second phase, profiling the data, develop the clusters and identify the high-value low-risk customers. This cluster typically represents the 10-20 percent of customers which yields 80% of the revenue. KEYWORDS: Data mining, customer clustering and I-Miner 1. INTRODUCTION For a successful business, identification of high-profit, low-risk customers, retaining those customers and bring the next level customers to above cluster is a key tasks for business owners and marketers. Traditionally, marketers must first identify customer cluster using a mathematical mode and then implement an efficient campaign plan to target profitable customers (1-4). This process confronts considerable problems. Most previous studies used various mathematical models to segment customers without considering the correlation between customer cluster and a campaign/loyalty programs. Moreover, due to advances in computing and information storage areas large companies are piling up vast volume of data and the traditional mathematical models are difficult predicts the segmentations and patterns. As a result, useful information is often overlooked, and the potential benefits of increased computational and data gathering capabilities are only partially realized. Only through data mining techniques, it is possible to extract useful pattern and association from the customer data (5). Data mining techniques like clustering and associations can be used to find meaningful patterns for future predictions (6,7).Customer
  • 2. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 2 clustering and segmentation are two of the most important techniques used in marketing and customer-relationship management. They use customer-purchase transaction data to track buying behavior and create strategic business initiatives. Businesses can use this data to divide customers into segments based on such "shareholder value" variables as current customer profitability, some measure of risk, a measure of the lifetime value of a customer, and retention probability. Creating customer segments based on such variables highlights obvious marketing opportunities. The paper is organized as follows. Section 2 discuss about Background research. Section 3 briefly reviews data mining and different clustering techniques. The proposed architecture, experiments and results are discussed in the section 4. Section 5 concludes the paper and gives suggestions for future work. 2. RESEARCH BAGROUND In traditional markets, customer clustering / segmentation is one of the most significant methods used in studies of marketing. This study classifies existing customer cluster/segmentation methods into methodology-oriented and application-oriented approaches. Most methodology- driven studies used mathematical methodologies; e.g statistics, neural net, generic algorithm (GA) and Fuzzy set to identify the optimized segmented homogenous group (8-15). In recent years, it has been recognized that the partitioned clustering technique is well suited for clustering a large dataset due to their relatively low computational requirements. Behavioral clustering and segmentation help derive strategic marketing initiatives by using the variables that determine customer shareholder value. By conducting demographic clustering and segmentation within the behavioral segments, we can define tactical marketing campaigns and select the appropriate marketing channel and advertising for the tactical campaign. It is then possible to target those customers most likely to exhibit the desired behavior by creating predictive models. In this work demographic clustering algorithm is used to identify the customer clustering. In phase 1, the customer data is cleansed and developed patterns using various parameters and subsequently, in phase 2 profiled the data, developed the clusters and identified the high-value low risk customers. From the experimental results it showed that the proposed approach would generate more useful pattern from large data. 3. DATA MINING AND CLUSTERING METHODS Data mining - also known as knowledge-discovery in databases (KDD) is process of extracting potentially useful information from raw data. A software engine can scan large amounts of data and automatically report interesting patterns without requiring human intervention. Other knowledge discovery technologies are Statistical Analysis, OLAP, Data Visualization, and Ad hoc queries. Unlike these technologies, data mining does not require a human to ask specific questions. In general, Data mining has four major relationships. They are: (i) Classes (ii) Clusters
  • 3. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 3 (iii)Associations (iv) Sequential patterns. (i) Classes: Stored data is used to locate data in predetermined groups. For example, a restaurant chain could mine customer purchase data to determine when customers visit and what they typically order. This information could be used to increase traffic by having daily specials. (ii) Clusters: Data items are grouped according to logical relationships or consumer preferences. For example, data can be mined to identify market segments or consumer affinities. (iii) Associations: Data can be mined to identify associations. The beer-diaper example is an example of associative mining. (iv) Sequential patterns: Data is mined to anticipate behavior patterns and trends. For example, an outdoor equipment retailer could predict the likelihood of a backpack being purchased based on a consumer's purchase of sleeping bags and hiking shoes. 3.1 Clustering Methods: Clustering is a typical unsupervised learning technique for grouping similar data points. A clustering algorithm assigns a large number of data points to a smaller number of groups such that data points in the same group share the same properties while, in different groups, they are dissimilar. Clustering has many applications, including part family formation for group technology, image segmentation, information retrieval, web pages grouping, market segmentation, and scientific and engineering analysis (16). Many clustering methods have been proposed and they can be broadly classified into four categories (17-24): partitioning methods, hierarchical methods, density-based methods and grid- based methods. Other clustering techniques that do not fir in these categories have been developed. They are fuzzy clustering, artificial neural networks and generic algorithms. The following section deals about detailed study of the customer clustering. The data is the production information of our organization smart retail store. 3.2 Customer Clustering: Customer clustering is the most important data mining methodologies used in marketing and customer relationship management (CRM). Customer clustering would use customer-purchase transaction data to track buying behavior and create strategic business initiatives. Companies want to keep high-profit, high-value, and low-risk customers. This cluster typically represents the 10 to 20 percent of customers who create 50 to 80 percent of a company's profits. A company would not want to lose these customers, and the strategic initiative for the segment is obviously retention. A low-profit, high-value, and low-risk customer segment is also an attractive one, and the obvious goal here would be to increase profitability for this segment. Cross-selling (selling new products) and up-selling (selling more of what customers currently buy) to this segment are the marketing initiatives of choice.
  • 4. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 4 3.3 Proposed Architecture: The proposed approach is a two phased model. In first phase, collect the data from our organization retail smart store and then do the data cleansing. It involves removing the noise first, so the incomplete, missing and irrelevant data are removed and formatted according to the required format. In second phase, generate the clusters and profile the clusters to identify by best clusters. Fig.1 illustrates the whole process. Fig: 1 Clustering Process 4. EXPERIMENTS AND RESULTS: For this study, the transaction of data of our organization retail smart store has been taken. Using these data, customers have been clustered using IBM Intelligent Miner tool. The first steps in the clustering process involve selecting the data set and the algorithm. There are two types of algorithms available in I-Miner process. i. Demographic clustering process ii. Neural clustering process.
  • 5. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 5 In this exercise, the Demographic clustering process has been chosen, since it works best for the continuous data type. The data set has all the data types are continuous. The next step in the process is to choose the basic run parameters for the process. The basic parameters available for demographic clustering include are: • Maximum number of clusters • Maximum number of passes through the data • Accuracy • Similarity threshold For this assignment, the maximum number of clusters is 4, maximum number of passes through the data is 3 and the accuracy is 0.5 have been chosen. The input parameters for the customer’s clustering are: 1. Recency 2. Total customer profit 3. Total customer revenue 4. Top revenue Department The top revenue department variable has been chosen as supplementary variable and rest of the variables have been chosen as active variables. The supplementary variables are used to profile the clusters and not to define them. The ability to add supplementary variables at the outset of clustering is a very useful feature of Intelligent Miner that allows quick and easy interpretation of clusters using data other than the input variables. The input and output field’s width are defined and The input data used in mining is the production data of our organization retail smart store. (Synthesized based on the experience). The data is first extracted from the oracle databases and flat files and converted into flat files. Subsequently, the IMiner process picks up the file and processed. The following figures show the snapshot of some of the IMiner process output screens. The entire output data set would have customer information appended to the end of the each record. The below figures 2 show the results of the clustering run in Intelligent Miner's cluster visualizer, which was used by both demographic and neural clustering (clustering mode). Each of the horizontal strips represents a cluster (with its ID number on the left). The clusters are ordered from top to bottom in order of size, with the number to the penultimate left indicate ng the size of a cluster as a percentage of the universe.
  • 6. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 6 Fig. 2: I-Miner Results
  • 7. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 7 Variables are ordered from left to right in order of importance to the cluster, based on a chi- square test between variable and cluster ID. This is the default metric. Other metrics include entropy, Condorcet criterion, and database order. The following figure shows the characteristics of the clusters along with cluster id (Name) and size of the each cluster. The variables used to define clusters are without brackets, while the supplementary variables appear within brackets. Numeric (integer), discrete numeric (small integer), binary, and continuous variables have their frequency distribution or histogram shown as a bar graph. The red bars in the foreground indicate the distribution of the variable within the current cluster. The gray solid bars in the background indicate the distribution of the variable in the whole universe. The more different the cluster distribution is from the average, the more interesting or distinct the cluster. The output of the I-miner clustering would be generated as a fixed length flat file which contains the output fields along with cluster ids, condorcet values and the confidence values. Subsequently, the flat file would be loaded into ORACLE table for profiling using SQL loader. 4.1 Cluster Profiling The next step in the clustering process is to profile the clusters by executing SQL queries. The purpose of profiling is to assess the potential business value of each cluster quantitatively by profiling the aggregate values of the shareholder value variables by cluster. Below table 1 provides values of a profile of revenue, profit and total revenue in percentage. The table shows that cluster 1 is the most profitable cluster, representing about 35 percent of the revenue yet only 6 percent of the customers.
  • 8. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 8 Table 1: Clustering results with total revenue, customer count, total profit etc. CLUSTER RANK CLUSTER ID CUSTOMER COUNT TOTAL PROFIT TOTAL REVENUE CUSTOMER COUNT(%) TOTAL REV.(%) 1 3 486 117256.59 563643.71 6.16 35.47 2 1 5977 85812.08 382009.43 75.72 24.04 3 2 529 82169.06 379789.03 6.70 23.90 4 0 902 51561.46 263656 11.43 16.59 From this result, it is possible to derive some high-level business strategies. It is obvious that the best customers (considering only the data contained in the table) are falls in cluster 3. These customers have higher revenue per person than the customers of other clusters, as indicated by the total revenue column. The cluster 3 has high value, low cost and is classified as High value cluster. The cluster 1 has high revenue, high cost and is classified as Medium value. The cluster 2 has low revenue, low cost and is classified as Low value. The cluster 0 has low revenue, high cost and is classified as Negative value. Some possible strategies include: • A retention strategy for best customers (those in cluster 3) • Cross sell strategy can be implanted between cluster 3 & 2 because they are close in value. • The strategy for cluster 1 would be to wait and see. It appears to be a group of new customers for which we have not yet collected sufficient data to determine what behaviors they may exhibit. • Cluster 0 appears to be the worst cluster, with a very low revenue percentage.
  • 9. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 9 5. CONCLUSION AND FUTURE SCOPE Identification of high-profit, high-value and low-risk customers via the data mining technique - customer clustering has been studied using IBM Intelligent Miner. Our organization retail smart store data is used for this study. This study uses demographic clustering technique for customer clustering. The final results demonstrate that the proposed approach revealed the high-value customers. From the results, cluster 3 has high revenue customers and represents about 35 percent of the revenue yet only by 6 percent of the customers. The cluster 3 has high value, low cost and is classified as High value cluster. The cluster 1 has high revenue, high cost and is classified as Medium value. The cluster 2 has low revenue, low cost and is classified as low value. Also, suggested possible strategies to retain and move the customers from lower band to upper band. 5.1 Future Scope: Based on the results our organization retail smart store decided to undertake lot of loyalty programs for smart store customers. The further work on segmentation (clustering) using more detailed behavioral data and opportunity identification using association algorithms within the segments discovered. Other possible future works are association of products and customer segmentations for cross-selling (selling new products) and up-selling (selling more of what customers currently buy). ACKNOWLEDGEMENTS Author would like to thanks to his colleague Mr. Nagarajan Karuppiah, Tata Consultancy Services for his valuable support and help. REFERENCES [1] Lefait, G. and Kechadi, T, (2010) “Customer Segmentation Architecture Based on Clustering Techniques” Digital Society, ICDS’10, Fourth International Conference, 10-02-2010. [2] Fraley, Andrew, and Thearting, Kurt (1999). Increasing customer value by integrating data mining and campaign management software. DataManagement, 49–53. [3] P. Bhargavi and S.Jyothi, (2009) “Applying Naïve Bayes Data Mining Technique for Classification of Agricultural land Soils” IJCSNS International Journal of computer Science and Network Security, VOL. 9 No.8, August 117-122. [4] I.Krishna Murthy, “Data Mining- Statistics Applications: A Key to Managerial Decision Making”, Article/Report indiastat.com, April-May 2010. [5] Association Analysis of Customer Services from the Enterprise Customer Management System- ICDM-2006. [6] Terry Harris, (2008)“Optimization creates lean green supply chains”, Data Mining Book [7] Matt Hartely (2005)“Using Data Mining to predict inventory levels” Data Mining Book [8] Hu, Tung-Lai, & Sheub, Jiuh-Biing (2003). “A fuzzy-based customer classification method for demand-responsive logistical distribution operations”, Fuzzy Sets and Systems, 139, 431–450.
  • 10. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 10 [9] Hwang, Hyunseok, Jung, Taesoo, & Suh, Euiho (2004). “An LTV model and customer segmentation based on customer value: A case study on the wireless telecommunication industry”, Expert Systems with Applications, 26, 181–188. [10] Jiao, Jianxin, & Zhang, Yiyang (2005). “Product portfolio identification based on association rule mining” Computer-Aided Design, 37, 149–172. [11] Jonker, Jedid-Jah, Piersma, Nanda, & Poel, Dirk Van den (2004). “Joint optimization of customer segmentation and marketing policy to maximize long-term profitability”. Expert Systems with Applications, 27, 159–168. [12] Kim, Su-Yeon, Jung, Tae-Soo, Suh, Eui-Ho, & Hwang, Hyun-Seok (2006). “Customer segmentation and strategy development based on customer life time value”: A case study. Expert Systems with Applications, 31, 101–107. [13] Kim, Yong Seog, & Street, W. Nick (2004). “An intelligent system for customer targeting: A data mining approach”. Decision Support Systems, 37, 215–228. [14] Kim, Yong Seog, Street, W. Nick, Russell, Gary J., & Menczer, Filippo (2005). “Customer Targeting: A Neural Network Approach Guided by Genetic Algorithms.” Management Science, 51(2), 264– 276. [15] Kuo, R. J., An, Y. L., Wang, H. S., & Chung, W. J. (2006). “Integration of self-organizing feature maps neural net work and genetic K-means algorithm for market segmentation”. Expert Systems with Applications, 30, 313–324.[ [16] Pham, D.T. and Afify, A.A. (2006) “Clustering techniques and their applications in engineering”. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, [17] A.K. Jain, M.N. Murty, and P. J. Flynn,(1999) “Data clustering: a review”, ACM Computing Surveys (CSUR), Vol.31, Issue 3,, 1999. [18] Grabmeier, J. and Rudolph, A. (2002) “Techniques of cluster algorithms in data mining” Data Mining and Knowledge Discovery, 6, 303-360. [19] Han, J. and Kamber, M. (2001) “Data Mining: Concepts and Techniques”, (Academic Press, San Diego, California, USA). [20] Jiyuan An , Jeffrey Xu Yu , Chotirat Ann Ratanamahatana , Yi-Ping Phoebe Chen,(2007) “A dimensionality reduction algorithm and its application for interactive visualization”, Journal of Visual Languages and Computing, v.18 n.1, , February p.48-70. [21] Nargess Memarsadeghi , Dianne P. O'Leary,(2003) “Classified Information: The Data Clustering Problem”, Computing in Science and Engineering, v.5 n.5, September p.54-60, [22] Yifan Li , Jiawei Han , Jiong Yang, (2004) “Clustering moving objects”, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, Seattle, WA, USA [23] Sherin M. Youssef , Mohamed Rizk , Mohamed El-Sherif, (2008) “Enhanced swarm-like agents for dynamically adaptive data clustering”, Proceedings of the 2nd WSEAS International Conference on Computer Engineering and Applications, p.213-219, January 25-27, Acapulco, Mexico
  • 11. International Journal of Database Management Systems ( IJDMS ) Vol.3, No.4, November 2011 11 [24] Marcel Brun , Chao Sima , Jianping Hua , James Lowey , Brent Carroll , Edward Suh , Edward R. Dougherty,(2007) Model-based evaluation of clustering validation measures, Pattern Recognition, v.40 n.3, March p.807-824. Author: Sankar Rajagopal received M.Sc., degree in Electronics, M.E degree in Materials Science and Ph.D in Metallurgical Engineering from Bharathidasan University in 1990, 1992 and Indian Institute of Technology-Madras, Chennai 1999 respectively. After completion of his doctoral degree, he joined in TATA Consultancy Services as a Software Consultant. Then he has elevated to position Enterprise DW/BI Architect. His areas of research interests include Metal Matrix Composites, Mechanical alloying, Material Informatics, Mechanical Behaviors of Materials, Nanotechnology and Data Mining and Knowledge Discovery. He has published about 15 contributed peer reviewed papers at National / International Journals and Conferences. He received best oral presentation awards in conferences.