SlideShare a Scribd company logo
Data Mining : Concepts
WHAT IS DATA MINING 
ELEMENTS 
TECHNIQUES 
APPLICATIONS
DATA 
MINING 
ELEMENTS TECHNIQUES APPLICATIONS 
Data mining (knowledge discovery from data) 
Extraction of interesting (non-trivial, implicit, previously unknown 
and potentially useful) patterns or knowledge from huge amount of 
data 
Alternative names 
Knowledge discovery (mining) in databases (KDD), knowledge 
extraction, data/pattern analysis, data archeology, data dredging, 
information harvesting, business intelligence, etc.
DATA 
MINING 
KDD 
ELEMENTS TECHNIQUES APPLICATIONS 
Data Mining – Core 
of Knowledge 
Discovery Process 
(KDD)
DATA 
MINING 
ELEMENTS 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
Data Relationships 
 Sequential 
Patterns 
 Clusters 
Data Mining Techniques 
 Decision Trees 
 Neural Networks 
 Regression 
 Association Rules 
 Nearest Neighbor Method 
 Genetic Algorithm 
 Artificial Intelligence
DATA 
MINING 
ELEMENTS 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
DATA RELATIONSHIPS 
Sequential Patterns 
 Finding statistically relevant patterns between data 
examples where discrete values delivered in sequence. 
 Problems addressed 
Building efficient databases, indexes for sequence 
information, extracting frequently occurring patterns, 
comparing sequences, recovering missing sequence 
members. 
 Application: 
{Retail Environment} 
Anticipating customer behavior for prediction of future 
customer purchasing habits. 
Increase profit, Decrease cost : Proper management of 
shelf space allocation & products display.
DATA 
MINING 
ELEMENTS 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
DATA RELATIONSHIPS 
Sequential Patterns {Eg.:}
DATA 
MINING 
ELEMENTS 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
DATA RELATIONSHIPS 
Clusters 
Placing data elements into 
related groups without advance 
knowledge of the group definitions. 
Popular clustering 
techniques: K-means, 
Expectation Maximization (EM) 
Problems addressed 
 Find natural groupings 
 Preprocess data to 
identify homogeneous groups 
on which to build supervised 
models. 
 Anomaly detection
DATA 
MINING 
ELEMENTS 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
DATA RELATIONSHIPS 
Clusters 
Application: 
 Plant and animal ecology 
Make spatial and temporal comparisons of 
communities of organisms in heterogeneous environments 
 Medical imaging 
differentiate between different types 
of tissue and blood in a three-dimensional image 
 Business and marketing 
Partition the general population of consumers for use 
in market segmentation, product positioning, new product 
development and Selecting test markets.
DATA 
MINING 
Decision Trees 
ELEMENTS 
 In decision tree technique, the root of the decision tree is a simple 
question or condition that has multiple answers. 
 Each answer then leads to a set of questions or conditions that help 
us determine the data so that we can make the final decision based 
on it. 
 For example, we use the following decision tree to determine 
whether or not to play tennis 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
 Starting at root node, if the outlook 
is overcast then we should 
definitely play tennis. 
 If it is rainy, we should only play 
tennis if the wind is week. 
 If it is sunny then we should play 
tennis in case the humidity is 
normal
DATA 
MINING 
ELEMENTS 
Neural Networks 
TECHNIQUES/ 
ALGORITHMS 
 Set of connected input/output units and each connection has a 
weight present with it. During the learning phase, network learns by 
adjusting weights so as to be able to predict the correct class labels 
of the input tuples. 
 Well suited for continuous valued inputs andoutputs 
 Used to extract patterns and detect trends that are too complex to be 
noticed by. 
 Neural networks are best at identifying patterns or trends in data and 
well suited for prediction of forecasting needs. 
APPLICATIONS 
Example : 
Handwritten character reorganization, for training a computer to 
pronounce English text and many real world business problems and 
have already been successfully applied in many industries.
DATA 
MINING 
Regression 
ELEMENTS 
 Regression technique can be adapted for predication 
 Regression analysis can be used to model the relationship between 
one or more independent variables and dependent variables. In data 
mining independent variables are attributes already known and 
response variables are what we want to predict. 
 However, it cannot be used for areas involving complex variables like 
in sales volumes, stock prices and product failure rates. 
Types of regression methods 
 Linear Regression 
 Multivariate Linear Regression 
 Nonlinear Regression 
 Multivariate Nonlinear Regress 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS
DATA 
MINING 
ELEMENTS 
Association Rules 
TECHNIQUES/ 
ALGORITHMS 
APPLICATIONS 
 In association, a pattern is discovered based on a relationship 
between items in the same transaction. 
 E.g. Rule Form : “Body  Head [support, confidence]” 
Application 
Retailers are using association technique to research 
customer’s buying habits. Based on historical sale data, retailers might 
find out that customers always buy crisps when they buy beers, and 
therefore they can put beers and crisps next to each other to save time 
for customer and increase sales. 
Types 
 Multilevel association rule 
 Multidimensional association 
rule 
 Quantitative association rule
DATA 
MINING 
ELEMENTS TECHNIQUES APPLICATIONS 
 Study of frequent flyer data from an Indian Airline 
 Data selected, prepared : 3 most common sectors flown & points 
redeemed for. 
(Note :Incomplete/Inaccurate Data supplied by airlines) 
 Data Mining results: 
 Patterns about customers flying between metropolitan cities 
 Customers that flew between Mumbai-Delhi also flew to other cities 
like Mumbai-Chennai, Mumbai-Kolkata & Mumbai Bangalore. 
 Customers flying Bangalore-Hyderabad also flew Delhi-Bangalore 
 Those who flew Bagdogra - Guwahati did not fly back; instead flew to 
Delhi
DATA 
MINING 
ELEMENTS TECHNIQUES APPLICATIONS 
 Banking information systems contains huge volumes of data both 
operational and historical. 
 Data mining can assist critical decision making processes in a bank. 
 Areas of application: 
 Marketing 
 Risk management and 
default detection 
 Fraud detection 
 Customer relationship 
management 
 Money laundering detection
DATA 
MINING 
 Wikipedia 
ELEMENTS TECHNIQUES APPLICATIONS 
 http://en.wikipedia.org/wiki/Sequential_Pattern_Mini 
ng 
 http://searchbusinessintelligence.techtarget.in/ 
 Indian Journal of Computer Science & Engg 
 Introduction to Data Mining with Case Studies by 
G.K. Gupta
Ad

More Related Content

What's hot (20)

Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
Amr Abd El Latief
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
DataminingTools Inc
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
Hadi Fadlallah
 
Data mining
Data miningData mining
Data mining
Kinza Razzaq
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
Introduction
IntroductionIntroduction
Introduction
neelamoberoi1030
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.ppt
neelamoberoi1030
 
Data Mining
Data MiningData Mining
Data Mining
ksanthosh
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
Shatavisha Roy Chowdhury
 
Data cubes
Data cubesData cubes
Data cubes
Mohammed
 
Introduction to data mining technique
Introduction to data mining techniqueIntroduction to data mining technique
Introduction to data mining technique
Pawneshwar Datt Rai
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
MadhuriNigam1
 
Data analytics
Data analyticsData analytics
Data analytics
BindhuBhargaviTalasi
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
Data mining
Data miningData mining
Data mining
Birju Tank
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
Pralhad Rijal
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
Amr Abd El Latief
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
Hadi Fadlallah
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.ppt
neelamoberoi1030
 
Introduction to data mining technique
Introduction to data mining techniqueIntroduction to data mining technique
Introduction to data mining technique
Pawneshwar Datt Rai
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
Pralhad Rijal
 

Viewers also liked (19)

Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Codemotion
 
A review on data mining
A  review on data miningA  review on data mining
A review on data mining
Er. Nancy
 
Ad-hoc Testing – Non-methodical yet Significant
Ad-hoc Testing – Non-methodical yet SignificantAd-hoc Testing – Non-methodical yet Significant
Ad-hoc Testing – Non-methodical yet Significant
Software Testing Solution
 
Data mining applications
Data mining applicationsData mining applications
Data mining applications
Dr. C.V. Suresh Babu
 
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONSEXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
An introduction to data mining and its techniques
An introduction to data mining and its techniquesAn introduction to data mining and its techniques
An introduction to data mining and its techniques
Sandhya Tarwani
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
Houw Liong The
 
Data Mining
Data MiningData Mining
Data Mining
R A Akerkar
 
1.3 applications, issues
1.3 applications, issues1.3 applications, issues
1.3 applications, issues
Krish_ver2
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an Introduction
Ali Abbasi
 
Lecture 01 Data Mining
Lecture 01 Data MiningLecture 01 Data Mining
Lecture 01 Data Mining
Pier Luca Lanzi
 
Weka presentation
Weka presentationWeka presentation
Weka presentation
Saeed Iqbal
 
Introduction to R for Data Mining (Feb 2013)
Introduction to R for Data Mining (Feb 2013)Introduction to R for Data Mining (Feb 2013)
Introduction to R for Data Mining (Feb 2013)
Revolution Analytics
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
mahavir_a
 
The comparative study of apriori and FP-growth algorithm
The comparative study of apriori and FP-growth algorithmThe comparative study of apriori and FP-growth algorithm
The comparative study of apriori and FP-growth algorithm
deepti92pawar
 
Decision trees
Decision treesDecision trees
Decision trees
Jagjit Wilku
 
Decision tree example problem
Decision tree example problemDecision tree example problem
Decision tree example problem
SATYABRATA PRADHAN
 
Data mining
Data miningData mining
Data mining
Samir Sabry
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Codemotion
 
A review on data mining
A  review on data miningA  review on data mining
A review on data mining
Er. Nancy
 
Ad-hoc Testing – Non-methodical yet Significant
Ad-hoc Testing – Non-methodical yet SignificantAd-hoc Testing – Non-methodical yet Significant
Ad-hoc Testing – Non-methodical yet Significant
Software Testing Solution
 
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONSEXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
An introduction to data mining and its techniques
An introduction to data mining and its techniquesAn introduction to data mining and its techniques
An introduction to data mining and its techniques
Sandhya Tarwani
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
Houw Liong The
 
1.3 applications, issues
1.3 applications, issues1.3 applications, issues
1.3 applications, issues
Krish_ver2
 
Data Mining: an Introduction
Data Mining: an IntroductionData Mining: an Introduction
Data Mining: an Introduction
Ali Abbasi
 
Weka presentation
Weka presentationWeka presentation
Weka presentation
Saeed Iqbal
 
Introduction to R for Data Mining (Feb 2013)
Introduction to R for Data Mining (Feb 2013)Introduction to R for Data Mining (Feb 2013)
Introduction to R for Data Mining (Feb 2013)
Revolution Analytics
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
mahavir_a
 
The comparative study of apriori and FP-growth algorithm
The comparative study of apriori and FP-growth algorithmThe comparative study of apriori and FP-growth algorithm
The comparative study of apriori and FP-growth algorithm
deepti92pawar
 
Ad

Similar to Data Mining : Concepts (20)

Data mining
Data miningData mining
Data mining
Murniana Shazwen
 
Data mining
Data miningData mining
Data mining
Murniana Shazwen
 
Data mining-basic
Data mining-basicData mining-basic
Data mining-basic
gufranresearcher
 
Data Mining
Data MiningData Mining
Data Mining
Gary Stefan
 
DataMining Techniq
DataMining TechniqDataMining Techniq
DataMining Techniq
Respa Peter
 
Abstract
AbstractAbstract
Abstract
raghavansrini7
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docx
andreecapon
 
The Data Scientist’s Toolkit: Key Techniques for Extracting Value
The Data Scientist’s Toolkit: Key Techniques for Extracting ValueThe Data Scientist’s Toolkit: Key Techniques for Extracting Value
The Data Scientist’s Toolkit: Key Techniques for Extracting Value
pallavichauhan2525
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
Editor IJCATR
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET Journal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
ijdpsjournal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
ijdpsjournal
 
Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
Vaibhav Dhattarwal
 
Z36149154
Z36149154Z36149154
Z36149154
IJERA Editor
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
SaravanaD2
 
Eckovation Machine Learning
Eckovation Machine LearningEckovation Machine Learning
Eckovation Machine Learning
Shikhar Srivastava
 
Machine Learning and Data Analytics in Semiconductor Yield Management.pptx
Machine Learning and Data Analytics in Semiconductor Yield Management.pptxMachine Learning and Data Analytics in Semiconductor Yield Management.pptx
Machine Learning and Data Analytics in Semiconductor Yield Management.pptx
yieldWerx Semiconductor
 
CUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTIONCUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTION
IRJET Journal
 
Unlock the power of information: Data Science Course In Kerala
Unlock the power of information: Data Science Course In KeralaUnlock the power of information: Data Science Course In Kerala
Unlock the power of information: Data Science Course In Kerala
paulwalkerpw334
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its Applications
IRJET Journal
 
DataMining Techniq
DataMining TechniqDataMining Techniq
DataMining Techniq
Respa Peter
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docx
andreecapon
 
The Data Scientist’s Toolkit: Key Techniques for Extracting Value
The Data Scientist’s Toolkit: Key Techniques for Extracting ValueThe Data Scientist’s Toolkit: Key Techniques for Extracting Value
The Data Scientist’s Toolkit: Key Techniques for Extracting Value
pallavichauhan2525
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
Editor IJCATR
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET Journal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
ijdpsjournal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
ijdpsjournal
 
DATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptxDATA MINING DC Presentation.pptx
DATA MINING DC Presentation.pptx
SaravanaD2
 
Machine Learning and Data Analytics in Semiconductor Yield Management.pptx
Machine Learning and Data Analytics in Semiconductor Yield Management.pptxMachine Learning and Data Analytics in Semiconductor Yield Management.pptx
Machine Learning and Data Analytics in Semiconductor Yield Management.pptx
yieldWerx Semiconductor
 
CUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTIONCUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTION
IRJET Journal
 
Unlock the power of information: Data Science Course In Kerala
Unlock the power of information: Data Science Course In KeralaUnlock the power of information: Data Science Course In Kerala
Unlock the power of information: Data Science Course In Kerala
paulwalkerpw334
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its Applications
IRJET Journal
 
Ad

Recently uploaded (20)

EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 

Data Mining : Concepts

  • 2. WHAT IS DATA MINING ELEMENTS TECHNIQUES APPLICATIONS
  • 3. DATA MINING ELEMENTS TECHNIQUES APPLICATIONS Data mining (knowledge discovery from data) Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data Alternative names Knowledge discovery (mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intelligence, etc.
  • 4. DATA MINING KDD ELEMENTS TECHNIQUES APPLICATIONS Data Mining – Core of Knowledge Discovery Process (KDD)
  • 5. DATA MINING ELEMENTS TECHNIQUES/ ALGORITHMS APPLICATIONS Data Relationships  Sequential Patterns  Clusters Data Mining Techniques  Decision Trees  Neural Networks  Regression  Association Rules  Nearest Neighbor Method  Genetic Algorithm  Artificial Intelligence
  • 6. DATA MINING ELEMENTS TECHNIQUES/ ALGORITHMS APPLICATIONS DATA RELATIONSHIPS Sequential Patterns  Finding statistically relevant patterns between data examples where discrete values delivered in sequence.  Problems addressed Building efficient databases, indexes for sequence information, extracting frequently occurring patterns, comparing sequences, recovering missing sequence members.  Application: {Retail Environment} Anticipating customer behavior for prediction of future customer purchasing habits. Increase profit, Decrease cost : Proper management of shelf space allocation & products display.
  • 7. DATA MINING ELEMENTS TECHNIQUES/ ALGORITHMS APPLICATIONS DATA RELATIONSHIPS Sequential Patterns {Eg.:}
  • 8. DATA MINING ELEMENTS TECHNIQUES/ ALGORITHMS APPLICATIONS DATA RELATIONSHIPS Clusters Placing data elements into related groups without advance knowledge of the group definitions. Popular clustering techniques: K-means, Expectation Maximization (EM) Problems addressed  Find natural groupings  Preprocess data to identify homogeneous groups on which to build supervised models.  Anomaly detection
  • 9. DATA MINING ELEMENTS TECHNIQUES/ ALGORITHMS APPLICATIONS DATA RELATIONSHIPS Clusters Application:  Plant and animal ecology Make spatial and temporal comparisons of communities of organisms in heterogeneous environments  Medical imaging differentiate between different types of tissue and blood in a three-dimensional image  Business and marketing Partition the general population of consumers for use in market segmentation, product positioning, new product development and Selecting test markets.
  • 10. DATA MINING Decision Trees ELEMENTS  In decision tree technique, the root of the decision tree is a simple question or condition that has multiple answers.  Each answer then leads to a set of questions or conditions that help us determine the data so that we can make the final decision based on it.  For example, we use the following decision tree to determine whether or not to play tennis TECHNIQUES/ ALGORITHMS APPLICATIONS  Starting at root node, if the outlook is overcast then we should definitely play tennis.  If it is rainy, we should only play tennis if the wind is week.  If it is sunny then we should play tennis in case the humidity is normal
  • 11. DATA MINING ELEMENTS Neural Networks TECHNIQUES/ ALGORITHMS  Set of connected input/output units and each connection has a weight present with it. During the learning phase, network learns by adjusting weights so as to be able to predict the correct class labels of the input tuples.  Well suited for continuous valued inputs andoutputs  Used to extract patterns and detect trends that are too complex to be noticed by.  Neural networks are best at identifying patterns or trends in data and well suited for prediction of forecasting needs. APPLICATIONS Example : Handwritten character reorganization, for training a computer to pronounce English text and many real world business problems and have already been successfully applied in many industries.
  • 12. DATA MINING Regression ELEMENTS  Regression technique can be adapted for predication  Regression analysis can be used to model the relationship between one or more independent variables and dependent variables. In data mining independent variables are attributes already known and response variables are what we want to predict.  However, it cannot be used for areas involving complex variables like in sales volumes, stock prices and product failure rates. Types of regression methods  Linear Regression  Multivariate Linear Regression  Nonlinear Regression  Multivariate Nonlinear Regress TECHNIQUES/ ALGORITHMS APPLICATIONS
  • 13. DATA MINING ELEMENTS Association Rules TECHNIQUES/ ALGORITHMS APPLICATIONS  In association, a pattern is discovered based on a relationship between items in the same transaction.  E.g. Rule Form : “Body  Head [support, confidence]” Application Retailers are using association technique to research customer’s buying habits. Based on historical sale data, retailers might find out that customers always buy crisps when they buy beers, and therefore they can put beers and crisps next to each other to save time for customer and increase sales. Types  Multilevel association rule  Multidimensional association rule  Quantitative association rule
  • 14. DATA MINING ELEMENTS TECHNIQUES APPLICATIONS  Study of frequent flyer data from an Indian Airline  Data selected, prepared : 3 most common sectors flown & points redeemed for. (Note :Incomplete/Inaccurate Data supplied by airlines)  Data Mining results:  Patterns about customers flying between metropolitan cities  Customers that flew between Mumbai-Delhi also flew to other cities like Mumbai-Chennai, Mumbai-Kolkata & Mumbai Bangalore.  Customers flying Bangalore-Hyderabad also flew Delhi-Bangalore  Those who flew Bagdogra - Guwahati did not fly back; instead flew to Delhi
  • 15. DATA MINING ELEMENTS TECHNIQUES APPLICATIONS  Banking information systems contains huge volumes of data both operational and historical.  Data mining can assist critical decision making processes in a bank.  Areas of application:  Marketing  Risk management and default detection  Fraud detection  Customer relationship management  Money laundering detection
  • 16. DATA MINING  Wikipedia ELEMENTS TECHNIQUES APPLICATIONS  http://en.wikipedia.org/wiki/Sequential_Pattern_Mini ng  http://searchbusinessintelligence.techtarget.in/  Indian Journal of Computer Science & Engg  Introduction to Data Mining with Case Studies by G.K. Gupta