SlideShare a Scribd company logo
Group 7
What is Data
                  Mining ?



                                                Mining and discovery of new
                                                information in terms of
                                                patterns or
                                                rules from vast amounts of
                                                data.



The process of discovering meaningful new correlations, patterns and trends by sifting
through large amounts of data stored in repositoties, using pattern recognition
technologies as well as statical and methematics techniques.
Why we mine
  Data ?




  Commercial View Point :-
  Lots of data is being collected and warehoused .
  Computers have become cheaper and more powerful.
  Competitive Pressure is Strong .


  Scientific View Point :-
  Data collected and stored at enormous speeds (GB/hour).
  Traditional techniques infeasible for raw data.
  Data mining may help scientists.
On what kind of
   Data...?



          •   Relational databases
          •   Data warehouses
          •   Transactional databases
          •   Advanced database systems:
                   Object-relational
                   Spacial and Temporal
                   Time-series
                   Multimedia, text
                   WWW
What are the goals
 of Data mining?



    • Prediction  e.g. sales volume, earthquakes
    • Identification e.g. existence of genes, system
    intrusions
    • Classification of different categories e.g. discount
    seeking shoppers or loyal regular shoppers in a
    supermarket
    • Optimization of limited resources such as time,
    space, money or materials and maximization of
    outputs such as sales or profits
What are the
      applications of Data-
            Mining ?


● Marketing
                                     ● Finance
 Analysis of consumer behavior
                                      Creditworthiness of clients
 Advertising campaigns
                                      Performance analysis of finance
 Targeted mailings
                                        investments
 Segmentation of
                                      Fraud detection
  customers, stores, or products

● Manufacturing
                                     ● Health Care
 Optimization of resources
                                      Discovering patterns in X-ray
 Optimization of manufacturing
                                        images
  processes
                                      Analyzing side effects of drugs
 Product design based on customer
                                      Effectiveness of treatments
  requirements
What are the present
commercial tools for
   Data Mining ?




                     Data to knowledge
 SAS                                            Oracle data-miner




 Intelligent miner                 Clementine
How to build a data
  mining model?       An important concept is
                      that building a mining
                      model is part of a larger
                      process.
1. Defining
    the
 problem.     Clearly define the business
                       problem.
2. Preparing
    Data       consolidate and clean the data that
               was identified in the Defining the
               Problem step.
3.Exploring
   Data
              Explore the prepared data



       .
4.Building
 Models      Before you build a model, you must
             randomly separate the prepared data into
             separate training and testing datasets.
             You use the training dataset to build the
             model, and the testing dataset to test the
             accuracy of the model by creating
             prediction queries.
5. Exploring
and validating
models           Explore the models that you
                 have built and test their
                 effectiveness.
6. Deploying
and updating
               Deploy to a production
models         environment the models
               that performed the best.
What are the major
issues in Data-Mining
      concept ?

    Mining different kinds of knowledge in databases
    Interactive mining of knowledge at multiple levels of
     abstraction
    Incorporation of background knowledge
    Data mining query languages and ad-hoc data mining
    Expression and visualization of data mining results
    Handling noise and incomplete data
    Pattern evaluation: the interestingness problem
    Integration of the discovered knowledge with existing
     knowledge: A knowledge fusion problem
    Protection of data security, integrity, and privacy
How will be the future of
 Data-Mining concept?




      ● Active research is ongoing
       Neural Networks
       Regression Analysis
       Genetic Algorithms
      ● Data mining is used in many areas today. We
      cannot even begin to imagine what the future
      holds in its womb!
Thank You !
Ad

More Related Content

What's hot (20)

3.7 outlier analysis
3.7 outlier analysis3.7 outlier analysis
3.7 outlier analysis
Krish_ver2
 
ensemble learning
ensemble learningensemble learning
ensemble learning
butest
 
Text analytics
Text analyticsText analytics
Text analytics
Utkarsh Sharma
 
Machine learning Lecture 2
Machine learning Lecture 2Machine learning Lecture 2
Machine learning Lecture 2
Srinivasan R
 
Data Acquisition
Data AcquisitionData Acquisition
Data Acquisition
primeteacher32
 
Computer forensics ppt
Computer forensics pptComputer forensics ppt
Computer forensics ppt
Nikhil Mashruwala
 
Malware forensics
Malware forensicsMalware forensics
Malware forensics
Sameera Amjad
 
Information Security Lecture #1 ppt
Information Security Lecture #1 pptInformation Security Lecture #1 ppt
Information Security Lecture #1 ppt
vasanthimuniasamy
 
Rayleigh model
Rayleigh modelRayleigh model
Rayleigh model
Roy Antony Arnold G
 
Software security
Software securitySoftware security
Software security
Roman Oliynykov
 
Pressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-modelsPressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-models
zeal123123
 
PPT.pptx
PPT.pptxPPT.pptx
PPT.pptx
ssuseref08b9
 
User authentication
User authenticationUser authentication
User authentication
CAS
 
Windows forensic artifacts
Windows forensic artifactsWindows forensic artifacts
Windows forensic artifacts
n|u - The Open Security Community
 
Software Engineering unit 2
Software Engineering unit 2Software Engineering unit 2
Software Engineering unit 2
Abhimanyu Mishra
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning
pyingkodi maran
 
Malware
MalwareMalware
Malware
Tuhin_Das
 
Computer worms viruses and Prevention
Computer worms viruses and PreventionComputer worms viruses and Prevention
Computer worms viruses and Prevention
Pratimesh Pathak
 
Fraud detection with Machine Learning
Fraud detection with Machine LearningFraud detection with Machine Learning
Fraud detection with Machine Learning
Scaleway
 
Pressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-modelsPressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-models
saurabhshertukde
 
3.7 outlier analysis
3.7 outlier analysis3.7 outlier analysis
3.7 outlier analysis
Krish_ver2
 
ensemble learning
ensemble learningensemble learning
ensemble learning
butest
 
Machine learning Lecture 2
Machine learning Lecture 2Machine learning Lecture 2
Machine learning Lecture 2
Srinivasan R
 
Information Security Lecture #1 ppt
Information Security Lecture #1 pptInformation Security Lecture #1 ppt
Information Security Lecture #1 ppt
vasanthimuniasamy
 
Pressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-modelsPressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-models
zeal123123
 
User authentication
User authenticationUser authentication
User authentication
CAS
 
Software Engineering unit 2
Software Engineering unit 2Software Engineering unit 2
Software Engineering unit 2
Abhimanyu Mishra
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning
pyingkodi maran
 
Computer worms viruses and Prevention
Computer worms viruses and PreventionComputer worms viruses and Prevention
Computer worms viruses and Prevention
Pratimesh Pathak
 
Fraud detection with Machine Learning
Fraud detection with Machine LearningFraud detection with Machine Learning
Fraud detection with Machine Learning
Scaleway
 
Pressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-modelsPressman ch-3-prescriptive-process-models
Pressman ch-3-prescriptive-process-models
saurabhshertukde
 

Viewers also liked (9)

Plán školení technik Haier
Plán školení   technik HaierPlán školení   technik Haier
Plán školení technik Haier
Michal Kadlec
 
The european union tr
The european union trThe european union tr
The european union tr
comeniusipb
 
Pp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comeniusPp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comenius
comeniusipb
 
AGEL - cesta k rovnováze
AGEL - cesta k rovnovázeAGEL - cesta k rovnováze
AGEL - cesta k rovnováze
Michal Kadlec
 
Gastronomy 2
Gastronomy 2Gastronomy 2
Gastronomy 2
comeniusipb
 
Re engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport serviceRe engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport service
Udara Seneviratne
 
Gun industry
Gun industryGun industry
Gun industry
Udara Seneviratne
 
Plán školení technik Haier
Plán školení   technik HaierPlán školení   technik Haier
Plán školení technik Haier
Michal Kadlec
 
The european union tr
The european union trThe european union tr
The european union tr
comeniusipb
 
Pp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comeniusPp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comenius
comeniusipb
 
AGEL - cesta k rovnováze
AGEL - cesta k rovnovázeAGEL - cesta k rovnováze
AGEL - cesta k rovnováze
Michal Kadlec
 
Re engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport serviceRe engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport service
Udara Seneviratne
 
Ad

Similar to Data mining concepts (20)

Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptx
YashwanthKumar306128
 
What is data mining ?
What is data mining ?What is data mining ?
What is data mining ?
Johan Blomme
 
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdfChapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
hamsalubekana
 
Introduction To Data Mining and Data Mining Techniques.pptx
Introduction To Data Mining and Data Mining Techniques.pptxIntroduction To Data Mining and Data Mining Techniques.pptx
Introduction To Data Mining and Data Mining Techniques.pptx
KirtiSamrit3
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
data minig for eng with all topics and history
data minig for eng with all topics and historydata minig for eng with all topics and history
data minig for eng with all topics and history
nbaisane16
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
dataminers.ir
 
3 marketing research
3 marketing research3 marketing research
3 marketing research
saadii410
 
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
Mark Heid
 
Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptxData-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
attitudeacademy1
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)
sadam33146
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
rameshkbudhani
 
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptxLecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
JITENDER773791
 
Data mining
Data miningData mining
Data mining
Ujjwal Kumar
 
17 cs002
17 cs00217 cs002
17 cs002
TPLatchoumi
 
2011 Shopper Insights Brochure
2011 Shopper Insights Brochure2011 Shopper Insights Brochure
2011 Shopper Insights Brochure
fglick
 
Data Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large DatabasesData Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large Databases
SSA KPI
 
Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptx
YashwanthKumar306128
 
What is data mining ?
What is data mining ?What is data mining ?
What is data mining ?
Johan Blomme
 
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdfChapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
hamsalubekana
 
Introduction To Data Mining and Data Mining Techniques.pptx
Introduction To Data Mining and Data Mining Techniques.pptxIntroduction To Data Mining and Data Mining Techniques.pptx
Introduction To Data Mining and Data Mining Techniques.pptx
KirtiSamrit3
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
data minig for eng with all topics and history
data minig for eng with all topics and historydata minig for eng with all topics and history
data minig for eng with all topics and history
nbaisane16
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
dataminers.ir
 
3 marketing research
3 marketing research3 marketing research
3 marketing research
saadii410
 
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
Mark Heid
 
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptxData-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
attitudeacademy1
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)
sadam33146
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
rameshkbudhani
 
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptxLecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
JITENDER773791
 
2011 Shopper Insights Brochure
2011 Shopper Insights Brochure2011 Shopper Insights Brochure
2011 Shopper Insights Brochure
fglick
 
Data Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large DatabasesData Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large Databases
SSA KPI
 
Ad

More from Udara Seneviratne (19)

Industrial presentation
Industrial presentationIndustrial presentation
Industrial presentation
Udara Seneviratne
 
Expert Food Analysis System
Expert Food Analysis SystemExpert Food Analysis System
Expert Food Analysis System
Udara Seneviratne
 
Eye disease expert system
Eye disease expert systemEye disease expert system
Eye disease expert system
Udara Seneviratne
 
Ayurvedic diet management system
Ayurvedic diet management systemAyurvedic diet management system
Ayurvedic diet management system
Udara Seneviratne
 
Media streaming
Media streamingMedia streaming
Media streaming
Udara Seneviratne
 
Automated Traval Ticketing System
Automated Traval Ticketing SystemAutomated Traval Ticketing System
Automated Traval Ticketing System
Udara Seneviratne
 
Business Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAirBusiness Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAir
Udara Seneviratne
 
Pros and cons of facebook
Pros and cons of facebookPros and cons of facebook
Pros and cons of facebook
Udara Seneviratne
 
Did you know....
Did you know....Did you know....
Did you know....
Udara Seneviratne
 
Add ons
Add onsAdd ons
Add ons
Udara Seneviratne
 
Brain damaging habits
Brain damaging habitsBrain damaging habits
Brain damaging habits
Udara Seneviratne
 
Mobile computing
Mobile computingMobile computing
Mobile computing
Udara Seneviratne
 
Scedule feasibility
Scedule feasibilityScedule feasibility
Scedule feasibility
Udara Seneviratne
 
Environmental issues
Environmental issuesEnvironmental issues
Environmental issues
Udara Seneviratne
 
Survey report of life style of young people in badulla area
Survey report of life style of young people in badulla areaSurvey report of life style of young people in badulla area
Survey report of life style of young people in badulla area
Udara Seneviratne
 
How to succeed
How to succeedHow to succeed
How to succeed
Udara Seneviratne
 
Parents wish1
Parents wish1Parents wish1
Parents wish1
Udara Seneviratne
 
The poor man
The poor manThe poor man
The poor man
Udara Seneviratne
 
Identity styles of communication
Identity styles of communicationIdentity styles of communication
Identity styles of communication
Udara Seneviratne
 

Recently uploaded (20)

HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 

Data mining concepts

  • 2. What is Data Mining ? Mining and discovery of new information in terms of patterns or rules from vast amounts of data. The process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositoties, using pattern recognition technologies as well as statical and methematics techniques.
  • 3. Why we mine Data ? Commercial View Point :- Lots of data is being collected and warehoused . Computers have become cheaper and more powerful. Competitive Pressure is Strong . Scientific View Point :- Data collected and stored at enormous speeds (GB/hour). Traditional techniques infeasible for raw data. Data mining may help scientists.
  • 4. On what kind of Data...? • Relational databases • Data warehouses • Transactional databases • Advanced database systems: Object-relational Spacial and Temporal Time-series Multimedia, text WWW
  • 5. What are the goals of Data mining? • Prediction e.g. sales volume, earthquakes • Identification e.g. existence of genes, system intrusions • Classification of different categories e.g. discount seeking shoppers or loyal regular shoppers in a supermarket • Optimization of limited resources such as time, space, money or materials and maximization of outputs such as sales or profits
  • 6. What are the applications of Data- Mining ? ● Marketing ● Finance  Analysis of consumer behavior  Creditworthiness of clients  Advertising campaigns  Performance analysis of finance  Targeted mailings investments  Segmentation of  Fraud detection customers, stores, or products ● Manufacturing ● Health Care  Optimization of resources  Discovering patterns in X-ray  Optimization of manufacturing images processes  Analyzing side effects of drugs  Product design based on customer  Effectiveness of treatments requirements
  • 7. What are the present commercial tools for Data Mining ? Data to knowledge SAS Oracle data-miner Intelligent miner Clementine
  • 8. How to build a data mining model? An important concept is that building a mining model is part of a larger process.
  • 9. 1. Defining the problem. Clearly define the business problem.
  • 10. 2. Preparing Data consolidate and clean the data that was identified in the Defining the Problem step.
  • 11. 3.Exploring Data Explore the prepared data .
  • 12. 4.Building Models Before you build a model, you must randomly separate the prepared data into separate training and testing datasets. You use the training dataset to build the model, and the testing dataset to test the accuracy of the model by creating prediction queries.
  • 13. 5. Exploring and validating models Explore the models that you have built and test their effectiveness.
  • 14. 6. Deploying and updating Deploy to a production models environment the models that performed the best.
  • 15. What are the major issues in Data-Mining concept ?  Mining different kinds of knowledge in databases  Interactive mining of knowledge at multiple levels of abstraction  Incorporation of background knowledge  Data mining query languages and ad-hoc data mining  Expression and visualization of data mining results  Handling noise and incomplete data  Pattern evaluation: the interestingness problem  Integration of the discovered knowledge with existing knowledge: A knowledge fusion problem  Protection of data security, integrity, and privacy
  • 16. How will be the future of Data-Mining concept? ● Active research is ongoing  Neural Networks  Regression Analysis  Genetic Algorithms ● Data mining is used in many areas today. We cannot even begin to imagine what the future holds in its womb!