SlideShare a Scribd company logo
KDD Process
By
Nithin N
KDD PROCESS
 KDD (Knowledge Discovery in Databases) is a process that
involves the extraction of useful, previously unknown, and
potentially valuable information from large datasets. The
KDD process is an iterative process and it requires multiple
iterations of the above steps to extract accurate knowledge
from the data.
November 12, 2024
Knowledge Discovery (KDD) Process
 Data mining—core of knowledge
discovery process
Data Cleaning
Data Integration
Databases
Data Warehouse
Task-relevant Data
Selection
Data Mining
Pattern Evaluation
DATA CLEANING
 Remove Noise and Inconsistent Data
DATA INTEGRATION
 Where multiple data sources may be combined
DATA SELECTION
 Where data relevant to the analysis task are retrieved from the data
base
DATA TRANSFORMATION
 Where data are transformed and consolidated into forms
appropriate for mining by performing summary or aggregation
operation
Data Mining
 An essential Process where intelligent methods are applied to
extract data patterns
PATTERN EVALUATION
 To identify the truly interesting patterns representing knowledge
based on interestingness measures
KNOWLEDGE REPRESENTATION
 Where visualization and knowledge representation techniques are
used to present mined knowledge to users
 Note: KDD is an iterative process where evaluation
measures can be enhanced, mining can be refined, new
data can be integrated and transformed in order to get
different and more appropriate results.Preprocessing of
databases consists of Data cleaning and Data
Integration.
WHAT KINDS OF DATA CAN BE MINED
 DATABASE DATA
 Data warehouses
 Transactional data
 Other Kinds of Data
DATABASE DATA
 DBMS
 Relational Database Examples
 Attributes
 Tuples
Data warehouses
Data Cube
Transactional Data
 A Transactional data must be unique
Other Kinds of Data
 Time Related Data
 Sequence Data(Historical Records,Stock Exchange)
 Data streams( Video Surveillance, Sensor Data)
 Spatial Data(maps)
 Hyper Text and Multimedia Data(Text,Video,Audio)
 Graph and Networked Data
 Engineering Design Data(auto CAD)
 Web
Advantages of KDD Process
1. Improves decision-making: KDD provides valuable insights and
knowledge that can help organizations make better decisions.
2. Increased efficiency: KDD automates repetitive and time-consuming
tasks and makes the data ready for analysis, which saves time and
money.
3. Better customer service: KDD helps organizations gain a better
understanding of their customers’ needs and preferences, which can
help them provide better customer service.
4. Fraud detection: KDD can be used to detect fraudulent activities by
identifying patterns and anomalies in the data that may indicate fraud.
5. Predictive modeling: KDD can be used to build predictive models that
can forecast future trends and patterns.
Disadvantages of KDD Process
1. Privacy concerns: KDD can raise privacy concerns as it involves collecting
and analyzing large amounts of data, which can include sensitive
information about individuals.
2. Complexity: KDD can be a complex process that requires specialized skills
and knowledge to implement and interpret the results.
3. Unintended consequences: KDD can lead to unintended consequences,
such as bias or discrimination, if the data or models are not properly
understood or used.
4. Data Quality: KDD process heavily depends on the quality of data, if data is
not accurate or consistent, the results can be misleading
5. High cost: KDD can be an expensive process, requiring significant
investments in hardware, software, and personnel.
6. Overfitting: KDD process can lead to overfitting, which is a common
problem in machine learning where a model learns the detail and noise in
the training data to the extent that it negatively impacts the performance of
the model on new unseen data.
Difference Between KDD and Data Mining
THANK YOU
Ad

More Related Content

What's hot (20)

Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
krishnaviswambharan
 
Unit 5- Architectural Design in software engineering
Unit 5- Architectural Design in software engineering Unit 5- Architectural Design in software engineering
Unit 5- Architectural Design in software engineering
arvind pandey
 
Metrics
MetricsMetrics
Metrics
geethawilliam
 
IoT & M2M.pdf
IoT & M2M.pdfIoT & M2M.pdf
IoT & M2M.pdf
GVNSK Sravya
 
Distance Vector Multicast Routing Protocol (DVMRP) : Presentation
Distance Vector Multicast Routing Protocol (DVMRP) : PresentationDistance Vector Multicast Routing Protocol (DVMRP) : Presentation
Distance Vector Multicast Routing Protocol (DVMRP) : Presentation
Subhajit Sahu
 
Parallel Distributed Systems and Heterogeneity.pptx
Parallel Distributed Systems and Heterogeneity.pptxParallel Distributed Systems and Heterogeneity.pptx
Parallel Distributed Systems and Heterogeneity.pptx
TayyabHussain032
 
software metrics(process,project,product)
software metrics(process,project,product)software metrics(process,project,product)
software metrics(process,project,product)
Amisha Narsingani
 
Models of Distributed System
Models of Distributed SystemModels of Distributed System
Models of Distributed System
Ashish KC
 
Threat Modeling for IoT Systems
Threat Modeling for IoT SystemsThreat Modeling for IoT Systems
Threat Modeling for IoT Systems
Denim Group
 
Block Cipher and its Design Principles
Block Cipher and its Design PrinciplesBlock Cipher and its Design Principles
Block Cipher and its Design Principles
SHUBHA CHATURVEDI
 
(Crypto) DES And RSA Algorithms Overview
(Crypto) DES And RSA Algorithms Overview(Crypto) DES And RSA Algorithms Overview
(Crypto) DES And RSA Algorithms Overview
EL Bachir Nouni
 
Perfect Security
Perfect SecurityPerfect Security
Perfect Security
Sou Jana
 
RSA Algorithm - Public Key Cryptography
RSA Algorithm - Public Key CryptographyRSA Algorithm - Public Key Cryptography
RSA Algorithm - Public Key Cryptography
Md. Shafiul Alam Sagor
 
Steganography
SteganographySteganography
Steganography
Josh Kumar
 
Shared information systems
Shared information systemsShared information systems
Shared information systems
Himanshu
 
Structured analysis and structured design
Structured analysis  and structured designStructured analysis  and structured design
Structured analysis and structured design
Sudeep Singh
 
BaggingBoosting.pdf
BaggingBoosting.pdfBaggingBoosting.pdf
BaggingBoosting.pdf
DynamicPitch
 
Virus and its CounterMeasures -- Pruthvi Monarch
Virus and its CounterMeasures                         -- Pruthvi Monarch Virus and its CounterMeasures                         -- Pruthvi Monarch
Virus and its CounterMeasures -- Pruthvi Monarch
Pruthvi Monarch
 
introduction-to_mobile_computing 1
 introduction-to_mobile_computing 1 introduction-to_mobile_computing 1
introduction-to_mobile_computing 1
Shahid Riaz
 
Component level design
Component   level designComponent   level design
Component level design
Midhula Chandren
 
Unit 5- Architectural Design in software engineering
Unit 5- Architectural Design in software engineering Unit 5- Architectural Design in software engineering
Unit 5- Architectural Design in software engineering
arvind pandey
 
Distance Vector Multicast Routing Protocol (DVMRP) : Presentation
Distance Vector Multicast Routing Protocol (DVMRP) : PresentationDistance Vector Multicast Routing Protocol (DVMRP) : Presentation
Distance Vector Multicast Routing Protocol (DVMRP) : Presentation
Subhajit Sahu
 
Parallel Distributed Systems and Heterogeneity.pptx
Parallel Distributed Systems and Heterogeneity.pptxParallel Distributed Systems and Heterogeneity.pptx
Parallel Distributed Systems and Heterogeneity.pptx
TayyabHussain032
 
software metrics(process,project,product)
software metrics(process,project,product)software metrics(process,project,product)
software metrics(process,project,product)
Amisha Narsingani
 
Models of Distributed System
Models of Distributed SystemModels of Distributed System
Models of Distributed System
Ashish KC
 
Threat Modeling for IoT Systems
Threat Modeling for IoT SystemsThreat Modeling for IoT Systems
Threat Modeling for IoT Systems
Denim Group
 
Block Cipher and its Design Principles
Block Cipher and its Design PrinciplesBlock Cipher and its Design Principles
Block Cipher and its Design Principles
SHUBHA CHATURVEDI
 
(Crypto) DES And RSA Algorithms Overview
(Crypto) DES And RSA Algorithms Overview(Crypto) DES And RSA Algorithms Overview
(Crypto) DES And RSA Algorithms Overview
EL Bachir Nouni
 
Perfect Security
Perfect SecurityPerfect Security
Perfect Security
Sou Jana
 
RSA Algorithm - Public Key Cryptography
RSA Algorithm - Public Key CryptographyRSA Algorithm - Public Key Cryptography
RSA Algorithm - Public Key Cryptography
Md. Shafiul Alam Sagor
 
Shared information systems
Shared information systemsShared information systems
Shared information systems
Himanshu
 
Structured analysis and structured design
Structured analysis  and structured designStructured analysis  and structured design
Structured analysis and structured design
Sudeep Singh
 
BaggingBoosting.pdf
BaggingBoosting.pdfBaggingBoosting.pdf
BaggingBoosting.pdf
DynamicPitch
 
Virus and its CounterMeasures -- Pruthvi Monarch
Virus and its CounterMeasures                         -- Pruthvi Monarch Virus and its CounterMeasures                         -- Pruthvi Monarch
Virus and its CounterMeasures -- Pruthvi Monarch
Pruthvi Monarch
 
introduction-to_mobile_computing 1
 introduction-to_mobile_computing 1 introduction-to_mobile_computing 1
introduction-to_mobile_computing 1
Shahid Riaz
 

Similar to kddprocess-[1].pptx DAta Mining Seminar KDD process (20)

Data mining in the field of library
Data mining in the field of libraryData mining in the field of library
Data mining in the field of library
Megha Goyal
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
Kartik Kalpande Patil
 
Lect 1 2 Data Mining.pptx for the predictive ananlysis
Lect 1  2 Data Mining.pptx for the predictive ananlysisLect 1  2 Data Mining.pptx for the predictive ananlysis
Lect 1 2 Data Mining.pptx for the predictive ananlysis
surajpandey4979
 
Data mining
Data miningData mining
Data mining
DeepikaT13
 
introduction to data mining
introduction to data mining introduction to data mining
introduction to data mining
rzgar zebari
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
Data mining
Data miningData mining
Data mining
DeepikaT13
 
DM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdfDM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdf
ssuserb933d8
 
KDD assignmnt data.docx
KDD assignmnt data.docxKDD assignmnt data.docx
KDD assignmnt data.docx
AbihaAkter201153203
 
15 19
15 1915 19
15 19
Ijarcsee Journal
 
kdd vs database. For data mining btech pptx
kdd vs database. For data mining btech pptxkdd vs database. For data mining btech pptx
kdd vs database. For data mining btech pptx
funadda1810
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
IJDKP
 
Data Warehose and Data Mining Unit II.docx
Data Warehose and Data Mining Unit II.docxData Warehose and Data Mining Unit II.docx
Data Warehose and Data Mining Unit II.docx
Ujjwala Sachin Patil
 
Data mining
Data miningData mining
Data mining
Annies Minu
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
Mehmet Beyaz
 
Chapter six new.pptx knowledge based management
Chapter six new.pptx knowledge based managementChapter six new.pptx knowledge based management
Chapter six new.pptx knowledge based management
cherinettoma18
 
Simplify Data Mining Methods and Benefits Unveiled.pptx
Simplify Data Mining Methods and Benefits Unveiled.pptxSimplify Data Mining Methods and Benefits Unveiled.pptx
Simplify Data Mining Methods and Benefits Unveiled.pptx
Agile dock
 
Introduction to Data Mining and Data Warehousing
Introduction to Data Mining and Data WarehousingIntroduction to Data Mining and Data Warehousing
Introduction to Data Mining and Data Warehousing
yokeshmca
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
Data mining
Data miningData mining
Data mining
nandini patil
 
Data mining in the field of library
Data mining in the field of libraryData mining in the field of library
Data mining in the field of library
Megha Goyal
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
Kartik Kalpande Patil
 
Lect 1 2 Data Mining.pptx for the predictive ananlysis
Lect 1  2 Data Mining.pptx for the predictive ananlysisLect 1  2 Data Mining.pptx for the predictive ananlysis
Lect 1 2 Data Mining.pptx for the predictive ananlysis
surajpandey4979
 
introduction to data mining
introduction to data mining introduction to data mining
introduction to data mining
rzgar zebari
 
DM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdfDM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdf
ssuserb933d8
 
kdd vs database. For data mining btech pptx
kdd vs database. For data mining btech pptxkdd vs database. For data mining btech pptx
kdd vs database. For data mining btech pptx
funadda1810
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
IJDKP
 
Data Warehose and Data Mining Unit II.docx
Data Warehose and Data Mining Unit II.docxData Warehose and Data Mining Unit II.docx
Data Warehose and Data Mining Unit II.docx
Ujjwala Sachin Patil
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
Mehmet Beyaz
 
Chapter six new.pptx knowledge based management
Chapter six new.pptx knowledge based managementChapter six new.pptx knowledge based management
Chapter six new.pptx knowledge based management
cherinettoma18
 
Simplify Data Mining Methods and Benefits Unveiled.pptx
Simplify Data Mining Methods and Benefits Unveiled.pptxSimplify Data Mining Methods and Benefits Unveiled.pptx
Simplify Data Mining Methods and Benefits Unveiled.pptx
Agile dock
 
Introduction to Data Mining and Data Warehousing
Introduction to Data Mining and Data WarehousingIntroduction to Data Mining and Data Warehousing
Introduction to Data Mining and Data Warehousing
yokeshmca
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
Ad

More from earningmoney9595 (7)

Module_3_Codes and Arithmetic operation.pptx
Module_3_Codes and Arithmetic operation.pptxModule_3_Codes and Arithmetic operation.pptx
Module_3_Codes and Arithmetic operation.pptx
earningmoney9595
 
Module 5 Web Programing Setting Up Postgres.pptx
Module 5 Web Programing Setting Up Postgres.pptxModule 5 Web Programing Setting Up Postgres.pptx
Module 5 Web Programing Setting Up Postgres.pptx
earningmoney9595
 
module-4_sent to students computer organization.pptx
module-4_sent to students computer organization.pptxmodule-4_sent to students computer organization.pptx
module-4_sent to students computer organization.pptx
earningmoney9595
 
Module 1 Computer Organization (2).pptx
Module 1 Computer Organization  (2).pptxModule 1 Computer Organization  (2).pptx
Module 1 Computer Organization (2).pptx
earningmoney9595
 
Module 3 Computer Organization Data Hazards.pptx
Module 3 Computer Organization Data Hazards.pptxModule 3 Computer Organization Data Hazards.pptx
Module 3 Computer Organization Data Hazards.pptx
earningmoney9595
 
Python_Programming_PPT Basics of python programming language
Python_Programming_PPT   Basics of python programming languagePython_Programming_PPT   Basics of python programming language
Python_Programming_PPT Basics of python programming language
earningmoney9595
 
Powerpoint Presentation on KARNATAKA'S CULTURE
Powerpoint Presentation on KARNATAKA'S CULTUREPowerpoint Presentation on KARNATAKA'S CULTURE
Powerpoint Presentation on KARNATAKA'S CULTURE
earningmoney9595
 
Module_3_Codes and Arithmetic operation.pptx
Module_3_Codes and Arithmetic operation.pptxModule_3_Codes and Arithmetic operation.pptx
Module_3_Codes and Arithmetic operation.pptx
earningmoney9595
 
Module 5 Web Programing Setting Up Postgres.pptx
Module 5 Web Programing Setting Up Postgres.pptxModule 5 Web Programing Setting Up Postgres.pptx
Module 5 Web Programing Setting Up Postgres.pptx
earningmoney9595
 
module-4_sent to students computer organization.pptx
module-4_sent to students computer organization.pptxmodule-4_sent to students computer organization.pptx
module-4_sent to students computer organization.pptx
earningmoney9595
 
Module 1 Computer Organization (2).pptx
Module 1 Computer Organization  (2).pptxModule 1 Computer Organization  (2).pptx
Module 1 Computer Organization (2).pptx
earningmoney9595
 
Module 3 Computer Organization Data Hazards.pptx
Module 3 Computer Organization Data Hazards.pptxModule 3 Computer Organization Data Hazards.pptx
Module 3 Computer Organization Data Hazards.pptx
earningmoney9595
 
Python_Programming_PPT Basics of python programming language
Python_Programming_PPT   Basics of python programming languagePython_Programming_PPT   Basics of python programming language
Python_Programming_PPT Basics of python programming language
earningmoney9595
 
Powerpoint Presentation on KARNATAKA'S CULTURE
Powerpoint Presentation on KARNATAKA'S CULTUREPowerpoint Presentation on KARNATAKA'S CULTURE
Powerpoint Presentation on KARNATAKA'S CULTURE
earningmoney9595
 
Ad

Recently uploaded (20)

The ever evoilving world of science /7th class science curiosity /samyans aca...
The ever evoilving world of science /7th class science curiosity /samyans aca...The ever evoilving world of science /7th class science curiosity /samyans aca...
The ever evoilving world of science /7th class science curiosity /samyans aca...
Sandeep Swamy
 
Sinhala_Male_Names.pdf Sinhala_Male_Name
Sinhala_Male_Names.pdf Sinhala_Male_NameSinhala_Male_Names.pdf Sinhala_Male_Name
Sinhala_Male_Names.pdf Sinhala_Male_Name
keshanf79
 
SPRING FESTIVITIES - UK AND USA -
SPRING FESTIVITIES - UK AND USA            -SPRING FESTIVITIES - UK AND USA            -
SPRING FESTIVITIES - UK AND USA -
Colégio Santa Teresinha
 
LDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini UpdatesLDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini Updates
LDM Mia eStudios
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Library Association of Ireland
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
Introduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe EngineeringIntroduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe Engineering
Damian T. Gordon
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Library Association of Ireland
 
Odoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo SlidesOdoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo Slides
Celine George
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 
The ever evoilving world of science /7th class science curiosity /samyans aca...
The ever evoilving world of science /7th class science curiosity /samyans aca...The ever evoilving world of science /7th class science curiosity /samyans aca...
The ever evoilving world of science /7th class science curiosity /samyans aca...
Sandeep Swamy
 
Sinhala_Male_Names.pdf Sinhala_Male_Name
Sinhala_Male_Names.pdf Sinhala_Male_NameSinhala_Male_Names.pdf Sinhala_Male_Name
Sinhala_Male_Names.pdf Sinhala_Male_Name
keshanf79
 
LDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini UpdatesLDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini Updates
LDM Mia eStudios
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Library Association of Ireland
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
Introduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe EngineeringIntroduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe Engineering
Damian T. Gordon
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Niamh Lucey, Mary Dunne. Health Sciences Libraries Group (LAI). Lighting the ...
Library Association of Ireland
 
Odoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo SlidesOdoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo Slides
Celine George
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 

kddprocess-[1].pptx DAta Mining Seminar KDD process

  • 2. KDD PROCESS  KDD (Knowledge Discovery in Databases) is a process that involves the extraction of useful, previously unknown, and potentially valuable information from large datasets. The KDD process is an iterative process and it requires multiple iterations of the above steps to extract accurate knowledge from the data.
  • 3. November 12, 2024 Knowledge Discovery (KDD) Process  Data mining—core of knowledge discovery process Data Cleaning Data Integration Databases Data Warehouse Task-relevant Data Selection Data Mining Pattern Evaluation
  • 4. DATA CLEANING  Remove Noise and Inconsistent Data
  • 5. DATA INTEGRATION  Where multiple data sources may be combined
  • 6. DATA SELECTION  Where data relevant to the analysis task are retrieved from the data base
  • 7. DATA TRANSFORMATION  Where data are transformed and consolidated into forms appropriate for mining by performing summary or aggregation operation
  • 8. Data Mining  An essential Process where intelligent methods are applied to extract data patterns
  • 9. PATTERN EVALUATION  To identify the truly interesting patterns representing knowledge based on interestingness measures
  • 10. KNOWLEDGE REPRESENTATION  Where visualization and knowledge representation techniques are used to present mined knowledge to users
  • 11.  Note: KDD is an iterative process where evaluation measures can be enhanced, mining can be refined, new data can be integrated and transformed in order to get different and more appropriate results.Preprocessing of databases consists of Data cleaning and Data Integration.
  • 12. WHAT KINDS OF DATA CAN BE MINED  DATABASE DATA  Data warehouses  Transactional data  Other Kinds of Data
  • 13. DATABASE DATA  DBMS  Relational Database Examples  Attributes  Tuples
  • 16. Transactional Data  A Transactional data must be unique
  • 17. Other Kinds of Data  Time Related Data  Sequence Data(Historical Records,Stock Exchange)  Data streams( Video Surveillance, Sensor Data)  Spatial Data(maps)  Hyper Text and Multimedia Data(Text,Video,Audio)  Graph and Networked Data  Engineering Design Data(auto CAD)  Web
  • 18. Advantages of KDD Process 1. Improves decision-making: KDD provides valuable insights and knowledge that can help organizations make better decisions. 2. Increased efficiency: KDD automates repetitive and time-consuming tasks and makes the data ready for analysis, which saves time and money. 3. Better customer service: KDD helps organizations gain a better understanding of their customers’ needs and preferences, which can help them provide better customer service. 4. Fraud detection: KDD can be used to detect fraudulent activities by identifying patterns and anomalies in the data that may indicate fraud. 5. Predictive modeling: KDD can be used to build predictive models that can forecast future trends and patterns.
  • 19. Disadvantages of KDD Process 1. Privacy concerns: KDD can raise privacy concerns as it involves collecting and analyzing large amounts of data, which can include sensitive information about individuals. 2. Complexity: KDD can be a complex process that requires specialized skills and knowledge to implement and interpret the results. 3. Unintended consequences: KDD can lead to unintended consequences, such as bias or discrimination, if the data or models are not properly understood or used. 4. Data Quality: KDD process heavily depends on the quality of data, if data is not accurate or consistent, the results can be misleading 5. High cost: KDD can be an expensive process, requiring significant investments in hardware, software, and personnel. 6. Overfitting: KDD process can lead to overfitting, which is a common problem in machine learning where a model learns the detail and noise in the training data to the extent that it negatively impacts the performance of the model on new unseen data.
  • 20. Difference Between KDD and Data Mining