0% found this document useful (0 votes)

30 views

Data Mining1 1

Data Mining refers to extracting or'mining3 knowledge from the large amount of data like mining the gold from the rocks and sand. Some other terms that are used for the Data Mining are knowledge mining from the database, knowledge extraction, data / pattern analysis, data dredging. As a knowledge discovery, Data Mining contains the following steps 1. 2. 3. 4. 5. 6. 7. Data cleaning - to remove the noise and inconsistent data data integration - here multiple data source may be combined data selection - here data

Uploaded by

Shruti Gupta

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Data Mining1 1

Uploaded by

Shruti Gupta

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 10

Data Mining

Data mining refers to extracting or mining knowledge from the large amount of data like mining the gold from the rocks and sand. So data mining should have more appropriate name knowledge mining from data Mining is process that finds a small set of data from a great deal of raw material. Some other terms that are used for the data mining are knowledge mining from the database, knowledge extraction, data/pattern analysis , data dredging. It is also sometimes referred as KDD (knowledge discovery in database).

As a Knowledge Discovery, data mining contains the following steps

1. 2. 3. 4. 5. 6. 7. Data cleaning
- To remove the noise and inconsistent data

Data integration
- here multiple data source may be combined

Data selection
- Here data relevant to the analysis task are retrieved from the database

Data transformation
- Where data transformed or consolidate into forms appropriate for mining by performing summary or aggregation operations.

Data mining
- An essential process where intelligent methods are applied in order to extract data patterns

Pattern evaluation - To identify the truly interesting patterns representing knowledge based on
some interestingness measures.

Knowledge presentation
-Where visualization and knowledge representation techniques re used to present the mined knowledge

Data Mining as a step in the process of knowledge discovery

Components of the Data Mining

1. Data base or data warehouse or other information repository: This is one or a set of databases, data warehouses, spreadsheet, or other kind of information repositories where data cleaning and data integration techniques may be performed. 2. Database or data warehouse servers: The database or data warehouse server is responsible for fetching the relevant data, based on the users data mining request. 3. Knowledge base: This is domain knowledge that is used to guide the search, or evaluate the interestingness of result patterns 4. Data mining engine: This is essential to the data mining system and ideally consists of a set of functional modules for tasks such as characterisation, classification, cluster analysis, and evolution and deviation analysis. 5. Pattern evaluation module: This component typically employs interestingness measures and interact with the data mining modules so as to focus the search towards interesting patterns. 6. Graphical user interface: This modules communicates between user and the data mining system, allowing the user to interact with the system by specifying a data mining query or task, providing information to help focus the search, and performing exploratory data mining based on intermediate data mining results.

Graphical user interface

Pattern evaluation Knowledge Base

Data mining engine

Database or data warehouse server Data cleaning Data Integration Filtering

Database

Data warehouse

Architecture of a typical data mining system

Data mining Functionalities What kinds of patterns can be mined?

Concept/class description: Characterisation and discrimination Association analysis Classification and prediction Cluster analysis Outlier analysis Evolution analysis

Concept/class description: Characterisation and description: Data can be associated with classes or concepts. It can be useful to describe individual classes and concepts in summarised, concise, and yet precise terms. Such description of a class or a concept are called class/concept descriptions. These descriptions can be viewed via 1)data characterisation, by summarising the data of the class under study in general terms or (2) data discrimination, by comparison of the target class with one or a set of comparative classes. Association Analysis: It is the discovery of association rules showing attribute-value conditions that occur frequently together in a given set of data. Association analysis is widely used for transaction analysis.\ classification and prediction: Classification is the process of finding a set of models ( or) functions that describe and distinguish data classes and concepts, for the purpose of being able to use the model to predict the class of objects whose class label is unknown. Classification can be used for predicting the class label of data objects. cluster analysis: It analyze data objects without consulting a known class label. Outlier Analysis: Outliers are data objects of a database that do not comply with the general behavior or model of data. Outlier analysis has wide application. It can be used in fraud detection, for example, by detecting unusual usage of credit cards or telecommunication services. Evolution analysis: Data evolution analysis describes and models regularities or trends for objects whose behavior changes over time.

Data mining primitives

Each end user will have a task in mind that is some form of data of data analysis that she would like to have performed A data mining task can be specified by in the form of data mining query. Which is input to the data mining system

A data mining query is defined in terms of following primitives

Task-relevant data: This is the database portion to be investigated. For example if a person is the in-charge of the sales for a region then he need to study only buying habits of the customer of that region rather then the entire country. The kinds of knowledge to be mined: This species the data mining functions to be performed, such as characterization, discrimination, association, classification, clustering, or evolution analysis. For instance, if studying the buying habits of customers in Canada, you may choose to mine associations between customer profiles and the items that these customers like to buy

Background knowledge: Users can specify background knowledge, or knowledge about the domain to be mined. This knowledge is useful for guiding the knowledge discovery process, and for evaluating the patterns found. There are several kinds of background knowledge. For example concept hierarchies, user beliefs regarding relationships in the data etc. Interestingness measures: These functions are used to separate uninteresting patterns from knowledge. They may be used to guide the mining process, or after discovery, to evaluate the discovered patterns. Different kinds of knowledge may have different interestingness measures. For example, interestingness measures for association rules include support (the percentage of task-relevant data tuples for which the rule pattern appears), and confidence (the strength of the implication of the rule). Rules whose support and confidence values are below user-specified thresholds are considered uninteresting Presentation and visualization of discovered patterns: This refers to the form in which discovered patterns are to be displayed. Users can choose from different forms for knowledge presentation, such as rules, tables, charts, graphs, decision trees, and cubes.

Ross Jeffries - Mindframe Persuasion - Seminar Transcript (2009)
83% (24)
Ross Jeffries - Mindframe Persuasion - Seminar Transcript (2009)
288 pages
Emotional Intelligence - New Perspectives and Applications PDF
No ratings yet
Emotional Intelligence - New Perspectives and Applications PDF
300 pages
Data Mining
No ratings yet
Data Mining
25 pages
Data Mining
No ratings yet
Data Mining
14 pages
Module 4
No ratings yet
Module 4
54 pages
Data Mining Tutorials
No ratings yet
Data Mining Tutorials
52 pages
DM-unit 1
No ratings yet
DM-unit 1
22 pages
DATA MINING MODULE 2
No ratings yet
DATA MINING MODULE 2
23 pages
Unit I
No ratings yet
Unit I
19 pages
Dataming T PDF
No ratings yet
Dataming T PDF
48 pages
Dwdm Unit-II Notes
No ratings yet
Dwdm Unit-II Notes
29 pages
Unit-2 Introduction To Data Mining
100% (1)
Unit-2 Introduction To Data Mining
11 pages
Unit-2 data Mining
No ratings yet
Unit-2 data Mining
23 pages
Data Mining
No ratings yet
Data Mining
22 pages
Data Mining Notes
No ratings yet
Data Mining Notes
9 pages
Q.1. What Is Data Mining?
No ratings yet
Q.1. What Is Data Mining?
15 pages
Unit-4 DWM
No ratings yet
Unit-4 DWM
73 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
15 pages
Unit 1
No ratings yet
Unit 1
21 pages
2 unit
No ratings yet
2 unit
15 pages
unit 3 BI & Data science (1)
No ratings yet
unit 3 BI & Data science (1)
19 pages
Data Mining - Tasks: Data Characterization Data Discrimination
No ratings yet
Data Mining - Tasks: Data Characterization Data Discrimination
4 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
24 pages
Data Mining Issues and Tasks
No ratings yet
Data Mining Issues and Tasks
5 pages
Unit 1 Data Mining task
No ratings yet
Unit 1 Data Mining task
7 pages
DWDM R13 Unit 1 PDF
No ratings yet
DWDM R13 Unit 1 PDF
10 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
39 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Unit 1
No ratings yet
Unit 1
27 pages
Data Mining Unit I notes
No ratings yet
Data Mining Unit I notes
29 pages
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
No ratings yet
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
6 pages
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
10 pages
DMW - Unit 1
No ratings yet
DMW - Unit 1
21 pages
Data Mining-Unit-1
No ratings yet
Data Mining-Unit-1
21 pages
data mining unit I notes
No ratings yet
data mining unit I notes
24 pages
CSC 425 Data Mining and Warehousing 2024
No ratings yet
CSC 425 Data Mining and Warehousing 2024
54 pages
Data Mining - Prashant
No ratings yet
Data Mining - Prashant
10 pages
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
No ratings yet
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
14 pages
1.data Mining Functionalities
No ratings yet
1.data Mining Functionalities
14 pages
Data Mining Task Primitives and Major Issues
No ratings yet
Data Mining Task Primitives and Major Issues
18 pages
Datawarehouse&Data mining_ALL
No ratings yet
Datawarehouse&Data mining_ALL
46 pages
R18CSE4102-UNIT 2 Data Mining Notes
100% (1)
R18CSE4102-UNIT 2 Data Mining Notes
31 pages
Unit 3 PPT (BA)
No ratings yet
Unit 3 PPT (BA)
19 pages
important questions unit-1
No ratings yet
important questions unit-1
20 pages
DataWarehouseMining Complete Notes
No ratings yet
DataWarehouseMining Complete Notes
55 pages
DMWH M1
No ratings yet
DMWH M1
25 pages
CH 2
No ratings yet
CH 2
37 pages
LECTURE NOTES ON DATA MINING and DATA WA
No ratings yet
LECTURE NOTES ON DATA MINING and DATA WA
84 pages
Data Warehousing Fundamentals - Unit 2
No ratings yet
Data Warehousing Fundamentals - Unit 2
38 pages
Data Mining and Warehouse
No ratings yet
Data Mining and Warehouse
30 pages
DM Sem U-1
No ratings yet
DM Sem U-1
50 pages
Knowledge Discovery Process
No ratings yet
Knowledge Discovery Process
3 pages
module 1
No ratings yet
module 1
41 pages
past ppr(1)
No ratings yet
past ppr(1)
31 pages
Data Mining - Reference - 1
No ratings yet
Data Mining - Reference - 1
91 pages
UNIT-2 BI
No ratings yet
UNIT-2 BI
26 pages
DM NOTES
No ratings yet
DM NOTES
91 pages
Data Mining Real
No ratings yet
Data Mining Real
19 pages
Unit-1 PPT
No ratings yet
Unit-1 PPT
21 pages
Great Compiled Notes Data Mining V1
No ratings yet
Great Compiled Notes Data Mining V1
92 pages
Unit-1 Data Mining
No ratings yet
Unit-1 Data Mining
19 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
DLL P6 Laws of Motion CLASS H Group 6
No ratings yet
DLL P6 Laws of Motion CLASS H Group 6
4 pages
Teaching Poetry in Secondary Schools
No ratings yet
Teaching Poetry in Secondary Schools
11 pages
BCA Bussiness Communication
No ratings yet
BCA Bussiness Communication
63 pages
Line by Line PDF
100% (1)
Line by Line PDF
241 pages
Comprehension Check
No ratings yet
Comprehension Check
3 pages
Adverb and Adjective
No ratings yet
Adverb and Adjective
6 pages
What Is An Independent Learner
No ratings yet
What Is An Independent Learner
2 pages
Moral Theories and Ethical Frameworks
No ratings yet
Moral Theories and Ethical Frameworks
4 pages
Pidgin and Creole English PDF
No ratings yet
Pidgin and Creole English PDF
21 pages
Crisis Theory Reading Report - SaraRuggieroPortocarrero
No ratings yet
Crisis Theory Reading Report - SaraRuggieroPortocarrero
2 pages
Use Modal Verbs: Can / Could / Be Able To Can/ May/ Might/ Could
No ratings yet
Use Modal Verbs: Can / Could / Be Able To Can/ May/ Might/ Could
8 pages
Learner-Centered: Teaching Strategies
No ratings yet
Learner-Centered: Teaching Strategies
4 pages
TRANSEASON Students To Professionals 020420223
No ratings yet
TRANSEASON Students To Professionals 020420223
54 pages
RRL - Mobile Phones
No ratings yet
RRL - Mobile Phones
3 pages
Grammar Terms: Active Voice Voices Passive Voice Adjective Part of Speech
No ratings yet
Grammar Terms: Active Voice Voices Passive Voice Adjective Part of Speech
19 pages
Urim Grammar
No ratings yet
Urim Grammar
267 pages
SC8 w29
No ratings yet
SC8 w29
6 pages
Anna - S New Glasses
No ratings yet
Anna - S New Glasses
6 pages
Pronouns Study Guide
No ratings yet
Pronouns Study Guide
3 pages
Digital Health and Imaging: Pioneer The Technology Led Healthcare Revolution
No ratings yet
Digital Health and Imaging: Pioneer The Technology Led Healthcare Revolution
10 pages
Tech 003 Barry Osullivan Linking Aptis v4 Single Pages 0
No ratings yet
Tech 003 Barry Osullivan Linking Aptis v4 Single Pages 0
110 pages
Halving Lesson Plan
100% (1)
Halving Lesson Plan
2 pages
Data Leakage Detection and Prevention
No ratings yet
Data Leakage Detection and Prevention
6 pages
The Concept of Educational Evaluation-Putri
No ratings yet
The Concept of Educational Evaluation-Putri
12 pages
Lecture 4 Slides - Self-Management and Self-Care
No ratings yet
Lecture 4 Slides - Self-Management and Self-Care
43 pages
The Impact of A Virtual Reality App On Adolescent EFL Learners' Vocabulary Learning
No ratings yet
The Impact of A Virtual Reality App On Adolescent EFL Learners' Vocabulary Learning
27 pages
LP 1 2 Rationale
No ratings yet
LP 1 2 Rationale
2 pages
ATHS Scaffolding & Success Criteria Training
No ratings yet
ATHS Scaffolding & Success Criteria Training
12 pages

Data Mining1 1

Uploaded by

Data Mining1 1

Uploaded by

Data Mining

As a Knowledge Discovery, data mining contains the following steps

Data Mining as a step in the process of knowledge discovery

Components of the Data Mining

Graphical user interface

Pattern evaluation Knowledge Base

Data mining engine

Database or data warehouse server Data cleaning Data Integration Filtering

Architecture of a typical data mining system

Data mining Functionalities What kinds of patterns can be mined?

Data mining primitives

A data mining query is defined in terms of following primitives

You might also like