May 14, 2015 Data Mining: Concepts and Techniques
May 14, 2015 Data Mining: Concepts and Techniques
Evolution of Database
Technology
1970s:
1980s:
1990s2000s:
Other Applications
Target marketing
Cross-market analysis
Customer profiling
data mining can tell you what types of customers buy what
products (clustering or classification)
Resource planning:
Competition:
Applications
Approach
Examples
Retail
10
Other Applications
Sports
Astronomy
11
Selection
Data Cleaning
Data Integration
Databases
May 14, 2015
12
13
Making
Decisions
Data Presentation
Visualization Techniques
Data Mining
Information Discovery
End User
Business
Analyst
Data
Analyst
Data Exploration
Statistical Analysis, Querying and Reporting
Data Warehouses / Data Marts
OLAP, MDA
Data Sources
Paper, Files, Information Providers, Database Systems, OLTP
May 14, 2015
DBA
14
Architecture of a Typical
Data Mining System
Graphical user interface
Pattern evaluation
Data mining engine
Database or
data warehouse
Filtering
Data cleaning & data
integration
server
Databases
May 14, 2015
Knowledgebase
Data
Warehouse
Data Mining: Concepts and
Techniques
15
Relational databases
Data warehouses
Transactional databases
Advanced DB and information repositories
16
17
Cluster analysis
18
Outlier analysis
Outlier: a data object that does not comply with the general
behavior of the data
Similarity-based analysis
19
20
Approaches
First general all the patterns and then filter out the
uninteresting ones.
Generate only the interesting patternsmining query
optimization
Data Mining: Concepts and
Techniques
21
Machine
Learning
Information
Science
May 14, 2015
Statistics
Data Mining
Visualization
Other
Disciplines
Data Mining: Concepts and
Techniques
22
General functionality
23
A Multi-Dimensional View of
Data Mining Classification
Databases to be mined
Relational, transactional, object-oriented, object-relational,
active, spatial, time-series, text, multi-media,
heterogeneous, legacy, WWW, etc.
Knowledge to be mined
Characterization, discrimination, association, classification,
clustering, trend, deviation and outlier analysis, etc.
Multiple/integrated functions and mining at multiple levels
Techniques utilized
Database-oriented, data warehouse (OLAP), machine
learning, statistics, visualization, neural network, etc.
Applications adapted
24
25
An OLAM Architecture
Mining query
Mining result
Layer4
User Interface
OLAM
Engine
OLAP
Engine
Layer3
OLAP/OLAM
MDDB
Filtering&Integration
MDDB
Database API
Meta
Data
Filtering
Layer1
Databases
May 14, 2015
Data cleaning
Data
Warehouse
Data integration
Data Mining: Concepts and
Techniques
Data
Repository
26
Techniques
27
28
Summary
29