Topic10 - Data Mining
Topic10 - Data Mining
Data Mining
Data Mining
Task-relevant Data
Data Selection
Warehouse
Data Cleaning
Data Integration
Data Exploration
Statistical Analysis, Querying and Reporting
Data Warehouses / Data Marts
OLAP, MDA DBA
Data Sources
Paper, Files, InformationIntro
Providers,
to Data Mining Database Systems, OLTP 18
Architecture: Typical Data
Mining System
Pattern evaluation
Data
Databases Warehouse
Intro to Data Mining 19
Data Mining: On What Kinds of
Data?
Relational database
Data warehouse
Transactional database
Advanced database and information repository
Object-relational database
Time-series data
Stream data
Multimedia database
Database
Statistics
Systems
Machine
Learning
Data Mining Visualization
Algorithm Other
Disciplines
Intro to Data Mining 23
Major Issues in Data Mining
Mining methodology
Mining different kinds of knowledge from diverse data
types, e.g., bio, stream, Web
Performance: efficiency, effectiveness, and scalability
Pattern evaluation: the interestingness problem
Incorporation of background knowledge
Handling noise and incomplete data
Parallel, distributed and incremental mining methods
Integration of the discovered knowledge with existing
one: knowledge fusion
Intro to Data Mining 24
Major Issues in Data Mining
User interaction
Data mining query languages and ad-hoc mining
Expression and visualization of data mining results
Interactive mining of knowledge at multiple levels of
abstraction
Applications and social impacts
Domain-specific data mining & invisible data mining