0% found this document useful (0 votes)
198 views

Data Mining Assignment 1

This document outlines an assignment on data mining and data warehouses. It provides 7 questions to be answered before September 16th, 2021. The questions cover defining data mining and its knowledge discovery process, distinguishing data warehouses from databases, explaining different data mining functionalities with examples, comparing related data mining techniques, discussing challenges of data mining methodology and user interaction, challenges of mining huge datasets versus small datasets, and outlining research challenges of data mining in a specific application domain.

Uploaded by

tempman tempman
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
198 views

Data Mining Assignment 1

This document outlines an assignment on data mining and data warehouses. It provides 7 questions to be answered before September 16th, 2021. The questions cover defining data mining and its knowledge discovery process, distinguishing data warehouses from databases, explaining different data mining functionalities with examples, comparing related data mining techniques, discussing challenges of data mining methodology and user interaction, challenges of mining huge datasets versus small datasets, and outlining research challenges of data mining in a specific application domain.

Uploaded by

tempman tempman
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Data Mining and Data Warehouse

Assignment 1
NOTE: To be completed before 16/09/2021

1. What is data mining? Describe the steps involved in data mining when viewed
as a process of knowledge discovery.
2. How is a data warehouse different from a database? How are they similar?
3. Define each of the following data mining functionalities: characterization,
discrimination, association and correlation analysis, classification, regression,
clustering, and outlier analysis. Give examples of each data mining functionality,
using a real-life database that you are familiar with.
4. Explain the difference and similarity between discrimination and classification,
between characterization and clustering, and between classification and regression.
5. Describe three challenges to data mining regarding data mining methodology
and user interaction issues.
6. What are the major challenges of mining a huge amount of data (e.g., billions of
tuples) in comparison with mining a small amount of data (e.g., data set of a few
hundred tuple)?
7. Outline the major research challenges of data mining in one specific application
domain, such as stream/sensor data analysis, spatiotemporal data analysis, or
bioinformatics.

You might also like