0% found this document useful (0 votes)

13 views4 pages

DMDW

The document is a semester exam key for the Data Warehousing and Data Mining course at Saveetha College of Liberal Arts and Sciences. It includes various questions on topics such as OLAP, classification vs. clustering, data mining techniques, and algorithms like kMeans and kNN. Additionally, it covers applications of data mining, data visualization tools, and the influence of data mining on social platforms.

Uploaded by

sathieshkumars.sse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views4 pages

DMDW

Uploaded by

sathieshkumars.sse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

SAVEETHA INSTITUTE OF MEDICAL AND TECHNICAL

SCIENCES
SAVEETHA COLLEGE OF LIBERAL ARTS AND SCIENCES

SEMESTER EXAM KEY

Sub.Code: CSA16 Sub. Name: Data Warehousing and Data Mining

Branch: BCA Year: 2025 Year: II

1. Define OLAP. (2 Marks)

OLAP (Online Analytical Processing) is a technology that enables multidimensional analysis
of data stored in data warehouses. It allows users to perform complex queries, generate
reports, and analyze data interactively from multiple perspectives (e.g., sales by region, time,
or product).

2. What is the difference between classification and clustering in data mining? (2 Marks)
Classification: A supervised learning technique that assigns predefined labels/classes to data
points based on training data (e.g., spam vs. not spam).
Clustering: An unsupervised learning technique that groups similar data points into clusters
without predefined labels (e.g., customer segmentation).

3. List down the different types of patterns. (2 Marks)

Frequent patterns (e.g., itemsets, subsequences)
Sequential patterns
Association patterns
Predictive patterns
Clustering patterns

4. Define data primitives. (2 Marks)

Data primitives are the basic attributes or elements of data used in data mining tasks, such as
data type (e.g., numeric, categorical), measurement scale (e.g., nominal, ordinal), and
specific values or ranges.

5. What are the three key components of an Association Rule? (2 Marks)

Antecedent (If): The condition or itemset that triggers the rule.
Consequent (Then): The result or itemset predicted by the rule.
Support/Confidence: Measures like support (frequency) and confidence (strength) that
validate the rule.
6. What is the Apriori Algorithm? (2 Marks)
The Apriori Algorithm is a data mining technique used to identify frequent itemsets in
transactional datasets and generate association rules. It works on the principle that all subsets
of a frequent itemset must also be frequent (Apriori property).

7. What is clustering in data mining? (2 Marks)

Clustering is the process of grouping similar objects into clusters based on their attributes,
without prior knowledge of labels. It is an unsupervised learning method used to discover
patterns or structures in data.

8. What is the objective of using the kMeans algorithm? (2 Marks)

The objective of the kMeans algorithm is to partition a dataset into *k* clusters, where each
data point belongs to the cluster with the nearest mean (centroid), minimizing the
withincluster variance.

9. What is the difference between classification and prediction? (2 Marks)

Classification: Assigns discrete labels to data points (e.g., yes/no, spam/not spam).
Prediction: Estimates continuous values or future outcomes (e.g., predicting sales revenue).

10. List down the applications of data mining. (2 Marks)

Market basket analysis
Fraud detection
Customer segmentation
Healthcare diagnostics
Predictive maintenance

11. Define Data mining. Describe the kinds of data used in data mining. (5 Marks)
Definition: Data mining is the process of discovering patterns, trends, and useful
information from large datasets using statistical, machine learning, and database techniques.
Kinds of Data:
1. Structured Data: Relational databases (tables with rows/columns).
2. Unstructured Data: Text, images, videos.
3. Semistructured Data: XML, JSON.
4. Timeseries Data: Stock prices, sensor data.
5. Spatial Data: Maps, geographic information.

12. Compare different data visualization tools and techniques in terms of their effectiveness
for different types of data. (5 Marks)
Bar Charts: Effective for categorical data (e.g., sales by region).
Line Graphs: Best for timeseries data (e.g., stock trends).
Scatter Plots: Suitable for numerical data showing relationships (e.g., height vs. weight).
Heatmaps: Useful for large datasets with intensity variations (e.g., website clicks).
Pie Charts: Good for showing proportions (e.g., market share), but less effective for complex
data.

13. Discuss how the kNearest Neighbors (kNN) algorithm works in classification. (5 Marks)
kNN is a supervised learning algorithm that classifies a data point based on the majority
class of its *k* nearest neighbors.
Steps:
1. Calculate the distance (e.g., Euclidean) between the new data point and all training data
points.
2. Identify the *k* closest points (neighbors).
3. Assign the class with the most votes among the *k* neighbors.
Works well for simple datasets but is computationally expensive for large data.

14. Explain the trend in data mining. (5 Marks)

Trends in data mining include:
Big Data Integration: Handling largescale, unstructured data.
AI and Machine Learning: Using deep learning for complex patterns.
Realtime Mining: Processing streaming data (e.g., IoT).
Privacypreserving Mining: Techniques like anonymization to protect data.
Cloudbased Mining: Leveraging cloud platforms for scalability.

15. Explain in detail about the functionalities of data mining. (12 Marks)
Definition Recap: Data mining extracts knowledge from data.
Functionalities:
1. Pattern Discovery: Identifies frequent itemsets, sequences, etc.
2. Classification: Assigns labels (e.g., Decision Trees, SVM).
3. Clustering: Groups similar objects (e.g., kMeans).
4. Association Rule Mining: Finds relationships (e.g., market basket analysis).
5. Prediction: Forecasts trends (e.g., regression).
6. Anomaly Detection: Identifies outliers (e.g., fraud detection).
7. Summarization: Provides concise data representations.
8. Visualization: Aids in interpreting results.
Examples: Fraud detection (anomaly), customer segmentation (clustering).

16. How would you apply different data mining techniques to a given dataset? Provide
examples for each type. (12 Marks)
Dataset Example: Retail sales data.
Techniques:
1. Classification: Use Decision Trees to classify customers as "loyal" or "not loyal" based
on purchase history.
2. Clustering: Apply kMeans to segment customers by buying patterns.
3. Association Rule Mining: Use Apriori to find rules like "If bread, then butter."
4. Prediction: Use regression to predict next month’s sales.
5. Anomaly Detection: Detect unusual transactions (e.g., fraud).

6. Timeseries Analysis: Analyze sales trends over time.

17. Explain the concept of Association Rule Mining. Discuss the different types of
association rules and their significance. (12 Marks)
Concept: Association Rule Mining identifies relationships between items in large datasets
(e.g., "If A, then B").
Measures: Support (frequency), Confidence (strength), Lift (correlation).
Types:
1. Boolean Rules: Binary presence/absence (e.g., "bread → butter").
2. Quantitative Rules: Numeric attributes (e.g., "age > 30 → high income").
3. Multilevel Rules: Hierarchies (e.g., "dairy → milk").
4. Multidimensional Rules: Multiple attributes (e.g., "age > 30 and male → luxury car").
Significance: Improves marketing, inventory management, and decisionmaking.

18. Discuss the various types of classification algorithms with examples. (12 Marks)
Types:
1. Decision Trees: Splits data based on features (e.g., spam email detection).
2. kNearest Neighbors (kNN): Classifies based on proximity (e.g., image recognition).
3. Support Vector Machines (SVM): Finds optimal hyperplane (e.g., text classification).
4. Naive Bayes: Probabilistic classifier (e.g., sentiment analysis).
5. Neural Networks: Complex patterns (e.g., handwriting recognition).
Examples: Classifying customers (Decision Trees), disease diagnosis (SVM).

19. How does data mining influence social platforms and social behavior? (12 Marks)
Influence on Platforms:
1. Personalization: Recommends content (e.g., Netflix, YouTube).
2. Ad Targeting: Mines user data for ads (e.g., Facebook).
3. Trend Analysis: Identifies viral topics (e.g., Twitter hashtags).
4. Sentiment Analysis: Gauges public opinion (e.g., election predictions).
Influence on Behavior:
1. Shapes preferences through tailored content.
2. Encourages engagement via gamification (likes, shares).
3. Raises privacy concerns, altering trust in platforms.
Example: Mining X posts to predict user reactions to news.

New AMOS User Guide-220312
100% (3)
New AMOS User Guide-220312
94 pages
7 Fundamental Steps To Complete A Data Analytics Project
No ratings yet
7 Fundamental Steps To Complete A Data Analytics Project
6 pages
Vehicle Insurance Management System Report
78% (9)
Vehicle Insurance Management System Report
48 pages
data mning
No ratings yet
data mning
40 pages
ans_DM
No ratings yet
ans_DM
16 pages
16 Marks DWDM
No ratings yet
16 Marks DWDM
6 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
Here are the answers to your questions
No ratings yet
Here are the answers to your questions
3 pages
Data Mining
No ratings yet
Data Mining
20 pages
QUESTION BANK BCA_IDS
No ratings yet
QUESTION BANK BCA_IDS
3 pages
DataWarehousing DataMining Question Bank
No ratings yet
DataWarehousing DataMining Question Bank
3 pages
Data Mining Long Answers
No ratings yet
Data Mining Long Answers
4 pages
Cs 2032 Data Warehousing and Data Mining Question Bank by Gopi
No ratings yet
Cs 2032 Data Warehousing and Data Mining Question Bank by Gopi
6 pages
Book Exercises NayelliAnswers
No ratings yet
Book Exercises NayelliAnswers
3 pages
Data Mining List of Important Question
No ratings yet
Data Mining List of Important Question
4 pages
Seperated
No ratings yet
Seperated
11 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
Full_Detailed_Data_Mining_Answer_Key
No ratings yet
Full_Detailed_Data_Mining_Answer_Key
4 pages
Data Mining
No ratings yet
Data Mining
3 pages
3
No ratings yet
3
4 pages
Data Mining and Warehousing (1)
No ratings yet
Data Mining and Warehousing (1)
7 pages
DWDM
No ratings yet
DWDM
18 pages
Data Mining Syllabus and Question
No ratings yet
Data Mining Syllabus and Question
6 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
Data Warehousing and Mining April 2019
No ratings yet
Data Warehousing and Mining April 2019
4 pages
IS421 Exam
No ratings yet
IS421 Exam
8 pages
Data Science
No ratings yet
Data Science
13 pages
J 3025-Data Mining and Warehousing
No ratings yet
J 3025-Data Mining and Warehousing
12 pages
Data_Mining_Assignment_Answers
No ratings yet
Data_Mining_Assignment_Answers
2 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
DM UNIT-1 Question and Answer
No ratings yet
DM UNIT-1 Question and Answer
25 pages
Question Bank 2
No ratings yet
Question Bank 2
4 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
Dataming Cat Answers
No ratings yet
Dataming Cat Answers
43 pages
Document
No ratings yet
Document
44 pages
2201020480_FAR
No ratings yet
2201020480_FAR
42 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
2018 & 2019 Data Mining Answers
No ratings yet
2018 & 2019 Data Mining Answers
25 pages
CS1004 DWM 2marks 2013
No ratings yet
CS1004 DWM 2marks 2013
22 pages
Data Warehousing and Data Mining Important Question
No ratings yet
Data Warehousing and Data Mining Important Question
7 pages
DOC-20241205-WA0001
No ratings yet
DOC-20241205-WA0001
6 pages
Page 1 of 2
No ratings yet
Page 1 of 2
4 pages
Important Questions From All Units
No ratings yet
Important Questions From All Units
3 pages
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
100% (1)
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
13 pages
DM VSAQ
No ratings yet
DM VSAQ
8 pages
Data Mining Merged
No ratings yet
Data Mining Merged
10 pages
Dm Answers
No ratings yet
Dm Answers
22 pages
Pyqp - Cs402-Qp-Jun21
No ratings yet
Pyqp - Cs402-Qp-Jun21
3 pages
Unit-1
No ratings yet
Unit-1
7 pages
Lecture_01_11jan
No ratings yet
Lecture_01_11jan
29 pages
SemSuggestions DM
No ratings yet
SemSuggestions DM
6 pages
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
10 pages
Question Bank
No ratings yet
Question Bank
3 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
13 pages
Data Warehousing and Mining (Notes)
No ratings yet
Data Warehousing and Mining (Notes)
12 pages
CS-DM MODULE -1
No ratings yet
CS-DM MODULE -1
27 pages
Data Mining Model Qns
No ratings yet
Data Mining Model Qns
14 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
INT GUIDE
No ratings yet
INT GUIDE
5 pages
C++ (1)
No ratings yet
C++ (1)
8 pages
II BCA B
No ratings yet
II BCA B
2 pages
Adobe Scan 12 Apr 2025
No ratings yet
Adobe Scan 12 Apr 2025
1 page
RESUME (Haroon) PDF
No ratings yet
RESUME (Haroon) PDF
2 pages
Wilco_Eaglesoft Citation X 奖状用户手册_部分2
No ratings yet
Wilco_Eaglesoft Citation X 奖状用户手册_部分2
1 page
Ranjith Krishnan: Session 7
No ratings yet
Ranjith Krishnan: Session 7
6 pages
Bda Unit-5 PDF
No ratings yet
Bda Unit-5 PDF
83 pages
DBMS Question Bank
No ratings yet
DBMS Question Bank
10 pages
DMlab - FilE prINCE
No ratings yet
DMlab - FilE prINCE
27 pages
Primary Key: The PRIMARY KEY Constraint Uniquely Identifies Each Record in A Database Table
No ratings yet
Primary Key: The PRIMARY KEY Constraint Uniquely Identifies Each Record in A Database Table
3 pages
Visualization or Visual Data Mining
No ratings yet
Visualization or Visual Data Mining
15 pages
DBMS Week - 7 (1) (1) 1
No ratings yet
DBMS Week - 7 (1) (1) 1
3 pages
Data Modeling and Relational Database Design
No ratings yet
Data Modeling and Relational Database Design
11 pages
Pega Day 1
No ratings yet
Pega Day 1
31 pages
JDBC Interview Questions With Answers PDF
No ratings yet
JDBC Interview Questions With Answers PDF
8 pages
ActiveMQinActionCH05
No ratings yet
ActiveMQinActionCH05
7 pages
Computer Practical File
No ratings yet
Computer Practical File
12 pages
NetApp Metrocluster TR4705
No ratings yet
NetApp Metrocluster TR4705
28 pages
Data Mining Lab
No ratings yet
Data Mining Lab
33 pages
Properties & States of Transaction
No ratings yet
Properties & States of Transaction
2 pages
2019a Product Availability
No ratings yet
2019a Product Availability
13 pages
Neha
No ratings yet
Neha
8 pages
04 PostgreSQL Lab Exercises
No ratings yet
04 PostgreSQL Lab Exercises
6 pages
Designing and implementing a computerized grading system using HTML for the front end
No ratings yet
Designing and implementing a computerized grading system using HTML for the front end
6 pages
Chapter 2 - Intro To Data Sciences (Updated)
No ratings yet
Chapter 2 - Intro To Data Sciences (Updated)
67 pages
Lantek Expert Manual
No ratings yet
Lantek Expert Manual
111 pages
009 MSPTDA ValuesExpressionsFunctions
No ratings yet
009 MSPTDA ValuesExpressionsFunctions
30 pages
Assignment SQL (DBMS)
No ratings yet
Assignment SQL (DBMS)
5 pages
Hospital Management System
No ratings yet
Hospital Management System
3 pages
Class 10 -Database Notes
No ratings yet
Class 10 -Database Notes
32 pages

DMDW

Uploaded by

DMDW

Uploaded by

SAVEETHA INSTITUTE OF MEDICAL AND TECHNICAL

SEMESTER EXAM KEY

Sub.Code: CSA16 Sub. Name: Data Warehousing and Data Mining

Branch: BCA Year: 2025 Year: II

1. Define OLAP. (2 Marks)

3. List down the different types of patterns. (2 Marks)

4. Define data primitives. (2 Marks)

5. What are the three key components of an Association Rule? (2 Marks)

7. What is clustering in data mining? (2 Marks)

8. What is the objective of using the kMeans algorithm? (2 Marks)

9. What is the difference between classification and prediction? (2 Marks)

10. List down the applications of data mining. (2 Marks)

14. Explain the trend in data mining. (5 Marks)

6. Timeseries Analysis: Analyze sales trends over time.

You might also like