0% found this document useful (0 votes)
5 views

Copy of Qb Ml Sem7 Computer 2023 3024.Docx

The document is a question bank for the II Unit Test of the Machine Learning course (ML-CSC701) for the academic year 2023-2024. It includes various questions related to clustering algorithms, distance measures, dimensionality reduction, and classification techniques. Topics covered include DBSCAN, Xgboost, PCA, SVM, and the impact of outliers on clustering algorithms.

Uploaded by

Sankalp Rane
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Copy of Qb Ml Sem7 Computer 2023 3024.Docx

The document is a question bank for the II Unit Test of the Machine Learning course (ML-CSC701) for the academic year 2023-2024. It includes various questions related to clustering algorithms, distance measures, dimensionality reduction, and classification techniques. Topics covered include DBSCAN, Xgboost, PCA, SVM, and the impact of outliers on clustering algorithms.

Uploaded by

Sankalp Rane
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Machine Learning (ML-CSC701)

2023-2024
II UNIT TEST Question Bank
Sr. Questions
No

1 Apply the DBSCAN algorithm with similarity threshold of 0.85 (using the similarity matrix ) to the given
data points and MinPts>=2 (Minimum required points in a cluster) what are core, border and noise
(outliers) in the set of points given in table.

2 The cluster has the following data point. Calculate their intra cluster distance. Also calculate its

inertia .

3 Apply the Xgboost on the following dataset:-


Draw the tree for only one sequence. Fill the values in the following table.
4 Plot the hyperplane for the following points:-
(2,2),(3,2),(2,-2),(3,-2),(5,0),(6,2)(6,-2),(7,0). Also Classify the point(6,2) based on the
calculated hyperplane.

Note:- Ensure that you follow all the steps accurately, including the creation of graphs
when necessary. Display the classified point by substituting values into the hyperplane
equation and also depict it on the graph.

5 Calculate EigenValue and EigenVector for the following centered features

6 Construct the RBF pattern classifier for the following dataset:-

Calculate the PCA Values using the following Normalized Eigenvectors on non centered

features:-

8 Why do we need similarity or dissimilarity measure for clustering algorithms?

9 Clustering algorithms are sensitive to starting points. Justify the statement with example.
Also, explain impact of outliers, distance measure and noise on K-mean clustering
algorithm.

10 Why we need to use dimensionality reduction methods? PCA minimizes loss of


information. Justify.

11 What is bagging and boosting? How it is used to combine classifiers to improve results?

12 Explain soft margin and hard margin SVM with suitable example.

13 What is a kernel? How you will choose appropriate kernel for problem in hand?

14 Compute the principal component of following data

CLASS 1
X = 2, 3, 4
Y = 1, 5, 3

CLASS 2

X = 5, 6, 7
Y = 6, 7, 8

15 Find appropriate transformation to convert non-linearly separable data to linearly


separable data:

You might also like