0% found this document useful (0 votes)

6 views5 pages

DA

The document consists of questions and topics related to various machine learning and statistical concepts, divided into four units. It covers short and long questions on supervised and unsupervised learning, regression techniques, neural networks, support vector machines, random forests, and big data analytics. Each question aims to assess understanding of key principles, methods, and applications in these fields.

Uploaded by

Akash Aryan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

DA

Uploaded by

Akash Aryan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

UNIT – 01

Short Questions (2 Marks)

1. What is the difference between supervised and unsupervised learning?

2. Explain the purpose of least squares in linear regression.

3. Define multicollinearity and explain why it can be problematic in multiple regression.

4. What is ridge regression, and how does it differ from ordinary least squares regression?

5. Describe the main difference between ridge regression and lasso regression.

6. What does the logistic function do in logistic regression?

7. Briefly explain the goal of subset selection in regression.

8. How does Linear Discriminant Analysis (LDA) differ from logistic regression?

9. What is the role of the bias term in linear regression models?

10. Define the Perceptron Learning Algorithm and state one limitation.

Long Questions (8 Marks)

1. Describe the process of linear regression using least squares. Explain how the model parameters
are estimated, and discuss how least squares minimizes the residuals. Include a brief discussion on
the assumptions of linear regression.

2. Compare and contrast ridge regression and lasso regression. Explain how each method addresses
multicollinearity and overfitting, and describe scenarios in which one might be preferred over the
other.

3. Explain Logistic Regression as a classification method. Describe how logistic regression differs from
linear regression, the interpretation of coefficients, and the role of the sigmoid function in making
predictions.

4. Describe the process of subset selection in multiple regression. What are the advantages and
limitations of this approach? Discuss forward selection, backward elimination, and stepwise selection
methods.

5. Explain Linear Discriminant Analysis (LDA) and its assumptions. Describe the goal of LDA in
classification and outline the steps for applying it to a dataset. How does it differ from Quadratic
Discriminant Analysis (QDA)?

6. Discuss the Perceptron Learning Algorithm. Provide a detailed explanation of how it updates
weights and converges to a solution. Include limitations, especially with non-linearly separable data,
and mention how it can be adapted to solve classification problems.
7. Explain multiple regression with multiple outputs. How does handling multiple output variables
differ from single-output regression, and what additional considerations are necessary for model
evaluation?

8. Describe the significance of regularization in regression models. Discuss how regularization helps
prevent overfitting, and compare the types of regularization penalties applied in ridge regression and
lasso regression.

UNIT – 02
Short Questions (2 Marks)

1. What is backpropagation, and why is it important in neural networks?

2. Explain the concept of a "kernel" in the context of Support Vector Machines.

3. Describe one advantage and one disadvantage of using K-Nearest Neighbor (K-NN) for
classification.

4. Define the term "reproducing kernel" in SVMs.

5. How does K-Nearest Neighbor (K-NN) determine the class of a new data point?

6. What is overfitting in neural networks, and how can it be mitigated?

Long Questions (7/8 Marks)

1. Explain the backpropagation algorithm in detail. Describe each step involved and discuss how it is
used to train neural networks. Additionally, explain why this algorithm is computationally intensive.

2. Discuss the challenges associated with training neural networks. Include issues such as vanishing
gradients, overfitting, and computational costs, and provide some methods used to overcome these
issues.

3. Explain Support Vector Machines (SVM) for classification. Describe the concept of margin
maximization, the importance of support vectors, and how the kernel trick enables SVMs to perform
classification in higher-dimensional feature spaces.

4. Describe the role of K-Nearest Neighbor (K-NN) in image scene classification. Explain how K-NN
can be applied in this domain, including the distance metric used and the limitations of K-NN for
image classification tasks.

5. Discuss the concept of SVM for regression. How does it differ from SVM for classification? Explain
the ε-insensitive loss function and how it helps in regression.

6. Compare and contrast Neural Networks, Support Vector Machines, and K-Nearest Neighbor.
Discuss scenarios where each method might be preferable over the others, considering factors such
as dataset size, dimensionality, and computational resources.
7. Assume that the nurons have a sigmoid functions perform a forward pass and
backward pass on the network.Assume that the actual output of y =0.5 and
learning rate=1.

8. 5.Assume SVM algorithm ,find the hyperplane with maximum margin for the
following data: N=3, X1(mean)=(2,2), X2(mean)=(4,5), X3(7,4) , y1=-1 ,
y2=+1 , y3=+1

UNIT - 03

1. Unsupervised Learning and Random Forests

Short Questions:

1. What is unsupervised learning, and how does it differ from supervised learning?

2. Explain the concept of cluster analysis.

3. Define association rules and give an example of its application.

4. What are principal components, and why are they important in data analysis?

5. How do Random Forests handle feature importance?

Long Questions:

1. Describe the process of cluster analysis. What are some methods used for clustering, and
how can they be applied?

2. Explain the concept of association rules in detail. Describe support, confidence, and lift in the
context of association rule mining.
3. Discuss the Principal Component Analysis (PCA) method. How does PCA help in
dimensionality reduction?

4. Explain how a Random Forest algorithm works. What are the advantages and limitations of
using Random Forests for classification problems?

2. Inferential Statistics and Prescriptive Analytics

Short Questions:

1. What is a t-test, and when is it used?

2. Explain the purpose of McNemar’s test.

3. What is the difference between a paired t-test and an unpaired t-test?

4. What does Analysis of Variance (ANOVA) analyze?

5. Describe one application of prescriptive analytics in business.

Long Questions:

1. Discuss how to assess the performance of a classification algorithm using t-tests and
McNemar's test. What are the conditions for using these tests?

2. Explain Analysis of Variance (ANOVA) and its applications. How can ANOVA be used to
analyze differences across multiple groups?

3. Describe how prescriptive analytics can be used in decision-making processes. Give an

example of how it can provide actionable recommendations.

Short Questions:

1. Define big data. What are the characteristics of big data (5 Vs)?

2. What are some of the main challenges associated with big data analytics?

3. Explain one method for managing large data sets.

Long Questions:

1. Describe the challenges of big data in terms of storage, processing, and analysis. How do
these challenges impact businesses and analytics processes?

2. Discuss various tools and technologies used for big data analytics. How do they help address
the volume, velocity, and variety of big data?

UNIT – 04
Short Questions (1-2 marks each)
a. Define simple linear regression and give an example of where it can be applied.
b. Explain the concept of the p-value in the context of a linear regression coefficient.
c. How does multiple linear regression differ from simple linear regression?
d. List two assumptions of multiple linear regression.
e. In logistic regression, why is the sigmoid function used?
f. Describe a scenario where logistic regression would be more appropriate than linear
regression.
g. Explain the goal of Linear Discriminant Analysis.
h. How does LDA handle dimensionality reduction?
i. Why might one choose ridge regression over simple linear regression?
j. What is the primary role of the regularization parameter in ridge regression?
k. Briefly describe the difference between cross-validation and bootstrapping.
l. Why is cross-validation important in model evaluation?
m. What is the main difference between a classification tree and a regression tree?
n. Describe the process of "pruning" in decision trees.
o. Name two distance metrics that can be used in K-NN classification.
p. What is the goal of Principal Component Analysis?
q. Explain the importance of eigenvalues in PCA.
r. What is the purpose of the “elbow method” in K-means clustering?
s. How does K-means cluster handle categorical data?

Eikos Mythos
No ratings yet
Eikos Mythos
23 pages
C Programming Tutorial
No ratings yet
C Programming Tutorial
73 pages
Interview Quations Data Science
50% (2)
Interview Quations Data Science
3 pages
ML QB For SEE
No ratings yet
ML QB For SEE
6 pages
Exam Preparation- Machine Learning Applications
No ratings yet
Exam Preparation- Machine Learning Applications
4 pages
Ai Chapter 4
No ratings yet
Ai Chapter 4
3 pages
2 Mark Questions
No ratings yet
2 Mark Questions
13 pages
ML Two Marks Question According to Syllabus.docx
No ratings yet
ML Two Marks Question According to Syllabus.docx
4 pages
Data Mining For Intelligence
No ratings yet
Data Mining For Intelligence
4 pages
ML 5 Marks Questions Answers 1 to 30
No ratings yet
ML 5 Marks Questions Answers 1 to 30
5 pages
Predictive Analytics Answer Key
No ratings yet
Predictive Analytics Answer Key
3 pages
UNIT 1 Practice Quiz - MCQs - ML
100% (1)
UNIT 1 Practice Quiz - MCQs - ML
10 pages
Shivaji University, Kolhapur
No ratings yet
Shivaji University, Kolhapur
12 pages
QCM_DL
No ratings yet
QCM_DL
7 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
ML Questions
No ratings yet
ML Questions
9 pages
ML_2023
No ratings yet
ML_2023
3 pages
Data Analytics Questions
No ratings yet
Data Analytics Questions
40 pages
ML external imp unitwise questions
No ratings yet
ML external imp unitwise questions
5 pages
ML_5_Mark_Questions_Answers
No ratings yet
ML_5_Mark_Questions_Answers
3 pages
S&UL Subjective Question Bank
No ratings yet
S&UL Subjective Question Bank
7 pages
Data Science Questions
No ratings yet
Data Science Questions
5 pages
ML-UNIT 2-QB
No ratings yet
ML-UNIT 2-QB
6 pages
machine_learning_mock_test_papers
No ratings yet
machine_learning_mock_test_papers
7 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
3 pages
SEC III Artificial Intelligence Question Bank
No ratings yet
SEC III Artificial Intelligence Question Bank
86 pages
FDS QP - Thy
No ratings yet
FDS QP - Thy
1 page
DL DL2 DL3 Merged
No ratings yet
DL DL2 DL3 Merged
11 pages
Que. Bank of PA
No ratings yet
Que. Bank of PA
3 pages
13.QUESTION BANK
No ratings yet
13.QUESTION BANK
4 pages
ML Question BanK
No ratings yet
ML Question BanK
5 pages
ML Suggestion 2
No ratings yet
ML Suggestion 2
11 pages
AssignmentQuestion4Bigdata_2025
No ratings yet
AssignmentQuestion4Bigdata_2025
2 pages
Khoi KHDL - de On
No ratings yet
Khoi KHDL - de On
6 pages
ASSIGNMENT2
No ratings yet
ASSIGNMENT2
6 pages
d3 PDF
No ratings yet
d3 PDF
7 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
chp-1,2
No ratings yet
chp-1,2
18 pages
IML-IITKGP - Assignment 5 Solution
No ratings yet
IML-IITKGP - Assignment 5 Solution
7 pages
ML QB Unit Wise
No ratings yet
ML QB Unit Wise
11 pages
Machine_Learning_One_Mark_Answers
No ratings yet
Machine_Learning_One_Mark_Answers
4 pages
Final: CS 189 Spring 2016 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2016 Introduction To Machine Learning
12 pages
Answer 2023-24
No ratings yet
Answer 2023-24
19 pages
Machine Learning May 2024
No ratings yet
Machine Learning May 2024
8 pages
ANS_for ML
No ratings yet
ANS_for ML
10 pages
Question Bank - ML - Unit1,2,3
No ratings yet
Question Bank - ML - Unit1,2,3
3 pages
15 Mlops Interview Questions for 2025
No ratings yet
15 Mlops Interview Questions for 2025
13 pages
ML 2 MARKS
No ratings yet
ML 2 MARKS
8 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Sem Rpa
No ratings yet
Sem Rpa
61 pages
SEM MLOps
No ratings yet
SEM MLOps
58 pages
ISE 529 mock test answers
No ratings yet
ISE 529 mock test answers
6 pages
AI - IV UNIT
No ratings yet
AI - IV UNIT
17 pages
ML_Question Bank
No ratings yet
ML_Question Bank
2 pages
AI407
No ratings yet
AI407
2 pages
QBankAllMod
No ratings yet
QBankAllMod
5 pages
ML 1
No ratings yet
ML 1
20 pages
Reinforcement Learning: A Practical Guide to Algorithms
From Everand
Reinforcement Learning: A Practical Guide to Algorithms
Trilokesh Khatri
No ratings yet
Sample questions
No ratings yet
Sample questions
8 pages
Big-O Notation Demystified: Definitive Reference for Developers and Engineers
From Everand
Big-O Notation Demystified: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Learn Design and Analysis of Algorithms in 24 Hours
From Everand
Learn Design and Analysis of Algorithms in 24 Hours
Alex Nordeen
No ratings yet
Data Science: Concepts, Strategies, and Applications
From Everand
Data Science: Concepts, Strategies, and Applications
Zemelak Goraga
No ratings yet
Document 0 1574167627
No ratings yet
Document 0 1574167627
4 pages
NYY 0.6/1 (1.2) KV: SPLN 43-1/IEC 60502-1
No ratings yet
NYY 0.6/1 (1.2) KV: SPLN 43-1/IEC 60502-1
1 page
SDF Manual
No ratings yet
SDF Manual
13 pages
Lab 03 2006
No ratings yet
Lab 03 2006
12 pages
Math MJR 2210 - W13
No ratings yet
Math MJR 2210 - W13
3 pages
Efficiency of A Front Wing On A FSAE Car Tarass Gorevoi
100% (1)
Efficiency of A Front Wing On A FSAE Car Tarass Gorevoi
55 pages
Latihan Empirical Formula
100% (1)
Latihan Empirical Formula
11 pages
EOY- syllabus- MYP 3
No ratings yet
EOY- syllabus- MYP 3
17 pages
W11 Module - Covalent Compound
No ratings yet
W11 Module - Covalent Compound
19 pages
3.3 Practical 3 (Web Programming)
No ratings yet
3.3 Practical 3 (Web Programming)
8 pages
Minicap FTC260, FTC262: Technical Information
No ratings yet
Minicap FTC260, FTC262: Technical Information
20 pages
Analysis of 5G LDPC Codes Rate-matching Design
No ratings yet
Analysis of 5G LDPC Codes Rate-matching Design
5 pages
HW 9.4 - Comparison of Series Tests
No ratings yet
HW 9.4 - Comparison of Series Tests
2 pages
Nikita Pashine - Investment Opportuinites
No ratings yet
Nikita Pashine - Investment Opportuinites
44 pages
Yuh K8 WCJ 0 PC CQH 35 NK KJ
No ratings yet
Yuh K8 WCJ 0 PC CQH 35 NK KJ
7 pages
St. Paul University Philippines
No ratings yet
St. Paul University Philippines
46 pages
Fianl Year Project Report
No ratings yet
Fianl Year Project Report
62 pages
Soas Dissertation Results Date
100% (2)
Soas Dissertation Results Date
7 pages
AN18f - Power Gain Stages For Monolithic Amplifiers
No ratings yet
AN18f - Power Gain Stages For Monolithic Amplifiers
16 pages
Newman 2018 Ch7 Measures
No ratings yet
Newman 2018 Ch7 Measures
60 pages
Roasting
0% (1)
Roasting
2 pages
Fast Data Processing With Spark - Second Edition - Sample Chapter
No ratings yet
Fast Data Processing With Spark - Second Edition - Sample Chapter
18 pages
10-12 Bi-Met 300
No ratings yet
10-12 Bi-Met 300
2 pages
IT NE 2005 LAB 4 - Securing Administrative Access Using AAA and RADIUS.
No ratings yet
IT NE 2005 LAB 4 - Securing Administrative Access Using AAA and RADIUS.
16 pages
JAMB Mathematics Past Questions EduNgr Sample PDF
No ratings yet
JAMB Mathematics Past Questions EduNgr Sample PDF
55 pages
COM02
No ratings yet
COM02
2 pages
Ndh-01ca-s0253 - App.e-patio Connection Type 3-Rev b
No ratings yet
Ndh-01ca-s0253 - App.e-patio Connection Type 3-Rev b
3 pages
HTML Tags
No ratings yet
HTML Tags
4 pages

DA

Uploaded by

DA

Uploaded by

UNIT – 01

Short Questions (2 Marks)

1. What is the difference between supervised and unsupervised learning?

2. Explain the purpose of least squares in linear regression.

3. Define multicollinearity and explain why it can be problematic in multiple regression.

6. What does the logistic function do in logistic regression?

7. Briefly explain the goal of subset selection in regression.

9. What is the role of the bias term in linear regression models?

Long Questions (8 Marks)

1. What is backpropagation, and why is it important in neural networks?

2. Explain the concept of a "kernel" in the context of Support Vector Machines.

4. Define the term "reproducing kernel" in SVMs.

6. What is overfitting in neural networks, and how can it be mitigated?

Long Questions (7/8 Marks)

1. Unsupervised Learning and Random Forests

2. Explain the concept of cluster analysis.

3. Define association rules and give an example of its application.

5. How do Random Forests handle feature importance?

2. Inferential Statistics and Prescriptive Analytics

1. What is a t-test, and when is it used?

2. Explain the purpose of McNemar’s test.

3. What is the difference between a paired t-test and an unpaired t-test?

4. What does Analysis of Variance (ANOVA) analyze?

5. Describe one application of prescriptive analytics in business.

3. Describe how prescriptive analytics can be used in decision-making processes. Give an

3. Explain one method for managing large data sets.

You might also like