Quiz_2_2021_sol

The document contains a quiz with questions related to machine learning models, including regression techniques and classification algorithms. It discusses the possibility of hyperparameters leading to identical predictions across different models, evaluates the effectiveness of a classification algorithm for color blindness, and compares linear regression with K-Nearest Neighbors. Additionally, it addresses loss functions in regression, issues with Bagging Classification trees, and the differences between Lasso Regression and PCA.

Uploaded by

aagamkasliwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views8 pages

Quiz_2_2021_sol

Uploaded by

aagamkasliwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Quiz 2, ME-781, October 31, 2021 Max Marks: 100

No explanation for any question would be provided. Please make any assumptions and solve the quiz.

1. A machine learning model is being developed to perform regression analysis. The three models being
considered are Ridge Regression, Lasso Regression and K-Nearest Neighbors regression. The training
10 is performed on a data set of size thousand. Is it possible to have hyperparameters in the three model
such that all the three models give exactly the same prediction? Briefly explain if this is possible.
Solution:

Yes, it is possible to have the three models give exactly the same answer with a specific choice of the
hyperparameters.

The hyperparameter “t” for Ridge ( ) and Lasso ( ) regression should be 0; and the k
value in knn regression should be 1000 (size of the dataset). This will lead to all the regression results
being equal to the mean of the data set.
2. A highly accurate (accuracy of 99%) classification algorithm is developed that can predict colour
10 blindness of a person. If color blindness is found in around 1 in 12 men and 1 in 200 women then
comment on whether the above classification algorithm is good enough for classifying colour
blindness in men and women. Please provide a brief explanation for your answer and with possible
remedial solutions.
Solution:

No, this is not a good enough classifier for the classification of color blindness, especially for women.
This is because 199 women (out of 200) are not color blind, thus even if a classifier always predicts that
a woman is not color blind will be (199/200)x100=99.5% accurate. This is due to the fact that the two
possible outcomes (color blind and not color blind) are highly skewed towards one outcome (not color
blind). To address this, we need to see the confusion matrix.
3. Which of the two, linear regression and K-Nearest Neighbors regression, would be a better model for
performing regression analysis on the following data:
10
a. For highly nonlinear output variable
b. Non-uniformly distributed data in the predictor space
Solution:

a. For this, K-Nearest Neighbors regression would be a better model since the linear model will not be
able to capture the highly nonlinearity.

b. For this, the linear regression would be a better model since the K-Nearest Neighbors model will
perform poorly in places where the data is sparse.
4. a.) In a regression analysis, two loss- functions, "Mean Square Error" and "Mean Absolute Error", were
used. Please provide one advantage and one disadvantage of them.
10+10
b.) What would be the trivial machine learning model if the "Mean Bias Error" loss function is being
used for the regression analysis.
Solution:

a.)

Advantage Disadvantage
Mean Square Error It has mathematical properties Due to squaring, predictions
which makes it easier to which are far away from actual
calculate gradients. values are penalized heavily in
comparison to less deviated
predictions.
Mean Absolute Error MAE is more robust to outliers MAE needs more complicated
since it does not make use of tools such as linear
square. programming to compute the
gradients.

b.) The trivial model would be 𝑦𝑦 = 𝑦𝑦� (i.e., mean value of y from the training dataset). This will give zero
Mean Bias Error (for training).
5. In Bagging Classification trees with hundred predictors, a very large majority of predictions were the
same. This made classification based on majority vote very simple. However, this also means that all
10 the trees (in bagging classification) are highly correlated. How could this be addressed? Provide a brief
description.
Solution:

Suppose there is one very strong predictor in the data set. And if it is also very strongly correlated to
several other predictors, then a large majority of the Bagging Classification trees would result in the
same classification. This can be addressed by Random Forest by forcing each split to use only one of the
predictors from a small set of randomly chosen predictors.
6. A data set with moderate random noise (in the output variable) is used for regression analysis. How
would the variance and bias change for a linear regression versus tenth-degree polynomial regression
when:
10
a. You have limited data
b. You have a very large data set
Solution:

Linear regression
Tenth-degree polynomial regression
a.) Variance Low High
a.) Bias Moderate Low
b.) Variance Low Low (because very large dataset)
b.) Bias Moderate Low
Thus, if you have very limited data, use a less flexible model; however, if you have a very large dataset,
you can use a more flexible model.
7. What is the key difference between the dimensionality reduction achieved by Lasso Regression and
10 principal component analysis?
Solution: Lasso Regression removes some variables based on their ability to predict the output variable;
however, principal component analysis reduces dimensionality only based on the variance of the input
(predictor) variables, without considering their ability to predict the output variable.
1 2 1
8. For a data set (with two predictors (X1, X2)) the covariance matrix is given by 𝐶𝐶 = � �.
3 1 1
This corresponds to eigenvalues and eigenvectors as follows:
20
Eigenvalues λ1 = 0.8727 and λ2 = 0.1273
−0.85065 0.52571
Eigenvectors 𝑎𝑎1 = � � and 𝑎𝑎2 = � �
−0.52571 −0.85065
What would be the eigenvalues and eigenvectors for data sets which have covariance Matrix as:
1 2 + 0.6 1 1 2 1 1 5 3
a. 𝐶𝐶1 = � � b. 𝐶𝐶2 = � �+ � �
3 1 1 + 0.6 3 1 1 9 3 2
Solution:
1 2 + 0.6 1 1 2 1 0.6 1 1
a. 𝐶𝐶1 = � � = 3� �+ 3 � �
3 1 1 + 0.6 1 1 1 1
1 1
Eigenvector remains the same, the Eigenvalue would be (𝜆𝜆1 + (0.6)) = 1.0727 and (𝜆𝜆2 + (0.6)) =
3 3
0.3273.
1 2 1 1 5 3 1 2 1 1 2 1 1 2 1
b. 𝐶𝐶2 = � �+ 9� �= � � + 3� � × 3� �
3 1 1 3 2 3 1 1 1 1 1 1
Therefor, eigenvector remains the same, the eigenvalue would be (𝜆𝜆1 + 𝜆𝜆12 ) = 1.63and (𝜆𝜆2 + 𝜆𝜆22 ) =
0.144.

Written Detention
80% (5)
Written Detention
3 pages
ECS7020P Sample Paper Solutions
No ratings yet
ECS7020P Sample Paper Solutions
6 pages
XDM-1000 Product Line RM ETSI B01 8.2.3-8.2.4 en
No ratings yet
XDM-1000 Product Line RM ETSI B01 8.2.3-8.2.4 en
688 pages
Hong Kong International Airport
No ratings yet
Hong Kong International Airport
15 pages
Supergene High Yield Introgressed Hybrid Oil Palm Malaysia
100% (13)
Supergene High Yield Introgressed Hybrid Oil Palm Malaysia
16 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
SDSC3006_Assignment 3
No ratings yet
SDSC3006_Assignment 3
4 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
RGRSSN Assgnmnt
No ratings yet
RGRSSN Assgnmnt
11 pages
ML QUES MOD-1
No ratings yet
ML QUES MOD-1
25 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Wa0006.
No ratings yet
Wa0006.
4 pages
EE2211_Past_Paper
No ratings yet
EE2211_Past_Paper
14 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
No ratings yet
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
8 pages
AIML-QB- UNIT 3
No ratings yet
AIML-QB- UNIT 3
6 pages
IE425_Spring25_Quiz1
No ratings yet
IE425_Spring25_Quiz1
3 pages
MLFA Spring 2024
No ratings yet
MLFA Spring 2024
11 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
Questo Es
No ratings yet
Questo Es
8 pages
Statistics Quiz
No ratings yet
Statistics Quiz
20 pages
Itae002 Test 2
No ratings yet
Itae002 Test 2
150 pages
QUESTION BANK ,sample paper , and many more
No ratings yet
QUESTION BANK ,sample paper , and many more
43 pages
ML LAB Viva Questions with Answers
No ratings yet
ML LAB Viva Questions with Answers
10 pages
ISE 529 mock test answers
No ratings yet
ISE 529 mock test answers
6 pages
Q and A BIS
No ratings yet
Q and A BIS
7 pages
MACHINE LEARNING T.E. (IT)(2019 Pattern) (Semester-I) Nov Dec 2022
No ratings yet
MACHINE LEARNING T.E. (IT)(2019 Pattern) (Semester-I) Nov Dec 2022
4 pages
Test 1 With Key 10-3
No ratings yet
Test 1 With Key 10-3
16 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
Extra 2
No ratings yet
Extra 2
7 pages
linear regression
No ratings yet
linear regression
37 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
ML 1
No ratings yet
ML 1
51 pages
ML Unit 1 MCQ
100% (1)
ML Unit 1 MCQ
9 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
Data Science
No ratings yet
Data Science
35 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
No ratings yet
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
3 pages
Process: Below. Calculate The F Statistics For The Data and Interpret The Result. 2 3 4 5 6.8 56 4.4 3.2 2
No ratings yet
Process: Below. Calculate The F Statistics For The Data and Interpret The Result. 2 3 4 5 6.8 56 4.4 3.2 2
1 page
Itae 002 Test 1 2
0% (1)
Itae 002 Test 1 2
5 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
ASSIGN8
No ratings yet
ASSIGN8
5 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
ML MID-1 Question Bank
No ratings yet
ML MID-1 Question Bank
6 pages
QB AMT305module 2
No ratings yet
QB AMT305module 2
4 pages
UNIT 1 Practice Quiz - MCQs - ML
100% (1)
UNIT 1 Practice Quiz - MCQs - ML
10 pages
Part A Multiple Choice (10 Marks)
No ratings yet
Part A Multiple Choice (10 Marks)
16 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
Solution 2
0% (1)
Solution 2
6 pages
MIDA1 AUT - Solutions
No ratings yet
MIDA1 AUT - Solutions
4 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
Int 354 ML-1
No ratings yet
Int 354 ML-1
4 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Accenture
No ratings yet
Accenture
3 pages
QCM_DL
No ratings yet
QCM_DL
7 pages
S&UL Subjective Question Bank
No ratings yet
S&UL Subjective Question Bank
7 pages
finals19
No ratings yet
finals19
16 pages
Introduction To Machine Learning Week 2 Assignment
100% (1)
Introduction To Machine Learning Week 2 Assignment
8 pages
Giant Pile ML Problems
No ratings yet
Giant Pile ML Problems
56 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
Lab 03 Sol
No ratings yet
Lab 03 Sol
6 pages
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
ME444_Lab5_Group8
No ratings yet
ME444_Lab5_Group8
18 pages
BB101_Tutorial2
No ratings yet
BB101_Tutorial2
2 pages
2018_Endsem_Paper
No ratings yet
2018_Endsem_Paper
19 pages
2009_Endsem_Paper
No ratings yet
2009_Endsem_Paper
10 pages
2017_Endsem_Paper_Solutions
No ratings yet
2017_Endsem_Paper_Solutions
12 pages
2019_Endsem_Paper
No ratings yet
2019_Endsem_Paper
8 pages
2012 Endsem Make Up Exam
No ratings yet
2012 Endsem Make Up Exam
2 pages
2011_Endsem_Paper
No ratings yet
2011_Endsem_Paper
8 pages
E6 - Report: Problem 1
No ratings yet
E6 - Report: Problem 1
16 pages
05 PilotProjectGridScaleBESS EGAT
No ratings yet
05 PilotProjectGridScaleBESS EGAT
34 pages
Monthly Service Report
No ratings yet
Monthly Service Report
96 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
1 page
Japan Economic Outlook - Deloitte Insights
No ratings yet
Japan Economic Outlook - Deloitte Insights
8 pages
All Types of Wind Sarsij
No ratings yet
All Types of Wind Sarsij
16 pages
Hydrotest Method Statement 12th Mar 2012 1
100% (1)
Hydrotest Method Statement 12th Mar 2012 1
21 pages
Aurelia and Caudatum
64% (11)
Aurelia and Caudatum
3 pages
Contract (Contractor Subcontractor)
No ratings yet
Contract (Contractor Subcontractor)
6 pages
Penilaian Harian Ganjil B.inggris Kls 8 - 2022-2023 Fix
No ratings yet
Penilaian Harian Ganjil B.inggris Kls 8 - 2022-2023 Fix
11 pages
Bipolar Limb Leads (Frontal Plane)
No ratings yet
Bipolar Limb Leads (Frontal Plane)
7 pages
Perfect Draw! Quick-Play Printables - v1.7
No ratings yet
Perfect Draw! Quick-Play Printables - v1.7
16 pages
Call Parent Slip
No ratings yet
Call Parent Slip
1 page
7the Chariot
No ratings yet
7the Chariot
6 pages
DLL MATATAG _PE&HEALTH 7 Q4 W1-2 (1)
No ratings yet
DLL MATATAG _PE&HEALTH 7 Q4 W1-2 (1)
16 pages
Hemlata 4
No ratings yet
Hemlata 4
292 pages
SCD Assignment 1
No ratings yet
SCD Assignment 1
4 pages
Mooring
No ratings yet
Mooring
11 pages
Essential Italian Dictionary For Musicians
No ratings yet
Essential Italian Dictionary For Musicians
2 pages
Netflix Cookies 1
No ratings yet
Netflix Cookies 1
3 pages
Construction and Building Materials: Mohammad Tahersima, Paul Tikalsky
No ratings yet
Construction and Building Materials: Mohammad Tahersima, Paul Tikalsky
12 pages
ACR102 2020 Week 9 Lecture Slides1
No ratings yet
ACR102 2020 Week 9 Lecture Slides1
25 pages
SpecialSection
No ratings yet
SpecialSection
37 pages
El Papel de La Competencia Pragmática en La Enseñanza Del Idioma Inglés
No ratings yet
El Papel de La Competencia Pragmática en La Enseñanza Del Idioma Inglés
17 pages
Vadiraja Life and Works
100% (1)
Vadiraja Life and Works
12 pages
Project_Report_Template_AICTE_Internship_2025
No ratings yet
Project_Report_Template_AICTE_Internship_2025
21 pages
Trauma Thoraks
No ratings yet
Trauma Thoraks
35 pages

Quiz_2_2021_sol

Uploaded by

Quiz_2_2021_sol

Uploaded by

Quiz 2, ME-781, October 31, 2021 Max Marks: 100

You might also like