Midterm - APS1070 - 2019 - 09 Fall
Midterm - APS1070 - 2019 - 09 Fall
Open book
Non-programmable & non-communicating calculators are allowed
Time allotted: 90 minutes
b) In the dataset above, what is the predicted class of a new data point at
x = 11, y = 7, using Manhattan distance, for k = 5? (“ ”, “ ” or “ ”) [2]
2. Here are four scatterplots, each expressing the relation between two variables:
2 1
3. Here are two vectors x1 and x2: 𝑥1 = [ ] , 𝑥2 = [ ]
1 −2
c) Do x1 and x2 form an orthonormal basis for vector space R2? Why? [2]
1 1 0
𝐴 = [−1 0 0]
0 1 1
5. You build a classification model for cancer detection using an imbalanced training
dataset and achieve an accuracy of 97% when testing on new data. Explain how
this performance can be deceiving, and what performance metric(s) might be more
appropriate. [2]