Assignment 2 Solution
Assignment 2 Solution
A. Regression problems
B. Classification problems
Ans: C
2. Entropy value of ____ represents that the data sample is pure or homogenous: (1 Mark)
A. 1
B. 0
C. 0.5
Ans: B
3. Entropy value of _____ represents that the data sample has a 50-50 split belonging to two categories: (1
mark)
A. 1
B. 0
C. 0.5
Ans: A
A. the internal nodes in a branch are connected by AND and the branches by AND
C. the internal nodes in a branch are connected by AND and the branches by OR
D. the internal nodes in a branch are connected by OR and the branches by AND
Ans: C
Explanation: definition of decision tree.
if C2 then
if C1 then A3
else A2
endif
else A1, A3
endif
A.
B.
C.
D.
Ans: C
Low F F
Low T T
Medium F F
Medium T T
6. What is the entropy of the dataset? (1
Mark) High F T
A. 0.50 High T T
B. 0.92
C. 1
D. 0
Ans: B
7. Which attribute would information gain choose as the root of the tree? (2 Marks)
A. GPA
B. Studied
C. Passed
Ans: B
Explanation: From information gain criterion. The Studied has the highest information gain.
8. A chemical company has three options: (i) commercial production, (ii) pilot plant and (iii) no
production. The cost of constructing a pilot plant is Rs 3 lacs. If a pilot plant is built, chances of high
and low yield are 80% and 20% respectively. In the case of high yield from the pilot plant, there is
a 75% chance of high yield from the commercial plant. In the case of low yield from the pilot plant,
there is only a 10% chance of high yield from the commercial plant. If the company goes for
commercial plant directly without constructing a pilot plant, then there are 60% chance of high
yield. The company earns Rs 1,20,00,000 in high yield and loses Rs 12,00,000 in low yield. The
optimum decision for the company is: (2 marks)
A. Commercial Production.
B. Pilot plant
C. No Production
Ans: A
Explanation: The company should produce commercially. The final estimated cost is Rs 67,20,000