0% found this document useful (0 votes)
306 views

MLT MCQ

The document discusses various topics related to machine learning including: - Machine learning is a field of artificial intelligence that uses algorithms to find patterns in data and learn without being explicitly programmed. - A top-down parser hypothesizes a sentence and successively predicts lower level constituents until individual symbols are reached. - Different learning methods do not include introduction as a method. - A model of language consists of categories but does not include structural units.

Uploaded by

Sinduja Baskaran
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
306 views

MLT MCQ

The document discusses various topics related to machine learning including: - Machine learning is a field of artificial intelligence that uses algorithms to find patterns in data and learn without being explicitly programmed. - A top-down parser hypothesizes a sentence and successively predicts lower level constituents until individual symbols are reached. - Different learning methods do not include introduction as a method. - A model of language consists of categories but does not include structural units.

Uploaded by

Sinduja Baskaran
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 13

1. What is true about Machine Learning?

A. Machine Learning (ML) is that field of computer science


B. ML is a type of artificial intelligence that extract patterns out of raw data
by using an algorithm or method.
C. The main focus of ML is to allow computer systems learn from experience
without being explicitly programmed or human intervention.
D. All of the above
View Answer
Ans : D

Explanation: All statement are true about Machine Learning.

2. ML is a field of AI consisting of learning algorithms that?

A. Improve their performance


B. At executing some task
C. Over time with experience
D. All of the above
View Answer
Ans : D

Explanation: ML is a field of AI consisting of learning algorithms that :


Improve their performance (P), At executing some task (T), Over time with
experience (E).

3. p → 0q is not a?

A. hack clause
B. horn clause
C. structural clause
D. system clause
View Answer
Ans : B

Explanation: p → 0q is not a horn clause.

4. The action _______ of a robot arm specify to Place block A on block B.


A. STACK(A,B)
B. LIST(A,B)
C. QUEUE(A,B)
D. ARRAY(A,B)
View Answer
Ans : A

Explanation: The action 'STACK(A,B)' of a robot arm specify to Place block A


on block B.
5. A__________ begins by hypothesizing a sentence (the symbol S) and
successively predicting lower level constituents until individual preterminal
symbols are written.

A. bottow-up parser
B. top parser
C. top-down parser
D. bottom parser
View Answer
Ans : C

Explanation: A top-down parser begins by hypothesizing a sentence (the symbol


S) and successively predicting lower level constituents until individual
preterminal symbols are written.

6. A model of language consists of the categories which does not include


________.

A. System Unit
B. structural units.
C. data units
D. empirical units
View Answer
Ans : B

Explanation: A model of language consists of the categories which does not


include structural units.

7. Different learning methods does not include?

A. Introduction
B. Analogy
C. Deduction
D. Memorization
View Answer
Ans : A

Explanation: Different learning methods does not include the introduction.

8. The model will be trained with data in one single batch is known as ?

A. Batch learning
B. Offline learning
C. Both A and B
D. None of the above
View Answer
Ans : C

Explanation: we have end-to-end Machine Learning systems in which we need


to train the model in one go by using whole available training data. Such kind of
learning method or algorithm is called Batch or Offline learning.

9. Which of the following are ML methods?

A. based on human supervision


B. supervised Learning
C. semi-reinforcement Learning
D. All of the above
View Answer

10. In Model based learning methods, an iterative process takes place on the
ML models that are built based on various model parameters, called ?

A. mini-batches
B. optimizedparameters
C. hyperparameters
D. superparameters
View Answer
Ans : C
Explanation: In Model based learning methods, an iterative process takes place
on the ML models that are built based on various model parameters, called
hyperparameters.

11. Which of the following is a widely used and effective machine learning
algorithm based on the idea of bagging?

A. Decision Tree
B. Regression
C. Classification
D. Random Forest
View Answer
Ans : D

Explanation: The Radom Forest algorithm builds an ensemble of Decision


Trees, mostly trained with the bagging method.

12. To find the minimum or the maximum of a function, we set the gradient to
zero because:

A. The value of the gradient at extrema of a function is always zero


B. Depends on the type of problem
C. Both A and B
D. None of the above
View Answer
Ans : A

Explanation: The gradient of a multivariable function at a maximum point will


be the zero vector of the function, which is the single greatest value that the
function can achieve.

13. Which of the following is a disadvantage of decision trees?

A. Factor analysis
B. Decision trees are robust to outliers
C. Decision trees are prone to be overfit
D. None of the above
View Answer
Ans : C
Explanation: Allowing a decision tree to split to a granular degree makes
decision trees prone to learning every point extremely well to the point of
perfect classification that is overfitting.

14. How do you handle missing or corrupted data in a dataset?

A. Drop missing rows or columns


B. Replace missing values with mean/median/mode
C. Assign a unique category to missing values
D. All of the above
View Answer
Ans : D

Explanation: All of the above techniques are different ways of imputing the
missing values.

15. When performing regression or classification, which of the following is the


correct way to preprocess the data?

A. Normalize the data -> PCA -> training


B. PCA -> normalize PCA output -> training
C. Normalize the data -> PCA -> normalize PCA output -> training
D. None of the above
View Answer
Ans : A

Explanation: You need to always normalize the data first. If not, PCA or other
techniques that are used to reduce dimensions will give different results.

16. Which of the following statements about regularization is not correct?

A. Using too large a value of lambda can cause your hypothesis to underfit the
data.
B. Using too large a value of lambda can cause your hypothesis to overfit the
data
C. Using a very large value of lambda cannot hurt the performance of your
hypothesis.
D. None of the above
View Answer
Ans : D

Explanation: A large value results in a large regularization penalty and


therefore, a strong preference for simpler models, which can underfit the data.

17. Which of the following techniques can not be used for normalization in text
mining?

A. Stemming
B. Lemmatization
C. Stop Word Removal
D. None of the above
View Answer
Ans : C

Explanation: Lemmatization and stemming are the techniques of keyword


normalization.

18. In which of the following cases will K-means clustering fail to give good
results?
1) Data points with outliers
2) Data points with different densities
3) Data points with nonconvex shapes

A. 1 and 2
B. 2 and 3
C. 1 and 3
D. All of the above
View Answer
Ans : D

Explanation: K-means clustering algorithm fails to give good results when the
data contains outliers, the density spread of data points across the data space is
different, and the data points follow nonconvex shapes.

19. Which of the following is a reasonable way to select the number of principal
components "k"?

A. Choose k to be the smallest value so that at least 99% of the varinace is


retained.
B. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
C. Choose k to be the largest value so that 99% of the variance is retained.
D. Use the elbow method.
View Answer
Ans : A

Explanation: This will maintain the structure of the data and also reduce its
dimension.

20. What is a sentence parser typically used for?

A. It is used to parse sentences to check if they are utf-8 compliant.


B. It is used to parse sentences to derive their most likely syntax tree
structures.
C. It is used to parse sentences to assign POS tags to all tokens.
D. It is used to check if sentences can be parsed into meaningful tokens.
View Answer
Ans : B

Explanation: Sentence parsers analyze a sentence and automatically build a


syntax tree.
1. What is Machine learning?
a) The autonomous acquisition of knowledge through the use of computer
programs
b) The autonomous acquisition of knowledge through the use of manual
programs
c) The selective acquisition of knowledge through the use of computer
programs
d) The selective acquisition of knowledge through the use of manual programs
View Answer
Answer: a
Explanation: Machine learning is the autonomous acquisition of knowledge
through the use of computer programs.
2. Which of the factors affect the performance of learner system does not
include?
a) Representation scheme used
b) Training scenario
c) Type of feedback
d) Good data structures
View Answer
Answer: d
Explanation: Factors that affect the performance of learner system does not
include good data structures.
3. Different learning methods does not include?
a) Memorization
b) Analogy
c) Deduction
d) Introduction
View Answer
Answer: d
Explanation: Different learning methods does not include the introduction.
4. In language understanding, the levels of knowledge that does not include?
a) Phonological
b) Syntactic
c) Empirical
d) Logical
View Answer
Answer: c
Explanation: In language understanding, the levels of knowledge that does not
include empirical knowledge.
5. A model of language consists of the categories which does not include?
a) Language units
b) Role structure of units
c) System constraints
d) Structural units
View Answer
Answer: d
Explanation: A model of language consists of the categories which does not
include structural units.
6. What is a top-down parser?
a) Begins by hypothesizing a sentence (the symbol S) and successively
predicting lower level constituents until individual preterminal symbols are
written
b) Begins by hypothesizing a sentence (the symbol S) and successively
predicting upper level constituents until individual preterminal symbols are
written
c) Begins by hypothesizing lower level constituents and successively predicting
a sentence (the symbol S)
d) Begins by hypothesizing upper level constituents and successively predicting
a sentence (the symbol S)
View Answer
Answer: a
Explanation: A top-down parser begins by hypothesizing a sentence (the symbol
S) and successively predicting lower level constituents until individual
preterminal symbols are written.
7. Among the following which is not a horn clause?
a) p
b) Øp V q
c) p → q
d) p → Øq
View Answer
Answer: d
Explanation: p → Øq is not a horn clause.
8. The action ‘STACK(A, B)’ of a robot arm specify to _______________
a) Place block B on Block A
b) Place blocks A, B on the table in that order
c) Place blocks B, A on the table in that order
d) Place block A on block B
View Answer
Answer: d
Explanation: The action ‘STACK(A,B)’ of a robot arm specify to Place block A
on block B.

1. Which of the following is a widely used and effective machine learning


algorithm based on the idea of bagging?
a. Decision Tree
b. Regression
c. Classification
d. Random Forest - answer
2. To find the minimum or the maximum of a function, we set the gradient to
zero because:
a. The value of the gradient at extrema of a function is always zero -
answer
b. Depends on the type of problem
c. Both A and B
d. None of the above
3. The most widely used metrics and tools to assess a classification model are:
a. Confusion matrix
b. Cost-sensitive accuracy
c. Area under the ROC curve
d. All of the above - answer
4. Which of the following is a good test dataset characteristic?
a. Large enough to yield meaningful results
b. Is representative of the dataset as a whole
c. Both A and B - answer
d. None of the above
5. Which of the following is a disadvantage of decision trees?
a. Factor analysis
b. Decision trees are robust to outliers
c. Decision trees are prone to be overfit - answer
d. None of the above
6. How do you handle missing or corrupted data in a dataset?
a. Drop missing rows or columns
b. Replace missing values with mean/median/mode
c. Assign a unique category to missing values
d. All of the above - answer
7. What is the purpose of performing cross-validation?
a. To assess the predictive performance of the models
b. To judge how the trained model performs outside the sample on test data
c. Both A and B - answer
8. Why is second order differencing in time series needed?
a. To remove stationarity
b. To find the maxima or minima at the local point
c. Both A and B - answer
d. None of the above
9. When performing regression or classification, which of the following is the
correct way to preprocess the data?
a. Normalize the data → PCA → training - answer
b. PCA → normalize PCA output → training
c. Normalize the data → PCA → normalize PCA output → training
d. None of the above
10.Which of the folllowing is an example of feature extraction?
a. Constructing bag of words vector from an email
b. Applying PCA projects to a large high-dimensional data
c. Removing stopwords in a sentence
d. All of the above - answer
11.What is pca.components_ in Sklearn?
a. Set of all eigen vectors for the projection space - answer
b. Matrix of principal components
c. Result of the multiplication matrix
d. None of the above options
12.Which of the following is true about Naive Bayes ?
a. Assumes that all the features in a dataset are equally important
b. Assumes that all the features in a dataset are independent
c. Both A and B - answer
d. None of the above options
13.Which of the following statements about regularization is not correct?
a. Using too large a value of lambda can cause your hypothesis to underfit the
data.
b. Using too large a value of lambda can cause your hypothesis to overfit the
data.
c. Using a very large value of lambda cannot hurt the performance of your
hypothesis.
d. None of the above - answer
14.How can you prevent a clustering algorithm from getting stuck in bad local
optima?
a. Set the same seed value for each run
b. Use multiple random initializations - answer
c. Both A and B
d. None of the above
15.Which of the following techniques can be used for normalization in text
mining?
a. Stemming
b. Lemmatization
c. Stop Word Removal
d. Both A and B - answer
16.In which of the following cases will K-means clustering fail to give good
results? 1) Data points with outliers 2) Data points with different densities 3)
Data points with nonconvex shapes
a. 1 and 2
b. 2 and 3
c. 1, 2, and 3 - answer
d. 1 and 3
17.Which of the following is a reasonable way to select the number of principal
components "k"?
a. Choose k to be the smallest value so that at least 99% of the varinace
is retained. - answer
b. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
c. Choose k to be the largest value so that 99% of the variance is retained.
d. Use the elbow method
18.You run gradient descent for 15 iterations with a=0.3 and compute J(theta)
after each iteration. You find that the value of J(Theta) decreases quickly and
then levels off. Based on this, which of the following conclusions seems most
plausible?
a. Rather than using the current value of a, use a larger value of a (say a=1.0)
b. Rather than using the current value of a, use a smaller value of a (say
a=0.1)
c. a=0.3 is an effective choice of learning rate - answer
d. None of the above
19.What is a sentence parser typically used for?
a. It is used to parse sentences to check if they are utf-8 compliant.
b. It is used to parse sentences to derive their most likely syntax tree
structures. - answer
c. It is used to parse sentences to assign POS tags to all tokens.
d. It is used to check if sentences can be parsed into meaningful tokens.
20.Suppose you have trained a logistic regression classifier and it outputs a new
example x with a prediction ho(x) = 0.2. This means
a. Our estimate for P(y=1 | x)
b. Our estimate for P(y=0 | x) - answer
c. Our estimate for P(y=1 | x)
d. Our estimate for P(y=0 | x)

You might also like