0% found this document useful (0 votes)
4K views

02 - Data Types - MCQ

This document contains multiple choice questions testing knowledge of machine learning concepts like feature engineering, classification metrics, decision trees, and summary statistics. It also includes ratio, proportion, and data types questions. The final questions cover exploratory data analysis techniques and information systems terms.

Uploaded by

Apurba Barua
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4K views

02 - Data Types - MCQ

This document contains multiple choice questions testing knowledge of machine learning concepts like feature engineering, classification metrics, decision trees, and summary statistics. It also includes ratio, proportion, and data types questions. The final questions cover exploratory data analysis techniques and information systems terms.

Uploaded by

Apurba Barua
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 4

Which of the following feature engineering techniques are used for categorical

variables?
A. Binarization
B. One-hot encoding
C. Quantization
D. Min-max scaling
ANSWER: B

In classification, a False Positive is when:


A. The model predicts TRUE and the real observation is TRUE
B. The model predicts TRUE and the real observation is FALSE
C. The model predicts FALSE and the real observation is TRUE
D. The model predicts FALSE and the real observation is FALSE
ANSWER: B

Decision Trees Algorithm can be used for


A. Regression only
B. Classification only
C. Both
ANSWER: C

What kind of table compares classifications predicted by the model with the actual
class labels?
A. Chaos table
B. Prediction plot
C. Confusion matrix
D. Residual plot
ANSWER: C

If there are 5 boys and 7 girls, write the ratio of girls to boys.
A. 5 to 7
B. 7 to 5
C. 5 to 12
D. 7 to 12
ANSWER: B

Which ratio is equivalent to 5 to 4


A. 25 to 20
B. 10 to 12
C. 4 to 5
D. 15 to 8
ANSWER: A

Find the MEAN and MODE of the following data set:(8, 12, 16, 8, 6, 7, 9)
A. Mean: 9 Mode: 8
B. Mean: 8 Mode: 9
C. Mean: 9.4 Mode:8
D. Mean: 7.6 Mode: 8
ANSWER: C

To find the _____, add up all the data and then divide by the number of data.
A. Mean
B. Median
C. Mode
D. Range
ANSWER: A

Find the Range: 2, 1, 3, 3, 2, 3, 2


A. 1
B. 2
C. 3
D. 16
ANSWER: B

The most widely used metrics and tools to assess a classification model are:
A. Confusion matrix
B. Cost-sensitive accuracy
C. Area under the ROC curve
D. All of the answer options are correct
ANSWER: D

In one day a movie store rented out 20 comedies.If the ratio of comedies rented to
action movies rented was 4:3,how many action movies were rented?
A. 5
B. 15
C. 3.3333
D. 41/3
ANSWER: B

Which one of the following is discrete data?


A. Sam is 160 cm tall
B. Sam weighs 60 lbs.
C. Sam ran 100 meters in 10.2 seconds
D. Sam has two brothers and one sister
ANSWER: D

Another way of saying 50%


A. Half
B. A quarter
C. 3 quarters
D. Just under
ANSWER: A

Another way of saying one quarter


A. 35%
B. 25%
C. 65%
D. 90%
ANSWER: B

Another way of saying 55%


A. Less than half
B. More than half
C. Two-fifths
D. A quarter
ANSWER: B

Is a histogram the same as a bar graph?


A. Yes, because they both use intervals.
B. No, because a histogram displays numerical data while a bar graph displays
categorical data.
C. Yes, because they both use bars to display data.
D. No, because a histogram displays categorical data while a bar graph displays
numerical data.
ANSWER: B

The vertical axis of a frequency polygon comes from this column:


A. Class width
B. Frequency
C. Class midpoint
D. Class boundaries
ANSWER: B

A survey was given to 100 teenagers to determine their favorite way to stay
entertained during quarantine.The results show that 49 students watch Netflix, 22
students listen to music, 14 students FaceTime their friends, and 15 students
workout. What is the relative frequency of students who like to watch Netflix?
A. 50%
B. 2.04%
C. 49%
D. 22%
ANSWER: C

A table showing how often observations fall within a particular category is also
known as a:
A. Contingency table
B. Frequency table
C. Cumulative frequency table
D. Expected values table
ANSWER: A

When conducting a multiple linear regression, we can visualise the data using:
A. Bar graphs
B. Plane of best fit
C. Line of best fit
D. Error bars
ANSWER: B

What is exploratory data analysis?


A. A rigid framework by which we analyze data
B. An initial way by which we can get a feel for data
C. A type of purely quantitative method of data analysis
D. A set of scientific principles for analyzing data in a categorical manner
ANSWER: B

A Histogram is____.
A. Is a bar chart.
B. Is a graphic representation of the frequency distribution of a qualitative or
categorical variable.
C. Is a graphic representation of the frequency distribution of a continuous
variable.
D. Is an alternative to a pie chart.
ANSWER: C

If I want to save a plot to a PDF file, which of the following is a correct way of
doing that?
A. Construct the plot on the PNG device with png(), then copy it to a PDF with
dev.copy2pdf().
B. Construct the plot on the screen device and then copy it to a PDF file with
dev.copy2pdf()
C. Open the screen device with quartz(), construct the plot, and then close the
device with dev.off().
D. Open the PostScript device with postscript(), construct the plot, then close the
device with dev.off().
ANSWER: B
Which of the following functions is typically used to add elements to a plot in the
base graphics system?
A. hist()
B. plot()
C. text()
D. boxplot()
ANSWER: C

Most often, EDA relies on _____.


A. visual techniques
B. assumptions
C. fixed models
D. testing for statistical significance
ANSWER: A

______________ a collection of hardware, software, data, people, and procedures


that work together to produce quality information.
A. Information Technology
B. Information System
C. Network
D. Database
ANSWER: B

____________is used to help managers make right decision in important areas of


business.
A. Decision Support System
B. Management Information System
C. Transaction Processing System
D. Executive Support Systems
ANSWER: A

Transaction Processing System ____________


A. maintains students’ database
B. provides computer-based training
C. handles distance learning program
D. keeps track of daily transaction in a database
ANSWER: D

MIN stands for


A. Mobile Identification Number
B. Mobile Internet
C. Mobility In Network
D. None of the above
ANSWER: A

Which of the numbers below is NOT part of a 5 number summary used to create a
boxplot?
A. Minimum
B. First quartile
C. Mean
D. Third quartile
E. Maximum
ANSWER: C

You might also like