Unit 3 & 4 Question Bank
Unit 3 & 4 Question Bank
be used as a classifier
9 If the probability of an event
occurring is 0.4, then
Create
a. What is the odds ratio? CO3
b. What is the log odds ratio?
10 Choose a topic of your interest, such
as a movie, a celebrity, or any buzz
word. Then collect 100 tweets related
to this topic. Hand-tag them as
positive, neutral, or negative. Next,
split them into 80 tweets as the
training set and the remaining 20 as Apply CO3
the testing set. Run one or more
classifiers over these tweets to
perform sentiment analysis. What are
the precision and recall of these
classifiers? Which classifier performs
better than the others?
11 Analyze the strengths and
weaknesses of using clustering for Analyze CO3
customer segmentation
12 Evaluate the effectiveness of TFIDF
Evaluate CO3
in representing textual data
13 Predict house prices using an
advanced regression model and Apply CO3
evaluate the performance
14 How are association rules used in
Understand CO3
market basket analysis?
15 What are the key differences between
hierarchical clustering and K-Means Understand CO3
clustering?
MIT ADT University, Pune
4
What types of loops exist in R, and
Remember CO4
what is the syntax of each type?
5
What types of data plots can be
Understand
created in R? CO4
6
What is the difference between the
Analyze CO4
subset() and sample() functions n R?
Evaluate CO4
8
The data analyst of Argon technology
Mr. John needs to enter the salaries of
10 employees in R. The salaries of
the employees are given in the
following table
9
i. Write the script to sort the values Evaluate CO4
contained in the following vector in
ascending order and descending
order: (23, 45, 10, 34, 89, 20, 67, 99).
Demonstrate the output.
10
Explain different applications of R? Analyze CO4
11
What is meant by the factor in R? Understand CO4
12
How to import .csv file,json file in R? Evaluate CO4
13
How to import web data in R? Remember CO4