0% found this document useful (0 votes)
10 views3 pages

Question Bank - Module 4

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views3 pages

Question Bank - Module 4

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Global Academy of Technology, Bengaluru

Department of Artificial Intelligence and Data Science


Question Bank
Module 4
Module 4: Feature Engineering & Learning
1. Explain the following with an example for each:
a. Imputation
b. Discretization
c. Categorical encoding
d. Feature splitting
e. Handling outliers
f. Variable transformations
g. Scaling & Feature creation
2. Define Feature Engineering. Explain in detail the Feature
Engineering process.
3. Explain various Forms of Learning.
4. Explain Supervised, Unsupervised, Semi-Supervised learning with
simple examples.
5. Explain Reinforcement Learning.
6. Using min-max normalization method, determine the normalized
values for the given data in the range [0,1].

v
1.9
2.1
6.0
6.1
6.5
6.6

7. Determine the min-max normalized values for the given data:

A
-8.6
-8.0
-5.8
-1.4
2.0
6.0
7.9

8. Determine the standardized values for the given data.


OR
Determine z- score values for the given data points.
v
1.9
2.1
6.0
6.1
6.5
6.6
9. Perform standardization on the given data points:

a
-8.6
-8.0
-5.8
-1.4
2.0
6.0
7.9
10. Apply equal frequency binning technique on the given data:
5, 10, 11, 13, 15, 35, 50, 55, 72, 92, 204, 215.

11. Given the data: 136, 24.9, 1.06, 10.94, 3.66, 4.21, 32.09,
30.07, 25.26, 16.24, 9.44, 4.64, 12.69, 16.42, 22.13. Apply binning
technique to have same number of data points in each bin. Assume
bins=5.

12. Consider the given data: 15, 10, 11, 13, 5, 35, 50, 55, 72, 92,
204, 215. Perform equal width binning with number of bins as 3.

13. Consider the given data: 5, 11, 10, 13, 15, 35, 50, 55, 72, 92,
204, 215. Perform equal width binning with number of bins as 5.

14. Perform data imputation on the given dataset:


15. Perform data imputation on the given dataset:

Row Id A B C
0 1 7 NaN
1 2 NaN 14
2 NaN 9 15
3 4 10 NaN
4 5 11 16
5 6 12 17

16. Perform data imputation on the given dataset:

No. Total Bill Gender Discount Day Time


1 16.99 F 2.9 SUN DINNER
2 10.34 M 0.23 SUN DINNER
3 NaN NaN 4 THURS LUNCH
4 22.76 M 2.4 THURS NaN
5 17.29 M NaN THURS LUNCH
6 NaN M 3.3 THURS LUNCH
7 16.66 M 4.2 NaN LUNCH
8 10.07 F 0.82 NaN LUNCH
9 21.01 M 2.2 SUN DINNER
10 20.65 M 1.85 SAT DINNER

You might also like