0% found this document useful (0 votes)
33 views

unit 6 questions and answers

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

unit 6 questions and answers

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Unit -6

Short questions and Answers

Short answer type questions:

1. What is Machine learning? Name the three methods of machine learning.

Ans. Machine learning (ML) is a type of artificial intelligence (AI) focused on building
computer systems that learn from data. It uses algorithms that learn from data to
make predictions. The predictions can be generated through three methods known
as supervised learning, unsupervised learning and reinforcement learning. In
supervised learning algorithms learn patterns from existing data, in unsupervised
learning, they discover general patterns in data and reinforcement learning where
they learn through reward and punishment methods.

2. How are correlation measures used in AI applications?


Ans. Feature selection: Identify features highly correlated with the target variable,
potentially indicating relevance for prediction. Exploratory data analysis:
Understand relationships between variables and identify potential or anomalies.
Recommender systems: Recommend items based on past user behaviour and
correlations between items purchased together.

3. Name some examples of regression algorithms?

Ans. Examples of regression algorithms include Linear Regression, Polynomial


Regression, Ridge Regression, Lasso Regression, Elastic Net Regression, Support
Vector Regression (SVR), Decision Tree Regression, Random Forest Regression,
and Gradient Boosting Regression. These algorithms are used to predict
continuous numerical values are widely applied in various fields such as finance,
economics and engineering.
4. What are regression algorithms used for?

Ans. Regression algorithms are used for predicting continuous numerical values based
on input features. They are widely applied in various fields such as finance for stock
price forecasting, economics for predicting economic indicators, healthcare for
disease progression estimation, and engineering for predicting product
performance. Regression analysis helps uncover relationships between variables
and make informed predictions for future data points.
5. What is Linear regression? Give two applications of regression in machine

learning?
Ans. Linear Regression is a supervised learning algorithm. It makes use of one
independent variable X to predict the outcome of a second dependent variable Y.
In machine learning, regression is used to predict outputs and forecast trends.
6. How can outliers impact regression analysis?
Ans. An outlier is a data point that differs significantly from other observations. An outlier
may be due to a variability in the measurement, an indication of data which may be
collected or it may be the result of experimental error. They can significantly skew
the results of regression analysis by distorting the regression line and affecting the
accuracy of predictions.
7. What is the primary difference between classification and regression?
Ans. Classification predicts discrete values, while regression predicts continuous
values.
8. Provide examples of classification problems in real-life scenarios.
Ans. Examples of classification problems include email spam detection, handwritten
character recognition, and sentiment analysis in social media posts.
9. What are some common applications of clustering techniques?
Ans. Common applications of clustering techniques include market segmentation,
statistical data analysis, social network analysis, image segmentation, and anomaly
detection.
10. List the types of clustering methods.
Ans. Types of clustering methods include partitioning clustering, density-based
clustering, distribution model-based clustering, hierarchical clustering, and fuzzy
clustering.
E. Long Answer Questions:
3. Write any two advantages and disadvantages of linear regression.
Ans:
Advantages of Linear regression
• Simple technique and easy to implement
• Efficient to train the machine on this model
Disadvantages of Linear regression
• Sensitivity to outliers, which can significantly impact the analysis.
• Limited to linear relationships between variables.
4. What are the steps involved in k-NN algorithm?
Ans:
● Select the number K of the neighbors
● Calculate the Euclidean distance of K number of neighbors
● Take the K nearest neighbors as per the calculated Euclidean distance.
● Among these k neighbors, count the number of the data points in each
category.
● Assign the new data points to that category for which the number of the
neighbor is maximum.
● Our model is ready.
5. What are the steps involved in k-means clustering?
Ans:
● Select the number K to decide the number of clusters.
● Select random K points or centroids. (It can be other from the input dataset).
● Assign each data point to their closest centroid, which will form the predefined
K clusters.
● Calculate the variance and place a new centroid of each cluster.
● Repeat the third steps, which means reassign each datapoint to the new
closest centroid of each cluster.
● If any reassignment occurs, then go to step-4 else go to FINISH.
● The model is ready.

3. Observe the scatter plot showing the amount of sleep needed per day by age.
What type of correlation is shown here?

Ans. As age increases (moving along the x-axis toward greater numbers), the amount
of sleep needed decreases (y-values decreasing). This is a negative correlation.
This indicates that as individuals grow older, they generally require less sleep.
4. Ramesh is working on an assignment where he needs to categorize real-world applications of
Artificial Intelligence (AI) into two groups: Classification and Clustering. While his initial attempt
seems partially correct, his teacher identified a mistake.

Identify the mistake


Ans: E-mail Spam Detection is categorized under Clustering, but it should be under Classification.

You might also like