ML Unit 1

Uploaded by

Roshan Kumar yadav (RauSan)

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

ML Unit 1

Uploaded by

Roshan Kumar yadav (RauSan)

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

1.

Introduction to Machine Learning

• What is Machine Learning?
o Machine Learning (ML) is a subset of artificial intelligence (AI) that enables
systems to learn and improve from experience without being explicitly
programmed. It focuses on using data and algorithms to imitate the way humans
learn, gradually improving accuracy.
o Key Concept: ML is fundamentally about automating the process of learning
patterns and making predictions based on data.
• Well-Posed Learning Problems
o A learning problem is considered "well-posed" when it has:
1. Task (T): The objective or activity the system needs to perform (e.g.,
predicting, classifying).
2. Performance Measure (P): A metric used to evaluate the success of the
system in performing the task (e.g., accuracy, precision).
3. Experience (E): Historical data or knowledge from which the system
can learn.
o Example of a Well-Posed Learning Problem:
▪ Predicting house prices:
▪ Task (T): Predict the sale price of houses.
▪ Performance Measure (P): Mean Absolute Error (MAE) or
Mean Squared Error (MSE) in prediction.
▪ Experience (E): Historical data of past house sales, including
features like area, number of bedrooms, and location.
• Designing a Learning System
o Step 1: Data Collection
▪ Gather relevant data that represents the problem domain. This could be
structured data (like databases) or unstructured data (like text or
images).
o Step 2: Feature Selection/Engineering
▪ Identifying the relevant attributes or features in the data that will help
the model make accurate predictions.
▪ Involves transforming raw data into inputs that the model can interpret
(e.g., converting dates to numeric values).
o Step 3: Choosing a Model
▪ Selecting an algorithm suited to the problem type (classification,
regression, clustering).
▪ Different models have strengths depending on data type and problem
requirements.
o Step 4: Training
▪ Feeding the data to the model to learn the relationship between inputs
and outputs.
o Step 5: Evaluation
▪ Testing the model’s performance using metrics (e.g., accuracy for
classification, RMSE for regression).
o Step 6: Deployment
▪ Integrating the model into an application where it can make predictions
on new data in real-time or batch processing.
• Learning vs. Designing
o Learning: Developing models based on data, letting the model find patterns
autonomously.
o Designing: In traditional programming, rules and logic are manually coded by
engineers rather than discovered by learning.
• Training vs. Testing
o Training: The model learns patterns from a set of labeled examples (known as
the training dataset).
o Testing: The trained model is then evaluated on a separate dataset (testing
dataset) to check how well it generalizes to new, unseen data.
o Goal: Ensure that the model performs well on testing data, indicating it can
generalize to new data.

2. Characteristics of Machine Learning Tasks

• Predictive Tasks
o These tasks focus on predicting a target variable using given input features.
o Examples:
▪ Classification: Assigning data points to predefined categories (e.g.,
spam vs. non-spam emails).
▪ Regression: Predicting a continuous output variable (e.g., predicting
house prices based on features).
• Descriptive Tasks
o These tasks aim to explore data and identify patterns without making explicit
predictions.
o Examples:
▪ Clustering: Grouping data points with similar characteristics (e.g.,
customer segmentation).
▪ Association Rule Mining: Finding relationships between variables in a
dataset (e.g., “market basket analysis” in retail, where buying one item
is linked to buying another).

3. Machine Learning Models

• Geometric Models
o These models interpret data points as vectors in a geometric space, and the
objective is to find boundaries that separate different classes or regions.
o Examples:
▪ Linear Regression: Uses a linear equation to fit data points to a straight
line.
▪ k-Nearest Neighbors (k-NN): Classifies data points based on the
closest labeled points in the feature space.
• Logical Models
o Logical models use rules or trees to make decisions based on data features.
o Examples:
▪ Decision Trees: Divide data by asking a series of “yes/no” questions
until each leaf represents a single class or value.
▪ Rule-Based Systems: If-then rules derived from data to classify or
predict.
• Probabilistic Models
o These models are based on probability theory and often work well with
uncertainty.
o Examples:
▪ Naïve Bayes: Uses Bayes’ theorem to classify data based on conditional
probabilities.
▪ Gaussian Mixture Models (GMM): Models data as a mixture of
several Gaussian distributions, useful in clustering.
• Issues in Machine Learning
o Overfitting: Model learns details/noise in the training data and fails to
generalize to new data.
o Underfitting: Model is too simple to capture the underlying trend in the data.
o Data Quality and Quantity: The effectiveness of machine learning is highly
dependent on high-quality, representative data.
o Model Complexity: The more complex a model, the harder it is to interpret and
the more computing power it may require.

4. Types of Machine Learning

• Learning Associations
o Association learning focuses on discovering interesting relations between
variables in large datasets.
o Example: Market Basket Analysis (e.g., finding patterns like "people who buy
bread also buy butter").
• Supervised Learning
o In supervised learning, the model learns from labeled data, where each data
point has an input and the correct output.
o Types of Supervised Learning:
▪ Classification: Predicts discrete values (e.g., spam vs. not spam).
▪ Regression: Predicts continuous values (e.g., predicting house prices).
o Objective: Make accurate predictions on new data based on learned
relationships.
o Examples of Algorithms: Linear Regression, Support Vector Machine,
Decision Trees, Random Forest, k-NN.
• Unsupervised Learning
o In unsupervised learning, the model learns patterns from data without any
labeled responses.
o Types of Unsupervised Learning:
▪ Clustering: Grouping similar data points together (e.g., customer
segmentation).
▪ Association Analysis: Finding rules that capture associations between
data items.
o Objective: Identify patterns, structure, or groupings within data.
o Examples of Algorithms: k-means, Hierarchical Clustering, PCA, Apriori.
• Reinforcement Learning
o A model learns by interacting with an environment and receives feedback in the
form of rewards or penalties.
o Key Concepts:
▪ Agent: Learner or decision maker.
▪ Environment: The world in which the agent operates.
▪ Actions: All possible moves the agent can make.
▪ Reward: Feedback from the environment based on the actions taken.
o Objective: Learn a strategy (policy) that maximizes cumulative reward over
time.
o Examples of Applications: Robotics, Game Playing, Autonomous Vehicles.
o Common Algorithms: Q-learning, Deep Q-Networks (DQN), Policy Gradient
Methods.

2-Marks Questions
1. What is machine learning?
Answer: Machine learning (ML) is a branch of artificial intelligence focused on
building systems that learn and improve from data without explicit programming. It
allows computers to identify patterns and make decisions.
2. Define a well-posed learning problem.
Answer: A well-posed learning problem has three components: a task (T) that specifies
what the model needs to do, a performance measure (P) that evaluates its success, and
experience (E) from data to learn from.
3. What is the difference between training and testing?
Answer: Training involves teaching a model using labeled data, while testing evaluates
the model's ability to generalize to new, unseen data.
4. What is overfitting?
Answer: Overfitting happens when a model learns noise and specific details in the
training data, resulting in poor performance on new data.
5. Explain supervised learning.
Answer: In supervised learning, the model is trained on labeled data, where each input
has a corresponding output. The model learns to map inputs to correct outputs.
6. What is the goal of unsupervised learning?
Answer: The goal of unsupervised learning is to discover patterns or structure in data
without labeled outputs, such as grouping similar items through clustering.
7. What is reinforcement learning?
Answer: Reinforcement learning is a type of machine learning where an agent learns
by interacting with an environment, receiving rewards or penalties for actions, and
aiming to maximize cumulative rewards.
8. Define feature engineering.
Answer: Feature engineering is the process of selecting, transforming, or creating
features (variables) in a dataset to improve the performance of a machine learning
model.
9. What are predictive tasks?
Answer: Predictive tasks involve using historical data to predict future or unknown
outcomes, commonly applied in regression and classification tasks.
10. Name two examples of logical models in machine learning.
Answer: Decision Trees and Rule-Based Systems are examples of logical models that
use a set of rules to make predictions.

4-Marks Questions
1. Describe the steps in designing a learning system.
Answer: The steps in designing a learning system include data collection, feature
selection/engineering, model selection, training, evaluation, and deployment. Each step
ensures that the model can learn patterns in data, make predictions, and operate
effectively in real-world scenarios.
2. Differentiate between classification and regression tasks.
Answer: Classification tasks aim to assign inputs to discrete categories (e.g., spam vs.
non-spam emails), while regression tasks predict continuous numerical values (e.g.,
house prices).
3. Explain the key differences between supervised and unsupervised learning.
Answer: In supervised learning, the model is trained on labeled data, meaning each
input has an associated output. In unsupervised learning, the model identifies patterns
in unlabeled data, grouping or organizing information without predefined labels.
4. What are geometric models in machine learning? Provide an example.
Answer: Geometric models interpret data points as vectors in a geometric space and
aim to find boundaries that separate classes. An example is k-Nearest Neighbors (k-
NN), which classifies points based on their proximity to labeled points in the feature
space.
5. List and describe two main characteristics of machine learning tasks.
Answer: Machine learning tasks are broadly classified as predictive tasks, which
focus on making predictions based on historical data, and descriptive tasks, which
focus on identifying patterns and insights, like clustering and association rule mining.
6. What is feature engineering and why is it important?
Answer: Feature engineering involves selecting, creating, or modifying features in the
data to improve model performance. It is crucial because the right features can enhance
a model's accuracy, reduce overfitting, and improve interpretability.
7. Define underfitting and describe one way to avoid it.
Answer: Underfitting occurs when a model is too simple to capture the underlying data
patterns. This can be avoided by using a more complex model or by adding relevant
features to improve representation.
8. Explain the concept of reinforcement learning with an example.
Answer: Reinforcement learning involves an agent interacting with an environment,
receiving rewards or penalties based on its actions, and learning to maximize
cumulative rewards. For example, a robot learning to navigate a maze can adjust its
actions to reach the goal faster by learning from rewards received.
9. Describe the difference between overfitting and underfitting.
Answer: Overfitting is when a model captures noise and specific details from training
data, reducing generalization ability. Underfitting occurs when the model is too
simplistic, failing to capture underlying trends, and thus performing poorly on both
training and test data.
10. List and briefly describe two probabilistic models in machine learning.
Answer:
o Naïve Bayes: A classification model based on Bayes’ theorem, assuming
independence among features.
o Gaussian Mixture Models (GMM): Models data as a mixture of several
Gaussian distributions, useful for clustering and density estimation.

6-Marks Questions
1. Explain three types of machine learning with examples.
Answer:
o Supervised Learning: Trains on labeled data to map inputs to outputs, such as
classification (e.g., spam detection) or regression (e.g., predicting prices).
o Unsupervised Learning: Learns patterns in unlabeled data, useful in clustering
(e.g., customer segmentation) and association (e.g., market basket analysis).
o Reinforcement Learning: An agent interacts with an environment, learning
through rewards and penalties to maximize cumulative rewards, as in robotics
or game-playing.
2. Discuss the main types of machine learning models and provide examples.
Answer:
o Geometric Models: Represent data points in space, such as k-Nearest
Neighbors.
o Logical Models: Use rules, like Decision Trees, which split data based on
feature values.
o Probabilistic Models: Use probability distributions to handle uncertainty, such
as Naïve Bayes.
3. How does overfitting occur, and what are some methods to prevent it?
Answer: Overfitting occurs when a model learns noise and specific details in the
training data, making it less effective on new data. To prevent overfitting, use
techniques like cross-validation, pruning (for decision trees), regularization, and
simplifying the model.
4. Explain the process of training and testing in machine learning and their
significance.
Answer: Training involves feeding a model labeled data to learn patterns, while testing
evaluates its performance on unseen data to ensure it generalizes well. This process is
vital for assessing a model's predictive power and accuracy.
5. Describe the importance of feature selection and its impact on model performance.
Answer: Feature selection involves choosing the most relevant variables, which helps
improve accuracy, reduces computation, and can prevent overfitting. The right features
provide clearer signals, allowing the model to learn more effectively.
6. What are association tasks in machine learning? Provide an example.
Answer: Association tasks discover interesting relationships between variables in data,
such as finding patterns in transactional data. An example is market basket analysis,
where purchasing one item may suggest a likelihood of purchasing related items.
7. Compare geometric, logical, and probabilistic models.
Answer:
o Geometric Models (e.g., k-NN) focus on spatial relationships in data.
o Logical Models (e.g., Decision Trees) use rules for classification and decision-
making.
o Probabilistic Models (e.g., Naïve Bayes) rely on probabilities and handle
uncertainty in predictions.
8. Discuss the importance of evaluation metrics in machine learning. Give two
examples.
Answer: Evaluation metrics assess model performance, guiding improvement.
Examples include accuracy for classification tasks and mean squared error (MSE)
for regression. Metrics provide quantitative ways to compare models and select the
best-performing one.
9. Explain clustering and its applications in machine learning.
Answer: Clustering groups similar data points into clusters based on shared
characteristics without labels. Applications include customer segmentation in
marketing, grouping documents by topics, and image segmentation.
10. Describe a scenario where reinforcement learning is more suitable than supervised
or unsupervised learning.
Answer: Reinforcement learning is ideal in scenarios where actions yield sequential
rewards, such as game-playing or robotics. For instance, an AI agent in a chess game
learns strategies by maximizing rewards (winning) through trial and error.

Machine Learning?
100% (2)
Machine Learning?
114 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
presenttion33
No ratings yet
presenttion33
2 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
Unit-I
No ratings yet
Unit-I
23 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
21CSC305P ML_ Unit 1-E.pptx
No ratings yet
21CSC305P ML_ Unit 1-E.pptx
137 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Mubbashir assignment ML
No ratings yet
Mubbashir assignment ML
10 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
ASSIGNMENT 1 Mavhine Learning
No ratings yet
ASSIGNMENT 1 Mavhine Learning
8 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
AI Module III
No ratings yet
AI Module III
14 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Assignment No 1
No ratings yet
Assignment No 1
9 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
ML Iat 1
No ratings yet
ML Iat 1
23 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
Introduction To Machine Learning Notes
No ratings yet
Introduction To Machine Learning Notes
26 pages
INTRODUCTION TO MACHINE LEARNING
No ratings yet
INTRODUCTION TO MACHINE LEARNING
31 pages
Machine Learning Unit-1
No ratings yet
Machine Learning Unit-1
22 pages
Study On Machine Learning Research Paper
No ratings yet
Study On Machine Learning Research Paper
17 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
AI unit 1
No ratings yet
AI unit 1
36 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
Machine learning_question bank
No ratings yet
Machine learning_question bank
45 pages
Chapter 5 - Machine Learning Basics
No ratings yet
Chapter 5 - Machine Learning Basics
58 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Class Notes: The Basics of Machine Learning
No ratings yet
Class Notes: The Basics of Machine Learning
4 pages
ML Lec 02 Introduction II
No ratings yet
ML Lec 02 Introduction II
22 pages
Ml Solutions
No ratings yet
Ml Solutions
34 pages
ML Notes
No ratings yet
ML Notes
52 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Module 1
No ratings yet
Module 1
22 pages
MCA -ML Question Bank Answer
No ratings yet
MCA -ML Question Bank Answer
139 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
What Are The Types of Machine Learning?
100% (1)
What Are The Types of Machine Learning?
24 pages
UNIT1@
No ratings yet
UNIT1@
4 pages
Introduction To Data Science Module 3
No ratings yet
Introduction To Data Science Module 3
24 pages
Unit I
No ratings yet
Unit I
44 pages
Deep Learning
No ratings yet
Deep Learning
11 pages
Advanced_Detection_of_AI-Generated_Images_Through_Vision_Transformers
No ratings yet
Advanced_Detection_of_AI-Generated_Images_Through_Vision_Transformers
9 pages
NLP Course Lecture02 Huawei Noahs Ark Lab
No ratings yet
NLP Course Lecture02 Huawei Noahs Ark Lab
139 pages
Classifying Authentic and AI-Generated Images with a Fine- Tuned ResNet50 Model.
No ratings yet
Classifying Authentic and AI-Generated Images with a Fine- Tuned ResNet50 Model.
7 pages
CP16036
No ratings yet
CP16036
6 pages
Seismic Facies Classification Using Supervised Convolutional Neural Networks and Semisupervised Generative Adversarial Networks
No ratings yet
Seismic Facies Classification Using Supervised Convolutional Neural Networks and Semisupervised Generative Adversarial Networks
12 pages
Artificial Intelligence Assignment
No ratings yet
Artificial Intelligence Assignment
8 pages
Internship_Report_bgsbu
No ratings yet
Internship_Report_bgsbu
19 pages
Machine Learning Roadmap For Absolute Beginners
No ratings yet
Machine Learning Roadmap For Absolute Beginners
2 pages
Redes Neurais Feedforward
No ratings yet
Redes Neurais Feedforward
53 pages
05 Linear Classifiers
No ratings yet
05 Linear Classifiers
59 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
28 pages
Pranshi Singla IX C AI Activity 1
No ratings yet
Pranshi Singla IX C AI Activity 1
24 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
ART APPRECIATION Reviewer For Recitation (BWISET)
No ratings yet
ART APPRECIATION Reviewer For Recitation (BWISET)
2 pages
Nazenin Ahin Tez
No ratings yet
Nazenin Ahin Tez
78 pages
Generative AI Tutorial
No ratings yet
Generative AI Tutorial
5 pages
Syllabus-Machine Learning Elective-II
No ratings yet
Syllabus-Machine Learning Elective-II
1 page
IoT For Mechanical Systems
No ratings yet
IoT For Mechanical Systems
2 pages
Cae-I Ai KCS 071 Q.P 2023-24
No ratings yet
Cae-I Ai KCS 071 Q.P 2023-24
1 page
Hawkeye-Report Generative AI
100% (1)
Hawkeye-Report Generative AI
10 pages
Important Questions Unit 2
No ratings yet
Important Questions Unit 2
8 pages
YOLO v4 Based Human Detection System Using Aerial Thermal Imaging For UAV Based Surveillance Applications
No ratings yet
YOLO v4 Based Human Detection System Using Aerial Thermal Imaging For UAV Based Surveillance Applications
7 pages
Question Answering System: 296: Natural Language Processing
No ratings yet
Question Answering System: 296: Natural Language Processing
30 pages
DL Unit 1
No ratings yet
DL Unit 1
16 pages
Notes On Backpropagation: Peter.j.sadowski@uci - Edu
No ratings yet
Notes On Backpropagation: Peter.j.sadowski@uci - Edu
3 pages
AI and Machine Lerning Model Paper
No ratings yet
AI and Machine Lerning Model Paper
3 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
CM412_DL_Model Paper
No ratings yet
CM412_DL_Model Paper
5 pages
Final Prac
No ratings yet
Final Prac
118 pages

ML Unit 1

Uploaded by

ML Unit 1

Uploaded by

1.

Introduction to Machine Learning

2. Characteristics of Machine Learning Tasks

3. Machine Learning Models

4. Types of Machine Learning

You might also like