0% found this document useful (0 votes)

23 views22 pages

ml_unit2

The document discusses supervised learning in machine learning, focusing on classification techniques, including linear and non-linear classifiers, decision trees, and regression methods. It explains various classification types such as binary, multi-class, and multi-label, along with performance metrics like precision and recall. Additionally, it covers decision tree algorithms like ID3 and CART, highlighting their differences and applications in both classification and regression tasks.

Uploaded by

Devabn Nirmal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views22 pages

ml_unit2

Uploaded by

Devabn Nirmal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 22

MACHINE LEARNING

UNIT-II SUPERVISED LEARNING

Learning a class from examples, linear classification,
non-linear classification, multiclass and multi labeled classification,
Decision trees-ID3, Classification and regression trees (CART),
Regression-linear regression, multiple regression, logistic regression

Supervised learning:
Supervised learning is a process of providing input data as well as correct output data to the machine learning
model. The aim of a supervised learning algorithm is to find a mapping function to map the input variable(x)
with the output variable(y). In the real-world, supervised learning can be used for Risk Assessment, Image
classification, Fraud Detection, spam filtering.

Components of Supervised learning:

Training Data → supervised learning → Model (Decision Tree, Neural networks etc.,) → Trained Model
(Hypothesis) → Prediction/ Inference

Linear and Non-linear Classifications:

A classifier is an algorithm that performs classification on a dataset by assigning input data to specific
categories or labels.
Types of Classification:
1. Binary Classification:
A classification problem with only two possible outcomes.
o Examples:
 YES or NO
 MALE or FEMALE
 SPAM or NOT SPAM
 CAT or DOG
2. Multi-class Classification:
A classification problem with more than two possible outcomes.
o Examples:
 Classification of different types of crops
 Classification of music genres
 Handwritten digit recognition (0–9)

we define a linear classifier as a two-class classifier that decides class membership by comparing a linear
combination of the features to a threshold. The diagrams given here demonstrate the same.
Linear classification:
In linear classification, the decision boundary is a straight line (in 2D), a plane (in 3D), or a hyperplane (in
higher dimensions) that separates data points belonging to different classes. The goal is to find a linear
function that can accurately classify data points.
Linear classifiers are simple, fast, and computationally efficient, making them widely used in many real-world
applications. These models make predictions based on a linear combination of input features. Some common
linear classification algorithms include:
1. Perceptron:
A foundational binary linear classifier that updates its weights iteratively to minimize misclassifications.
It’s one of the earliest and simplest models in machine learning.
2. Linear Support Vector Machine (SVM):
A powerful classifier that finds the optimal hyperplane separating the classes by maximizing the margin
—the distance between the hyperplane and the nearest data points from each class.
3. Logistic Regression:
A probabilistic linear classifier commonly used for binary classification tasks. It models the probability
of class membership using the logistic (sigmoid) function and is particularly useful when interpretability
is important.
Linear classifiers work well when the data points are linearly separable, meaning they can be separated by a
straight line or a plane.
Non-linear classification:
Non-linear classification is employed when the data cannot be accurately separated by a straight line or a
hyperplane in the input feature space. In this case, more complex decision boundaries, such as curves or
surfaces, are used to separate the data into different classes. To achieve this, non-linear classifiers use
techniques like feature transformations or kernel methods to map the original data into a higher-dimensional
space where a linear boundary can be found. Some common non-linear classification algorithms include:
1. Support Vector Machine with non-linear kernels (e.g., Polynomial kernel, Radial Basis Function kernel).
2. Decision Trees: These recursively split the feature space into regions to form non-linear decision boundaries.
3. Random Forest: An ensemble method that combines multiple decision trees to improve performance.
4. Neural Networks: Deep learning models capable of learning complex non-linear relationships between
features.

Non-linear classifiers are more flexible and can handle complex data patterns. They are well-suited for tasks
where the decision boundary is intricate and not easily separable by a straight line or a plane. However, they
may require more computational resources and could be prone to overfitting if not properly regularized. In
summary, linear classifiers work best for linearly separable data, while non-linear classifiers are more
appropriate for complex and non-linear data patterns. The choice between the two depends on the nature of
the data and the task at hand. Sometimes, a combination of both techniques can be used to achieve better
classification performance.
Multi-class & Multi-label Classification:
Multi-class and multi-label classification are two types of classification problems in machine learning that
involve assigning objects to multiple classes or labels.
In multi-class classification, each data point belongs to one and only one class out of multiple mutually
exclusive classes. The goal is to predict the correct class label for each data point from a predefined set of
classes. Some common algorithms used for multi-class classification include:
1. Softmax Regression (Multinomial Logistic Regression): An extension of logistic regression that handles
multiple classes.
2. Support Vector Machine (SVM): uses methods like one-vs-one or one-vs-all.
3. Decision Trees and Random Forest: Decision trees can directly handle multi-class problems, and random
forests can be used for more robust multi-class classification.

On the other hand, in multi-label classification, each data point can belong to multiple classes or have multiple
labels simultaneously. The goal is to predict the presence or absence of multiple labels for each data point. This
type of classification is commonly used in tasks where an object can have more than one attribute or
characteristic. Some common algorithms used for multi-label classification include:
1. Binary Relevance: Treat each label as a separate binary classification problem and combine the results.
2. Label Powerset: Convert the multi-label problem into a multi-class problem with one class for each unique
combination of labels.
3. Classifier Chains: Create a chain of classifiers, where each classifier predicts one label and takes into account
the predictions of previous classifiers in the chain.

some algorithms can be extended to handle both multi-class and multi-label tasks.
Email Spam Filtering using Binary Classification:
Email spam filtering is a classic application of machine learning, and various techniques can be employed to
identify and filter out spam messages effectively. Spam is nothing but an email which contains commercial /
unwanted content. It consists of legitimate content. Decision tree is a technique that resolves spam filtering
problem.
Performance Metrics:
In order to evaluate the performance of a machine learning algorithm, one can use different approaches.
While building any machine learning model, the first thing that comes to our mind is how we can build an
accurate & 'good fit' model and what the challenges are that will come during the entire procedure.
Confusion Matrix in Machine Learning: Confusion Matrix helps us to display the performance of a model or
how a model has made its prediction in Machine Learning. Confusion Matrix helps us to visualize the point
where our model gets confused in discriminating two classes. It can be understood well through a 2×2 matrix
where the row represents the actual truth labels, and the column represents the predicted labels.

Precision and Recall are the two most important but confusing concepts in Machine Learning. Precision and
recall are performance metrics used for pattern recognition and classification in machine learning.

Precision: the precision is the number of true positive results divided by the number of all positive results,
including those not identified correctly, Precision is also known as positive predictive value. It attempts to
answer the following question: What proportion of positive identifications was actually correct?
Precision is defined as follows:

For example, consider the confusion matrix here. The precision will be:

Recall: The recall is the number of true positive results divided by the number of all samples that should have
been identified as positive.
Recall is also known as sensitivity in diagnostic binary classification. Recall attempts to answer the following
question: What proportion of actual positives was identified correctly? Mathematically, recall is defined as
follows:
Recall is defined as follows:

It is also known as true positive rate (TPR). To fully evaluate the effectiveness of a model, you must examine
both precision and recall. Unfortunately, precision and recall are often in tension. That is, improving precision
typically reduces recall and vice versa. For example, in case of classifying an email with the help of a confusion
matrix for the following data:

Decision Trees:
A Decision tree is a popular and widely used supervised machine learning algorithm for both classification and
regression tasks. It is a non-linear model that can be used for various types of data and can handle both
numerical and categorical features. Decision Tree can be used for both classification and Regression problems,
but mostly it is preferred for solving Classification problems. It is a tree-structured classifier, where internal
nodes represent the features of a dataset, branches represent the decision rules and each leaf node represents
the outcome. In a Decision tree, there are two nodes, which are the Decision Node and Leaf Node.

The basic idea behind a decision tree is to recursively divide the dataset into subsets based on the values of
different features, making decisions at each step to create a tree-like structure. The leaves of the tree
represent the final decisions or predictions for the instances falling into those subsets.

Entropy: In machine learning, entropy is a measure of uncertainty or disorder in a dataset. Entropy is defined
as the randomness or measuring the disorder of the information being processed in Machine Learning. In
simply, entropy is often interpreted as the degree of disorder or randomness in the system. Further, we can say
that entropy is the machine learning metric that measures the unpredictability or impurity in the system. It is
commonly used in decision trees as a criterion to evaluate the quality of a split. Decision trees are a popular
supervised learning algorithm used for classification and regression tasks. When building a decision tree, the
algorithm aims to find the best features to split the data, such that the resulting subsets have the highest
possible information gain or the greatest reduction in entropy. Entropy is calculated using the following
formula:

Where:
 S is the dataset for which we are calculating the entropy.
 n is the number of classes in the dataset.
 Pi is the proportion of instances belonging to class i in the dataset S
In decision trees, the goal is to minimize entropy by finding the feature that leads to the most homogeneous
subsets (i.e., subsets with low entropy) after the split. This process is iteratively applied to create a tree-like
structure, where the leaves represent the final decision or prediction for each instance based on the majority
class in the corresponding leaf node.

Information gain, which is used to decide which feature to split on, is simply the difference between the
entropy of the parent node before the split and the weighted average of the entropies of the child nodes after
the split.

Decision trees use entropy (or alternative measures like Gini impurity) to make decisions about how to divide
the data at each node, which ultimately allows them to create a tree that can make predictions for new,
unseen instances based on the patterns learned from the training data.

Steps for Decision Trees:

• Step-1: Begin the tree with the root node, says S, which contains the complete dataset.
• Step-2: Find the best attribute in the dataset using Attribute Selection Measure (ASM).
• Step-3: Divide the S into subsets that contains possible values for the best attributes.
• Step-4: Generate the decision tree node, which contains the best attribute.
• Step-5: Recursively make new decision trees using the subsets of the dataset created in step -3. Continue this
process until a stage is reached where you cannot further classify the nodes and called the final node as a leaf
node Classification and Regression Tree algorithm.

ID3 Algorithm for constructing Decision Tree:

ID3 stands for Iterative Dichotomiser 3 and is named such because the algorithm iteratively (repeatedly)
dichotomizes(divides) features into two or more groups at each step. It is a classification algorithm that follows
a greedy approach by selecting a best attribute that yields maximum Information Gain (IG) or minimum
Entropy(H).
ID3 Algorithm for constructing Decision Tree:
ID3 stands for Iterative Dichotomiser 3 and is named such because the algorithm iteratively (repeatedly)
dichotomizes(divides) features into two or more groups at each step. It is a classification algorithm that follows
a greedy approach by selecting a best attribute that yields maximum Information Gain (IG) or minimum
Entropy(H).
CART: CART stands for Classification And Regression Tree. CART is a variation of the decision tree algorithm.
CART is a powerful and widely used algorithm in machine learning It can handle both classification and
regression tasks.
CART was first produced by Leo Breiman, Jerome Friedman, Richard Olshen, and Charles Stone in 1984. CART is
a predictive algorithm used in Machine learning and it explains how the target variable’s values can be
predicted based on other matters.
CART is a popular and powerful algorithm used in machine learning for both classification and regression tasks.
CART is a decision tree-based algorithm that recursively splits the dataset into subsets based on the feature
values to create a tree-like structure. It is widely used because it is easy to understand, interpretable, and can
handle both categorical and numerical data.
The main idea behind CART is to divide the feature space into rectangular regions and assign a specific class or
regression value to each region. Here's a brief overview of how CART works:
1. Tree Construction:
• CART starts with the entire dataset as the root node of the tree.
• It then searches for the best feature and split point that maximizes the information gain (in the case
of classification) or minimizes the mean squared error (in the case of regression).
• The dataset is split into two subsets based on the chosen feature and split point, creating two child
nodes connected to the root node.
• This process is recursively applied to each child node until a stopping criterion is met, such as a
maximum tree depth, a minimum number of samples per leaf, or no further improvement in the split
quality.
2. Decision Making:
• For classification tasks, the majority class in each leaf node becomes the predicted class for instances
falling into that region.
• For regression tasks, the average (or another measure) of the target values in each leaf node
becomes the predicted value for instances falling into that region.
We can conclude that CART constructs decision trees by recursively partitioning the data based on feature
values, providing an interpretable and effective way to make predictions.
The other way of splitting a decision tree is via the Gini Index. The Entropy and Information Gain method
focuses on purity and impurity in a node. The Gini Index or Impurity measures the probability for a random
instance being misclassified when chosen randomly. The lower the Gini Index, the better the lower the
likelihood of misclassification. The formula is given here:

ID3 (Iterative Dichotomiser 3) and CART (Classification and Regression Trees) are both decision tree algorithms
used in machine learning for classification and regression tasks.

CART's main advantages include its ability to handle non-linear relationships between features and the target
variable, as well as its ability to handle missing values. Additionally, the resulting decision tree can be visualized
and easily interpreted, making it a valuable tool for understanding the underlying patterns in the data.
ID3 Vs CART:
ID3 (Iterative Dichotomiser 3) CART (Classification and Regression Tree)
1.Algorithm Type: ID3 is primarily used for 1.CART is versatile and can be used for both
classification tasks. It constructs a decision tree classification and regression tasks. It constructs
based on the training data and uses it to classify new decision trees for classification and regression
instances. purposes.
2.Uses Information Gain for splitting 2.Uses Gini Index (classification) or variance
(regression)
3.Creates multi-way splits 3.Creates binary splits only
4.Works with categorical features only 4.Works with both categorical and numerical
features
5.Supports classification only 5.Supports classification and regression
6.Does not support pruning 6.Supports pruning (cost-complexity pruning)
7.Tree Structure: ID3 can produce deep and complex 7.CART trees tend to be more balanced and less
trees, which may lead to overfitting on the training complex due to pruning, which often leads to
data. improved performance on unseen data.
8.Handles missing values poorly 8.Can handle missing values well

REGRESSION:

Regression is a type of supervised learning algorithm in machine learning that is used for predicting
continuous numerical values. Regression models are used to describe relationships between variables by fitting
a line to the observed data. Regression allows you to estimate how a dependent variable changes as the
independent variable(s) change. In regression tasks, the goal is to model the relationship between a set of
input features (independent variables) and a continuous target variable (dependent variable). The algorithm
learns from a labeled training dataset, where the target variable's actual values are provided.

The main objective in regression is to find a function or model that best fits the data, allowing us to make
accurate predictions on new, unseen data. The most common form of regression is linear regression, but the
other types of regression models, includes:
1.Multiple Linear Regression
2.Logistic Regression
3.Polynomial regression
4.Decision tree regression, and more.

Linear Regression:
Linear regression is a simple and widely used regression technique that models the relationship between the
input features and the target variable as a linear equation. The equation takes the form: Y=AX+B.

The goal is to find the optimal values for the coefficients a and b that minimize the difference between the
predicted values and the actual target values in the training data. This is usually done by minimizing a cost or
loss function, such as the mean squared error.

Training the Model: During the training phase, the model is presented with the labeled training dataset. The
model adjusts the coefficients using an optimization algorithm (e.g., gradient descent) to find the best-fitting
line or hyperplane that minimizes the error between the predicted values and the actual target values.
Making Predictions: Once the model is trained, it can be used to make predictions on new, unseen data. Given
the input features of a new instance, the model calculates the predicted target value using the learned
coefficients and the linear equation.

Regression is commonly used in various fields, such as finance (stock price prediction), economics, healthcare
(medical data analysis), and many other domains where predicting continuous numerical values is essential.
Regression is based on either correlation or covariance.

Correlation: Correlation coefficients are used to measure how strong a relationship is between two variables.
There are several types of correlation coefficient, but the most popular is Pearson’s. Correlation between sets
of data is a measure of how well they are related. The most common measure of correlation in stats is the
Pearson Correlation. Correlation coefficient formulas are used to find how strong a relationship is between
data. The formulas return a value between -1 and 1.

Where,
• 1 indicates a strong positive relationship.
• -1 indicates a strong negative relationship.
• A result of zero indicates no relationship at all.

Covariance: In statistics, the covariance formula is used to assess the relationship between two variables. It is
essentially a measure of the variance between two variables. The variance can be any positive or negative
values. Covariance is a quantitative measure of the degree to which the deviation of one variable (X) from its
mean is related to the deviation of another variable (Y) from its mean. To simplify, covariance measures the
joint variability of two random variables.

Following are the interpreted values:

• When two variables move in the same direction, it results in a positive covariance
• Contrary to the above point is two variables in opposite directions, it results in a negative covariance

Multiple Linear Regression:

Regression models are used to describe relationships between variables by fitting a line to the observed data.
Regression allows you to estimate how a dependent variable changes as the independent variable(s) change.
Multiple Linear Regression is one of the important regression algorithms which models the linear relationship
between a single dependent continuous variable and more than one independent variable.
Multiple linear regression is used to estimate the relationship between two or more independent variables and
one dependent variable. We can use multiple linear regression when we want to know:
1. How strong the relationship is between two or more independent variables and one dependent
variable (e.g. how rainfall, temperature, and amount of fertilizer added affect crop growth).
2. The value of the dependent variable at a certain value of the independent variables (e.g. the expected
yield of a crop at certain levels of rainfall, temperature, and fertilizer addition).
The linear equation for Multiple Regression model is:

here,
y is the dependent variable.
• x1, x2, x3 ,… are independent variables.
• b0 =intercept of the line.
• b1, b2, … are coefficients.
The multiple regression of two variables x1 and x2 are:
In general for ‘n’ independent variables, it will be:
 y = f(x1, x2)
 y = a0+ a1 x1 + a2 x2
 y = a0+ a1 x1 + a2 x2 + …..+an xn + E where E is the error term
Logistic Regression:
Logistic regression is one of the most popular Machine Learning algorithms, which comes under the
Supervised Learning technique. It is used for predicting the categorical dependent variable using a given set of
independent variables. It is a statistical method used for binary classification tasks, where the goal is to predict
the probability of an input belonging to one of two classes. Despite its name, logistic regression is a
classification algorithm rather than a regression algorithm. It models the probability of the dependent variable
belonging to a particular class using the logistic function (also known as the sigmoid function).

Logistic regression predicts the output of a categorical dependent variable. Therefore, the outcome must be a
categorical or discrete value. It can be either Yes or No, 0 or 1, true or False, etc. but instead of giving the exact
value as 0 and 1, it gives the probabilistic values which lie between 0 and 1

Logistic Regression is much similar to the Linear Regression except that how they are used. Linear Regression is
used for solving Regression problems, whereas Logistic regression is used for solving the classification
problems.

In Logistic regression, instead of fitting a regression line, we fit an "S" shaped logistic function, which predicts
two maximum values (0 or 1). The curve from the logistic function indicates the likelihood of something such
as whether the cells are cancerous or not, a mouse is obese or not based on its weight, etc. The logistic
function is defined as follows:

Where z is a linear combination of the input features and their corresponding weights: z = w0+w1.x1+w2.x2+
….+wn.xn Here, w0, w1, w2 ….wn are the coefficients or weights associated with the input features x1, x2,
x3….xn.

The probability that the dependent variable is 1 is given by the equation for the given independent variables:
The sigmoid function is a mathematical function used to map the predicted values to probabilities. It maps any
real value into another value within a range of 0 and 1. The value of the logistic regression must be between 0
and 1, which cannot go beyond this limit, so it forms a curve like the "S" form. The S-form curve is called the
Sigmoid function or the logistic function.

In logistic regression, we use the concept of the threshold value, which defines the probability of either 0 or 1.
Such as values above the threshold value tends to 1, and a value below the threshold values tends to 0.

Basic assumptions for Logistic Regression:

1.The dependent variable must be categorical in nature.
2.The independent variable should not have multi-collinearity.

On the basis of the categories, Logistic Regression can be classified into three types:

1.Binomial: In binomial Logistic regression, there can be only two possible types of the dependent variables,
such as 0 or 1, Pass or Fail, etc.
2.Multinomial: In multinomial Logistic regression, there can be 3 or more possible unordered types of the
dependent variable, such as "cat", "dogs", or "sheep"
3.Ordinal: In ordinal Logistic regression, there can be 3 or more possible ordered types of dependent variables,
such as "low", "Medium", or "High".

Linear Regression Vs Logistic Regression:

Category Linear Regression Logistic Regression

Purpose Used for predicting continuous Used for binary classification problems, where the
numeric values (regression problems), goal is to predict one of two possible outcomes (0
such as predicting house prices, or 1) based on input features. It models the
temperature, or stock prices. probability of belonging to a certain class.
Output Produces a continuous output that can Produces a probability score between 0 and 1.
be any real number.
Equation

Hypothesis Uses a linear equation to model the Uses the logistic (sigmoid) function to model the
function relationship between input features probability of a binary outcome
and the target variable.
Objective Often uses Mean Squared Error (MSE) Uses Maximum Likelihood Estimation (MLE) to
function or Root Mean Squared Error (RMSE) as maximize the likelihood of observing the given
the loss function to be minimized. data given the model's parameters.
Cost function Often uses Mean Squared Error (MSE) Uses the Cross-Entropy Loss (also known as Log
as the cost function to be minimized Loss) as the cost function.
during training.
Optimization Gradient Descent is commonly used to Gradient Descent or other optimization
optimize the parameters. algorithms are used to find the optimal
parameters that maximize the likelihood
Regularization Regularization techniques like L1 and Regularization can also be applied to prevent
L2 regularization can be applied to overfitting in logistic regression
prevent overfitting.
Evaluation Evaluated using metrics like Mean Evaluated using metrics like accuracy, precision,
Absolute Error (MAE), Mean Squared recall, F1- score, and ROC-AUC for binary
Error (MSE), or R-squared. classification.
Use case Predicting house prices, stock prices, Predicting house prices, stock prices, sales data,
sales data, etc. etc.

Pradeep Physics Class 12 Vol 1 2023-2024
80% (25)
Pradeep Physics Class 12 Vol 1 2023-2024
1,044 pages
LCL Ewm BBP Document
100% (2)
LCL Ewm BBP Document
18 pages
Thomson 2007 - Four Paradigm Transformations in Oral History
No ratings yet
Thomson 2007 - Four Paradigm Transformations in Oral History
23 pages
Unit 4 ML
No ratings yet
Unit 4 ML
28 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Machine Learning Models
No ratings yet
Machine Learning Models
11 pages
ML UNIT-II
No ratings yet
ML UNIT-II
37 pages
ML Notes -2025
No ratings yet
ML Notes -2025
145 pages
Classification in Machine Learning
No ratings yet
Classification in Machine Learning
25 pages
Unit-1 DL
No ratings yet
Unit-1 DL
29 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
ml4
No ratings yet
ml4
32 pages
Classification
No ratings yet
Classification
22 pages
Classification Clustering Recommender System
No ratings yet
Classification Clustering Recommender System
12 pages
Chapter 4. Classification Algorithms-Stud
No ratings yet
Chapter 4. Classification Algorithms-Stud
43 pages
Classification
No ratings yet
Classification
21 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
Module 2_Deep_Learning_Fundamentals
No ratings yet
Module 2_Deep_Learning_Fundamentals
98 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Aimlf Unit 3
No ratings yet
Aimlf Unit 3
20 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
ML UNIT-4
No ratings yet
ML UNIT-4
20 pages
ML UNIT-1-1
No ratings yet
ML UNIT-1-1
16 pages
AI
No ratings yet
AI
52 pages
2
No ratings yet
2
15 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Unit II
No ratings yet
Unit II
25 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
ML_ML in Nutshell
No ratings yet
ML_ML in Nutshell
7 pages
Wa0000.
No ratings yet
Wa0000.
26 pages
Lec05 - Supervised
No ratings yet
Lec05 - Supervised
26 pages
Experiment # 10
No ratings yet
Experiment # 10
10 pages
BSC ML CH1.pptx
No ratings yet
BSC ML CH1.pptx
63 pages
ARTIFICIAL INTELLIGENCE LEC 2
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 2
17 pages
30905022071_AGNIK KR JANA_CA2
No ratings yet
30905022071_AGNIK KR JANA_CA2
9 pages
There Are Key Areas in The Process of Machine Learning, Like
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
45 pages
UNIT 1 All Notes
No ratings yet
UNIT 1 All Notes
24 pages
ML Final Print Upload
No ratings yet
ML Final Print Upload
10 pages
ML notes
No ratings yet
ML notes
10 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Classification in Data Mining 12
No ratings yet
Classification in Data Mining 12
7 pages
Untitled
No ratings yet
Untitled
11 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Ijctt V48P126
No ratings yet
Ijctt V48P126
11 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
Ch4
No ratings yet
Ch4
8 pages
ML Notes UT-1
No ratings yet
ML Notes UT-1
21 pages
Unit 3
No ratings yet
Unit 3
10 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
AI Chapter 3 Part 3
No ratings yet
AI Chapter 3 Part 3
49 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Assignment 2
No ratings yet
Assignment 2
111 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Arrays
No ratings yet
Arrays
2 pages
The Power of Authentic Assessment in Higher Ed
No ratings yet
The Power of Authentic Assessment in Higher Ed
59 pages
Programme Credit Framework
No ratings yet
Programme Credit Framework
14 pages
Omniful.ai Intern Frontend
No ratings yet
Omniful.ai Intern Frontend
3 pages
Digital Nurture 4.0 - Qualifier Assessment
No ratings yet
Digital Nurture 4.0 - Qualifier Assessment
1 page
Gsb Biss 2025 (Gitam University, Vizag)
No ratings yet
Gsb Biss 2025 (Gitam University, Vizag)
8 pages
FSD Internal Questions
No ratings yet
FSD Internal Questions
2 pages
1-2 & 2-2 Sem Invigilations
No ratings yet
1-2 & 2-2 Sem Invigilations
1 page
UNIT-5
No ratings yet
UNIT-5
14 pages
Apssdc Summer Internship-2025
No ratings yet
Apssdc Summer Internship-2025
20 pages
Digital Nurture Eligible students list
No ratings yet
Digital Nurture Eligible students list
10 pages
APF
No ratings yet
APF
9 pages
R25 DEPARTMENT VISION & MISSION
No ratings yet
R25 DEPARTMENT VISION & MISSION
3 pages
CO-PO Mapping
No ratings yet
CO-PO Mapping
11 pages
Building, Trustworthy Generative AI Systems brochure (3)
No ratings yet
Building, Trustworthy Generative AI Systems brochure (3)
6 pages
SAT CLASS - 5
No ratings yet
SAT CLASS - 5
18 pages
Challenging Problems Time Work Speed Distance (2)
No ratings yet
Challenging Problems Time Work Speed Distance (2)
40 pages
SAT CLASS - 1
No ratings yet
SAT CLASS - 1
13 pages
PPT on ESD UNIT 5
No ratings yet
PPT on ESD UNIT 5
31 pages
Sat Class - 19
No ratings yet
Sat Class - 19
15 pages
SAT CLASS - 2
No ratings yet
SAT CLASS - 2
17 pages
SAT CLASS - 4
No ratings yet
SAT CLASS - 4
6 pages
Sat Class - 14
No ratings yet
Sat Class - 14
7 pages
Sat Class - 11
No ratings yet
Sat Class - 11
13 pages
Sat Class - 26
No ratings yet
Sat Class - 26
15 pages
Sat Class - 12
No ratings yet
Sat Class - 12
11 pages
SAT CLASS -23
No ratings yet
SAT CLASS -23
4 pages
Transform Warehouse Security With AI Powered Surveillance
No ratings yet
Transform Warehouse Security With AI Powered Surveillance
9 pages
SAT CLASS -21
No ratings yet
SAT CLASS -21
4 pages
SAT CLASS -13
No ratings yet
SAT CLASS -13
2 pages
Volcano Infographics
No ratings yet
Volcano Infographics
1 page
English 3-ACTIVIDAD DE REGULARIZACIÓN
No ratings yet
English 3-ACTIVIDAD DE REGULARIZACIÓN
4 pages
Executive Summary
No ratings yet
Executive Summary
2 pages
GRI 302 - Energy 2016
No ratings yet
GRI 302 - Energy 2016
18 pages
School of Thought - Associationism
No ratings yet
School of Thought - Associationism
15 pages
SNC2D Physics
No ratings yet
SNC2D Physics
5 pages
DOC
No ratings yet
DOC
5 pages
Abhijay NEET AITS FinalTrack Syllabus 2025
No ratings yet
Abhijay NEET AITS FinalTrack Syllabus 2025
2 pages
2023 12 High Rise Wood Print Book Final.2
No ratings yet
2023 12 High Rise Wood Print Book Final.2
44 pages
4.triple Integral
No ratings yet
4.triple Integral
6 pages
Urban Settlements: Structure
No ratings yet
Urban Settlements: Structure
22 pages
Grade 7 Rationalized Social Studies Schemes of Work Term 2
No ratings yet
Grade 7 Rationalized Social Studies Schemes of Work Term 2
28 pages
22 Studies On Chromosome Variation in Vanda Species of Orchidaceae
No ratings yet
22 Studies On Chromosome Variation in Vanda Species of Orchidaceae
6 pages
2000 BPH High Speed Semi Automatic Bottle Blowing Machine
100% (1)
2000 BPH High Speed Semi Automatic Bottle Blowing Machine
15 pages
2023 PDF
No ratings yet
2023 PDF
12 pages
MTE QMQ10203 SEM 1 2425
No ratings yet
MTE QMQ10203 SEM 1 2425
3 pages
Introduction and Nature of Research & Literature Review
No ratings yet
Introduction and Nature of Research & Literature Review
5 pages
Economic Geography - Schemes of Work
No ratings yet
Economic Geography - Schemes of Work
8 pages
March 2025 lunar eclipse
No ratings yet
March 2025 lunar eclipse
7 pages
Plate Bearing Test Sub Grade Contoh Laporan 4
100% (1)
Plate Bearing Test Sub Grade Contoh Laporan 4
1 page
hw8 (5555)
No ratings yet
hw8 (5555)
3 pages
Basic Probability PDF
No ratings yet
Basic Probability PDF
80 pages
Pronoun
No ratings yet
Pronoun
10 pages
Solving-the-Cybersecurity-Challenge
No ratings yet
Solving-the-Cybersecurity-Challenge
14 pages
Studije Slučaja
No ratings yet
Studije Slučaja
6 pages
Dehancer OFX Plugin Quick Guide
No ratings yet
Dehancer OFX Plugin Quick Guide
53 pages
Advance Welding Question Paper 21 22
100% (1)
Advance Welding Question Paper 21 22
3 pages

ml_unit2

Uploaded by

ml_unit2

Uploaded by

MACHINE LEARNING

UNIT-II SUPERVISED LEARNING

Components of Supervised learning:

Linear and Non-linear Classifications:

Steps for Decision Trees:

ID3 Algorithm for constructing Decision Tree:

Following are the interpreted values:

Multiple Linear Regression:

Basic assumptions for Logistic Regression:

Linear Regression Vs Logistic Regression:

Category Linear Regression Logistic Regression

You might also like