0% found this document useful (0 votes)

8 views

Session 5 ppt

The document provides an introduction to machine learning algorithms, detailing various models such as classification, regression, and clustering, along with their applications. It discusses different learning algorithms including supervised, unsupervised, and semi-supervised methods, and highlights specific techniques like k-nearest neighbors, decision trees, and boosting methods. Additionally, it covers the challenges of high-dimensional data and computational complexity in machine learning.

Uploaded by

Maira Kopzhasarova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Session 5 ppt

Uploaded by

Maira Kopzhasarova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Introduction to Machine

Learning Algorithms
Pabitra Mitra
Indian Institute of Technology Kharagpur
[email protected]
Machine Learning
• Learning Algorithms/Systems: Performance improvement with
experience, generalize to unseen input
• Example:
• Face recognition
• Email spam detection
• Market segmentation
• Rainfall forecasting

• Inductive inference – Data to Model

Machine Learning
Parameter
R Update Learning Algorithm
e
p Error function
r
e
Object s Output
e Learning Model
y
n f(X)
t
a
t
i
o Training Data
X n
Machine Learning Models
• Classification
• Predicts category of input objects – predefined classes
• Object recognition in images, email spam detection
• Regression
• Predicts real valued output for a given input
• Predicting value of a stock, predicting number of clicks in an advertisement
• Clustering
• Groups objects into homogeneous clusters – clusters not predefined
• Market segmentation, anomaly detection in industrial plants
Learning Algorithms
• Supervised (predictive data analysis)
• For each input in the training data the desired output is known
• Previous history, ground truth, annotations, labels
• Unsupervised (explorative data analysis)
• Output is not specified
• Natural groups are to be determined
• Semi-supervised
• Supervisory output available for few data points
• Output not available for most data points
Examples of Machine Learning Models
• Classification and Regression
• Logistic Regression
• Bayesian learning
• K-Nearest neighbor
• Decision Tree
• Support Vector Machine
• Boosting – Random Forests, Xgboost
• Neural Networks and Deep Learning
• Clustering
• K-means clustering
• Hierarchical clustering
• DBSCAN
Linear Regression
y

Prediction Model: y = 𝑓 𝑋, 𝛽 + ε

Linear Regression: 𝑓 𝑋, 𝛽 = 𝛽0 + 𝛽1𝑋 + ε Find 𝛽 that minimises the sum squared error
Logistic Regression: Binary Classification
Predict if a student will pass an exam depending on how many hours she has studied

Instead of modeling y, model P(y = 1 | X) = pX

(Logistic function: inverse of logit)

Predict class = 1, if pX > 1 - pX

Computing Parameters of Logistic Regression
𝛽0 :: b, 𝛽1 :: w, σ( ) :: sigmoid

Find values of w, b that minimizes the cross entropy loss:

K Nearest Neighbors

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of an input x are training data points that have the K
smallest distance to x
K-Nearest Neighbor Classifier
• Find K-nearest neighbors of an input data
• Count class membership of the neighbors and find the majority class
• The majority class is the predicted class for the input

Predicted class for x according to 3-NN

rule is +
X X X

For K-NN regression predict the average

value of the neighbors

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

Nearest-Neighbor Classifiers: Design Choices
– The value of k, the number of nearest neighbors to retrieve
– Distance Metric to compute distance between data points
Value of K
• Choosing the value of K:
• If k is too small, sensitive to noise points
• If k is too large, neighborhood may include points from other classes

Rule of thumb:
K = sqrt(N)
N: number of training points X
Distance Metrics
Distance Measure: Scale Effects
• Different features may have different measurement scales
• E.g., patient weight in kg (range [50,200]) vs. blood protein values in ng/L ([-3,3])
• Consequences
• Patient weight will have a greater influence on the distance between samples
• May bias the performance of the classifier
x ij - m j
• Transform raw feature values into z-scores zij =
sj
• x ij is the value for the ith sample and jth feature
• m j is the average of all inputs or feature j
• s is the standard deviation of all inputs over all input samples
j
• Range and scale of z-scores should be similar (providing distributions of raw feature values are alike)
Nearest Neighbor : Dimensionality
• Problem with Euclidean measure:
• High dimensional data
• curse of dimensionality
• Can produce counter-intuitive results
• Shrinking density – sparsification effect

111111111110 100000000000
vs
011111111111 000000000001
d = 1.4142 d = 1.4142
Nearest Neighbour : Computational Complexity
• Expensive
• To determine the nearest neighbour of a query point q, must compute the distance to all N
training examples
+ Pre-sort training examples into fast data structures (kd-trees)
+ Compute only an approximate distance (LSH)
+ Remove redundant data (condensing)
• Storage Requirements
• Must store all training data P
+ Remove redundant data (condensing)
- Pre-sorting often increases the storage requirements
• High Dimensional Data
• “Curse of Dimensionality”
• Required amount of training data increases exponentially with dimension
• Computational cost also increases dramatically
• Partitioning techniques degrade to linear search in high dimension
kd-tree: Data structure for fast range search
• Index data into a tree
• Search on the tree
• Tree construction: At each level we use a different dimension to split

x=5
x<5 x>=5
C
B y=6
y=3

A E D x=6
Ensemble Classifier
S Training
Data

Multiple Data S1 S2 Sn
Sets

Multiple C1 C2 Cn
Classifiers

Combined
Classifier
H
Bagging (Bootstrapped Aggregation)

Bootstrapping: Sampling with replacement from the original data set

Decision Trees

Leaves denote class decisions, other nodes denote attributes of data points
Decision Tree Construction
Repeat:
1. Split the “best” decision attribute (A) for next node
2. For each value of A, create new descendant of node
4. Sort training examples to leaf nodes
5. If training examples perfectly classified, STOP,
Else iterate over new leaf nodes
Grow tree just deep enough for perfect classification
–If possible (or can approximate at chosen depth)
Which attribute is best? (Information Gain Maximization)

• Simplified tree construction: At each level use only a small random

subset of attributes to create descendants
Decision Tree Construction
• Goodness of an attribute: Class distribution of the data subsets after split on the attribute

Outlook Temp Humidity Wind

Entire Data

sunny hot
high
weak

overcast mild
normal
Class: Yes strong
Class: No
cool
rain
Randomization on attributes + Randomization on training data points
Boosting

Data points are adaptively weighted. Misclassified points are emphasised such that the next classifier
Compensates for error of earlier classifier
Adaboost
• Data Point Weight Updates

• Weighted Classifier Combination

Forward Stagewise Additive Modelling

Adaboost FSAM for exponential loss.

Gradient Boosting
• Gradient Descent + Boosting

Error term indicate inadequacy of the model Residual = negative gradient

Residual

Model residual with another classifier M2. And append it to M1.

Continue over iterations

M1 additively compensates inadequacy of M1 Iterative process a gradient descent

Gradient Boosting
Gradient Boosting for Regression
• We have a set of variables vectors x1 , x2 and x3. You need to predict
y which is a continuous variable.
• Steps of Gradient Boost algorithm
Step 1 : Assume mean is the prediction of all variables.
Step 2 : Calculate errors of each observation from the mean (latest prediction).
Step 3 : Find the variable that can split the errors perfectly and find the value
for the split. This is assumed to be the latest prediction.
Step 4 : Calculate errors of each observation from the mean of both the sides of
split (latest prediction).
Step 5 : Repeat the step 3 and 4 till the objective function maximizes/minimizes.
Step 6 : Take a weighted mean of all the classifiers to come up with the final
model.
XGBoost: eXtreme Gradient Boosting
• Gradient Boosting + Regularization
Clustering
• Finding groups of objects such that the objects in a group will be
similar (or related) to one another and different from (or unrelated
to) the objects in other groups
Inter-cluster
Intra-cluster distances are
distances are maximized
minimized
Clustering Algorithms
p1
p3 p4
p2

p1 p2 p3 p4

Partitional Clustering Hierarchical Clustering

K-Means Clustering
• Partitional clustering approach
• Each cluster is associated with a centroid (center point)
• Each point is assigned to the cluster with the closest centroid
• Number of clusters, K, must be specified
K-Means Iterations
Iteration 1 Iteration 2 Iteration 3
3 3 3

2.5 2.5 2.5

2 2 2

1.5 1.5 1.5

y
1 1 1

0.5 0.5 0.5

0 0 0

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
x x x

Iteration 4 Iteration 5 Iteration 6

3 3 3

2.5 2.5 2.5

2 2 2

1.5 1.5 1.5

y
1 1 1

0.5 0.5 0.5

0 0 0

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
x x x
References

Generative AI, Cybersecurity, and Ethics
100% (1)
Generative AI, Cybersecurity, and Ethics
380 pages
Копия Digital SAT English Workbook t.me SAT DIGITAL DSAT
100% (4)
Копия Digital SAT English Workbook t.me SAT DIGITAL DSAT
442 pages
Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
No ratings yet
Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
77 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Classification
No ratings yet
Classification
74 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Lect 1
No ratings yet
Lect 1
24 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Unit 2 - SVM
No ratings yet
Unit 2 - SVM
137 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
[English (Auto-generated)] All Machine Learning Algorithms Explained in 17 Min [DownSub.com]
No ratings yet
[English (Auto-generated)] All Machine Learning Algorithms Explained in 17 Min [DownSub.com]
19 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Unit-5
No ratings yet
Unit-5
73 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
datamining-lect12
No ratings yet
datamining-lect12
75 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Unit 1
No ratings yet
Unit 1
15 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
CSCI946 w5-classification
No ratings yet
CSCI946 w5-classification
72 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Unit 3 in Machine Intelligence
No ratings yet
Unit 3 in Machine Intelligence
62 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
ppt4dl
No ratings yet
ppt4dl
81 pages
Chapter 4. Classification Algorithms-Stud
No ratings yet
Chapter 4. Classification Algorithms-Stud
43 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
ML unit-2 (CEC)
No ratings yet
ML unit-2 (CEC)
96 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
5 markd
No ratings yet
5 markd
24 pages
Classification
100% (2)
Classification
105 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Machine Learning Concepts
No ratings yet
Machine Learning Concepts
68 pages
Classification
No ratings yet
Classification
53 pages
ML
No ratings yet
ML
9 pages
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
UCS551 Chapter 6 - Classification
No ratings yet
UCS551 Chapter 6 - Classification
20 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
SWE622 Lecture 3 Classification
No ratings yet
SWE622 Lecture 3 Classification
57 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Python 06 MachineLearning
No ratings yet
Python 06 MachineLearning
45 pages
Week 8. Supervised Learning. Classification
No ratings yet
Week 8. Supervised Learning. Classification
45 pages
Machine_Learning
No ratings yet
Machine_Learning
35 pages
CH 4
No ratings yet
CH 4
106 pages
Chapter
100% (1)
Chapter
101 pages
machine learning
No ratings yet
machine learning
37 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
NOTICE - 2025 Batch VVDN Technologies Pvt. Ltd. MCA & B.tech (CSE) Regi
No ratings yet
NOTICE - 2025 Batch VVDN Technologies Pvt. Ltd. MCA & B.tech (CSE) Regi
2 pages
Deep Neural Network - Application 2layer
No ratings yet
Deep Neural Network - Application 2layer
7 pages
It Era Reviewer
No ratings yet
It Era Reviewer
64 pages
Cais
No ratings yet
Cais
8 pages
Soft Computing Programs
No ratings yet
Soft Computing Programs
12 pages
Neuropunk Cards
No ratings yet
Neuropunk Cards
13 pages
Tech Startup XL by Slidesgo
No ratings yet
Tech Startup XL by Slidesgo
12 pages
Open Ai
No ratings yet
Open Ai
27 pages
Assignments Module 10
No ratings yet
Assignments Module 10
2 pages
EFFECT OF AI IN TALENT ACQUISITION - Scope of Study
No ratings yet
EFFECT OF AI IN TALENT ACQUISITION - Scope of Study
4 pages
Ucz 013
No ratings yet
Ucz 013
22 pages
Differences Between AI and Traditional Programming
No ratings yet
Differences Between AI and Traditional Programming
3 pages
CCCS-CIC-AndMal-2020 (2)
No ratings yet
CCCS-CIC-AndMal-2020 (2)
6 pages
Diagnostic Report 2024 Book 1 (1)
No ratings yet
Diagnostic Report 2024 Book 1 (1)
287 pages
Animal Classification pAPER
No ratings yet
Animal Classification pAPER
7 pages
Neural Sheet 6
No ratings yet
Neural Sheet 6
3 pages
DS - ML Resume Template and Guide
No ratings yet
DS - ML Resume Template and Guide
3 pages
Job Junior Data Entry Specialist - Freelance AI Tutor - Mindrift
No ratings yet
Job Junior Data Entry Specialist - Freelance AI Tutor - Mindrift
1 page
Generating Evidence For Artificial Intelligence-Based Medical Devices
No ratings yet
Generating Evidence For Artificial Intelligence-Based Medical Devices
104 pages
Meta
No ratings yet
Meta
2 pages
Simplified Com Ai Letter Writer PDF
No ratings yet
Simplified Com Ai Letter Writer PDF
9 pages
Electricity Load Forecasting - A Systematic Review
No ratings yet
Electricity Load Forecasting - A Systematic Review
19 pages
The State of Cloud Driven Transformation HBR
No ratings yet
The State of Cloud Driven Transformation HBR
12 pages
An Introduction To Multiagent Systems
No ratings yet
An Introduction To Multiagent Systems
33 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
13 pages
Aero
No ratings yet
Aero
217 pages
Intelligent Automation With Micro Focus Unified Functional Testing Flyer
No ratings yet
Intelligent Automation With Micro Focus Unified Functional Testing Flyer
2 pages

Session 5 ppt

Uploaded by

Session 5 ppt

Uploaded by

Introduction to Machine

• Inductive inference – Data to Model

Instead of modeling y, model P(y = 1 | X) = pX

(Logistic function: inverse of logit)

Predict class = 1, if pX > 1 - pX

Find values of w, b that minimizes the cross entropy loss:

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

Predicted class for x according to 3-NN

For K-NN regression predict the average

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

Bootstrapping: Sampling with replacement from the original data set

• Simplified tree construction: At each level use only a small random

Outlook Temp Humidity Wind

• Weighted Classifier Combination

Adaboost FSAM for exponential loss.

Error term indicate inadequacy of the model Residual = negative gradient

Model residual with another classifier M2. And append it to M1.

Continue over iterations

M1 additively compensates inadequacy of M1 Iterative process a gradient descent

Partitional Clustering Hierarchical Clustering

2.5 2.5 2.5

1.5 1.5 1.5

0.5 0.5 0.5

Iteration 4 Iteration 5 Iteration 6

2.5 2.5 2.5

1.5 1.5 1.5

0.5 0.5 0.5

You might also like