0% found this document useful (0 votes)

22 views39 pages

Lecture 2 - Principle of Machine Learning

This lecture covers the basic principles of statistical machine learning, focusing on the Naive Bayes classifier. It discusses the general model of learning from examples, empirical risk minimization, and the differences between generative and discriminative models. Additionally, it explains Bayesian classification, maximum likelihood estimation, and various types of Naive Bayes classifiers.

Uploaded by

Mai Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views39 pages

Lecture 2 - Principle of Machine Learning

Uploaded by

Mai Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Lecture 2

Basic Principles of Statistical Machine

Learning and Naive Bayes Classifiers
LÊ ANH CƯỜNG
Ton Duc Thang University

1
Outline
1. The general model of learning from examples
2. Empirical risk minimization inductive principle
3. Probability Theory and Bayesian Classification
4. Generative and Discriminative Models
5. Naive Bayesian Classification

2
The General Model of Learning from Examples
• Suppose that there is a functional relationship between two sets of
objects X and Y:
f: X -> Y
• Given a finite set of examples:
D = {(xi,yi) | i=1,2,…,N} , where xi X and yi. Y

• The task here is to derive (i.e. to learn) the objective function f

3
Objective of Learning
• Learn to generalize from a finite set of examples
• The learnt function then can predict output y given a new input x

4
Classification and Regression
• y = f(x)
• If y is the real value ie Y = R then we have a regression problem
• If y is a value in a given finite discrete set, then we have a
classification problem

5
Data Representation
• x is a vector of features
x = (x1, x2, ..., xd)
X = Rd
• y is a real number in the regression problem
• y is in classification problem:
• binary classification, y = {0,1} or {-1,+1}
• multiple classes: y = {1, 2, ..., k} or one-hot vector (0,...0,1,0,...0)

6
Loss function
• Suppose that (x,y) is an example. We want to find the difference
between the ground true value y and the predicted value h(x)
• For regression:

• For classification:

7
Expected Risk and Empirical Risk
• Expected risk/loss is the mean of L(y,h(x)) over the whole space X x Y

• Empirical risk/loss is the mean of L(y,h(x)) over the training dataset D

8
Empirical Risk

• For regression:

• For classification:

9
Empirical risk minimization inductive principle.
(Nguyên lý quy nạp cực tiểu sai số thực nghiệm)

• We will consider the objective function f by the approximation function g as

follows:

10
Overfitting

11
Probability Theory for Statistical Machine
Learning
• Probability theory is a mathematical framework for quantifying our uncertainty about the
world. It allows to reason effectively in situations where being certain is impossible.
Probability theory is at the foundation of many machine learning algorithms.
• Probability Theory simply talks about how likely is the event to occur, and its value
always lies between 0 and 1 (inclusive of 0 and 1)

12
Some basic probabilities

13
Probability Theory for Statistical Machine
Learning
Discrete Probability Distribution: The Continuous Probability Distribution: The
mathematical definition of a discrete mathematical definition of a continuous
probability function, f(x), is a function that
probability function, p(x), is a function that satisfies the following properties. This is
satisfies the following properties. This is referred as Probability Density Function.
referred as Probability Mass Function.

14
Probability Theory for Statistical Machine
Learning

15
Discriminate and Generative Models
Let's say you have input data x and you want to classify the data into labels y.
A generative model learns the joint probability distribution p(x,y) and a
discriminative model learns the conditional probability distribution p(y|x)

16
Discriminate and Generative Models
Let's say you have input data x and you want to classify the data into labels y. A generative
model learns the joint probability distribution p(x,y) and a discriminative model learns
the conditional probability distribution p(y|x)

Some popular discriminative algorithms are: Some popular generative algorithms are:
•k-nearest neighbors (k-NN) •Naive Bayes Classifier
•Logistic regression •Generative Adversarial Networks
•Support Vector Machines •Gaussian Mixture Model
•Decision Trees •Hidden Markov Model
•Random Forest •Probabilistic context-free grammar
•Artificial Neural Networks (ANNs)

17
Bayesian Classification

18
Bayesian Classification

19
Bayesian Classification and Expected Risk

Then, the expected Risk at input x will be:

20
Bayesian Classification and Expected Risk
• Suppose h(x) = cj, then:

• It means:

• So that to minimize the Expected Risk, it is equivalent to choose for

maximizing P(cj|x)

21
Maximum Likelihood Estimation
• We are given a data set D = {x1,x2,...,xN}
• Suppose that the given examples come from the probability
distribution with parameter θ
• We need to estimate θ that maximize p(D)
θ = argmax p(x1,x2,…,xN|θ)
• p(D) is likelihood of D
θ = argmax ∏ p(xi|θ)

22
Maximum Likelihood Estimation

θ = argmax ∏ p(xi|θ)
• To make the calculation more convenient, we can use Maximum Log-
likelihood:
θ = argmax∑ log(p(xi|θ))

23
Example
Suppose the problem is that there are 5 students taking the test with scores of 3, 6,
5, 9, 8 respectively. To model the scores of these students, we assume that the data
points are segregated. distributed according to the Gaussian distribution:

24
Example

μ = 6.2 and σ = 2.14

25
Naive Bayesian Classification

26
27
Naive Bayesian Classification
1. Model
2. Parameter Estimation with Different Distribution of Data

28
NB Classification
• Bayesian classification

29
NB Classification
• How to estimate the model’s parameters:

• X is represented by a vector of feature values

30
NB classification
• How to estimate the model’s parameters:
The task now is to calculate/estimate the probabilities:

where P(cj) is the probability of a class cj, and P(xj|ck) is the probability
of a value xj (of a feature jth) with the condition of class ck.
These probabilities are estimated based on the probability distribution
of:

31
NB classification
• How to estimate the model’s parameters:
These probabilities are estimated based on the probability distribution of:

where: c is the class variable

x is feature value variable

for example: P(c=N), P(c=P)

P(outlook = sunny | c=P)

32
NB Classification
• What are the parameters of the NB Model?

• What is the inference process of the NB Model?

• given input:

• we have: ?

33
NB Classification
• What are the parameters of the NB Model?

• What is the inference process of the NB Model?

• given input:

• we have:

34
NB Classification
• Parameter Estimation

• Then:

35
Multinomial NB

36
Gaussian NB
When working with continuous data, an assumption often taken is that the
continuous values associated with each class are distributed according to a normal
(or Gaussian) distribution. The likelihood of the features is assumed to be”

37
Other NB Classifiers
• Complement Naive Bayes
• Bernoulli Naive Bayes
• Categorical Naive Bayes

Reference:
• https://scikit-learn.org/stable/modules/naive_bayes.html

38
Practice
• https://www.kaggle.com/code/prashant111/naive-bayes-classifier-in-
python

MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
67% (49)
MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
10 pages
KET Vocabulary List
100% (1)
KET Vocabulary List
7 pages
Attack of The Voice Clones REPORT - EMBARGOED
No ratings yet
Attack of The Voice Clones REPORT - EMBARGOED
25 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Lecture 6_Generative Models
No ratings yet
Lecture 6_Generative Models
33 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
IML Module 3.pptx
No ratings yet
IML Module 3.pptx
95 pages
Module - 4 - ECE3047 - Machine Learning
No ratings yet
Module - 4 - ECE3047 - Machine Learning
81 pages
Wk08
No ratings yet
Wk08
10 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
lecture3-linear-classifiers
No ratings yet
lecture3-linear-classifiers
36 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Unit 7 - 2
No ratings yet
Unit 7 - 2
59 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
UCS_401_Unit-LV_Probabilistic Models Normal Distribution and Its Geometric Interpretations_03
No ratings yet
UCS_401_Unit-LV_Probabilistic Models Normal Distribution and Its Geometric Interpretations_03
14 pages
The Naive Bayes Model, Maximum-Likelihood Estimation, and The EM Algorithm
No ratings yet
The Naive Bayes Model, Maximum-Likelihood Estimation, and The EM Algorithm
21 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Dl Highlights
No ratings yet
Dl Highlights
6 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
BSC ML CH2.pptx
No ratings yet
BSC ML CH2.pptx
79 pages
Lect9 NB
No ratings yet
Lect9 NB
46 pages
Naive Bayes
No ratings yet
Naive Bayes
6 pages
Lesson 6.0 Supervised Learning with Naive Bayes Classifiers (1)
No ratings yet
Lesson 6.0 Supervised Learning with Naive Bayes Classifiers (1)
13 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
FML Unit3
No ratings yet
FML Unit3
18 pages
AI Week 14
No ratings yet
AI Week 14
3 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Pattern Recognition - Lec02
No ratings yet
Pattern Recognition - Lec02
44 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
An Introduction to Naive Bayes Algorithm for Beginners
No ratings yet
An Introduction to Naive Bayes Algorithm for Beginners
11 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Unit 6
No ratings yet
Unit 6
19 pages
05_lecturenote_NB
No ratings yet
05_lecturenote_NB
10 pages
ESGB - Naive Bayes and Logistic Regression
No ratings yet
ESGB - Naive Bayes and Logistic Regression
36 pages
Naive-By
No ratings yet
Naive-By
23 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Rubric_DIP
No ratings yet
Rubric_DIP
1 page
LAB 07 - DIGITAL IMAGE PROCESSING PRACTICE
No ratings yet
LAB 07 - DIGITAL IMAGE PROCESSING PRACTICE
2 pages
Essay Algebra 2023
No ratings yet
Essay Algebra 2023
3 pages
Beyond I Human Final Book 1
100% (1)
Beyond I Human Final Book 1
120 pages
GROUP 3 Term Paper
No ratings yet
GROUP 3 Term Paper
27 pages
ZeroBERTo - Leveraging Zero-Shot Text
No ratings yet
ZeroBERTo - Leveraging Zero-Shot Text
11 pages
AI ML Drone in OSH
No ratings yet
AI ML Drone in OSH
2 pages
voice alchemy AI
No ratings yet
voice alchemy AI
30 pages
Prometric - Business Development Director Saudi Arabia
No ratings yet
Prometric - Business Development Director Saudi Arabia
2 pages
Elaine - Individual Assignment
No ratings yet
Elaine - Individual Assignment
6 pages
A radical opinion on ChatGPT. It’s a challenge to get back to being… _ by Thomas Maisey _ UX Collective
No ratings yet
A radical opinion on ChatGPT. It’s a challenge to get back to being… _ by Thomas Maisey _ UX Collective
3 pages
Payroll Trends
No ratings yet
Payroll Trends
8 pages
[1]The Impact of AI and LMS Integration on the Future of Higher Education Opportunities, Challenges, and Strategies for Transformation
No ratings yet
[1]The Impact of AI and LMS Integration on the Future of Higher Education Opportunities, Challenges, and Strategies for Transformation
21 pages
Ai Fundamentals Final Exam
No ratings yet
Ai Fundamentals Final Exam
21 pages
RT10024 Summary Report 2024 1
No ratings yet
RT10024 Summary Report 2024 1
120 pages
Thesis
No ratings yet
Thesis
78 pages
Only Human: Ahead of Its Time
No ratings yet
Only Human: Ahead of Its Time
2 pages
Physics Informed Neural Networks Reducing Data Size Requirements Via Hybrid Learning
No ratings yet
Physics Informed Neural Networks Reducing Data Size Requirements Via Hybrid Learning
2 pages
A hybrid time series forecasting method based on neutrosophic logic with applications in financial issues
No ratings yet
A hybrid time series forecasting method based on neutrosophic logic with applications in financial issues
21 pages
AI Workloads and Considerations Exam Ref AI 900 Microsoft Azure AI Fundamentals
No ratings yet
AI Workloads and Considerations Exam Ref AI 900 Microsoft Azure AI Fundamentals
24 pages
Answer: B. Making A Machine Intelligent. Explanation: Artificial Intelligence Is A Branch of Computer Science, Which Aims To
No ratings yet
Answer: B. Making A Machine Intelligent. Explanation: Artificial Intelligence Is A Branch of Computer Science, Which Aims To
101 pages
Autonomous Driving: Moonshot Project With Quantum Leap From Hardware To Software & AI Focus
100% (1)
Autonomous Driving: Moonshot Project With Quantum Leap From Hardware To Software & AI Focus
56 pages
Artificial Intelligence - Faiza Yamen-3
No ratings yet
Artificial Intelligence - Faiza Yamen-3
39 pages
New R.C.C.
100% (2)
New R.C.C.
45 pages
free spirit rules
No ratings yet
free spirit rules
4 pages
Computer Science 0478 Notes (Chapter 6 :Automated System, Robotic & Artificial Intelligence)
No ratings yet
Computer Science 0478 Notes (Chapter 6 :Automated System, Robotic & Artificial Intelligence)
12 pages
Artificial Intelligence and Plagiarism - Original Author Dilemma Student Handout
No ratings yet
Artificial Intelligence and Plagiarism - Original Author Dilemma Student Handout
2 pages
Fraud Detection in Banking Data by Machine Learning Techniques
No ratings yet
Fraud Detection in Banking Data by Machine Learning Techniques
10 pages
Euronews Homework
100% (1)
Euronews Homework
8 pages
Analysis Sustainable Supply Chain and Industry 4.0
No ratings yet
Analysis Sustainable Supply Chain and Industry 4.0
39 pages
AI
100% (1)
AI
2 pages

Lecture 2 - Principle of Machine Learning

Uploaded by

Lecture 2 - Principle of Machine Learning

Uploaded by

Lecture 2

Basic Principles of Statistical Machine

• The task here is to derive (i.e. to learn) the objective function f

• Empirical risk/loss is the mean of L(y,h(x)) over the training dataset D

• We will consider the objective function f by the approximation function g as

Then, the expected Risk at input x will be:

• So that to minimize the Expected Risk, it is equivalent to choose for

μ = 6.2 and σ = 2.14

• X is represented by a vector of feature values

where: c is the class variable

for example: P(c=N), P(c=P)

• What is the inference process of the NB Model?

• What is the inference process of the NB Model?

You might also like