Introduction To Machine Learning

Machine learning gives computers the ability to learn without being explicitly programmed by analyzing large amounts of data. There are different types of machine learning including supervised learning, unsupervised learning, semi-supervised learning, and reinforcement learning. Supervised learning involves using labeled training data to learn a function that maps inputs to outputs, while unsupervised learning discovers hidden patterns in unlabeled data. The machine learning process involves choosing data, an algorithm, and a performance measure to build models that can make predictions or decisions.

Uploaded by

feathh1

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Introduction To Machine Learning

Uploaded by

feathh1

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Introduction to Machine Learning

What is learning?
The ability to improve one’s behaviour with experience.
How machine learning is different from traditional programming?
Data Program Data Output

Computer Computer

Output Program
Traditional Programming Machine Learning
Definitions
Arthur Samuel (1959):

Machine Learning is a field of study that gives computers the ability to learn
without being explicitly programmed.

Tom Mitchell (1998):

Well-posed Learning Problem: A computer program is said to learn from

experience E with respect to some task T and some performance measure P, if its
performance on T, as measured by P, improves with experience E.
What ML does?
Machine Learning explores algorithms that learn from data or build models from
data that perform some tasks.

The tasks can be:

- Making predictions
- Classifications
- Clustering
- Decision Making
- Solving tasks/problems
Machine Learning Process
Machine Learning Process
1. Choose the training experience/data.
2. Choose the target function (that is to be learnt).
3. Choose the target class of the function.
4. Choose a learning algorithm to infer the target function.
Types of Machine Learning
Type of Machine Learning
● Supervised Learning
● Unsupervised Learning
● Semi-supervised Learning
● Reinforcement Learning
Supervised Learning
● Supervised learning involves learning from a training set of labelled data.
● Every point in the training is an input-output pair, where the input maps to an
output.
● The learning problem consists of inferring the function that maps between the
input and the output, such that the learned function can be used to predict the
output from future input.
● It is called “supervised” because of the presence of the outcome variable to
guide the learning process.
● Supervised learning problems are further categorised into Regression and
Classification problems.
Supervised Learning
Applications of Supervised Learning
● Prediction
○ Stock prices
○ House prices
○ Weather Forecasting
○ Sales Volume Forecasting
● Classification
○ Disease identification
○ Sentiment Analysis
○ Spam Mail Detection
○ Handwritten Digit Recognition
Supervised Learning Algorithms
● Linear Regression
● Logistic Regression
● Naive Bayes Classifier
● Decision Trees
● Neural Networks
● Support Vector Machines
● K-nearest Neighbour
Unsupervised Learning
● It is a type of machine learning in which the algorithm is not provided with any
pre-assigned labels or scores for the training data.
● Unsupervised learning algorithms must first self-discover any naturally
occurring patterns in that training data set.
● Common examples include clustering, and principal component analysis,
● We observe only the features and have no measurements of the outcome.
● The task of learner is to describe how the data are organized or clustered.
Advantages and Disadvantages of Unsupervised Learning
● A minimal workload to prepare and audit the training set.
● Greater freedom to identify and exploit previously undetected patterns that
may not have been noticed by the "experts".
● The cost of unsupervised techniques requiring a greater amount of training
data and converging more slowly to acceptable performance.
● Increased computational and storage requirements during the exploratory
process,
● Potentially greater susceptibility to artifacts or anomalies in the training data
that might be obviously irrelevant or recognized as erroneous by a human, but
are assigned undue importance by the unsupervised learning algorithm.
Applications of Unsupervised Learning
● Clustering
○ Customer Segmentation
○ Grouping products in a supermarket
● Visualization
● Dimensionality reduction
● Finding association rules
○ Customer that buy item X will buy item Y too.
● Anomaly detection
○ Fraudulent card transaction
○ Malware detection
○ Identification of human errors during data entry
Unsupervised Learning Algorithms
● K-Means Clustering
● Expectation Maximization
● Principal Component Analysis
● Hierarchical Clustering
Basic Terminology
● The inputs are often called the predictors or more classically the
independent variables.

In the pattern recognition literature the term features is preferred.

● The outputs are called the responses, or classically the dependent

variables.
● Such type of problems (learnings) is called inductive learning problems
because we identify a function by inducting on data.
Types of Variables
● Quantitative Variables
○ Variables whose values exist on a continuous scale
○ Examples: temperature, salary, pressure, sales, price etc.
● Qualitative Variables
○ Variables that have values from a discrete set of values.
○ Also referred as categorical or discrete variables.
○ Example: Spam/Not-spam, Malignant/Benign etc.
● Ordered Categorical Variables
○ There is an ordering between the values, but no metric notion is appropriate (the difference
between medium and small need not be the same as that between large and medium).
○ Example: Small, Medium and Large
Train/Test/Validation Set
Training Set: Set of examples/data that is used to train or build the model (find
parameters).

Testing Set: Set of examples/data that is used to estimate the model’s

performance i.e. how well the model fits the data

Validation Set: Set of data/examples used to tune the parameters of a classifier. It

is not required if no fine-tuning of hyperparameters is required.

Using validation and test sets will increase the generalizing capability of the model
on new unseen data.
Hypothesis Space and Inductive Bias
Hypothesis
● A hypothesis is a function that best describes the target in supervised
machine learning.
● Represented by h1, h2, h3 etc.

Machine learning
involves finding a
model (hypothesis)
that best explains
the training data.
Hypothesis Space
Hypothesis space is a set of valid hypothesis, i.e. all possible functions.

It is typically defined by a Hypothesis Language, possibly in conjunction with a

Language Bias.

Represented by symbol H.
Hypothesis Language

Decision Tree and Rule Set

Bayesian Network, Markov Network, Neural Network

Example: Hypothesis Space
Let’s take the features which are boolean i.e. X1, X2, X3, X4 are 4 features which
are boolean. Thus, X1 can take either 1(=T) or 0(=F). Similarly, X2,X3,X4 can take
either a 0 or 1 as shown below.

X1 X2 X3 X4 Output Class

1 0 1 0 POSITIVE

0 0 0 1 NEGATIVE

1 1 1 1 POSITIVE

Thus, there are 24 = 16 possible instances.

How Many Boolean Functions Are Possible?
The number of functions is the number of possible subsets of the 16 instances.
So, the possible number of subsets are (216).
This can be generalised to N boolean features. If there are N boolean features
then the number of possible instances is 2n and the number of possible functions
will be 2(2^N).
Thus, it can be inferred that the hypothesis space is gigantic as the number of
features increases and it is not possible to look at every hypothesis individually in
order to select the best hypothesis.
So, one puts restrictions in the Hypothesis Space to consider only specific
Hypothesis Space. These restrictions are also referred as Bias.
Inductive Bias
● The inductive bias (also known as learning bias) of a learning algorithm is the
set of assumptions that the learner uses to predict outputs.
● Bias is of two types, constraints and preferences.
○ Constraints or Restrictions limit the hypothesis space.
○ Preferences impose ordering on the hypothesis space.
● Examples:
1. Instead of considering all Boolean formulas, we are going to consider only
conjunctive Boolean formulas, this can be an example of Constraints.
2. Giving preference that out of all possible polynomials, I will prefer
polynomials of lower degree. This is an example of Preferences.

Glencoe McGraw-Hill-Introduction To Business, Student Edition-Glencoe - McGraw-Hill (2008) PDF
90% (29)
Glencoe McGraw-Hill-Introduction To Business, Student Edition-Glencoe - McGraw-Hill (2008) PDF
768 pages
k 科普类思维导图102张
No ratings yet
k 科普类思维导图102张
102 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
B29 MIHS Hematology Logbook
No ratings yet
B29 MIHS Hematology Logbook
14 pages
SCSA3015 Deep Learning Unit 1 Notes PDF
No ratings yet
SCSA3015 Deep Learning Unit 1 Notes PDF
30 pages
Basics of Project Planning and Appraisal
100% (4)
Basics of Project Planning and Appraisal
37 pages
ML_Theory
No ratings yet
ML_Theory
10 pages
Unit 4 Machine Learning Tools, Techniques and Applications
No ratings yet
Unit 4 Machine Learning Tools, Techniques and Applications
78 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
ML Basics
No ratings yet
ML Basics
3 pages
Unit 4
No ratings yet
Unit 4
23 pages
Dsai
No ratings yet
Dsai
22 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Notes
No ratings yet
Notes
125 pages
AIML
No ratings yet
AIML
30 pages
Artificial Intelligence Chapter 18 (Updated)
No ratings yet
Artificial Intelligence Chapter 18 (Updated)
19 pages
Ann Unit 2
No ratings yet
Ann Unit 2
21 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
Notes on Machine_learning
No ratings yet
Notes on Machine_learning
88 pages
2 ML
No ratings yet
2 ML
9 pages
ML_Unit-2
No ratings yet
ML_Unit-2
23 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
AIML - Unit 4 Notes
No ratings yet
AIML - Unit 4 Notes
23 pages
ML Revision
No ratings yet
ML Revision
37 pages
Ai Unit-4 ML
No ratings yet
Ai Unit-4 ML
4 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
12 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
ML Sit1305
No ratings yet
ML Sit1305
127 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
BDAunit5
No ratings yet
BDAunit5
26 pages
THEORY FILE - Machine Learning (6th Sem)!!
No ratings yet
THEORY FILE - Machine Learning (6th Sem)!!
26 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
8 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
ML_Unit_1 (1)
No ratings yet
ML_Unit_1 (1)
124 pages
ML QUESTION BANK ESE (1)
No ratings yet
ML QUESTION BANK ESE (1)
37 pages
BDA Unit-5
No ratings yet
BDA Unit-5
26 pages
Unit 2
No ratings yet
Unit 2
63 pages
Chapter III - Supervised and Unsupervised Algorithms
No ratings yet
Chapter III - Supervised and Unsupervised Algorithms
122 pages
ML notes
No ratings yet
ML notes
49 pages
Supervised learning
No ratings yet
Supervised learning
19 pages
CSL0777 L05dfd
No ratings yet
CSL0777 L05dfd
26 pages
M2_AI_Chap1_neural-network
No ratings yet
M2_AI_Chap1_neural-network
60 pages
05-1 Supervised Learning
No ratings yet
05-1 Supervised Learning
65 pages
Machine Learning L1
No ratings yet
Machine Learning L1
34 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
ML Type
No ratings yet
ML Type
13 pages
(Pec Cs701e)
No ratings yet
(Pec Cs701e)
4 pages
Notes Artificial Intelligence Unit 5
No ratings yet
Notes Artificial Intelligence Unit 5
11 pages
ML Viva Q&A
No ratings yet
ML Viva Q&A
17 pages
1 ML M1503-Introduction - ABP
No ratings yet
1 ML M1503-Introduction - ABP
14 pages
What Are The Types of Machine Learning?
100% (1)
What Are The Types of Machine Learning?
24 pages
5 Le
No ratings yet
5 Le
36 pages
Module 1 PPT
No ratings yet
Module 1 PPT
122 pages
Week 02
No ratings yet
Week 02
9 pages
Practical Issues
No ratings yet
Practical Issues
30 pages
Task The Problems That Can Be Solved With Machine Learning
No ratings yet
Task The Problems That Can Be Solved With Machine Learning
9 pages
Unit1 Class4 ML Models
No ratings yet
Unit1 Class4 ML Models
50 pages
unit 4
No ratings yet
unit 4
20 pages
Unit 4
No ratings yet
Unit 4
72 pages
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
From Everand
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
Joe Grant
No ratings yet
Use of Pari Materia As An External Aids
No ratings yet
Use of Pari Materia As An External Aids
17 pages
Letter For Daigon 2.0
No ratings yet
Letter For Daigon 2.0
1 page
Synopsis For Architectural Thesis in Fashionarium: Bachelor of Architecture
No ratings yet
Synopsis For Architectural Thesis in Fashionarium: Bachelor of Architecture
7 pages
FIDE-TRG - Chess Steps A - Book
No ratings yet
FIDE-TRG - Chess Steps A - Book
144 pages
Generalized Association Rule Mining Algorithms Based On Data Cube
No ratings yet
Generalized Association Rule Mining Algorithms Based On Data Cube
6 pages
Sample Bai GTVH BPD 2
No ratings yet
Sample Bai GTVH BPD 2
15 pages
En Christmas Desserts Recipe Book
100% (1)
En Christmas Desserts Recipe Book
30 pages
8111 - Family Law 1 Test Questions
No ratings yet
8111 - Family Law 1 Test Questions
2 pages
Indo-Iran Relations PDF
No ratings yet
Indo-Iran Relations PDF
203 pages
9 Grammar Worksheet
No ratings yet
9 Grammar Worksheet
4 pages
General Conformable Fractional Derivative and Its Physical Interpretation
No ratings yet
General Conformable Fractional Derivative and Its Physical Interpretation
16 pages
J of Family Theo Revie - 2018 - Masten - Resilience Theory and Research On Children and Families Past Present and
No ratings yet
J of Family Theo Revie - 2018 - Masten - Resilience Theory and Research On Children and Families Past Present and
20 pages
History of Medicine Timeline
No ratings yet
History of Medicine Timeline
6 pages
Inflation and Deflation
No ratings yet
Inflation and Deflation
7 pages
MICROBIAL GENETICS Questions and Answers PDF
100% (1)
MICROBIAL GENETICS Questions and Answers PDF
4 pages
Contract. 2
No ratings yet
Contract. 2
59 pages
de Xuat Olympic 8
No ratings yet
de Xuat Olympic 8
9 pages
Text
No ratings yet
Text
6 pages
Soc 88 Comse
No ratings yet
Soc 88 Comse
3 pages
Immediate download eTextbook 978-0134527338 Network Security Essentials: Applications and Standards (6th Edition) ebooks 2024
No ratings yet
Immediate download eTextbook 978-0134527338 Network Security Essentials: Applications and Standards (6th Edition) ebooks 2024
41 pages
Lecture 6 Brand Management
No ratings yet
Lecture 6 Brand Management
14 pages
CRPC - Full - Material - by - Letslearnthelaw - and - Lawnotes - in Full PDF
No ratings yet
CRPC - Full - Material - by - Letslearnthelaw - and - Lawnotes - in Full PDF
252 pages
Unit Outline
No ratings yet
Unit Outline
8 pages
Torts and Damages - Negligence
No ratings yet
Torts and Damages - Negligence
2 pages
To Err Is Human 1999 Report Brief PDF
100% (3)
To Err Is Human 1999 Report Brief PDF
8 pages
Berpikir Positif Untuk Mengurangi Stress
No ratings yet
Berpikir Positif Untuk Mengurangi Stress
22 pages