Chapter 1

The document introduces machine learning as a method for analyzing vast amounts of data by detecting patterns to make predictions or decisions. It categorizes machine learning into supervised, unsupervised, and reinforcement learning, with supervised learning further divided into classification and regression tasks. Real-world applications of these methods include document classification, image recognition, and stock market predictions.

Uploaded by

Saman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views3 pages

Chapter 1

Uploaded by

Saman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Chapter 1:- Introduction

Machine Learning: What and Why?

We are drowning in information and starving for knowledge. — John Naisbitt.
We are entering the era of big data. For example, there are about 1 trillion web pages; one
hour of video is uploaded to YouTube every second, amounting to 10 years of content every
day; the genomes of 1000s of people, each of which has a length of 3.8 × 109 base pairs,
have been sequenced by various labs; Walmart handles more than 1M transactions per hour
and has databases containing more than 2.5 petabytes of information; and so on.
This deluge of data calls for automated methods of data analysis, which is what machine
learning provides. In particular, we define machine learning as a set of methods that can
automatically detect patterns in data, and then use the uncovered patterns to predict future
data, or to perform other kinds of decision making under uncertainty (such as planning how
to collect more data!).
Types of Machine Learning
Types of machine learning Machine learning is usually divided into two main types. In the
predictive or supervised learning approach, the goal is to learn a mapping from inputs x to
N
outputs y, given a labeled set of input-output pairs D = { ( xi , yi ) }i=1. Here D is called the
training set, and N is the number of training examples. In the simplest setting, each training
input xi is a D-dimensional vector of numbers, representing, say, the height and weight of a
person. These are called features, attributes or covariates. In general, however, xi could be
a complex structured object, such as an image, a sentence, an email message, a time series,
a molecular shape, a graph, etc.

methods assume that yi is a categorical or nominal variable from some finite set, yi ∈
Similarly the form of the output or response variable can in principle be anything, but most

{1,...,C} (such as male or female), or that yi is a real-valued scalar (such as income level).
When yi is categorical, the problem is known as classification or pattern recognition, and
when yi is real-valued, the problem is known as regression. Another variant, known as
ordinal regression, occurs where label space Y has some natural ordering, such as grades A–
F.
The second main type of machine learning is the descriptive or unsupervised learning
approach. Here we are only given inputs, D = {xi }Ni=1 i=1, and the goal is to find “interesting
patterns” in the data. This is sometimes called knowledge discovery. This is a much less
well-defined problem, since we are not told what kinds of patterns to look for, and there is
no obvious error metric to use (unlike supervised learning, where we can compare our
prediction of y for a given x to the observed value).
There is a third type of machine learning, known as reinforcement learning, which is
somewhat less commonly used. This is useful for learning how to act or behave when given
occasional reward or punishment signals. (For example, consider how a baby learns to
walk.) Semi-Supervised learning is a type of Machine Learning algorithm that lies between
Supervised and Unsupervised machine learning. It represents the intermediate ground between
Supervised (With Labelled training data) and Unsupervised learning (with no labelled training data)
algorithms and uses the combination of labelled and unlabeled datasets during the training period.

Figure1.1: Left: Some labeled training examples of colored shapes, along with 3 unlabeled

the feature vector xi. The last column is the label, yi ∈ {0, 1}.
test cases. Right: Representing the training data as an N × D design matrix. Row i represents

Classification

outputs y, where y ∈ {1,...,C}, with C being the number of classes. If C = 2, this is called
In this section, we discuss classification. Here the goal is to learn a mapping from inputs x to

binary classification (in which case we often assume y ∈ {0, 1}); if C > 2, this is called
multiclass classification. If the class labels are not mutually exclusive (e.g., somebody may
be classified as tall and strong), we call it multi-label classification, but this is best viewed as
predicting multiple related binary class labels (a so-called multiple output model). When we
use the term “classification”, we will mean multiclass classification with a single output,
unless we state otherwise.
One way to formalize the problem is as function approximation. We assume y = f(x) for
some unknown function f, and the goal of learning is to estimate the function f given a
labeled training set, and then to make predictions using Y^ =f(x). (We use the hat symbol to
denote an estimate.) Our main goal is to make predictions on novel inputs, meaning ones
that we have not seen before (this is called generalization), since predicting the response on
the training set is easy (we can just look up the answer).
Example:- As a simple toy example of classification, consider the problem illustrated in
Figure 1.1(a). We have two classes of object which correspond to labels 0 and 1. The inputs
are colored shapes. These have been described by a set of D features or attributes, which
are stored in an N × D design matrix X, shown in Figure 1.1(b). The input features x can be
discrete, continuous or a combination of the two. In addition to the inputs, we have a vector
of training labels y. In Figure 1.1, the test cases are a blue crescent, a yellow circle and a blue
arrow. None of these have been seen before. Thus we are required to generalize beyond
the training set. A reasonable guess is that blue crescent should be y = 1, since all blue
shapes are labeled 1 in the training set. The yellow circle is harder to classify, since some
yellow things are labeled y = 1 and some are labeled y = 0, and some circles are labeled y = 1
and some y = 0. Consequently it is not clear what the right label should be in the case of the
yellow circle. Similarly, the correct label for the blue arrow is unclear.

Here are some examples of real-world classification applications.

 Document classification and email spam filtering.

 Classifying Flowers’
 Image Classification and handwriting recognition
 Face detection and recognition

Regression

example: we have a single real-valued input x i ∈ R, and a single real-valued response yi ∈ R. We

Regression is just like classification except the response variable is continuous. Figure shows a simple

consider fitting two models to the data: a straight line and a quadratic function.

(a) Linear regression on some 1d data. (b) Same data with polynomial regression (degree 2).

Here are some examples of real-world regression problems.

 Predict tomorrow’s stock market price given current market conditions and other possible
side information.
 Predict the age of a viewer watching a given video on YouTube.
 Predict the location in 3d space of a robot arm end effector, given control signals (torques)
sent to its various motors.
 Predict the temperature at any location inside a building using weather data, time, door
sensors, etc.

Teaching With AI
100% (6)
Teaching With AI
154 pages
Machine Learning
100% (11)
Machine Learning
135 pages
Internship Report On Machine Learning
0% (1)
Internship Report On Machine Learning
25 pages
Week 1
No ratings yet
Week 1
9 pages
I. The Types of Machine Learning
No ratings yet
I. The Types of Machine Learning
8 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Machine-Learning-MBA-Unit 4 Machine_Learning-MBA-unit-3
No ratings yet
Machine-Learning-MBA-Unit 4 Machine_Learning-MBA-unit-3
5 pages
Classification
No ratings yet
Classification
53 pages
ML Reference-Material-I
No ratings yet
ML Reference-Material-I
41 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
UNIT 1 PART 3
No ratings yet
UNIT 1 PART 3
11 pages
Unit - 2 Machine Learning
No ratings yet
Unit - 2 Machine Learning
45 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
Fundamentals of machine learning with QA
No ratings yet
Fundamentals of machine learning with QA
41 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Hundred Page Machine Learning Book
No ratings yet
Hundred Page Machine Learning Book
34 pages
Aimlf Unit 3
No ratings yet
Aimlf Unit 3
20 pages
unit 1
100% (1)
unit 1
13 pages
UNIT 1 - Introduction (Types of Machine Learning)
100% (1)
UNIT 1 - Introduction (Types of Machine Learning)
21 pages
unit 01
No ratings yet
unit 01
32 pages
ML NOTES(UNIT 1&2)
No ratings yet
ML NOTES(UNIT 1&2)
42 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
ML Chapter 1
No ratings yet
ML Chapter 1
41 pages
Task The Problems That Can Be Solved With Machine Learning
No ratings yet
Task The Problems That Can Be Solved With Machine Learning
9 pages
Machine-Learning-MBA-Unit_4-09-04 Machine_Learning-MBA-unit-3
No ratings yet
Machine-Learning-MBA-Unit_4-09-04 Machine_Learning-MBA-unit-3
18 pages
nn
No ratings yet
nn
24 pages
Module1 ML
No ratings yet
Module1 ML
13 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
Akanksh learning
No ratings yet
Akanksh learning
33 pages
Module 1
No ratings yet
Module 1
22 pages
Unit5_ML_introduction
No ratings yet
Unit5_ML_introduction
32 pages
ML Final Print Upload
No ratings yet
ML Final Print Upload
10 pages
4.1 Machine Learning Basics
No ratings yet
4.1 Machine Learning Basics
26 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
01_ml-overview_notes
No ratings yet
01_ml-overview_notes
19 pages
Self-Taught Learning: Implementation Using MATLAB
100% (1)
Self-Taught Learning: Implementation Using MATLAB
42 pages
Unit-I
No ratings yet
Unit-I
112 pages
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
No ratings yet
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
65 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
65 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Manish NTCC Presentation Sem 5
No ratings yet
Manish NTCC Presentation Sem 5
11 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Matematics and Machine Learning
No ratings yet
Matematics and Machine Learning
156 pages
Maths For ML
No ratings yet
Maths For ML
156 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
ML Merge
No ratings yet
ML Merge
145 pages
Supervised Learning (Classification and Regression)
No ratings yet
Supervised Learning (Classification and Regression)
14 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Ybi Python Final Internship Report
100% (6)
Ybi Python Final Internship Report
29 pages
14-004-1 Machine Learning
No ratings yet
14-004-1 Machine Learning
10 pages
ML by Andrew NG
No ratings yet
ML by Andrew NG
2 pages
UNIT 1
No ratings yet
UNIT 1
12 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
Normalization
No ratings yet
Normalization
2 pages
Introduction to Data Science Lecture 1
No ratings yet
Introduction to Data Science Lecture 1
4 pages
Case Study Normalization
No ratings yet
Case Study Normalization
1 page
Research On Short-Time Traffic Flow Prediction Model Considering The Evolution of Traffic Network Operation
No ratings yet
Research On Short-Time Traffic Flow Prediction Model Considering The Evolution of Traffic Network Operation
37 pages
Ds7201 Adip
No ratings yet
Ds7201 Adip
2 pages
AI_Term End_QP_IX_18.03.2024
No ratings yet
AI_Term End_QP_IX_18.03.2024
6 pages
Conference Latex Template 1
No ratings yet
Conference Latex Template 1
6 pages
mldap
No ratings yet
mldap
6 pages
Personalized E-Learning Recommender System Based On Autoencoders
No ratings yet
Personalized E-Learning Recommender System Based On Autoencoders
20 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
GNN-Foundations-Frontiers-and-Applications-chapter1
No ratings yet
GNN-Foundations-Frontiers-and-Applications-chapter1
13 pages
Lecture 2
No ratings yet
Lecture 2
98 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
5 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
1.ICASERT 2019 Paper 690
No ratings yet
1.ICASERT 2019 Paper 690
7 pages
DL Unit 2
No ratings yet
DL Unit 2
29 pages
Artificial Intelligence Graduate Certificate Course Planning
No ratings yet
Artificial Intelligence Graduate Certificate Course Planning
4 pages
Tech Seminar
No ratings yet
Tech Seminar
22 pages
Elective-II Soft Computing
No ratings yet
Elective-II Soft Computing
3 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
Perceptrons: Perception Without Awareness, Psychology of
No ratings yet
Perceptrons: Perception Without Awareness, Psychology of
4 pages
Lab Final - BAI-4 - AIC270 - PAI
No ratings yet
Lab Final - BAI-4 - AIC270 - PAI
3 pages
Deep Unsupervised Learning
No ratings yet
Deep Unsupervised Learning
90 pages
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
99 pages
Course Plan: Upon Completion of This Course, Students Will Be Able To Do The Following
No ratings yet
Course Plan: Upon Completion of This Course, Students Will Be Able To Do The Following
6 pages
Solar Panels Presentation UC3M
No ratings yet
Solar Panels Presentation UC3M
24 pages
Iitmp Aaiml Brochure
No ratings yet
Iitmp Aaiml Brochure
30 pages
Machine Learning(MCA)
No ratings yet
Machine Learning(MCA)
5 pages
Unit-3 Notes
No ratings yet
Unit-3 Notes
16 pages
Ai Chapter 5
No ratings yet
Ai Chapter 5
2 pages
Wa0002.
No ratings yet
Wa0002.
14 pages

Chapter 1

Uploaded by

Chapter 1

Uploaded by

Chapter 1:- Introduction

Machine Learning: What and Why?

Here are some examples of real-world classification applications.

 Document classification and email spam filtering.

example: we have a single real-valued input x i ∈ R, and a single real-valued response yi ∈ R. We

Here are some examples of real-world regression problems.

You might also like