0% found this document useful (0 votes)
18 views70 pages

ML Unit-1

Uploaded by

arjunkarna2898
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views70 pages

ML Unit-1

Uploaded by

arjunkarna2898
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 70

MACHINE LEARNING

Machine learning is a branch of artificial


intelligence(AI) that enables computers to “self
learn” from training data and improve overtime
without explicitly programmed.
Machine learning algorithms are able to detect
patterns in data and learn from them, in order to make
their own predictions.
In short, ML algorithms and models learn through
experience.
Supervised learning:
Supervised learning is a category of machine learning that uses labeled
datasets to train algorithms to predict outcomes and recognize patterns.
Unlike unsupervised learning, supervised learning algorithms are given
labeled training to learn the relationship between the input and the outputs.
Unsupervised learning:
Unsupervised learning in artificial intelligence is a type of machine
learning that learns from data without human supervision. Unlike supervised
learning, unsupervised machine learning models are given unlabeled data
and allowed to discover patterns and insights without any explicit guidance
or instruction.
Reinforcement learning:
Reinforcement Learning (RL) is the science of decision making. It is
about learning the optimal behavior in an environment to obtain maximum
reward.
It's about taking the best possible action or path to gain maximum
rewards and minimum punishment through observations in a specific
situation. It acts as a signal to positive and negative behaviors.

Applications of Machine Learning
Machine learning jobs and skills
Jobs:
Machine learning Engineer
Data Engineer
Machine learning scientist
Required skills
Python (or) R
Probability and Statistics
Maths
Companies using Machine Learning

Google, Salesforce, Amazon, IBM, Pinterest,


Netflix, face book
Features of ML:
ML uses data to detect various patterns in a given
dataset.
It can learn from past data and improve
automatically.
It’s a data driven technology.
Machine learning is much similar to data mining as
it also deals with the huge amount of the data.
Well posed learning Problem
Well Posed Learning Problem – A computer program is said to learn from
experience E in context to some task T and some performance measure P, if
its performance on T, as was measured by P, upgrades with experience E.
• Any problem can be segregated as well-posed learning problem if it has
three traits –
– Task
– Performance Measure
– Experience
• Certain examples that efficiently defines the well-posed learning
problem are –
1. To better filter emails as spam or not
T -> Identifying whether or not an email is spam.
P -> The percentage of emails correctly categorized as spam or nonspam.
E -> Observing how you categorize emails as spam or nonspam.
2. A checkers learning problem
T -> Playing checkers game
P -> Percent of games won against opponent
E -> Playing practice games against itself
3. Handwriting Recognition Problem
T -> Recognizing and classifying handwritten words from images.
P -> Percentage of correctly identified words.
E-> Set of handwritten words with their classifications in a database.
4. A Robot Driving Problem
T -> To drive on public four-lane highways using sight scanners.
P -> The average distance progressed before an error.
E -> The order of images and steering instructions noted down
while observing a human driver.
5. Face Recognition Problem
– Task – predicting different types of faces
– Performance Measure – able to predict maximum types of
faces
– Experience – training machine with maximum amount of
datasets of different face images
6. Automatic Translation of documents
– Task – translating one type of language used in a document
to other language
– Performance Measure – able to convert one language to
other efficiently
– Experience – training machine with a large dataset of
different types of languages
Designing a learning system
• According to Arthur Samuel
– “Machine Learning enables a Machine to Automatically learn from
Data, Improve performance from an Experience and predict things
without explicitly programmed.”
• In Simple Words,
– When we fed the Training Data to Machine Learning Algorithm, this
algorithm will produce a mathematical model and with the help of the
mathematical model, the machine will make a prediction and take a
decision without being explicitly programmed.
– Also, during training data, the more machine will work with it the
more it will get experience and the more efficient result is produced.
Why a designing a learning system is important?
For a perfect & efficient system, it is important to design the
learning system. If not expected output will not come & the system
without function properly.
Designing a learning system:
There are 5 basic steps to be followed while designing a
learning system.
1) Choosing the Training Experience
2) Choosing the Target Function
3) Choose a Representation for the Target Function
4) Choosing a Function Approximation Algorithm
5) The Final Design
Step1:Choosing a training experience
– The first important task is to choose the training
data or training experience which will be fed to the
machine learning algorithm.
– It will impact on the success or failure of the
model(machine)
– So training data(or) experience should be chosen
wisely.
– Below are the attributes which will impact on
success and failure of data.
• Type of feedback
• Degree
• Distribution of examples.
1) Type of feedback: the training experience will be able to
provide direct (or) indirect feedback regarding choices.
For example: while playing chess, the training data will
provide feedback to itself like “Instead of this move, if this is
chosen, the chances of success increases ”
2) Degree: It is the degree capability to which the learner will
control the sequence of training.

ex: driving a car

with trainer help with trainers help completely


100% dependence partially 50% on your own 100%
similarly, when training data is fed to the machine then at
that time accuracy is very less -but when it gains experience
while doing task again and again will get feedback and
control the task accordingly.
3) Distribution of examples to increase performance:
How well it represent the distribution of examples over
which the performance of final system is measured.
More possible combinations
More situations, more examples
Learning over distributed range of examples.
Ex: Learning driving
A machine learning algorithm will get experience which
going through a no.of different cases and different examples.
Thus, ML algorithm will get more and more experience
by passing through more and more examples and hence its
performance will increases.
Step:2 Choosing the target function:
It means according to what ever knowledge we fed to the
algorithm, the machine will choose next action/move function
which will describe what type of legal moves should be taken.
Ex: Checkers game
While moving -diagonally, set of all possible moves is called
legal moves.
- Travel only in forward direction
- Only one move per chance
- Only in diagonal direction
- Jump over opponent
Target function => v(b)
Board State => b
Legal moves set => B
b is final board state that is won, then v(b) = 100
b is final board state that is lost, then v(b) = -100
b is final board state that is draw, then v(b) = 0
If b is not final state then v(b)= v(b’)
b’ ->best final state
Step:3 Choosing a representation for target function:
For any board state, we calculate function ‘c’ as linear
combination of following board features i.e. c(b).
Features:
x1- No. of black pieces on board
x2 - No .of red pieces on board
x3- No .of black kings on board
x4- No. of red kings on board
x5- No. of black pieces threatened by red (blacks which can be
beaten by red)
x6- No. of red pieces threatened by black.
• V^(b)=w0+w1x1+ w2x2 + w3x3 + w4x4 + w5x5 + w6x6
(representation of c(b))
(w0-w6)= numerical coefficients (or) weights of each feature.
determine the importance / weightage of
features
w0 – additive constant.

Step:4 Choosing a Function Approximation Algorithm:


In this step, we choose a learning algorithm that can
approximate the target function chosen. This step further
consists of two sub-steps,
a. Estimating the training value,
b. Adjusting the weights.
Step:5 Final Design:
The final design of checkers learning system is described naturally by four
different program modules which indicate the central components in most
of the learning systems. The four modules are depicted as follows.
Perspectives and Issues in ML
PERSPECTIVES:
• Machine learning is a subfield of artificial intelligence and machine
learning algorithms are used in other related fields like natural
language processing and computer vision.
• In general, there are three types of learning and these are supervised
learning, unsupervised learning, and reinforcement learning.
• Their names tell the main idea behind them actually.
• In supervised learning, your system learns under the supervision of
the data outputs so supervised algorithms are preferred if your
dataset contains output information.
• Let me give you an example in there.
• Let’s assume you have a medical statistic company and you have a
dataset which contains patients’ features like blood pressure, sugar
rate in their blood, heart rate per minute, etc.
ISSUES:
LACK OF QUALITY DATA
 One of the main issues in Machine Learning is the absence of good data.
 While, algorithms tend to make developers exhaust most of their time on artificial
intelligence.
FAULT IN CREDIT CARD FRAUD DETECTION
 Although this AI-driven software helps to successfully detect credit card fraud,
there are issues in Machine Learning that make the process redundant.
GETTING BAD RECOMMENDATIONS
 Proposal engines are quite regular today.
 While some might be dependable, others may not appear to provide the necessary
results.
TALENT DEFICIT
 Albeit numerous individuals are pulled into the ML business, however, there are
still not many experts who can take complete control of this innovation.
MAKING THE WRONG ASSUMPTIONS
 ML models can’t manage datasets containing missing data points.
 Thus, highlights that contain a huge part of missing data should be erased.
supervised vs. unsupervised learning
What is Supervised Learning?
• Supervised Learning is the machine learning approach defined by its use of
labeled datasets to train algorithms to classify data and predict outcomes.
• The labeled dataset has output tagged corresponding to input data for the
machine to understand what to search for in the unseen data.
Supervised Machine Learning Methods
• There are two main areas where supervised machine learning comes
in handy: classification problems and regression problems.
Classification
• Classification refers to taking an input value and mapping it to a
discrete value. In classification problems, our output typically
consists of classes or categories. This could be things like trying to
predict what objects are present in an image (a cat/ a dog) or
whether it is going to rain today or not.
Regression
• Regression is related to continuous data (value functions). In
Regression, the predicted output values are real numbers. It deals
with problems such as predicting the price of a house or the trend in
the stock price at a given time, etc.
Some of the most common algorithms in Supervised Learning include
Support Vector Machines (SVM), Logistic Regression, Naive Bayes,
Neural Networks, K-nearest neighbor (KNN), and Random Forest.
Supervised Machine Learning Applications
Now, let's have a look at some of the popular applications of Supervised
Learning:
Predictive analytics (house prices, stock exchange prices, etc.)
Text recognition
Spam detection
Customer sentiment analysis
Object detection (e.g. face detection)
What is Unsupervised Learning?
• Unsupervised Learning is a type of machine learning in which the algorithms
are provided with data that does not contain any labels or explicit instructions
on what to do with it. The goal is for the learning algorithm to find structure in
the input data on its own.
• To put it simply—Unsupervised Learning is a kind of self-learning where the
algorithm can find previously hidden patterns in the unlabeled datasets and give
the required output without any interference.
• Identifying these hidden patterns helps in clustering, association, and detection
of anomalies and errors in data.
Unsupervised Machine Learning Methods
• Unsupervised Learning models can perform more complex tasks than
Supervised Learning models, but they are also more unpredictable. Here are the
main tasks that utilize this approach.
Clustering
• Clustering is the type of Unsupervised Learning where we find hidden patterns
in the data based on their similarities or differences. These patterns can relate to
the shape, size, or color and are used to group data items or create clusters.
• There are several types of clustering algorithms, such as exclusive,
overlapping, hierarchical, and probabilistic.
Association
• Association is the kind of Unsupervised Learning where we can find the
relationship of one data item to another data item. We can then use those
dependencies and map them in a way that benefits us—e.g., understanding
consumers' habits regarding our products can help us develop better cross-
selling strategies.
• The association rule is used to find the probability of co-occurrence of
items in a collection. These techniques are often utilized in customer
behavior analysis in e-commerce websites and OTT platforms.
Dimensionality reduction
• As the name suggests, the algorithm works to reduce the dimensions of the
data. It is used for feature extraction.
• Extracting the important features from the dataset is an essential aspect of
machine learning algorithms. This helps reduce the number of random
variables in the dataset by filtering irrelevant features.
• Finally, here's a nice visual recap of everything we've covered so far (plus
the Reinforcement Learning).

You might also like