Machine Learning Approachs (AI)
Machine Learning Approachs (AI)
Approachs
By Abhishek kumar
Introduction to Machine Learning
In supervised learning, models are trained using labelled dataset, where the model learns about each
type of data. Once the training process is completed, the model is tested on the basis of test data (a
subset of the training set), and then it predicts the output.
Suppose we have a dataset of different types of shapes which includes square,
rectangle, triangle, and Polygon. Now the first step is that we need to train the model
for each shape.
• If the given shape has four sides, and all the sides are equal, then it will be labelled as a
Square.
• If the given shape has three sides, then it will be labelled as a triangle.
• If the given shape has six equal sides, then it will be labelled as hexagon.
Now, after training, we test our model using the test set, and the task of the model is to identify
the shape. The machine is already trained on all types of shapes, and when it finds a new shape, it
classifies the shape on the bases of a number of sides, and predicts the output.
2. Unsupervised Learning:
This means that human labour is not required to make the dataset machine-readable,
allowing much larger datasets to be worked on by the program.
In supervised learning, the labels allow the algorithm to find the exact nature of the
relationship between any two data points. However, unsupervised learning does not
have labels to work off of, resulting in the creation of hidden structures.
Relationships between data points are perceived by the algorithm in an abstract manner,
with no input required from human beings.
The creation of these hidden structures is what makes unsupervised learning algorithms
versatile.
Instead of a defined and set problem statement, unsupervised learning algorithms can
adapt to the data by dynamically changing hidden structures
.
This offers more post-deployment development than supervised learning algorithms.
Here, we have taken an unlabelled input data, which means it is not categorized and
corresponding outputs are also not given. Now, this unlabelled input data is fed to the machine
learning model in order to train it. Firstly, it will interpret the raw data to find the hidden
patterns from the data and then will apply suitable algorithms such as k-means clustering,
Decision tree, etc. Once it applies the suitable algorithm, the algorithm divides the data objects
into groups according to the similarities and difference between the objects.
3. Reinforcement Learning
Reinforcement Learning directly takes inspiration from how human beings learn from
data in their lives. It features an algorithm that improves upon itself and learns from
new situations using a trial-and-error method. Favourable outputs are encouraged or
‘reinforced’, and non-favourable outputs are discouraged or ‘punished’.
In every iteration of the algorithm, the output result is given to the interpreter, which
decides whether the outcome is favourable or not.
In case of the program finding the correct solution, the interpreter reinforces the solution
by providing a reward to the algorithm. If the outcome is not favourable, the algorithm
is forced to reiterate until it finds a better result. In most cases, the reward system is
directly tied to the effectiveness of the result.
In typical reinforcement learning use-cases, such as finding the shortest route between
two points on a map, the solution is not an absolute value. Instead, it takes on a score
of effectiveness, expressed in a percentage value. The higher this percentage value is,
the more reward is given to the algorithm.
Thus, the program is trained to give the best possible solution for the best possible
reward.
Types of Reinforcement learning
Positive Reinforcement
The positive reinforcement learning means adding something to increase the
tendency that expected behaviour would occur again. It impacts positively on the behaviour of the agent
and increases the strength of the behaviour. This type of reinforcement can sustain the changes for a long
time, but too much positive reinforcement may lead to an overload of states that can reduce the
consequences.
Negative Reinforcement:
The negative reinforcement learning is opposite to the positive reinforcement as
it increases the tendency that the specific behaviour will occur again by avoiding the negative
condition. It can be more effective than the positive reinforcement depending on situation and
behaviour, but it provides reinforcement only to meet minimum behaviour.
Reinforcement Learning Applications
Thank you