Concepts of Machine Learning
Concepts of Machine Learning
Table of Contents
Table of Figures.............................................................................................................................................2
Introduction:................................................................................................................................................3
Types of Machine Learning:...................................................................................................................3
Supervised Machine Learning................................................................................................................4
Unsupervised Machine Learning:..........................................................................................................4
Semi-supervised Machine Learning.......................................................................................................4
Reinforcement Learning.........................................................................................................................4
Machine Learning Algorithms:..................................................................................................................5
Artificial Neural Network.......................................................................................................................5
Random Forest.........................................................................................................................................6
Regression Trees:.....................................................................................................................................6
Applications of Machine Learning:............................................................................................................7
Predictive analytics and intelligent decision making:..........................................................................8
Cyber security and threat intelligence...................................................................................................8
Internet of Things (IoT) and smart cities..............................................................................................8
E-mail Spam and Malware Filtering:....................................................................................................8
E-Commerce and product recommendations:......................................................................................8
Natural language processing (NLP) and sentiment analysis...............................................................9
Product Recommendation:...................................................................................................................9
List of Figures
Figure 1: Machine Learning Process.......................................................................................................3
Figure 2 : Machine learning algorithm.................................................................................................7
Tables
Table 1 : Types of Machine Learning.......................................................................................................5
Table 2 : Comparison between Collaborative and Content –Based Filtering......................................9
1
Introduction:
ML is a field that focuses on the learning aspect of AI by developing algorithms that best
represent a set of data. In contrast to classical programming, in which an algorithm can be
explicitly coded using known features, ML uses subsets of data to generate an algorithm that
may use novel or different combinations of features and weights than can be derived from
first principles. In ML, there are four commonly used learning methods, each useful for
solving different tasks: supervised, unsupervised, semi supervised, and reinforcement
learning. To better understand these methods, they will be defined via an example of a
hypothetical real estate company that specializes in predicting housing prices and features
associated with those houses [1]
2
learning. In the following, we briefly discuss each type of learning technique with the
scope of their applicability to solve real-world problems. [2]
Reinforcement Learning:
Reinforcement learning is a type of machine learning algorithm that enables software
agents and machines to automatically evaluate the optimal behavior in a particular context
or environment to improve its efficiency. This type of learning is based on reward or
3
penalty, and its ultimate goal is to use insights obtained from environmental activists to
take action to increase the reward or minimize the risk. It is a powerful tool for training
AI models that can help increase automation or optimize the operational efficiency of
sophisticated systems such as robotics, autonomous driving tasks, manufacturing and
supply chain logistics, however, not preferable to use it for solving the basic or
straightforward problems. [3]
4
The most common approach to develop nonparametric and nonlinear
classification/regression is based on ANNs. There are many different types of ANNs.
However, it is not the scope of this paper to describe the different types of networks,
which can be found at the bibliography. This section provides a brief description of one
of the most used ANNs: the feed-forward propagation neural network [4]
Random Forest:
RF is a regression technique that combines the performance of numerous DT algorithms
to classify or predict the value of a variable avoid the correlation of the different trees, RF
increases the diversity of the trees by making them grow from different training data
subsets created through a procedure called bagging. Bagging is a technique used for
training data creation by resampling randomly the original dataset [4]
Regression Trees:
DTs, along with neural networks, are the most widely used machine learning algorithms
in geosciences. The main benefit of using a hierarchical tree structure to perform
classification decisions is that the tree structure is transparent, which in comparison with
artificial neural networks (ANNs), is easier to interpret. In order to induce the DT from a
dataset, an evaluation measure of each of the evidential features is used to maximize the
inter node heterogeneity. A DT represents a set of restrictions or conditions which are
hierarchically organized, and which are successively applied from a root to a terminal
node or leaf of the tree. The main benefit of using a hierarchical tree structure to perform
classification decisions is that the tree structure is transparent, which in comparison with
artificial neural networks (ANNs), is easier to interpret [4]
5
Figure 2 : Machine learning algorithm.
6
analytics. The basis of predictive analytics is capturing and exploiting relationships
between explanatory variables and predicted variables from previous events to predict the
unknown outcome
Cyber security and threat intelligence: Cyber security is one of the most
essential areas of Industry which is typically the practice of protecting networks, systems,
hardware, and data from digital attacks. For instance, clustering techniques can be used to
identify cyber-anomalies, policy violations, etc.
Internet of Things (IoT) and smart cities: Internet of Things (IoT) is another
essential area of Industry which turns everyday objects into smart objects by allowing
them to transmit data and automate tasks without the need for human interaction. IoT is,
therefore, considered to be the big frontier that can enhance almost all activities in our
lives, such as smart governance, smart home and education.
Product recommendation is one of the most well-known and widely used applications of
machine learning, and it is one of the most prominent features of almost any e-commerce
website today. Machine learning technology can assist businesses in analyzing their
consumers’ purchasing histories and making customized product suggestions.
Natural language processing (NLP) involves the reading and understanding of spoken or
written language through the medium of a computer. Thus, NLP helps computers, for
7
instance, to read a text, hear speech, interpret it, analyze sentiment, and decide which
aspects are significant, where machine learning techniques can be used.
References
[4] I. H.Sarker, "SPRINGER," FRI MARCH 2021. [Online]. Available: Sarker, I.H. Machine
Learning: Algorithms, Real-World Applications and Research Directions. SN COMPUT.
SCI. 2, 160 (2021). https://doi.org/10.1007/s42979-021-00592-x. [Accessed FRI MARCH
8
2021].