0% found this document useful (0 votes)

12 views

2

Unsupervised learning is a machine learning paradigm where algorithms learn patterns from unlabeled data without supervision. Unsupervised learning uses methods like clustering, dimensionality reduction, and neural networks to discover hidden patterns in unlabeled data. Some common unsupervised learning tasks include clustering, dimensionality reduction, and generating new examples that plausibly come from the same distribution as the training data.

Uploaded by

Quinone Quentin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

2

Uploaded by

Quinone Quentin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 8

Unsupervised learning

Article
Talk
Read
Edit
View history

Tools
From Wikipedia, the free encyclopedia
Unsupervised learning is a paradigm in machine learning where, in contrast to
supervised learning and semi-supervised learning, algorithms learn patterns
exclusively from unlabeled data.

Neural networks
Tasks vs. methods

Tendency for a task to employ supervised vs. unsupervised methods. Task names
straddling circle boundaries is intentional. It shows that the classical division
of imaginative tasks (left) employing unsupervised methods is blurred in today's
learning schemes.
Part of a series on
Machine learning
and data mining
Paradigms
Supervised learningUnsupervised learningOnline learningBatch learningMeta-
learningSemi-supervised learningSelf-supervised learningReinforcement learningRule-
based learningQuantum machine learning
Problems
Supervised learning
(classification • regression)
Clustering
Dimensionality reduction
Structured prediction
Anomaly detection
Artificial neural network
Reinforcement learning
Learning with humans
Model diagnostics
Mathematical foundations
Machine-learning venues
Related articles
vte
Neural network tasks are often categorized as discriminative (recognition) or
generative (imagination). Often but not always, discriminative tasks use supervised
methods and generative tasks use unsupervised (see Venn diagram); however, the
separation is very hazy. For example, object recognition favors supervised learning
but unsupervised learning can also cluster objects into groups. Furthermore, as
progress marches onward some tasks employ both methods, and some tasks swing from
one to another. For example, image recognition started off as heavily supervised,
but became hybrid by employing unsupervised pre-training, and then moved towards
supervision again with the advent of dropout, ReLU, and adaptive learning rates.

Training
During the learning phase, an unsupervised network tries to mimic the data it's
given and uses the error in its mimicked output to correct itself (i.e. correct its
weights and biases). Sometimes the error is expressed as a low probability that the
erroneous output occurs, or it might be expressed as an unstable high energy state
in the network.
In contrast to supervised methods' dominant use of backpropagation, unsupervised
learning also employs other methods including: Hopfield learning rule, Boltzmann
learning rule, Contrastive Divergence, Wake Sleep, Variational Inference, Maximum
Likelihood, Maximum A Posteriori, Gibbs Sampling, and backpropagating
reconstruction errors or hidden state reparameterizations. See the table below for
more details.

Energy
An energy function is a macroscopic measure of a network's activation state. In
Boltzmann machines, it plays the role of the Cost function. This analogy with
physics is inspired by Ludwig Boltzmann's analysis of a gas' macroscopic energy
from the microscopic probabilities of particle motion

{\displaystyle p\propto e^{-E/kT}}, where k is the Boltzmann constant and T is

temperature. In the RBM network the relation is

{\displaystyle p=e^{-E}/Z},[1] where

p and

E vary over every possible activation pattern and

=
∑
All Patterns

(
pattern
)
{\displaystyle \textstyle {Z=\sum _{\scriptscriptstyle {\text{All Patterns}}}e^{-
E({\text{pattern}})}}}. To be more precise,

)
=

)
/
{\displaystyle p(a)=e^{-E(a)}/Z}, where

a is an activation pattern of all neurons (visible and hidden). Hence, early neural
networks bear the name Boltzmann Machine. Paul Smolensky calls
−

{\displaystyle -E\,} the Harmony. A network seeks low energy which is high Harmony.

Networks
This table shows connection diagrams of various unsupervised networks, the details
of which will be given in the section Comparison of Networks. Circles are neurons
and edges between them are connection weights. As network design changes, features
are added on to enable new capabilities or removed to make learning faster. For
instance, neurons change between deterministic (Hopfield) and stochastic
(Boltzmann) to allow robust output, weights are removed within a layer (RBM) to
hasten learning, or connections are allowed to become asymmetric (Helmholtz).

Hopfield Boltzmann RBM Stacked Boltzmann

A network based on magnetic domains in iron with a single self-connected layer. It

can be used as a content addressable memory.

Network is separated into 2 layers (hidden vs. visible), but still using symmetric
2-way weights. Following Boltzmann's thermodynamics, individual probabilities give
rise to macroscopic energies.

Restricted Boltzmann Machine. This is a Boltzmann machine where lateral connections

within a layer are prohibited to make analysis tractable.

This network has multiple RBM's to encode a hierarchy of hidden features. After a
single RBM is trained, another blue hidden layer (see left RBM) is added, and the
top 2 layers are trained as a red & blue RBM. Thus the middle layers of an RBM acts
as hidden or visible, depending on the training phase it's in.
Helmholtz Autoencoder VAE

Instead of the bidirectional symmetric connection of the stacked Boltzmann

machines, we have separate one-way connections to form a loop. It does both
generation and discrimination.

A feed forward network that aims to find a good middle layer representation of its
input world. This network is deterministic, so it's not as robust as its successor
the VAE.

Applies Variational Inference to the Autoencoder. The middle layer is a set of

means & variances for Gaussian distributions. The stochastic nature allows for more
robust imagination than the deterministic autoencoder.
Of the networks bearing people's names, only Hopfield worked directly with neural
networks. Boltzmann and Helmholtz came before artificial neural networks, but their
work in physics and physiology inspired the analytical methods that were used.

History
1969 Perceptrons by Minsky & Papert shows a perceptron without hidden layers fails
on XOR
1970s (approximate dates) First AI winter
1974 Ising magnetic model proposed by WA Little [de] for cognition
1980 Fukushima introduces the neocognitron, which is later called a convolutional
neural network. It is mostly used in SL, but deserves a mention here.
1982 Ising variant Hopfield net described as CAMs and classifiers by John
Hopfield.
1983 Ising variant Boltzmann machine with probabilistic neurons described by
Hinton & Sejnowski following Sherington & Kirkpatrick's 1975 work.
1986 Paul Smolensky publishes Harmony Theory, which is an RBM with practically the
same Boltzmann energy function. Smolensky did not give a practical training scheme.
Hinton did in mid-2000s.
1995 Schmidthuber introduces the LSTM neuron for languages.
1995 Dayan & Hinton introduces Helmholtz machine
1995-2005 (approximate dates) Second AI winter
2013 Kingma, Rezende, & co. introduced Variational Autoencoders as Bayesian
graphical probability network, with neural nets as components.
Specific Networks
Here, we highlight some characteristics of select networks. The details of each are
given in the comparison table below.

Hopfield Network
Ferromagnetism inspired Hopfield networks. A neuron correspond to an iron domain
with binary magnetic moments Up and Down, and neural connections correspond to the
domain's influence on each other. Symmetric connections enable a global energy
formulation. During inference the network updates each state using the standard
activation step function. Symmetric weights and the right energy functions
guarantees convergence to a stable activation pattern. Asymmetric weights are
difficult to analyze. Hopfield nets are used as Content Addressable Memories (CAM).
Boltzmann Machine
These are stochastic Hopfield nets. Their state value is sampled from this pdf as
follows: suppose a binary neuron fires with the Bernoulli probability p(1) = 1/3
and rests with p(0) = 2/3. One samples from it by taking a uniformly distributed
random number y, and plugging it into the inverted cumulative distribution
function, which in this case is the step function thresholded at 2/3. The inverse
function = { 0 if x <= 2/3, 1 if x > 2/3 }.
Sigmoid Belief Net
Introduced by Radford Neal in 1992, this network applies ideas from probabilistic
graphical models to neural networks. A key difference is that nodes in graphical
models have pre-assigned meanings, whereas Belief Net neurons' features are
determined after training. The network is a sparsely connected directed acyclic
graph composed of binary stochastic neurons. The learning rule comes from Maximum
Likelihood on p(X): Δwij
∝\propto sj * (si - pi), where pi = 1 / ( 1 + eweighted inputs into neuron i ).
sj's are activations from an unbiased sample of the posterior distribution and this
is problematic due to the Explaining Away problem raised by Judea Perl. Variational
Bayesian methods uses a surrogate posterior and blatantly disregard this
complexity.
Deep Belief Network
Introduced by Hinton, this network is a hybrid of RBM and Sigmoid Belief Network.
The top 2 layers is an RBM and the second layer downwards form a sigmoid belief
network. One trains it by the stacked RBM method and then throw away the
recognition weights below the top RBM. As of 2009, 3-4 layers seems to be the
optimal depth.[2]
Helmholtz machine
These are early inspirations for the Variational Auto Encoders. It's 2 networks
combined into one—forward weights operates recognition and backward weights
implements imagination. It is perhaps the first network to do both. Helmholtz did
not work in machine learning but he inspired the view of "statistical inference
engine whose function is to infer probable causes of sensory input".[3] the
stochastic binary neuron outputs a probability that its state is 0 or 1. The data
input is normally not considered a layer, but in the Helmholtz machine generation
mode, the data layer receives input from the middle layer and has separate weights
for this purpose, so it is considered a layer. Hence this network has 3 layers.
Variational autoencoder
These are inspired by Helmholtz machines and combines probability network with
neural networks. An Autoencoder is a 3-layer CAM network, where the middle layer is
supposed to be some internal representation of input patterns. The encoder neural
network is a probability distribution qφ(z given x) and the decoder network is pθ(x
given z). The weights are named phi & theta rather than W and V as in Helmholtz—a
cosmetic difference. These 2 networks here can be fully connected, or use another
NN scheme.
Comparison of networks
Hopfield Boltzmann RBM Stacked RBM Helmholtz Autoencoder VAE
Usage & notables CAM, traveling salesman problem CAM. The freedom of
connections makes this network difficult to analyze. pattern recognition. used in
MNIST digits and speech. recognition & imagination. trained with unsupervised
pre-training and/or supervised fine tuning. imagination, mimicry language:
creative writing, translation. vision: enhancing blurry images generate realistic
data
Neuron deterministic binary state. Activation = { 0 (or -1) if x is negative,
1 otherwise } stochastic binary Hopfield neuron ← same. (extended to real-
valued in mid 2000s) ← same ← same language: LSTM. vision: local
receptive fields. usually real valued relu activation. middle layer neurons
encode means & variances for Gaussians. In run mode (inference), the output of the
middle layer are sampled values from the Gaussians.
Connections 1-layer with symmetric weights. No self-connections. 2-layers. 1-hidden
& 1-visible. symmetric weights. ← same.
no lateral connections within a layer. top layer is undirected, symmetric. other
layers are 2-way, asymmetric. 3-layers: asymmetric weights. 2 networks combined
into 1. 3-layers. The input is considered a layer even though it has no inbound
weights. recurrent layers for NLP. feedforward convolutions for vision. input &
output have the same neuron counts. 3-layers: input, encoder, distribution sampler
decoder. the sampler is not considered a layer
Inference & energy Energy is given by Gibbs probability measure :

=
−
1
2
∑

+
∑

E=-{\frac 12}\sum _{{i,j}}{w_{{ij}}{s_{i}}{s_{j}}}+\sum _{i}{\theta _{i}}{s_{i}}

← same ← same minimize KL divergence inference is only feed-
forward. previous UL networks ran forwards AND backwards minimize error =
reconstruction error - KLD
Training Δwij = si*sj, for +1/-1 neuron Δwij = e*(pij - p'ij). This is
derived from minimizing KLD. e = learning rate, p' = predicted and p = actual
distribution. Δwij = e*( < vi hj >data - < vi hj >equilibrium ). This is a form
of contrastive divergence w/ Gibbs Sampling. "<>" are expectations. ← similar.
train 1-layer at a time. approximate equilibrium state with a 3-segment pass. no
back propagation. wake-sleep 2 phase training back propagate the reconstruction
error reparameterize hidden state for backprop
Strength resembles physical systems so it inherits their equations ← same.
hidden neurons act as internal representatation of the external world faster more
practical training scheme than Boltzmann machines trains quickly. gives
hierarchical layer of features mildly anatomical. analyzable w/ information
theory & statistical mechanics
Weakness hard to train due to lateral connections equilibrium requires too
many iterations integer & real-valued neurons are more complicated.
Hebbian Learning, ART, SOM
The classical example of unsupervised learning in the study of neural networks is
Donald Hebb's principle, that is, neurons that fire together wire together.[4] In
Hebbian learning, the connection is reinforced irrespective of an error, but is
exclusively a function of the coincidence between action potentials between the two
neurons.[5] A similar version that modifies synaptic weights takes into account the
time between the action potentials (spike-timing-dependent plasticity or STDP).
Hebbian Learning has been hypothesized to underlie a range of cognitive functions,
such as pattern recognition and experiential learning.

Among neural network models, the self-organizing map (SOM) and adaptive resonance
theory (ART) are commonly used in unsupervised learning algorithms. The SOM is a
topographic organization in which nearby locations in the map represent inputs with
similar properties. The ART model allows the number of clusters to vary with
problem size and lets the user control the degree of similarity between members of
the same clusters by means of a user-defined constant called the vigilance
parameter. ART networks are used for many pattern recognition tasks, such as
automatic target recognition and seismic signal processing.[6]

Probabilistic methods
Two of the main methods used in unsupervised learning are principal component and
cluster analysis. Cluster analysis is used in unsupervised learning to group, or
segment, datasets with shared attributes in order to extrapolate algorithmic
relationships.[7] Cluster analysis is a branch of machine learning that groups the
data that has not been labelled, classified or categorized. Instead of responding
to feedback, cluster analysis identifies commonalities in the data and reacts based
on the presence or absence of such commonalities in each new piece of data. This
approach helps detect anomalous data points that do not fit into either group.

A central application of unsupervised learning is in the field of density

estimation in statistics,[8] though unsupervised learning encompasses many other
domains involving summarizing and explaining data features. It can be contrasted
with supervised learning by saying that whereas supervised learning intends to
infer a conditional probability distribution conditioned on the label of input
data; unsupervised learning intends to infer an a priori probability distribution .

Approaches
Some of the most common algorithms used in unsupervised learning include: (1)
Clustering, (2) Anomaly detection, (3) Approaches for learning latent variable
models. Each approach uses several methods as follows:

Clustering methods include: hierarchical clustering,[9] k-means,[10] mixture

models, DBSCAN, and OPTICS algorithm
Anomaly detection methods include: Local Outlier Factor, and Isolation Forest
Approaches for learning latent variable models such as Expectation–maximization
algorithm (EM), Method of moments, and Blind signal separation techniques
(Principal component analysis, Independent component analysis, Non-negative matrix
factorization, Singular value decomposition)
Method of moments
One of the statistical approaches for unsupervised learning is the method of
moments. In the method of moments, the unknown parameters (of interest) in the
model are related to the moments of one or more random variables, and thus, these
unknown parameters can be estimated given the moments. The moments are usually
estimated from samples empirically. The basic moments are first and second order
moments. For a random vector, the first order moment is the mean vector, and the
second order moment is the covariance matrix (when the mean is zero). Higher order
moments are usually represented using tensors which are the generalization of
matrices to higher orders as multi-dimensional arrays.

In particular, the method of moments is shown to be effective in learning the

parameters of latent variable models. Latent variable models are statistical models
where in addition to the observed variables, a set of latent variables also exists
which is not observed. A highly practical example of latent variable models in
machine learning is the topic modeling which is a statistical model for generating
the words (observed variables) in the document based on the topic (latent variable)
of the document. In the topic modeling, the words in the document are generated
according to different statistical parameters when the topic of the document is
changed. It is shown that method of moments (tensor decomposition techniques)
consistently recover the parameters of a large class of latent variable models
under some assumptions.[11]

The Expectation–maximization algorithm (EM) is also one of the most practical

methods for learning latent variable models. However, it can get stuck in local
optima, and it is not guaranteed that the algorithm will converge to the true
unknown parameters of the model. In contrast, for the method of moments, the global
convergence is guaranteed under some conditions.

See also
Automated machine learning
Cluster analysis
Anomaly detection
Expectation–maximization algorithm
Generative topographic map
Meta-learning (computer science)
Multivariate analysis
Radial basis function network
Weak supervision
References
Hinton, G. (2012). "A Practical Guide to Training Restricted Boltzmann Machines"
(PDF). Neural Networks: Tricks of the Trade. Lecture Notes in Computer Science.
Vol. 7700. Springer. pp. 599–619. doi:10.1007/978-3-642-35289-8_32. ISBN 978-3-642-
35289-8.
Hinton, Geoffrey (September 2009). "Deep Belief Nets" (video).
Peter, Dayan; Hinton, Geoffrey E.; Neal, Radford M.; Zemel, Richard S. (1995).
"The Helmholtz machine". Neural Computation. 7 (5): 889–904.
doi:10.1162/neco.1995.7.5.889. hdl:21.11116/0000-0002-D6D3-E. PMID 7584891. S2CID
1890561. closed access
Buhmann, J.; Kuhnel, H. (1992). "Unsupervised and supervised data clustering with
competitive neural networks". [Proceedings 1992] IJCNN International Joint
Conference on Neural Networks. Vol. 4. IEEE. pp. 796–801.
doi:10.1109/ijcnn.1992.227220. ISBN 0780305590. S2CID 62651220.
Comesaña-Campos, Alberto; Bouza-Rodríguez, José Benito (June 2016). "An
application of Hebbian learning in the design process decision-making". Journal of
Intelligent Manufacturing. 27 (3): 487–506. doi:10.1007/s10845-014-0881-z. ISSN
0956-5515. S2CID 207171436.
Carpenter, G.A. & Grossberg, S. (1988). "The ART of adaptive pattern recognition
by a self-organizing neural network" (PDF). Computer. 21 (3): 77–88.
doi:10.1109/2.33. S2CID 14625094. Archived from the original (PDF) on 2018-05-16.
Retrieved 2013-09-16.
Roman, Victor (2019-04-21). "Unsupervised Machine Learning: Clustering Analysis".
Medium. Retrieved 2019-10-01.
Jordan, Michael I.; Bishop, Christopher M. (2004). "7. Intelligent Systems §Neural
Networks". In Tucker, Allen B. (ed.). Computer Science Handbook (2nd ed.). Chapman
& Hall/CRC Press. doi:10.1201/9780203494455. ISBN 1-58488-360-X.
Hastie, Tibshirani & Friedman 2009, pp. 485–586
Garbade, Dr Michael J. (2018-09-12). "Understanding K-means Clustering in Machine
Learning". Medium. Retrieved 2019-10-31.
Anandkumar, Animashree; Ge, Rong; Hsu, Daniel; Kakade, Sham; Telgarsky, Matus
(2014). "Tensor Decompositions for Learning Latent Variable Models" (PDF). Journal
of Machine Learning Research. 15: 2773–2832. arXiv:1210.7559.
Bibcode:2012arXiv1210.7559A.
Further reading
Bousquet, O.; von Luxburg, U.; Raetsch, G., eds. (2004). Advanced Lectures on
Machine Learning. Springer. ISBN 978-3540231226.
Duda, Richard O.; Hart, Peter E.; Stork, David G. (2001). "Unsupervised Learning
and Clustering". Pattern classification (2nd ed.). Wiley. ISBN 0-471-05669-3.
Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome (2009). "Unsupervised
Learning". The Elements of Statistical Learning: Data mining, Inference, and
Prediction. Springer. pp. 485–586. doi:10.1007/978-0-387-84858-7_14. ISBN 978-0-
387-84857-0.
Hinton, Geoffrey; Sejnowski, Terrence J., eds. (1999). Unsupervised Learning:
Foundations of Neural Computation. MIT Press. ISBN 0-262-58168-X.

Text Analysis in Python For Social Scientists
No ratings yet
Text Analysis in Python For Social Scientists
102 pages
Unsupervised_learning
No ratings yet
Unsupervised_learning
6 pages
Major Classes of Neural Networks
No ratings yet
Major Classes of Neural Networks
21 pages
ANN ARTIFICAL NEURAL NETWORK
No ratings yet
ANN ARTIFICAL NEURAL NETWORK
34 pages
Aula 3 T
No ratings yet
Aula 3 T
12 pages
Feature Learning
No ratings yet
Feature Learning
12 pages
Lec 12 NN
No ratings yet
Lec 12 NN
20 pages
An Introductory Note On Machine Learning. A V Narasimhadhan
No ratings yet
An Introductory Note On Machine Learning. A V Narasimhadhan
2 pages
THE_DEEP_NEURAL_NETWORK-A_REVIEW
No ratings yet
THE_DEEP_NEURAL_NETWORK-A_REVIEW
5 pages
Unit - V
No ratings yet
Unit - V
44 pages
Hopfield Networks and Boltzman Machines-Part 2
No ratings yet
Hopfield Networks and Boltzman Machines-Part 2
13 pages
Backpropagation
No ratings yet
Backpropagation
6 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Module 1
No ratings yet
Module 1
23 pages
Week-12 - Introduction To ML-NN-CNN
No ratings yet
Week-12 - Introduction To ML-NN-CNN
45 pages
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
2 Units 8,12
No ratings yet
2 Units 8,12
9 pages
Deep
No ratings yet
Deep
15 pages
SIM - Chapters - DA T5
No ratings yet
SIM - Chapters - DA T5
9 pages
DL mod 5
No ratings yet
DL mod 5
2 pages
Intorduction of ML
No ratings yet
Intorduction of ML
14 pages
Machine Learning Course in Bangalore
No ratings yet
Machine Learning Course in Bangalore
14 pages
unit 2
No ratings yet
unit 2
40 pages
Unsupervised Learning Using Back Propagation in Neural Networks
No ratings yet
Unsupervised Learning Using Back Propagation in Neural Networks
4 pages
Learning Law in Neural Networks
100% (2)
Learning Law in Neural Networks
19 pages
Deep Learning For Wind Speed Forecasting in Northeastern Region of Brazil
No ratings yet
Deep Learning For Wind Speed Forecasting in Northeastern Region of Brazil
6 pages
Parallelized Deep Neural Networks
No ratings yet
Parallelized Deep Neural Networks
34 pages
ml-lab
No ratings yet
ml-lab
75 pages
Image Processing 7
No ratings yet
Image Processing 7
193 pages
machine learning notes
No ratings yet
machine learning notes
20 pages
1. Supervised Learning (1)
No ratings yet
1. Supervised Learning (1)
4 pages
Basic Neural Networks
No ratings yet
Basic Neural Networks
9 pages
Introduction To Boltzmann Learning
No ratings yet
Introduction To Boltzmann Learning
2 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
19 pages
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
What are the commonly used activation functions
No ratings yet
What are the commonly used activation functions
8 pages
23ECE205 FoDS 13 Introduction To ML
No ratings yet
23ECE205 FoDS 13 Introduction To ML
41 pages
Machine Learning 4th Unit
No ratings yet
Machine Learning 4th Unit
54 pages
Restricted Boltzmann Machines: Abstract
No ratings yet
Restricted Boltzmann Machines: Abstract
21 pages
Supervised Learning Network Introduction: Unit 2
No ratings yet
Supervised Learning Network Introduction: Unit 2
52 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
DOC-20241117-WA0000
No ratings yet
DOC-20241117-WA0000
52 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
SECA4002
No ratings yet
SECA4002
65 pages
Neural Net 2002
No ratings yet
Neural Net 2002
12 pages
ANN Assignment
No ratings yet
ANN Assignment
10 pages
EE05425Notes-12
No ratings yet
EE05425Notes-12
4 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
NNFL Lecture 5 21 July 2021
No ratings yet
NNFL Lecture 5 21 July 2021
66 pages
Artificial Intelligence, An Introductory Course
No ratings yet
Artificial Intelligence, An Introductory Course
8 pages
Soft Computing: Dynamic Neural Networks
No ratings yet
Soft Computing: Dynamic Neural Networks
33 pages
Lect3 UWA PDF
No ratings yet
Lect3 UWA PDF
73 pages
Machine Learning Lab Experiments
No ratings yet
Machine Learning Lab Experiments
40 pages
Chapter-1 Intorduction to Neural networks [Autosaved]
No ratings yet
Chapter-1 Intorduction to Neural networks [Autosaved]
118 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
6 pages
Deep learning
No ratings yet
Deep learning
14 pages
Machine Learning Lab Manual - New
No ratings yet
Machine Learning Lab Manual - New
35 pages
AnaPeixoto_SupervisedVSUnsupervised_IRISHEPHSFIndia_16012025
No ratings yet
AnaPeixoto_SupervisedVSUnsupervised_IRISHEPHSFIndia_16012025
59 pages
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
5
No ratings yet
5
8 pages
4
No ratings yet
4
3 pages
Text Computers 8
No ratings yet
Text Computers 8
4 pages
Hahahaha Python Questions - Variable Names
No ratings yet
Hahahaha Python Questions - Variable Names
11 pages
Python Question and Answers - Built-In Functions - 1
0% (1)
Python Question and Answers - Built-In Functions - 1
15 pages
ML Complete Notes-AIDS
No ratings yet
ML Complete Notes-AIDS
115 pages
Machine Learning Technical Report
No ratings yet
Machine Learning Technical Report
12 pages
2024.emnlp-industry.37
No ratings yet
2024.emnlp-industry.37
29 pages
Unit 1 Introduction To ML
100% (1)
Unit 1 Introduction To ML
52 pages
Machine Learning
No ratings yet
Machine Learning
18 pages
Autonomous Driving With Deep Learning: A Survey of State-of-Art Technologies
No ratings yet
Autonomous Driving With Deep Learning: A Survey of State-of-Art Technologies
33 pages
R55 Fintech in Investment Management IFT Notes
No ratings yet
R55 Fintech in Investment Management IFT Notes
18 pages
Grade 10 Unit 2 - Project Cycle
No ratings yet
Grade 10 Unit 2 - Project Cycle
40 pages
Simdetr: Simplifying Self-Supervised Pretraining For Detr
No ratings yet
Simdetr: Simplifying Self-Supervised Pretraining For Detr
14 pages
Machine Learning in Bioinformatics
No ratings yet
Machine Learning in Bioinformatics
7 pages
UNIT-IV Notes
No ratings yet
UNIT-IV Notes
42 pages
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
No ratings yet
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
13 pages
Madhubalakichu
No ratings yet
Madhubalakichu
29 pages
Chapter - Machine Learning Algorithms
No ratings yet
Chapter - Machine Learning Algorithms
2 pages
보건의료 분야에서의 인공지능과 기계학습 활용 및 전망
No ratings yet
보건의료 분야에서의 인공지능과 기계학습 활용 및 전망
14 pages
Semester I: Discipline: Electronics and Communication Stream: EC3
No ratings yet
Semester I: Discipline: Electronics and Communication Stream: EC3
99 pages
EE6006 - ASC Anna University Question Nov Dec 2016
No ratings yet
EE6006 - ASC Anna University Question Nov Dec 2016
8 pages
Credit Card Fraud Detection Using Machine Learning
No ratings yet
Credit Card Fraud Detection Using Machine Learning
6 pages
Application of Computer Chapter 5
No ratings yet
Application of Computer Chapter 5
9 pages
AI Unit-4
No ratings yet
AI Unit-4
58 pages
Body Pose Detection Using Research
No ratings yet
Body Pose Detection Using Research
12 pages
Artificial Neural Networ (ANN)
No ratings yet
Artificial Neural Networ (ANN)
6 pages
Ann-Unit Iv
No ratings yet
Ann-Unit Iv
27 pages
Fake News Detection Using LSTM
No ratings yet
Fake News Detection Using LSTM
67 pages
AI PROJECT CYCLE-1 Class 9
100% (1)
AI PROJECT CYCLE-1 Class 9
7 pages
Vocational Training Report On AI
No ratings yet
Vocational Training Report On AI
57 pages
From Data To Action How Marketers Can Leverage AI
No ratings yet
From Data To Action How Marketers Can Leverage AI
17 pages
AI Unit-1
No ratings yet
AI Unit-1
36 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages

2

Uploaded by

2

Uploaded by

Unsupervised learning

{\displaystyle p\propto e^{-E/kT}}, where k is the Boltzmann constant and T is

{\displaystyle p=e^{-E}/Z},[1] where

E vary over every possible activation pattern and

Hopfield Boltzmann RBM Stacked Boltzmann

A network based on magnetic domains in iron with a single self-connected layer. It

Restricted Boltzmann Machine. This is a Boltzmann machine where lateral connections

Instead of the bidirectional symmetric connection of the stacked Boltzmann

Applies Variational Inference to the Autoencoder. The middle layer is a set of

E=-{\frac 12}\sum _{{i,j}}{w_{{ij}}{s_{i}}{s_{j}}}+\sum _{i}{\theta _{i}}{s_{i}}

A central application of unsupervised learning is in the field of density

Clustering methods include: hierarchical clustering,[9] k-means,[10] mixture

In particular, the method of moments is shown to be effective in learning the

The Expectation–maximization algorithm (EM) is also one of the most practical

You might also like