Modul 8 (ANN1)
Modul 8 (ANN1)
k-means
RBF networks
PNN
SOM
Non-Hierarchical Cluster Analysis: k-means
+ B
+
A +: centroid
k =1 x k
http://www.elet.polimi.it/upload/matteucc/Clustering/tutorial_html/kmeans.html
Example: k-means Clustering
Data
ID x1 x2 B2
A1 1 1 + B3
A2 2 1 B1
B1 4 5 x2
B2 5 7
B3 7 7
A1
A2
+
• 2 centroids (k = 2) x1
• Euclidian Distance
Cluster Boundary
(Classifier)
k-Nearest Neighbors
x2 k=3
k=6
x1
Kernel-based Nearest Neighbors
Kernel function
f( A,B) → scalar value
f( A,B) = 0 if A = B
f( A,B) > 0 if A ≠ B + µB
x2
σB
Gaussian Kernel φ
+µ
x −µ 2
A σA
j
Φ j ( x ) = exp −
2σ 2j
x1
x−x 2
1
∑
class , j
yclass = f class (x) = exp −
j∈CLASS
D D
M (2π ) σ j 2σ 2j
φ1
Optional: softmax
x1 y1 exp( y class )
zclass = C
xd yc ∑ exp( y
k =1
k )
smoothed version of
Inputs φM Outputs „winner-take-all“
Basis
functions
Probabilistic Neural Network (PNN)
property 1
P(x)
<l
ip
op
hi
lic
ity
> <volume>
Radial Basis Function (RBF) Network
M
y ( x ) = ∑ w kj Φ j ( x ) + w k 0
j =1
Gaussian basis function φ
M basis functions φ
x −µ 2
( x − µ j )T ( x − µ j )
φ1 j
Φ j ( x ) = exp − = exp −
2σ 2j 2σ j 2
x1 y1
Standardized Gaussian φ
w
xd yc x −µ 2
1 j
Φ j (x ) = exp −
(2π )D σ Dj 2σ 2j
Inputs φM Outputs
D : dimension of X
Basis
functions
The Self-Organizing Map (SOM)
Data Analysis by Self-Organizing Maps
(Kohonen networks)
X
SOM
Properties of Kohonen Networks
w A=6
(A • B) neuron array
x1
x2
Input
x3
B=5
x4
Output
1/0
Square
neuron array
second neighborhood
first neighborhood
Hexagonal central (active) neuron
neuron array
Toroidal Topology of 2D-Kohonen Maps
An “endless plane”
Competitive Learning
1. Randomly select an input pattern x
2. Determine the “winner” neuron (competitive stage)
dim
i* ← min ∑ x j − wij ; i = 1,2, ... , n
( 2
)
j =1
wold + η x
ij j
if i ∈ N i * (Normalization of w)
new w old + η x
wij = i
wijold if i ∉ N i *
h
• Neighborhood (N) correction
d1 (r , s ) 2 s
h(t , r , s ) = exp − r
2σ 2 (t )
h
t / t max
σ fin
σ (t ) = σ ini
σ ini s
r
Neuron 2 x2 x2
Neuron 2
x1 x1
Neuron 1
Neuron 1 unit sphere
Learning Time
SOM Adaptation to Probability Distributions
t = 500
t = 400
t = 300
t = 200
t = 100
0 Learning time
B
Voronoi
tesselation
A
SOM - Issues
• features are not invariant to, e.g., translations of the input signal
Mapping Chemical Space: “Drugs” and “Nondrugs”
R2 O R2
MeOH / RT H H
N + N
R1 R4 R1 NH
O R3 O R3
4
3
2
1
PC1 1 2 3 4 5 6 7
PCA Kohonen-Map
Self-organizing neural networks demo
1) University of Bochum
http://www.neuroinformatik.ruhr-uni-
bochum.de/ini/VDM/research/gsn/DemoGNG/GNG
2) SOMMER