0% found this document useful (0 votes)

27 views

Dynamics of Self-Organizing Adaptive Networks

Uploaded by

movecraftmagic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Dynamics of Self-Organizing Adaptive Networks

Uploaded by

movecraftmagic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Journal of Statistical Physics, Vol. 34, Nos.

3/4, 1984

Dynamics of Self-Organization in
Complex Adaptive Networks
D. d'Humi~res l'2 and B. A. Huberman 1

Received August 20, 1983

We study the dynamical behavior of complex adaptive automata during unsu-

pervised learning of periodic training sets. A new technique for their analysis is
presented and applied to an adaptive network with distributed memory. We
show that with general imput pattern sequences, the system can display behavior
that ranges from convergence into simple fixed points and oscillations to chaotic
wanderings. We also test the ability of the self-organized automaton to recognize
spatial patterns, discriminate between them, and to elicit meaningful informa-
tion out of noisy inputs. In this configuration we determine that the higher the
ratio of excitation to inhibition, the broader the equivalence class into which
patterns are put together.

KEY WORDS: Automata dynamics; self-organizing networks; learning

and recognition.

1. INTRODUCTION

T h e p r o b l e m of h a n d l i n g large a m o u n t s of r e d u n d a n t d a t a a n d extracting
relevant i n f o r m a t i o n from t h e m lies at the h e a r t of b o t h p a t t e r n recognition
a u t o m a t a a n d m o d e l s of the higher b r a i n functions, such as learning a n d
associative m e m o r y . As such, they have b e e n the focus of intense efforts
a i m e d at designing algorithms a n d architectures. A m o n g the m a n y avenues
b e i n g explored, a p r o m i s i n g one resorts to local a n d parallel c o m p u t a t i o n
b y a r r a y s of processors with d e l o c a l i z e d memories. (1-3)
In spite of all this work, little is k n o w n a b o u t the d y n a m i c s of c o m p l e x
networks a n d their b e h a v i o r u n d e r general circumstances. Issues such as

Xerox Palo Alto Research Center, Palo Alto, California 94304.

2 Permanent address: Ecole Normale Sup~rieure, 24 rue Lhomond, 75231 Paris Cedex 05,
France.
361
0022-4715/84/0200-0361503.50/0 9 1984 Plenum Publishing Corporation
362 d'Humk~res and Huberman

their self-organization, stability under parameter changes, and the extent of

their parallel computing cannot be answered with ease, even in the simplest
nontrivial cases. And yet, even partial answers to these questions are of
relevance to VLSI design and the understanding of neural organization; the
latter now emerging as one of the central problems of neurobiology.
Although part of the scarcity of answers is due to the lack of general
enough experiments, a more serious problem has been posed by the
absence of a methodology with which to systematically analyze experimen-
tal data. We believe that recent advances in dynamical systems theory can
provide such a framework, and lead to a quantitative understanding of
these very important issues. Moreover, complex adaptive networks with
hierarchical structures lend themselves to techniques that have been devel-
oped in the study of nonlinear systems with few degrees of freedom.~4)
This paper reports on work that we have performed on the dynamical
behavior of complex networks composed of interconnected cells arranged
in a hierarchically layered structure. The systems that we dealt with are
such that they possess a fixed wiring configuration but variable connection
strengths, allowed to track the inputs so as to produce the best output
according to given criteria. In particular, as the connectivity of the network
changes when subjected to an input set, it is of interest to determine
whether an asymptotic or "learning" behavior can take place. By this we
mean a situation whereby the output produced by the system does not
change much with further presentations of the same pattern sequence.
Moreover, one would like to know how such convergence depends on
parameters such as excitation, lateral inhibition, or connectivity.
In order to answer these questions, we first present a stroboscopic
technique to analyze the time evolution of a layered adaptive network
during unsupervised learning of periodic sequence of training sets (Section
2). The method is then applied to the experimental study of an adaptive
network with distributed memory described in Section 3. We show that
with general input pattern sequences, changing the relative levels of excita-
tion versus inhibition leads to fixed points, oscillatory states, and chaotic
wanderings during the adaptive process (Section 4). Moreover, using the
self-organized network after the adaptive process has taken place, we study
the size of equivalence classes as a function of excitation and inhibition.
We also present preliminary results on pattern recognition of incomplete
and noisy patterns (Section 5). A conclusion summarizes our findings and
poses some questions for future research, and an Appendix provides the
mathematical details of Section 3.
The results that we have obtained point to the usefulness of computer
experiments in order to gain insights into issues relevant to self-organization
in complex systems. Rather than optimizing an existing network or trying
Dynamics of Self-Organization in Complex Adaptive Networks 363

to closely model neurobiological systems, we have concentrated on the

general dynamic properties of adaptive automata, a problem which under-
lies the behavior of many other machines.

2. STROBOSCOPIC DYNAMICS OF LAYERED NETWORKS

2.1. General Description of Adaptive Networks

Consider the general layered network shown in Fig. 1, having p layers,
each layer containing n cells. Each layer l of the net is characterized at time
t by vectors Ii(t ) and Ol(t ), made up of the inputs and outputs of the
layer's cells, an array St(t ) which stores the information about the internal
state of the network, and a transformation F relating the output of the layer

layerl [ $1,1,i ~i >1 S1'2'i I $1'3'i ~'~i~>'J $1'4'i ]

t i=lto4 ~,! ~ i=lto4 I i=lto4 ~ i [ ~ i=2to4 J
',o~,~-!!,io~,~-
o : : i : o U- o~,~-T-!i!!o~,,~--
o
I f I I

i I',II I I

layer2 c e ~ ~ ] esll2,42,'i4 I
~ 4 ~ll/l~ i=1t04 I
I I - - " I I I I - '

Oz, 1 ,~)~ 0 ~ O~a ~)~ 0 ~

t --t -- "--
I " rr . . . . . . . . . . . . . . . . . . . . . . . . . trrr" ....
, -[_ . . . . . . . . . . . . . . . . . . . . . . . . . . . rlr
..... -:-~--'2-- :--_--'2--_::.---2s
"., ., ., ., ", , ,

Fig. 1. A network with two layers and four cells per layer. The dashed lines show the
feedback path during the adaptive phase.
364 d'Humieres and Huberman

at time t + r to its input and its state at time t, i.e., Ol(t + r) = F(St(t),
It(t)). Each cell i is a device with n afferent inputs stemming from the n
outputs of the preceding layer, i.e., Ii(t ) = 01_ 1(0, which in turn produces
an output Ot, i(t ) whose value depends on its internal state St, i(t), the input
values, and the propagation rules given by F. The inputs to the first layer
are given externally, and the information is transmitted from layer to layer,
down to the last layer, which we will call the output layer.
As long as the states of all the cells are fixed there are no fundamental
differences between asynchronous or synchronous networks. This is no
longer true however, if the states are allowed to change in time through an
adaptive process. Since synchronous networks cover a wide range of
applications and are easier to model and simulate than asynchronous ones,
we will restrict our study to them. Moreover, we will assume that the
external inputs of the first layer are stable for long enough to insure that
both the state variables of the network and the output of the last layer are
stable. We will also change the external inputs of the first layer at a fixed
rate (define by an external clock with a period T >> r). Under these
conditions the network can then be sampled at the same clock rate in time
intervals k . T, which are multiples of T. Therefore transients can be
ignored and It(t ), Ot(t ) and St(t ) can then be replaced by I~k), O~k) and
St(h) , respectively, with

= 1 91 (2.1)

O~ k) = F( St(k ), I~k~) (2.2)

Any adaptive process consists of a feedback mechanism through which

the different outputs can act back on the internal state of the network. Here
we will restrict this process to feedback of the outputs of one layer on its
own internal state, as shown by the dashed lines in Fig. 1. The adaptive
process for the layer l is then determined by its initial state and by a
transformation Gt, relating its state at sampled time k + 1, to its state and
its output at sampled time k:

St (k+ ') = Gt( St (k) , O~k)) (2.3)

We should point out that the general network being considered here
has a material connectivity (or wiring) independent of the adaptive process.
The latter only changes the relative strength of the couplings as the input is
changed, thereby producing an effective change in connectivity. This is to
be contrasted with other automata where the rules are such that the wiring
itself is allowed to change with the adaptive process.
Dynamics of Self-Organization in Complex Adaptive Networks 365

2.2. Stroboscopic Dynamics

In principle, the time evolution of the network is completely given by
the transformations F, G/, and the input sequence. Unfortunately, with the
exception of trivial transformations on networks with few elements, our
knowledge about the dynamics of systems with many degrees of freedom
does not provide many useful insights. However, the layered and hierarchi-
cal structure of the network allows us to consider as equivalent all states
producing the same output at the last layer for a given input vector. 3 Thus,
we only need to focus on the last output vector, thereby reducing the
number of independent variables f r o m p 9 n 9 (1 + m - n) to n (if the dimen-
sionality of the state arrays S z is m 9 n 9 n). For a network such as the one
we studied (n = 144, p = 3, m = 2), this simplification reduces the number
of independent variables (or degrees of freedom) by almost three orders of
magnitude.
An additional simplification arises if the input patterns I~~ are pre-
sented to the network in a periodic sequence, i.e.,
I~ + K) = i~k) (2.4)
In this case we can use a method similar to the mappings at a period used
in classical mechanics, (4) and which Consists in sampling the state of the
network at each period of the input sequence. This can be achieved by
recording, for example, the output vector for a given input, or the output
vector with maximum length at each period. Therefore, analyzing the
periodically sampled data, it is easier to draw conclusions about the
system's behavior. In particular, we have the following:
(i) One point in the sampled hyperspace will indicate a cycle, either a
static fixed point, or a cycle with the same frequency as the input sequence.
(ii) Several points will indicate a more complicated motion, such as
periodic behavior but with a fundamental period which is a harmonic of
the input pattern sequence.
(iii) A closed trajectory will indicate quasiperiodic motion, i.e., dy-
namics in which the state of the network changes with a frequency which is
incommensurate with that of the input pattern sequence.
(iv) A cloud of points will indicate the presence of chaotic behavior.
By this, we mean dynamical behavior such that, although deterministic in
origin, the system can be best described by probabilistic methods. As we
show below, this behavior implies lack of convergence of the adaptive
process although not necessarily poor flow properties of the input patterns.

3 Thus we will consider as irrelevant any modification of the intermediate outputs or of the
internal states which do not produce any change on the last output.
366 d'Humieres and Huberman

In spite of the simplification brought about by this periodic sampling

of the network, we are still faced with the problem of following a vector in
time in a high dimension hyperspace (144 in our experiment). As we are
interested in the existence of fixed points for the system, one possible
solution is to measure some distance between the successive outputs and
these fixed points. Since unfortunately we have no a priori idea of the fixed
points distribution in the hyperspace for a given set of parameters, we can
instead measure the distance between two successive outputs produced by
the same input pattern. With this technique, the absence of convergence for
this quantity will signal the lack of any fixed points for the system. On the
other hand, its convergence to zero will be a strong indication of the
existence of fixed points.
In what follows, the distance between two vectors V and W (either
inputs or outputs), will be defined as one minus the usual coherence
function between the two vectors, i.e., their inner product divided by their
Euclidian measures:
(V,W) = 1 - V-W/(HVtl 9 IlWll ) (2.5)
Since V. W = IlVII 9 IIWII if and only if V = c- W (c and IlWll nonzero),
two vectors differing by only a multiplicative constant will be considered as
equal. 4
Thus, to describe the network we will measure for each period K of the
training sequence, the maximum and the minimum value of the quantities:
( nv p ~k+~'m ~ nV p~k+(~+0"x)) (2.6)

evaluated at the last layer for k = 0 to K - 1 as a function of time K, with

the time unit equal to the period of the input sequence.

3. EXPERIMENTAL ADAPTIVE MACHINE

In what follows we will describe a particular adaptive network which
we used in order to test the theoretical ideas presented in Section 2. Among
the very many cell structures, ~5-1~ we chose the one shown in Fig. 2, which
was introduced by F u k u s h i m a F ) The details of the mathematical formula-
tion of this algorithm are given in the Appendix. Here we will focus on its
basic structure, 'along with the numerical values used for the simulations.
The cells we used were made o f two spatial filters simultaneously fed
by the n inputs to the layer. Each filter consists of n multipliers followed by
a summing unit and is controlled by an adaptive module into which the

4 O n e c a n easily c h e c k t h a t this d i s t a n c e is n o t a true m e a s u r e as it d o e s n o t satisfy the

triangular inequality.
Dynamics of Self-Organization in Complex Adaptive Networks 367

filter 1

_•Sl,l,h,1
I1.2 2_ ,
I f'~/ ~l,l , h , 3
[ 1,3 >', ~-'/~----- non linear
] f ~ ~',-~r S 1 l h /Tf /
differential amplifier
I 1,4 ,, ,j (~_,,,

_2 ....... 2_2___ 7

I
i

'~-~- 01,1
i

"--~- 01, 2 0"1. h

Adapti~e control ',i
r-~- 01
t
,3
i
i
r-~-
i
01,4
Gain
i
---n---n--_n___n___- 1
I i i i

[1,4 >l i
i t ;
, ,

I1, a >, ,
I J \ ' $2,1,h,3

[1,2 ~ ~ S 2 l,h,2
I 1,1 -~-~~' .-~
S2,1,h,l +1

filter2
Fig. 2. Schematic representation of a cell of the network.

outputs of all the cells are fed back, as shown by the dashed lines of Fig. 2.
The outputs from the two filters are then compared by a nonlinear
differential amplifier acting as a rectifier (a threshold device with a fixed
threshold equal to zero). In addition, the output of the second filter added
to 1 sets the inverse of the amplifier gain.
The effect of these filters is determined by the actual adaptive process.
In a pattern recognition automaton, the goal is to increase the distance
between the outputs produced by the different training patterns and to
broaden the equivalence classes associated with them. To achieve this, the
adaptive process is implemented as follows. The output of each cell is
locally compared to outputs from other given cells in the same layer. The
states of the cells producing local maxima are then changed by adding a
part of the input vector to the coefficients of the first filter, thus producing
filters better matched to this input. In this way the first set of filters acts as
a template comparator and is referred to as the exictatory set of connec-
368 d'Humidres and Huberman

tions. At the same time, a constant term related to the average input value
is added to the coefficients of the second filter, leading to a measure of the
"linear power" of all positive inputs, along with a normalization of the
output vectors. This is done in order to make them independent of the
number of times the first filter has been modified by the same input. This
second set of coefficients will be referred to as the inhibitory set of
connections. Finally, the gain of the differential amplifier is set by a

Layer 3

Q 9 9 9 9 9 9 9

9 t 9 9 9 9 9 9

9 Q 9 9 9 9 9 9

Layer 2 " ~ 9 9 9 ~

9 _/-2"~ /./ / ~. , . . . .

[ 1,' I ,'
" t~ . t,' 9 I //";'
[i

/I 9 . . . .

...... ~--~-.-~;~
~ " . .
Pill . ~" ." /

Fig. 3. Actual implementation of the network. All the cells inside the squares act as inputs
for the cell at the top of the cones. T h e insert shows the random set of connections for the cells
as indexed by a random permutation. The shaded diamond shows the range of comparison
between outputs of the same layer during the adaptive process.
Dynamics of Self-Organization in Complex Adaptive Networks 369

quantity Q0 which measures the ratio of excitation to inhibition, or equiva-

lently, the amount of information flowing through the network, as shown in
the Appendix.
The automaton was then simulated in the following configuration:
(1) A three-dimensional structure made up of three layers (p = 3)
with the input and output vectors folded into 12 by 12 square matrices (Le.,
n = 144).
(2) The connections between the inputs of one layer and the outputs
of the preceding one were separated into two sets. Through a first set of
connections, each cell of-a given layer was connected to its 25 nearest
neighbors in the previous one (see Fig. 3). A second set of connections had
a random character and it connected each cell to another 25 cells of the
previous layer through assignments made with a random number generator.
This random set of connections, wired into the network and therefore
independent of the adaptive process, was first introduced by Fukushima ~v)
in order to extend the range of interactions between cells in different layers.
These cell-cell interactions were implemented through given matrices B l,
which thus have only 50 nonzero coefficients.
(3) The range of comparison between outputs of a layer during the
adaptive process was determined by the outputs of the 12 nearest neighbors
(a 5 • 5 diamond configuration within the same layer, as shown in Fig. 3
by the shaded area).
(4) Each layer was split in two parts, the first one remaining as an
adaptive layer, and updated by the output of a second layer with the same
structure but given filters. This second step provided an edge enhancement
process of the intermediate output vector, as shown in the Appendix. As far
as lateral inhibition was concerned, each cell was connected to the 49
nearest neighbors of the adaptive layer in a similar fashion as the adaptive
connections.

4. RESULTS OF A D A P T I V E E X P E R I M E N T S
With the specifications given above, we studied the dynamics of the
network for two training sets of input patterns as a function of the ratio of
excitation to inhibition Q0. The first training set consisted of all the
horizontal and vertical full lines (12 dots long) which could be arranged
into the square input matrix, whereas the second set was composed of the
26 capital letters A to Z plus the ten digits 0 to 9, arranged in 9 x 11 matrix
within the 12 x 12 array.
Typically, the maximum and the minimum of the quantities defined by
Eq. (2.6) were measured over many periods (from 60 to 600) of the input
pattern sequence and their decimal logarithm was plotted as a function of
370 d'Humk~res and Huberman

time (in units of pattern sequences). This is shown in Figs. 4-6. The lower
curves correspond to the minimum distance measured for some patterns
and the upper one denotes the maximum distance.
For both sets of patterns, the best convergence properties for the
network, as measured by these curves, were found for Qo ~ 2. As expected,
the time to reach a fixed point was longer for the more complicated set of
input patterns. As Q0 was decreased or increased away from that value, we
found out that the convergence of the adaptive process was altered, as
shown in Figs. 4 6.
In particular, with the input set composed of lines, we discovered (Fig.
5d) a periodic behavior in a very narrow range of parameter values, i.e.,
1.48 < Q0 < 1.51. It was characterized by rapid oscillations of the distance

o ' I I t = I i I
QO = 1.20 (a) O0 =1.80 (b)
LINES LINES

2 2

#
4 4

6 i P
80 160 240 0 8O 160 240
t t

o j [ I t I I I I
o o = 2.40 (c) Qo =3.00 (d)
LINES LINES

>-
o o

I ,1, 6
0 80 160 240 0 80 , 160 240
t t

Fig. 4. The distance (V, W ) as a function of time for a training set of lines. Data obtained
for values of Q0 = 1.2, 1.8, 2.4, and 3.0. The time unit for this figure, and the following two, is
defined as the time to process a complete set of input patterns.
Dynamics of Self-Organization in Complex Adaptive Networks 371

0 * I = r ~ I J
O.0 = 1.45 (a)
LINES

2
r 4 ' 4

~ F--q---,
6
80 160 240 320 0 100 200 300 400
t t

r [ ~ [ , I i
(c)

2 2

J
' 4 ' 4

6 r~~~
(] 80 160 240 320
6
0 160 240 320
t t

n E i I i I
(e)

>"
v
o

6 I r I I I
40 80 120 160
t

Fig. 5. The distance (V,W) as a function of time for a training set of lines. Data obtained
for Q0 = 1.45, 1.47, 1.48, 1.49, and 1.55.

after m a n y passes of the c o m p l e t e set of inputs. T h i s process r e p r e s e n t e d a

d y n a m i c a l state of the n e t w o r k such that the relative strengths of its
c o n n e c t i o n s c h a n g e d in a cyclic m a n n e r as the training set was p r e s e n t e d
over a n d over again. F u r t h e r m o r e , for values of Q0 between 1.46 a n d 1.48,
we f o u n d (Fig. 5b) a c h a o t i c regime, with the d i s t a n c e v a r y i n g erratically
372 d'Humieres and Huberman

t
J (a)

f
I
|

(IO=1.20
ALPHABET
9 I I ,,, L_ 1
0 200 4O0 600 800
t

0 --

Q0 = 2.00 (b)
ALPHABET

~
~4

t ,[ I
0 160 320 480 640
t

I ,-~]

QO = 2.20
(c)
ALPHABET

0 160 320 480 640

Fig. 6. The distance (V,W) as a function of time for the alphabet training set. Data
obtained for Q0 = 1.2, 2.0, and 2,2.

between zero and small values for several periods of the input sequence.
We should also point out that these periodic and chaotic behaviors were
entangled with regimes for which the network flowed toward a fixed point,
the ranges of existence for each of them being very narrow.
Another interesting phenomenon is illustrated in Fig. 4d. For long
times the network shows monotonic convergence towards a self-organized
Dynamics of Self-Organization in Complex Adaptive Networks 373

state with a simple fixed point, only to start unraveling itself at later times.
Although all these phenomena exist for different input sequences, the exact
numerical values of Q0 associated with them depend on the actual input
sequence; see, for example, the adaptive behavior for an alphabetic training
set depicted in Fig. 6a.
These results are to be contrasted with dynamical Systems with few
degrees of freedom, in which the sequences of attractors one observes are
both simpler and immune to external perturbations. (11) The reason for this
difference seems to be due to the presence of a few patterns which during
the adaptive process start producing weaker outputs. As this process
continues, a particular pattern ends up producing a zero output vector, thus
leading to a bootstrapping procedure to recover from this situation. This in
turn produces a change in the distributed memory of the network in such a
way so as to take it away from its fixed point. One can conclude from this
observation that minor perturbations can eventually drive a complex net-
work away from its fixed points.

5. E X P E R I M E N T S ON THE S E L F - O R G A N I Z E D N E T W O R K
In what follows we will describe experiments which are performed on
the network after the adaptive process took place. These experiments used
the network in a pattern recognition mode as a probe of its final state, thus
studying the filtering properties of the network and how they were related
to the training set of patterns.
With the state of the network encoded in its final state arrays St, we
computed, and stored as templates, all the output patterns O(ek) produced
by the different input patterns I~k) of the training set. Typical output
patterns are shown in Fig. 7; as can be seen, they range from having only
one nonzero component (Figs. 7a and 7b) to having several cells with
positive values (Figs. 7c and 7d). We should point out that most of the
output patterns obtained for all values of Q0 showed this latter behavior.
For each couple of training input patterns {I~,I~ k') ) we then mea-
sured both their mutual distance, as defined by Eq. (2.5), i.e.,

d / = (I~ ~), I~k')) (5.1)

and the respective mutual distance of their output vectors at the last layer:
do = (5,2)
The output correlations, as a function of Q0, produced by the pattern S
with other letters of the alphabet, are summarized in Table I. As Q0
increases from 1.2 to 2.2 one can easily see that the network evolves from a
state with very sharp discrimination between similar patterns, to a state
374 d'Humieres and Huberman

Qo = 1.2 Qo =2.1

o l a m ~ 8 ~ J

0
0

0 0
0

QO = 1.2 O.0 = 3

0 0
0

Fig. 7. Input and output patterns for a character and a line. The stars ( , ) represent the
position of the strongest output, the circles ( o ) the positions of outputs greater than average,
and the dots (.) the positions of outputs less than average.

with broad class aggregation (or equivalence classes), as shown by the

underlined values. 5
Moreover, using this procedure for input patterns other than the
training set, we found out that this result also holds for incomplete patterns

s The equivalence classes are arbitrarily defined in such a way that the distance between the
last pattern in a class and the first one excluded is a maximum.
Dynamics of Self-Organization in Complex Adaptive Networks 375

Table I. The input and output distances between the pattern S and other
letters of the alphabet for different values of Q0- The distances equal to 1
(orthogonal patterns) are omitted for clarity. The numbers with a star
correspond to distances between patterns lower at the output than at the
input (worse discrimination after processing). The underlined values denote
patterns belonging to the same equivalence class.

Output

Q0 =
Pattern Input 1.2 1.4 1.6 1.8 2.0 2.2
8 0.067 0.700 0.560 0.065* 0.037* 0.002* 0.001 *
9 0.101 0.559 0.623 0.106 0.092* 0.012" 0.010"
6 0.101 0.672 0.344 0.095* 0.051" 0.140 0.016"
B 0.230 0.952 0.828 0.140" 0.080*
C 0.238 0.786 0.760 0.103" 0.267 0.081"
3 0.245 0.811 0.718 0.130" 0.080*
G 0.262 0.781 0.747 0.039* 0.278 0.105"
O 0.273 0.814 0.792 0.153" 0.296 0.108"
5 0.274 0.885 0.771 0.386
Q 0.320 0.800 0.987 0.856 0.282* 0.338 0.183"
D 0.388 0.970 0.794 0.222* 0.068*
2 0.396 0.701 0.808 0.275* 0.201 * 0.198"
E 0.397 0.935 0.812 0.480
P 0.444 0.936 0.830
R 0.456 0.993 0.809
F 0.500 0.991
U 0.500 0.967 0.819 0.829 0.800

Z 0.593 0.544

W 0.678 0.827 0.998 0.967

7 0.647 0.760

or noisy inputs as well. Quite generally, we concluded that the higher the
ratio between excitatory and inhibitory connections, the larger the equivalence
class and therefore the easier for a given input to produce an output
correlated with the learnt patterns.
A possible explanation of these results, along with those of the previ-
ous section, can be derived from the fact that Qo measures the differential
gain of the cell and thus the amount of information flowing through the
network. For low values of Qo, only few elements of the total input
information are propagated from layer to layer. This leads to a high
network selectivity and a consequent destruction of important elements of
the input patterns. On the other hand, for large values of Qo most of the
information is propagated through the network, leading to a buildup of
376 d'Humk~res and Huberman

filters matched to some average input. This in turn results in broad

aggregation of the patterns as processed by the machine. These results,
which are also relevant to cognitive processes in neurobiology, emphasize
the importance of quantitative measurements in the study of pattern
recognition by automata.

6. CONCLUSION
Complex automata are structures situated in between the dynamics of
few degrees of freedom and the simplifying disorder encountered in many-
body systems like gases. As such they pose a special challenge when trying
to understand their dynamical properties as a function of given parameters
and inputs.
In this paper we have shown that it is indeed possible to study in a
quantitative fashion the dynamics of their self-organization. Through the
introduction of a general methodology, we were able to obtain crisp
information on issues that are central to the understanding of data process-
ing by machines and brains. Also, by performing experiments on a particu-
lar nonlinear adaptive network, we uncovered a rich variety of behaviors
and quantified them as a function of excitation, inhibition, and connectiv-
ity. In that fashion we discovered that in addition to regimes where
asymptotic learning can take place, there exist scenarios characterized by
periodic oscillations and chaos. Moreover, experiments on the recognition
properties of the automaton led to an understanding of the dependence of
equivalence classes on excitation and inhibition. Generally speaking, we
concluded that the higher the ratio of excitation to inhibition, the broader
the equivalence class into which patterns are lumped together. This finding
might be of relevance to both pattern recognition machines and neuro-
biology.
Last but not least, we should mention the issue of universality, i.e., to
what extent our results depend on both the particular set of training
patterns or the automaton being simulated. Whereas they indicate that the
behavior encountered in this study does not depend on a particular pattern
sequence or type, we have only tentative conclusions concerning indepen-
dence of network architecture. Although we believe that our findings are
likely to be found in any layered automaton obeying local computational
rules, more experiments will be necessary to test this hypothesis.

ACKNOWLEDGMENTS
We have benefited from useful conversations with T. Hogg and
M. Kerszberg. D. d'Humi~res would like to thank the Xerox Palo Alto
Dynamics of Self-Organization in Complex Adaptive Networks 377

Research Center for its hospitality during his stay. This work was partially
supported by O.N.R. contract N00014-82-0699.

APPENDIX

In this Appendix we present the mathematical details of the algorithm

used to implement the cell structure shown in Fig. 2. Since each cell needs
two filters and n coefficients per filter, the state array S t of layer l can be
separated into two n x n submatrices slj and s2j, whose lines sl,~,i and s2j, i
store the filter coefficients of the cell i. The transformation F is then given
by
F(S,V) = qb(s1 • V,s 2 X V) (A1)
where qb is a transformation of R" • R" into R" (where R is the real field)
such that the components U~ of U, V, of V, and W, of W = qs(U, V) are
related by
Wi = max{0,(V,.- V,)/(1 + V,)} (A2)
and where • denotes the usual matrix product. The adaptive process for
the layer l is then given by an initial state defined by both setting all
coefficients St(~ to zero, and by
s/(k + 1) = t(~v ( k + 1) ,o2,z
,,(k+ 1)]r G,(S/(~),I~~) (a3)
with
s~k+
l ~ , j = s~l,, + ~ , ( I ~ , O~k~) (A3a)

and
s(k+2,/1) = s(k)2,l + .~l~.-Cl~k~z , Ot~xz ) (A3b)
where ~I't and "-z are two transformations of R n • R" into R n'n such that the
elements Mi,j of M = qtz(U, V), and Ni,j of N = ,-'z(U, V), are related to two
given parameters ql and q2 (qJ > q2), to the components U/ of U, V, of V
and to the elements Bzj,j of given n • n matrices Bz, through the relations

Mi,j = ql " r~i" Bl, i,j" Uj (A4)

Xi~i = q2-(Mi • U)/(B,, i • U) (a5)
with
6 , = 1, if V ~ = m a x ( V j ; j E E 1 , i) (A6)
and
6,=0, if V~<max(Vj;jEEIj} (A7)
where the matrices B~ provide local intercell connections between eonsecu-
378 d'Humleres and Huberman

tive layers when needed, and where the terms El,s are given sets of integers
between 1 and n which determine the outputs from other cells in the same
layer l to be locally compared to the output of cell i.
If the inputs remain always positive, the coefficients of the state array
grow indefinitely in time as the training sequence is repeated. At the same
time, the modifications induced by each pattern become decreasingly
important. Within this context, ql determines the relative amount of
change. Simultaneously, V, becomes large compared to 1 in Eq. (A2), and
Qo = 1/q2 then provides a measure of the differential amplifier gain, or of
the amount of information flowing through the network.
Lastly, we should mention that the exact implementation of the
algorithm is slightly more complicated than that described above. This
stems out of the need to deal with the problems of bootstrapping the
network out of a state characterized by a null output for any input. We
thus replaced Eqs. (A4) and (A5) by
M,? = q. 8,-B,,,?. Uj (A8)
N~,j = q. 6i. (Bt, i X U) (a9)
whenever V~ was zero and still a maximum. In this case the cells in the
neighborhood of a given one produce a zero output for nonzero input, and
it becomes necessary to build up the connections with equal weights for the
two filters. This particular process is continued with the same input pattern
until the network produces a nonzero output.
Within this scheme, Eq. (A1) is also replaced by
F(S,V)=~(~(s,• s2XV).fXdP(s, XV, s2XV)) (AI')
where f is a given n X n matrix, usually labeled lateral inhibition in brain
modeling. This is equivalent to splitting each layer in two parts, the first
one remaining as an adaptive layer updated by the output of a second one
having the same structure but given filters. The first filter selects the input
with the same index as the cell (identity transfer matrix), and the coeffi-
cients of the second filter are given by f. This second step provides an edge
enhancement process of the intermediate output vector. For example, if f is
a bidiagonal matrix filled with 1/2, then V - f • V is the discrete first-
order approximation to the derivative of V along its components (V is
essentially the sampling of a continuous function). This same process can
construct higher order derivatives or many similar output transformations.

REFERENCES
1. J. A. Feldman and D. H. Ballard, Cognitive Science 6:205 (1982) and references therein,
2. T. Kohonen, Associative Memory (Springer, New York, 1978); see also Parallel Models of
Dynamics of Self-Organization in Complex Adaptive Networks 379

Associative Memory, E. E. Hinton and J. A. Anderson, eds. (L. Elbaum Associates,

Hillsdale, New Jersey, 1981).
3. G. Edelman and B. Mountcastle, The Mindful Brain (MIT Press, Boston, 1978)i W. J.
Freeman, in Synergetics of the Brain, H. Haken, ed. (Springer, New York, 1983).
4. V. I. Arnold, Mathematical Methods of Classical Mechanics (Springer, New York, 1978).
5. F. Rosenblatt, Principles of Neurodynamics (Spartan Books, Washington, D.C., 1962);
M. Minsky and S. Papert, Perceptrons(MIT Press, Cambridge, 1969).
6. J. S. Koford and G. F. Groner, 1EEE Trans. Information Theory 12:42 (1966).
7. K. Fukushima, Systems Computer Controls 6:15 (1975); Teeh. Monograph (N.H.K) 30:178
(1981).
8. M. Osborne, 1EEE Trans. Comp. C-26:1302 (!977).
9. I. Morishita and H. Yajima, Kybernetik 11:154 (1972).
10. R. Takiyama, Pattern Recognition 15:405 (1982).
l h J. P. Crutchfield, D. Farmer, and B. A. Huberman, Phys. Rep. 92:45 (1982).

ANN by B.Yegnanarayana PDF
93% (14)
ANN by B.Yegnanarayana PDF
479 pages
Artificial Intelligence Artificial Neural Networks - : Introduction
No ratings yet
Artificial Intelligence Artificial Neural Networks - : Introduction
43 pages
On The Dynamics of Small Continuous-Time Recurrent Neural Networks
No ratings yet
On The Dynamics of Small Continuous-Time Recurrent Neural Networks
57 pages
Gordon Pask - A Predictive Model For Self Organizing Systems II (1961, University of Illinois)
No ratings yet
Gordon Pask - A Predictive Model For Self Organizing Systems II (1961, University of Illinois)
36 pages
Recognition of Temporal Sequences of Patterns Using State-Dependent Synapses
No ratings yet
Recognition of Temporal Sequences of Patterns Using State-Dependent Synapses
18 pages
Complex Adaptive Systems
No ratings yet
Complex Adaptive Systems
10 pages
Grossberg1987 Nonlinear Neural Networks Principles Mechanisms and Arhitectures
No ratings yet
Grossberg1987 Nonlinear Neural Networks Principles Mechanisms and Arhitectures
45 pages
Lecture Notes in Control and Information Sciences: Yousri M. EI-Fattah Claude Foulard
No ratings yet
Lecture Notes in Control and Information Sciences: Yousri M. EI-Fattah Claude Foulard
123 pages
Liao 2004
No ratings yet
Liao 2004
14 pages
Soft Computing: Dynamic Neural Networks
No ratings yet
Soft Computing: Dynamic Neural Networks
33 pages
Entropy in NeuralNets
No ratings yet
Entropy in NeuralNets
12 pages
Paper Review On Con Troll Ability
No ratings yet
Paper Review On Con Troll Ability
7 pages
Neuro-Fuzzy Systems and Their Applications: Bogdan
No ratings yet
Neuro-Fuzzy Systems and Their Applications: Bogdan
15 pages
Neural Networks, Radial Basis Functions, and Complexity
No ratings yet
Neural Networks, Radial Basis Functions, and Complexity
26 pages
Neural Basic
No ratings yet
Neural Basic
6 pages
Slide 08
No ratings yet
Slide 08
8 pages
Neural Network Fundamentals With Graphs
No ratings yet
Neural Network Fundamentals With Graphs
6 pages
30 Years of Adaptive Neural Networks (JNL Article) - B. Widrow, M. Lehr (1990) WW
No ratings yet
30 Years of Adaptive Neural Networks (JNL Article) - B. Widrow, M. Lehr (1990) WW
28 pages
Widrow Lehr
No ratings yet
Widrow Lehr
28 pages
The Neural-Network Analysis & Its Applications Data Filters: Saint-Petersburg State University JASS 2006
No ratings yet
The Neural-Network Analysis & Its Applications Data Filters: Saint-Petersburg State University JASS 2006
77 pages
Chapter-1 Intorduction to Neural networks [Autosaved]
No ratings yet
Chapter-1 Intorduction to Neural networks [Autosaved]
118 pages
Full Download (Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X PDF DOCX
No ratings yet
Full Download (Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X PDF DOCX
76 pages
(Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X - The full ebook with all chapters is available for download
100% (1)
(Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X - The full ebook with all chapters is available for download
55 pages
Adaptive Neural Oscillator Using Continuous-Time Back-Propagation Learning
No ratings yet
Adaptive Neural Oscillator Using Continuous-Time Back-Propagation Learning
11 pages
1 s2.0 S0378437104006363 Main
No ratings yet
1 s2.0 S0378437104006363 Main
9 pages
Lecture 2
No ratings yet
Lecture 2
47 pages
Chaotic Systems
100% (1)
Chaotic Systems
322 pages
Critical Dynamics and Cyclic Memory
No ratings yet
Critical Dynamics and Cyclic Memory
27 pages
State Estimation For Discrete-Time Complex Networks With Randomly Occurring Sensor Saturations and Randomly Varying Sensor Delays
No ratings yet
State Estimation For Discrete-Time Complex Networks With Randomly Occurring Sensor Saturations and Randomly Varying Sensor Delays
12 pages
Artificial Neural Networks: Torsten Reil
No ratings yet
Artificial Neural Networks: Torsten Reil
47 pages
Specialism Research Report
No ratings yet
Specialism Research Report
13 pages
The Echo State Approach To Analysing and Training
No ratings yet
The Echo State Approach To Analysing and Training
47 pages
30 Years of Adaptive Neural Networks - Perceptron, Madaline, and Backpropagation
No ratings yet
30 Years of Adaptive Neural Networks - Perceptron, Madaline, and Backpropagation
28 pages
Session 1
No ratings yet
Session 1
8 pages
Complex Systems Science
No ratings yet
Complex Systems Science
199 pages
08 Chapter3 CONCEPTSOFANNFUZZYLOGICANDANFIS
No ratings yet
08 Chapter3 CONCEPTSOFANNFUZZYLOGICANDANFIS
18 pages
Yegna 1999
No ratings yet
Yegna 1999
479 pages
(Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X download
100% (2)
(Ebook) Self-Organization in Continuous Adaptive Networks by Anne-Ly Do ISBN 9788792982766, 879298276X download
60 pages
Chaotic Itinerancy in Coupled Dynamical Recognizers
No ratings yet
Chaotic Itinerancy in Coupled Dynamical Recognizers
27 pages
Boedecker et al. - 2012 - Information processing in echo state networks at t
No ratings yet
Boedecker et al. - 2012 - Information processing in echo state networks at t
10 pages
Predictability, Risk and Online Management in A Complex System of Adaptive Agents
No ratings yet
Predictability, Risk and Online Management in A Complex System of Adaptive Agents
35 pages
Master Stability
No ratings yet
Master Stability
38 pages
Chua_Dynamic_Circuits
No ratings yet
Chua_Dynamic_Circuits
29 pages
Neural Network
No ratings yet
Neural Network
7 pages
Fast Terminal Sliding-Mode Control Design For Nonlinear Dynamical Systems
No ratings yet
Fast Terminal Sliding-Mode Control Design For Nonlinear Dynamical Systems
4 pages
2504.06209v1
No ratings yet
2504.06209v1
42 pages
Module-IV ANN
No ratings yet
Module-IV ANN
22 pages
1 s2.0 S0893608014000318 Main
No ratings yet
1 s2.0 S0893608014000318 Main
8 pages
AdaptiveAndLearningSystems TheoryAndApplications KumpatiSNarendra
No ratings yet
AdaptiveAndLearningSystems TheoryAndApplications KumpatiSNarendra
410 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
Guiding The Self-Organization of Random Boolean Networks
No ratings yet
Guiding The Self-Organization of Random Boolean Networks
12 pages
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
Time-Frequency Domain for Segmentation and Classification of Non-stationary Signals: The Stockwell Transform Applied on Bio-signals and Electric Signals
From Everand
Time-Frequency Domain for Segmentation and Classification of Non-stationary Signals: The Stockwell Transform Applied on Bio-signals and Electric Signals
Ali Moukadem
No ratings yet
Stationary and Related Stochastic Processes: Sample Function Properties and Their Applications
From Everand
Stationary and Related Stochastic Processes: Sample Function Properties and Their Applications
Harald Cramér
4/5 (2)
Theory of Markov Processes
From Everand
Theory of Markov Processes
E. B. Dynkin
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
Algorithmic Information Theory: Fundamentals and Applications
From Everand
Algorithmic Information Theory: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
New Directions in Dynamical Systems, Automatic Control and Singular Perturbations
From Everand
New Directions in Dynamical Systems, Automatic Control and Singular Perturbations
John O’Reilly
No ratings yet
The Geroch Held Penrose Calculus
No ratings yet
The Geroch Held Penrose Calculus
80 pages
Magnetohydrodynamic Kinks in Astrophysics
No ratings yet
Magnetohydrodynamic Kinks in Astrophysics
17 pages
Global Regulairty For Unstable Bubles
No ratings yet
Global Regulairty For Unstable Bubles
95 pages
Fidnemtneals of Steins Thery
No ratings yet
Fidnemtneals of Steins Thery
86 pages
Applescript
No ratings yet
Applescript
217 pages
GR Paper
No ratings yet
GR Paper
9 pages
AAAAAAAAAAA
No ratings yet
AAAAAAAAAAA
137 pages
Rev3 Skidded by Niq New Name VapeClient
No ratings yet
Rev3 Skidded by Niq New Name VapeClient
292 pages
Message
No ratings yet
Message
114 pages
Message
No ratings yet
Message
214 pages
Ae86 1.6
No ratings yet
Ae86 1.6
218 pages
PDF Stochastic processes and random matrices : lecture notes of the Les Houches Summer School First Edition. Edition Altland download
100% (1)
PDF Stochastic processes and random matrices : lecture notes of the Les Houches Summer School First Edition. Edition Altland download
55 pages
Advt T-2023-01 19-4-23 - 230419 - 150511
No ratings yet
Advt T-2023-01 19-4-23 - 230419 - 150511
9 pages
Chapter 21 Direct Current Circuits
No ratings yet
Chapter 21 Direct Current Circuits
31 pages
ADMModule STEM GP12EU-Ia-1
No ratings yet
ADMModule STEM GP12EU-Ia-1
8 pages
Science 7
No ratings yet
Science 7
5 pages
Two by Two Matrices Determinants and Inverses
No ratings yet
Two by Two Matrices Determinants and Inverses
11 pages
Wosten Et Al. 1999 PDF
No ratings yet
Wosten Et Al. 1999 PDF
17 pages
Tabel Data Percobaan Praktikum Optik-2
No ratings yet
Tabel Data Percobaan Praktikum Optik-2
2 pages
Lab Report - Experiment No 1
100% (1)
Lab Report - Experiment No 1
3 pages
Untitled
No ratings yet
Untitled
10 pages
Spherical Geometry And Its Applications Marshall A. Whittlesey download pdf
100% (5)
Spherical Geometry And Its Applications Marshall A. Whittlesey download pdf
55 pages
Bpho Round 1 Section 1 15 November 2018 This Question Paper Must Not Be Taken Out of The Exam Room Instructions
No ratings yet
Bpho Round 1 Section 1 15 November 2018 This Question Paper Must Not Be Taken Out of The Exam Room Instructions
8 pages
Applied Physics Theory Periods/WK Course Code Course Work Mid Sem. End Sem. Total
No ratings yet
Applied Physics Theory Periods/WK Course Code Course Work Mid Sem. End Sem. Total
12 pages
EM-I_Unit-II_edited
No ratings yet
EM-I_Unit-II_edited
29 pages
Section-A: Class X Mathematics (Standard) SQP Marking Scheme (2019-20)
No ratings yet
Section-A: Class X Mathematics (Standard) SQP Marking Scheme (2019-20)
8 pages
Structural Health Monitoring
No ratings yet
Structural Health Monitoring
11 pages
The Big Bang Quiz: Round 2
No ratings yet
The Big Bang Quiz: Round 2
49 pages
CEA STUDENT SCHEDULE FIRST SEM-AY2022-2023 Revised
No ratings yet
CEA STUDENT SCHEDULE FIRST SEM-AY2022-2023 Revised
88 pages
GHS Term Ii 5TH Form Exams 2024
No ratings yet
GHS Term Ii 5TH Form Exams 2024
1 page
M7 Q3L6 Collinear and Coplanar Points
No ratings yet
M7 Q3L6 Collinear and Coplanar Points
4 pages
Diffuse Re Ection FTIR Spectral Database of Dyes and Pigments
No ratings yet
Diffuse Re Ection FTIR Spectral Database of Dyes and Pigments
10 pages
GDP Ad & Cae Book PDF
No ratings yet
GDP Ad & Cae Book PDF
28 pages
1 s2.0 S0307904X18304487 Main
No ratings yet
1 s2.0 S0307904X18304487 Main
19 pages
Squat Form & Calculations
No ratings yet
Squat Form & Calculations
9 pages
31 WSD Method 01 (Singly Reinforced Beam)
No ratings yet
31 WSD Method 01 (Singly Reinforced Beam)
24 pages
ADSS-48 Specification TECNICAS DE FIBRA SPAM 250 SPAM 600 SPAM 1000 SPAM 1200 PDF
No ratings yet
ADSS-48 Specification TECNICAS DE FIBRA SPAM 250 SPAM 600 SPAM 1000 SPAM 1200 PDF
10 pages
Module 1 - Lasers
No ratings yet
Module 1 - Lasers
11 pages
BC Load Calculation Sheet
No ratings yet
BC Load Calculation Sheet
10 pages
AIR_V.P_JEE Main (2025)_(14-01-2025)_Q
No ratings yet
AIR_V.P_JEE Main (2025)_(14-01-2025)_Q
9 pages
5.3 RCC Structures-1 B5
No ratings yet
5.3 RCC Structures-1 B5
77 pages

Dynamics of Self-Organizing Adaptive Networks

Uploaded by

Dynamics of Self-Organizing Adaptive Networks

Uploaded by

Journal of Statistical Physics, Vol. 34, Nos.

Received August 20, 1983

We study the dynamical behavior of complex adaptive automata during unsu-

KEY WORDS: Automata dynamics; self-organizing networks; learning

Xerox Palo Alto Research Center, Palo Alto, California 94304.

their self-organization, stability under parameter changes, and the extent of

to closely model neurobiological systems, we have concentrated on the

2. STROBOSCOPIC DYNAMICS OF LAYERED NETWORKS

2.1. General Description of Adaptive Networks

layerl [ $1,1,i ~i >1 S1'2'i I $1'3'i ~'~i~>'J $1'4'i ]

Oz, 1 ,~)~ 0 ~ O~a ~)~ 0 ~

O~ k) = F( St(k ), I~k~) (2.2)

Any adaptive process consists of a feedback mechanism through which

St (k+ ') = Gt( St (k) , O~k)) (2.3)

2.2. Stroboscopic Dynamics

In spite of the simplification brought about by this periodic sampling

evaluated at the last layer for k = 0 to K - 1 as a function of time K, with

3. EXPERIMENTAL ADAPTIVE MACHINE

4 O n e c a n easily c h e c k t h a t this d i s t a n c e is n o t a true m e a s u r e as it d o e s n o t satisfy the

___2 ....... 2___2___ 7

"--~- 01, 2 0"1. h

quantity Q0 which measures the ratio of excitation to inhibition, or equiva-

after m a n y passes of the c o m p l e t e set of inputs. T h i s process r e p r e s e n t e d a

0 160 320 480 640

d / = (I~ ~), I~k')) (5.1)

with broad class aggregation (or equivalence classes), as shown by the

W 0.678 0.827 0.998 0.967

filters matched to some average input. This in turn results in broad

In this Appendix we present the mathematical details of the algorithm

Mi,j = ql " r~i" Bl, i,j" Uj (A4)

Associative Memory, E. E. Hinton and J. A. Anderson, eds. (L. Elbaum Associates,

You might also like

_2 ....... 2_2___ 7