Logical Categories of Learning
Logical Categories of Learning
1000-1011
http://www.emeraldinsight.com/Insight/viewContainer.do?containerType=Journal&containerId=357
Purpose—Bateson's model of classifying different types of learning will be analyzed from a logical
and technical point of view. While learning_0 has been realized for chess playing computers, learning I
turns out today as the basic concept of artificial neural nets (ANN). All models of ANN are basically
(non linear) data filters, which is the idea behind simple and behavioristic input-output models.
Design/methodology/approach—We will discuss technical systems designed on the concept of
learning 0 and learning I and we will demonstrate that these models do not have an environment, i.e.
they are non-cognitive and therefore "non-learning" systems.
Findings—Models based on Bateson's category of learning II differ fundamentally from learning 0 and
I. They cannot be modeled any longer on the basis of classical (mono-contextural) logics. Technical
artifacts which belong to this category have to be able to change their algorithms (behavior) by their
preprint_
own effort. Learning II turns out as a process which cannot be described or modeled on a sequential
time axis. Learning II as a process belongs to the category of (parallel interwoven) heterarchical-hier-
archical process-structures.
Originality/value—In order to model this kind of process-structures the polycontextural theory has to
be used—a theory which was introduced by the German-American philosopher and logician Gotthard
Günther (1900-1984) and has been further developed by Rudolf Kaehr and others.
Keywords: machine learning, polycontexturality, standpoint dependency
Paper type—conceptual paper
Introduction
Bateson himself summarizes his logical categories of learning as follows (Bateson,
p.293):
Zero learning is characterized by specificity of response, which—right or wrong—is
not subject to correction.
Learning I is a change in specificity of response by correction of errors of choice
within a set of alternatives.
Learning II is change in the process of Learning I, e.g., a corrective change in the set
of alternatives from which choice is made, or it is a change in how the sequence of ex-
perience is punctuated.
Learning III is a change in the process of Learning II, e.g., a corrective change in the
system of sets of alternatives from which choice is made. (We shall see later that to
demand this level of performance of some men and some mammals is sometimes
pathogenic.)
Learning IV would be a change in Learning III, but probably does not occur in any
adult living organism on this earth. Evolutionary process has, however, created or-
ganisms whose ontogeny brings them to Level III. The combination of phylogenesis
with ontogenesis, in fact, achieves Level IV.
2
of behavior must be ordered according to the Theory of Logical Types. Without the
assumption of repeatable context (and the hypothesis that for the organisms which we
study the sequence of experience is really somehow punctuated in this manner), it
would follow that all »learning« would be of one type: namely, all would be zero
learning." (Bateson, 1972, p. 288)
All technical models which are known today and which have been realized fulfill the
condition of a repeatable context. The reason is very simple: All technical models
have one feature in common—they have no environment and hence no changing con-
texts. For example: A robot working at an assembly line in a car manufacturing
process only has an environment from the standpoint of an observer of both the robot
and the assembly line. From a "standpoint of the robot", however, the robot does not
have an environment. Such a robot even does not have its own standpoint. All the
"environment" which is important for the functioning of the robot such as the screws
or the car body, where the screws have to be fixed, are parts of the robot program and
therefore belong to the robot and not to its environment—these robots neither have an
environment nor an own standpoint.
Standpoint dependency is a necessity for modeling situations with changing contexts!
Classical mathematics and logic—which form the basis for any technical construct
today—do not allow modeling of standpoint dependencies. Or, to phrase it in a
somewhat shortened way: So far as mathematics is concerned, the result of 2_times_2
does not depend on standpoints and by analogy all the classic standard and non-
standard logic conceptions are non-standpoint dependent calculi—or to put it in the
terminology of Gotthard Günther they are mono-contextural calculi.
3
The question arises:
a) What is the difference between learning I and learning
II from an algorithmic point of view?
And furthermore one has to ask:
b) Why "from an algorithmic point of view" and not from
the view of logical types?
The second question already has been answered by Bateson
himself, because …
"… the word »learning« undoubtedly denotes change of some
kind. To say what kind of change is a delicate matter…. Change
denotes process. But processes are themselves subject to Figure 1: pattern for
»change« …" (Bateson, 1972, p.283) "reversal learning"
and
" …the world of action, experience, organization, and learning cannot be completely
mapped onto a model which excludes propositions about the relation between classes
of different logical type…" (Bateson, 1972, p.307)
Processes and actions can only be modeled algorithmically with the intention to im-
plement the model into a machine (cf., Kaehr, 2003).
An answer to the first question is much more difficult and has been given by the Ger-
man-American philosopher and logician Gotthard Günther who introduced the Theory
of Polycontexturality into life sciences (Günther, 1976, 1979a, 1980). Before we trace
the main idea of this theory we have to take a short look on Bateson's Notes on Hier-
archies (Bateson, 1972, p. 307):
et cetera ...
"If C1 is a class of propositions, and C2 is a class of
propositions about the members of C1; C3 then being a
class of propositions about the members of C2; how then
shall we classify propositions about the relation between
these classes? For example, the proposition »As
level_3
members of C1 are to members of C2, so members of C2
are to members of C3« cannot be classified within the
unbranching ladder of types.
The whole of this essay is built upon the premise that the ? ? level_2
relation between C2 and C3 can be compared with the C4
relation between C1 and C2. I have again and again taken ?
a stance to the side of my ladder of logical types to ?2 C3
discuss the structure of this ladder. The essay is
? ?1
therefore itself an example of the fact that the ladder is level_1
not unbranching. C2
It follows that a next task will be to look for examples of
C1
learning which cannot be classified in terms of my
hierarchy of learning but which fall to the side of this
hierarchy as learning about the relation between steps of Figure 2: The hierarchy of
the hierarchy." [emphasis by the authors] logical types: C1, C2, … see text.
4
and it has characteristics that are a part of what it is. In other words, Aristotle's Law of
Identity strictly holds, i.e., everything that exists has a specific nature. What the
pyramid of different classes (or types) in Figure 2 depicts, is the structural pattern of
an absolute hierarchy where all elements are linked by a common measure. This is the
well known world of natural sciences which—from a epistemological point of view—
belongs to an ontology of identity. In other words, Bateson's categories of learning
describe the results of different processes with attributes observed during different
learning situations. From a technical point of view, however, the central question is:
How can we model the process of learning II ? What about the transitions between the
different levels of logical types? How can these transitions be modeled in a formal
mathematical way in order to develop and to implement algorithms which are able to
learn in the sense of learning II by their own efforts?
interpreted as:
A A
O, O: O (O) = ~ O (O) (1)
Osr(O)
O level 3 O Osr O E
O sr,
A
O: Osr(O) = ~ O (Osr) (2)
O level 2 O interpreted as: Osr(Osr) = ~ O (Osr) (3)
O level 1 O Osr(Osr) O (Osr)
(a) (b) (c)
Figure 3: The problem of self-reference from a monocontextural point of view
~ : negation, ∀ : universal quantifier, ∃ : existential quantifier
5
of the object. This has been achieved in Bateson's work—describing the results of
learning processes—by different logical categories which leads to the hierarchy of
logical types (logical domains) as shown in Figure 2. However, there are no logical
operators which allow the modeling between the different logical types (domains)—
operators which become necessary if the process of learning has to be modeled and
not only the result, the content of a learning process.
Figure 3a shows the different logical types of Figure 2 using the symbolic metaphor of
equation (1). The crucial point of Figure 3a is, to understand that the different logical
domains are not mediated, they are isolated, i.e., there are no logical operators that
allow transitions between the different logical types (domains) and their elements.
And as a matter of fact any system of n logical types can always be reduced (type
reduction) to only one logical type whereby the different processes which are the
object of modeling will be homogenized to sequential process-structures that always
obey the transitivity law. Therefore these processes are always hierarchically—and
never heterarchically—structured; and any attempt of a formal logical description of
cognitive-volitive processes ends up within the thicket of notorious circuli vitiosi (see
also: Günther, 1979a; Kaehr & von Goldammer, 1988, 1989).
Figure 3b represents the process of hetero-referencing from the operator (cognitive
system) to the operand (object), a process where an image of the object is created
from which the cognitive system references on itself in order to make a distinction
between itself and its environment. This is a self-referential process. From a logi-
cal point of view, this process is a vicious circle, i.e., a logical antinomy. This has
been shown in Figure 3c. Relation (1) in equation 3c expresses the fact, that an
order relation exists between an operator and its operand, i.e., the operand cannot
become an operator of it"self". Relation (2) refers to the hetero-referential aspect
of the process and relation (3) to the self-referential aspect. Needless to say that
relation (3) is in contradiction with the self-referential situation in Figure 3b and
within this context it can be seen, that self-referentiality cannot be modeled by
recursion as suggested frequently by artificial intelligence scientists. In other
words, self-reference cannot be modeled without antinomies and ambiguities
within the linguistic frame of classical standard logic—the classical standard logic
reveals a basic weakness as an intellectual tool for modeling self-referential
processes.
O i+2 O i+1
O i+1 Oi
Oi O i-1
6
In Figure 4 we have introduced a symbol for an exchange relation between an operator
O i+1 and operand Oi+1 which belong to different logical domains respectively. Within
the mono-contextural logical world no such exchange relation exists. The logical
domains are mediated provided the exchange relation is based on logical operations
between different logical domains (contextures). In other words, in Figure 4 different
logical places come into play—a situation which has no meaning in all classical stan-
dard and non-standard logic conceptual designs. Since we are not restricted to a lim-
ited number of contextures, Figure 4 represents an ensemble of any number of me-
diated contextures. Obviously here is the ladder to escape the eye—the black hole, the
abyss—of circularities. The question is, how can we work with such a ladder?
7
such as thinking or learning. What we can observe or experience are the actions, i.e.
the "products", the content of these processes but not the processes themselves.[2]
Why is it so?
The answer can be given with reference to McCulloch's undiscovered paper "A heter-
archy of values …" (McCulloch, 1945): The structure of all mental processes is an
interplay of heterarchical and hierarchical interwoven components. A heterarchical
process structure is defined as a process where the transitivity law cannot be applied
any longer and therefore these process-structures cannot be mapped sequentially, i.e.
these process-structures never can be measured! To express it inversely: For any
measurement the transitivity law strictly holds; its validity is—so to speak—a neces-
sity for all experimental processes of measurement.
It was Gotthard Günther who provided a basis for modeling such process structures.
His polycontextural theory not only contains a many-placed logic but also a theory of
heterarchical numbers (Günther, 1976) and the pre-logical theory of morphogrammtic
as well as the pre-semiotical theory of kenogrammatic (cf., Kaehr, 2003, 2004).
In the following we will demonstrate in a short and somewhat simplified way how a
decision making process can be rationalized within the language of Günther's poly-
contextural theory, a theory which has to be considered as the basis for a standpoint
dependent systems theory.
Again three standpoints are considered which will be indexed by natural numbers.
Each number stands not only for a standpoint but also for a logical place which repre-
sents a standpoint by at least one contexture, i.e., a logical domain.[3] The following
chain of negations which is very often taken as an example in the work of Gotthard
Günther will be interpreted:
p = N1, 2, 1, 2, 1, 2 p ( 2a )
and
p = N2, 1, 2, 1, 2, 1 p ( 2b )
Where p = N1, 2, 1, 2, 1, 2 p corresponds to
p = N1 (N2 (N1 (N2 (N1 (N2 p))))) =def N1 N2 N1 N2 N1 N2 p ( 3a )
and p = N2, 1, 2, 1, 2, 1 p corresponds to
p = N2 (N1 (N2 (N1 (N2 (N1 p))))) =def N2 N1 N2 N1 N2 N1 p ( 3b )
The different (global) negations in (2) will be executed from the right to the left. The
negation N1 and N2 are defined according to the table (4a, b):
p N1 p p N2 p
1 2 1 1
2 1 2 3
3 3 3 2
( 4a ) (4b )
At the end of such a negation circle the proposition p has a "history of reflection" as
Günther calls it in the foreword of his Beiträge…(2nd volume) (Günther, 1979). The
classical negation ( ~ ) never gains such a "history of reflection". While the inter-
contextural transitions (the rejections within the negation chain) correspond to the
cognitive aspects of a cognitive-volitive process. The designation of a standpoint, of a
contexture on the other side corresponds to the volitive aspects of a cognitive-volitive
process. For a more detailed discussion on cognition and volition it is referred to the
literature, especially to Günther's "Cognition and Volition" (Günther, 1979a).
UPSHOT: The classical standard logic as well as all (classical) non-standard logics
like modal-logic, probability logic, fuzzy logic, or paraconsistent logics, etc. are truth-
definite in the sense of an ontology of identity ("something is or is not"—any third is
excluded—cf. example above). Günther calls the sciences or languages based upon
these truth-definite logics positive sciences or languages. All natural languages as well
as the artificial languages like the classical standard- and non-standard-logics or math-
ematic are positive languages. Positive languages are characterized by their (intra-
contextural) negations which always imply indirectly the corresponding positive
proposition.
Günther's negative language (Günther, 1979b) can be considered as complementary
to the artificial positive languages. The negative language is characterized by a variety
of negations (negation chains or negation circles) which operate inter-contextural (not
intra-contextural) and which are mutually mediated. Therefore any inter-contextural
negation always refers to at least one further contexture, i.e., any rejection (negation)
of a contexture (standpoint or logical place) is always related to at least one further
contexture (standpoint or logical place) as it was demonstrated above (step 1 to 6). In
other words, a contexture (standpoint or logical place) can only be negated (rejected)
in relation to (at least) one further contexture. This corresponds to a process (not a
9
state!) where the positive appears not before a contexture (standpoint or logical place)
has been designated in the sense of an affirmation. From the view of the classical logic
these negations are meaningless since all classical standard- and non-standard logics
are mono-contextural, i.e., only one contexture (one standpoint, one logical place) ex-
ists which can be located only outside but not within the contexture.
Notes
1 2005 the economists Robert J. Aumann and Thomas C. Schelling got the Sveriges Riksbank Prize
in Economic Sciences in Memory of Alfred Nobel "for having enhanced our understanding of
conflict and cooperation through game-theory analysis".
2 This is, so to speak, the quintessence of Varela's closure thesis (Varela, 1979)—Closure Thesis:
"Every autonomous system is organizationally closed ... organizational closure is to describe a
system with no input and no output ..."
3 For an implementation Günther's heterarchically structured numbers, which he called dialectical-
or keno-numbers have to be used. This is of importance in present context because the heterarchi-
cally structured system of numbers prevents any formation of a hierarchy of logical types. Using
natural numbers instead of keno-numbers is only one of the simplifications which we use in the
presented example. We also have not mentioned the proemial relationship and its importance in
Günther's Theory of Polycontexturality. In his scientific essay Strukturelle Minimalbedingungen
einer Theorie des objektiven Geistes als Einheit der Geschichte (Günther, 1980, Band 3, p. 136-
182) Günther describes the logical complexity underlying any formal description of mental
processes. Both Günther's morphogrammatic which is a pre-logical theory and his kenogrammatic
which is a pre-semiotical theory also cannot be discussed within such a short report. For more
details it is referred to the literature (cf., Kaehr, 2004).
10
References
All articles marked by (*) or (#) are available at: www.vordenker.de or www.thinkartlab.com
Bateson, G. (1972), Steps to an Ecology of Mind, Intertext Books, London — International Textboook
Company Ltd., Chandler Publ. Company.
Bateson, G. (1979), Mind and Nature—A Necessary Unity, W. Collins Sons & Co. Ldt., Glasgow.
Günther, G. (1976-1980), Beiträge zur Grundlegung einer operationsfähigen Dialektik, Band 1-3,
Felix Meiner Verlag, Hamburg.
Günther, G. (1976), "Cybernetic Ontology and Transjunctional Operations", in: Günther, G., Beiträge
zur Grundlegung einer operationsfähigen Dialektik, Band 1, p. 249-328.(*)
Günther, G. (1979a), "Cognition and Volition", in: Günther, G., Beiträge zur Grundlegung einer ope-
rationsfähigen Dialektik, Band 2, p. 203-240.(*)
Günther, G. (1979b), "Identität, Gegenidentität und Negativsprache", Hegeljahrbücher, p. 72-88.(*)
English translation by J. Paul and J. Newbury (2005), in: www.vordenker.de
Kaehr, R. and von Goldammer, E. (1989), "Again Computers and the Brain", Journal of Molecular
Electronics, Vol.4, p. S31-S37.(*)
Kaehr, R. and von Goldammer, E. (1989), "Poly-contextural Modelling of Heterarchies in Brain Func-
tions", in: Cotterill, R.M.J. (ed.), Models of Brain Functions, Cambridge University Press, p. 483-
497.(*)
Kaehr, R. (2003), Derrida's Machine, in: Kaehr, R. (ed.), URL:
http://www.thinkartlab.com/pkl/media/index.htm (#)
Kaehr, R. (2004), Skizze-0.9.5: Strukturation der Interaktivität. Grundriss einer Theorie der
Vermittlung, in: Kaehr, R. (ed.), URL: http://www.thinkartlab.com/pkl/media/SKIZZE-0.9.5-medium.pdf (#)
McCulloch, W. St. (1945), "A Heterarchy of Values Determined by the Topolgy of Neural Nets", Bul-
letin of Mathematical Biophysics, Vol. 7, p. 89-93.(*) Reprinted in: McCulloch, W.St. (1988), Em-
bodiments of Mind, The MIT Press.(*)
Varela, F. (1979), "Principles of Biological Autonomy", in: Klir, G. (ed.), General Systems Research
Vol.II, North Holland Publ., Amsterdam, p.58.
von Goldammer, E. and Kaehr, R. (1990), "Problems of Autonomy and Discontexturality in the Theory
of Living Systems", in: Moeller, D.P.F. and Richter, O. (eds.), Analyse dynamischer Systeme in
Medizin, Biologie und Oekologie, Informatik-Fachberichte 275, Springer Verlag, Berlin, p. 3-12.(*)
11