84
84
BLANK PAGE
Volume 2
JOHN LYONS
Trinity Hall, Cambridge
43 CAMBRIDGE
Sg) UNIVERSITY PRESS
Published by the Press Syndicate of the University of Cambridge
The Pitt Building, Trumpington Street, Cambridge CB2 1RP
40 West 20th Street, New York, NY 10011-4211, USA
10 Stamford Road, Oakleigh, Melbourne 3166, Australia
discourse 657
15.3. Deixis, anaphora and the universe-of-
VOLUME I
1. A model of communication page 36
2. The triangle of signification 96
3. Truth-table for the one-place negation connective 143
4. Truth-tables for the two-place connectives 144
tion of classes 158
5. Venn diagrams illustrating the union and intersec-
VOLUME 2
SMALL CAPITALS
For sense-components and other more abstract elements, or correlates,
of meaning (cf. 9.9). :
Italics
1. For forms (as distinct from lexemes or expressions: cf. 1.5) in their
orthographic representation.
2. For certain mathematical and logical symbols, according to standard
conventions.
Single quotation-marks
1. For lexemes and expressions (cf. 1.5).
2. For the citation of sentences (i.e. system-sentences: cf. 1.6).
3. For titles of articles.
Double quotation-marks
1. For meanings (cf. 1.5). )
2. For propositions (cf. 6.2).
3. For quotations from other authors.
Asterisk
For technical terms when first introduced and occasionally thereafter to
remind the reader of their technical sense.
Notes
1, When a term has been furnished with an asterisk, single quotation-
marks are not used.
2. Single quotation-marks are omitted when a sentence, expression or
lexeme is numbered and set on a different line; but italics and double
quotation-marks are still used in such circumstances.
3. In quotations from other authors, the original typographical conven-
tions have usually been preserved. Occasionally adjustments have
been made in order to avoid confusion or ambiguity.
| Preface
When I began writing this book six years ago, it was my intention to
produce a fairly short one-volume introduction to semantics which
might serve the needs of students in several disciplines and might be of
interest to the general reader. The work that I have in fact produced is
far longer, though in certain respects it 1s less comprehensive, than I
originally anticipated; and for that reason it is being published in two
volumes.
Volume 1 is, for the most part, more general than Volume 2; and it is
relatively self-contained. In the first seven chapters, I have done my
best, within the limitations of the space available, to set semantics
within the more general framework of semiotics (here defined as the
investigation of both human and non-human signalling-systems); and I
have tried to extract from what ethologists, psychologists, philosophers,
anthropologists and linguists have had to say about meaning and
communication something that amounts to a consistent, if rather
eclectic, approach to semantics. One if the biggest problems that I have
had in writing this section of the book has been terminological. It is
frequently the case in the literature of semantics and semiotics that the
same terms are employed in quite different senses by different authors
or that there are several alternatives for what is essentially the same
phenomenon. All I can say is that I have been as careful as possible in
selecting between alternative terms or alternative interpretations of the
same terms and, within the limits of my own knowledge of the field, in
drawing the reader’s attention to certain terminological pitfalls. At one
time, I had hoped to be able to follow the practice of never using non-
technically any word that was also employed anywhere in the book in
some technical sense or other. I soon had to abandon this rather quixotic
ambition! Some of the most ordinary words of English (e.g. ‘case’,
‘feature’, ‘aspect’) are employed in a highly specialized sense in lin-
Xil Preface
guistics and related disciplines; and, however hard I tried, I found it
impossible to get by without them. I trust that the context (and the
device of using asterisks for introducing technical terms) will reduce, if
it does not entirely eliminate, ambiguity and the possibility of mis-
understanding.
The last two chapters of Volume 1 are devoted to structural semantics
(or, more precisely, to structural lexicology). This is a topic that I have
been concerned with, on and off, for the best part of 20 years; and,
although the so-called structuralist approach to semantics is no longer
as fashionable among linguists as it once was, I still believe that it has
much to contribute to the analysis of language.
Volume 2 may be read, independently of Volume 1, by anyone who 1s
already familiar with, or is prepared to take on trust, notions and dis-
tinctions explained in Volume 1. In Volume 2, which (apart from the
chapter on Context, Style and Culture) is concerned with semantics
from a fairly narrowly linguistic point of view, I have been tempted to
do something more than merely clarify and systematize the work of
others; and this accounts for the fact that the book, as a whole, has taken
me far longer to write than I had expected it to take. Five of the eight
chapters in Volume 2 -two of the three chapters on Semantics and
Grammar, the chapter on Deixis, Space and Time, the chapter on Mood
and Illocutionary Force, and the chapter on Modality — contain sections
in which, unless I am mistaken, there are a few ideas of my own.
Caveat lector ! | !
As I have said, the book is, in certain respects, less comprehensive
than I intended. There is nothing on etymology and historical seman-
tics, or on synonymy; and there is very little on the structure of texts
(or so-called text-linguistics), or on metaphor and style. If I had dealt
with these topics, I should have had to make my book even longer.
Sometimes one must stop even if one has not finished! :
As I write this Preface, I am all too conscious of having just moved
from Edinburgh where I have now spent twelve years, in one of the
finest Departments of Linguistics in the world. Throughout this time I
have benefited, in my writing and in my teaching, from the advice and
criticisms of my colleagues in several Departments. Many of them have
helped me, as far as the present book is concerned, by reading sections
of it for me in draft and commenting upon them or by discussing (and
in some instances originating) the ideas that have found their way into
my text: John Anderson, R. E. Asher, Martin Atkinson, Gillian Brown,
Keith Brown, John Christie, Kit Fine, Patrick Griffiths, Stephen Isard,
Preface X11
W. E. Jones, John Laver, Christopher Longuet-Higgins, J. E. Miller,
Keith Mitchell, Barry Richards, and James Thorne. Ron Asher and
Bill Jones have been especially helpful: each of them has read the whole
typescript; and Bill Jones has undertaken to do the index for me. Apart
from these Edinburgh and ex-Edinburgh colleagues, there are many
others to whom I am indebted for their comments on drafts of parts of
the book: Harry Bracken, Simon Dik, R. M. Dixon, Francoise Dubois-
Charlier, Newton Garver, Gerald Gazdar, Arnold Glass, F. W. House-
holder, Rodney Huddleston, R. A. Hudson, Ruth Kempson, Geoffrey
Leech, Adrienne Lehrer, David Makinson, P. H. Matthews, G. A.
Miller, R. H. Robins, Geoffrey Sampson, the late Stephen Ullmann,
Anthony Warner. There are doubtless many errors and inadequacies
that remain but without the aid of so many friends, whose specialized
knowledge in many of the relevant fields is far greater than my own, I
should have gone astray more often than I have done.
Like all teachers, I have learned more from my students over the
years than they have learned from me. It has been my privilege to con-
duct several research seminars and to supervise a fair number of Ph.D.
dissertations on semantics during the period when I was writing this
book. T'wo of my students I must mention by name, since I am very
conscious of having derived directly from them some of the points that
appear in the book: Marilyn Jessen and Claudia Guimaraes de Lemos.
I have no doubt, however, that others of my students are also responsible
for much of what IJ think of as being original in the second volume.
I owe a special debt of gratitude to Rena Somerville who, as my
secretary in the last few years (the best secretary that I have ever had),
has typed so many versions of certain sections of my manuscript that she
could probably reproduce at least the gist of them from memory! Much
of this work she has done at home in the evenings and at the week-end:
I trust that her family will forgive me for the time that I have stolen
from them in this way.
Without the specialized assistance provided by the Cambridge
University Press this book would never have seen the light of day.
Jeremy Mynott read both volumes in typescript and made many valuable
editorial suggestions. Penny Carter was responsible for the sub-editing
and had to cope with far more inconsistencies and handwritten changes
in the typescript than an author should have been allowed to make. I am
grateful to both of them for their help and their forbearance.
Finally, I must record my gratitude to my wife and children for their
willingness to put up with my frequent bouts of depression, ill-temper
XIV Preface
or sheer absent-mindedness while I was writing the book and the
postponement of so many promised outings and holidays. More par-
ticularly, I wish to thank my wife for the love and support that she has
always given me, in my writing as in everything.
DetYN 'a eo
' a o\
'
Det N V NP
! i
i''{!'t'‘‘ttt'i‘ (t3ttN ' t ! '
|
I‘t‘t‘it$;’t
NP Poss Adj N Det N
{
tence .
For example, just as the lexically ambiguous sentence ‘They drank the
port at midnight’ is presumably not interpretable as containing ‘port,’,
on any occasion of its utterance, so the grammatically ambiguous sen-
(7) You can’t get fresh fruit and vegetables these days
(8) They are eating apples.
That these utterances each have at least two interpretations is obvious
enough. It is also clear that their ambiguity does not depend upon
homonymy or polysemy, but is of the kind that is naturally accounted
for in terms of the notion of phrase-structure. Any reasonably compre-
hensive generative grammar of English would automatically assign to
the system-sentences in correspondence with these utterances, and to
indefinitely many sentences like them, at least two different phrase-
structure analyses. Their grammatical structure is to this extent un-
controversial.
Not all aspects of the phrase-structure of (4)-(8) are uncontroversial.
Phrase-structure, in certain systems of formalization at least, involves
both grouping and categorization (cf. 10.3). There is no problem, from
this point of view, in either (5) or (7), which demonstrate the fact that
grammatical ambiguity may, in certain cases at least, be determined
solely by differences in the way in which the forms are grouped to-
gether. But (6) is not like (5) and (7) in this respect. Under one inter-
pretation of (6) ‘with a stick’ is an adjectival phrase modifying the noun
‘man’. In the other interpretation, it is what would be traditionally
described as an adverbial complement of the verb-phrase. In both cases,
however, the phrase ‘with a stick’ is composed of a preposition and a
noun-phrase. The question is, therefore, whether the phrase should be
categorized differently in the two phrase-markers in the one case as an
adjectival phrase and in the other as an adverbial phrase. If this is done,
then (6) will be grammatically ambiguous in terms both of grouping and
of categorization. Furthermore, although we have assumed that (6) is
not lexically ambiguous, it is obviously arguable that the preposition
‘with’ differs in meaning in the two cases. We do not propose to argue
that ‘with a stick’ is or is not ambiguous in terms of its syntactic
categorization or in terms of the meaning of ‘with’. The point that is
being emphasized here is that grammatical ambiguity is at least partly
dependent upon the way in which the language-system is analysed; and
this is especially so in respect of the categorization of constituents in the
402 Semantics and grammar I
about (6). :
phrase-markers assigned by a generative grammar. Much the same
comments can be made about (4) and (8) as have just been made
HES
bring it into line with the standard version of Chomskyan transforma-
tional grammar, is the introduction of the notions of deep structure and
HES
Figure 9. The so-called standard theory
surface structure. Apart from the base, there are three boxes of rules:
transformational rules (‘I’-rules), semantic rules (S-rules) and phono-
logical rules (P-rules). The output of the base is a set of deep structures
(DS), to which the S-rules (Katz & Fodor’s projection rules) apply
and yield a set of semantic interpretations (SI). The output of the T-
rules, on the other hand, is a set of surface structures, to which the
phonological rules apply and derive for each sentence its phonological
representation (PR).
The general conclusion towards which Katz and Fodor (1963), and
more especially Katz and Postal (1964), were working was the thesis
that (apart from certain rules that were responsible for what was held to
be purely stylistic variation) all of the T-rules were obligatory; and
this thesis was taken over and made part of the standard version of
transformational grammar by Chomsky (196s). It carries as an imme- _
diate corollary, by virtue of the semiotic principle that meaningfulness
implies choice (cf. 2.1), the proposition that transformations do not
affect meaning.!8 It is only in so far as this proposition is held to be true
18 The implications of this proposition and the looseness with which it was
formulated are discussed in Partee (1971).
10.5. Generative semantics 413
that one can maintain the principle that all the information that is
relevant to the semantic interpretation of a sentence is present in its
deep structure. Acceptance of this principle is made explicit in figure 9,
it will be observed, by the absence of any path from SS to SI.
It is tempting, having accepted that deep-structure identity is a
sufficient condition of semantic identity, to take the further step of
making it a necessary condition also. In effect, this is what is done by
those calling themselves generative semanticists (cf. Lakoff, 19714), who
argue that there is no need to postulate any distinction between the deep
structure of a Sentence and its semantic interpretation. Their approach
: to the construction of an integrated model of linguistic description 1s
shown in figure 10.9
of sentences. |
theory of transformational grammar defines to be deep-structure sub-
jects, objects and predicates play any role in the semantic interpretation
discusses. :
between nouns and verbs. This point, as we shall see, is particularly
relevant in connexion with one of the languages (Nootka) that Sapir
English to treat }
so, it is more or less standard practice in conventional grammars of
and ,
or
(8) When I was in prison, I ate anything that was put in front of me
(9) When I was in prison, I ate everything that was put in front of me
shows that it is at least initially plausible to relate ‘any’ to ‘every’. But
there are problems here too.
There are many contexts in which there is a very clear semantic
difference between ‘any’ and ‘every’. For example,
(10) I don’t know anyone here
and
(11) I don’t know everyone here
differ far more obviously in their truth-conditions than (6) and (7) do.
But this can be accounted for, it is suggested, in terms of the relative
scope of the operators of negation and quantification, (10) being analysed
aS
(10a) (x)( ~(I know x)): “For all values of x, it is not the case that I
know x”’
and (11) as
(11a) ~((x) (I know x)): “It is not the case that, for all values of x,
I know x”’.
More generally, it has been proposed that ‘any’ is (or is the English-
language equivalent of) a quantifier of universal quantification which
has wider scope than any other operator of negation, quantification or
| modality. This principle accounts, not only for (10) and (11), analysed
along the lines of (10a) and (11a), but also for many other more complex
combinations of logical operators and their alleged English-language
equivalents.
However, there would seem to be certain exceptions to the principle
that ‘any’ always takes the widest possible scope in any clause in which
it occurs. For example, the following two sentences do not differ in
meaning as obviously as (10) and (11) do:
(12) Some remedies can cure anything
(13) Some remedies can cure everything.
Under one of its possible interpretations, (12) can be analysed, from a
11.4. Determiners, quantifiers and classtfiers 459
logical point of view, as meaning roughly “There is/are some x such
that, for all values of y, x can cure y’’; and this interpretation (in which
‘some’ is to be construed as having non-specific reference) cannot be
accounted for in terms of the principle that ‘any’ always has wider scope
than ‘some’.
There are also certain rather subtler differences between ‘any’ and
‘every’ which cast doubt upon the identification of ‘any’ with the
universal quantifier. First of all, ‘any’ and ‘every’ differ in certain con-
texts with respect to existential presupposition. For example, whereas
(14) I don’t know everyone with yellow lips
would not normally be uttered (except as a context-dependent denial:
cf. 16.4) unless the speaker was prepared to commit himself (if challen-
ged, on this score) to a belief in the existence of people with yellow lips,
(15) I don’t know anyone with yellow lips
is quite neutral with respect to the existence or non-existence of people
with yellow lips.
More serious than the problem of existential presupposition, how-
ever, is the fact that the truth-conditions of sentences containing ‘any’,
unlike the truth-conditions of otherwise identical sentences containing
‘every’ or ‘some’, are rather obscure. If it is asserted that every member
of a particular set of mountaineers, {a, b, c, d}, is capable of climbing
Everest, the person making this assertion 1s committed to the truth of the
following four propositions:
(16) “‘a can climb Everest’’
(17) “db can climb Everest”’
(18) “ec can climb Everest”’
(19) “d can climb Everest”’.
Anyone asserting that some member of the team is capable of climbing
Everest (where ‘some’ is to be construed non-specifically) is committed
to the truth of the disjunction of the same four propositions. At first
sight it might appear that the truth-conditions of
(20) Any member of the team can climb Everest
are the same as the truth-conditions of
(21) Every member of the team can climb Everest.
But, arguably, this is not so. Unlike (21), (20) neither says nor implies
460 Semantics and grammar II
that there is any more than one member of the team who is capable of
climbing Everest. In this respect, it is like
(22) Some member of the team can climb Everest.
But (20) differs, nonetheless, from both (21) and (22). The proposition
expressed by (21) entails, but is not entailed by, the proposition expres-
sed by (20); and the proposition expressed by (20) entails, but is not
entailed by, the proposition expressed by (22). So much is clear enough.
The problem is to formulate the truth-conditions of (20) in such a way
that they bring out these semantic differences; and no-one appears to
have done this yet.
Vendler (1967: 70-96) has emphasized the fact that when we utter
sentences like (20) what we do in effect is to throw down a challenge:
‘‘Pick some member of the team and I assert of the member that you
pick that he is capable of climbing Everest’”’. This analysis of the
meaning of ‘any’ cannot be accommodated, in any straightforward way,
within the framework of truth-conditional semantics. But it does have
the advantage that it accounts very naturally for the fact that ‘any’ tends
to occur in a variety of what may be referred to, loosely, as modal
contexts; and the particular grammatical or quasi-grammatical relation-
ship that holds between ‘some’ and ‘any’, which was mentioned earlier,
reflects this tendency.
Limitations of space prevent us from discussing the equivalents, or
near-equivalents, of ‘some’, ‘any’, ‘every’, ‘all’, etc., in other lan-
guages. All that can be said here is that, although they may not present
exactly the same problems of analysis as the English determiners and
quantifiers, there is no reason to believe that they are any more satis-
factorily analysable in terms of the existential and universal quantifiers
than the English determiners and quantifiers are.
Semanticists have devoted far less attention to classifiers* than they
have to determiners and quantifiers. The reason, no doubt, is that,
although very many of the world’s languages make use of classifiers, the
more familiar Indo-European languages do not. What is meant by the
term ‘classifier’ (in the sense in which it is being used here) is best
explained by means of an example.
If one wanted to translate into Tzeltal (a Mayan language spoken in
Mexico) the English phrases ‘three trees’ or ‘four men’, one would
have to use, in each instance, a three-word phrase, rather than a two-
word phrase (cf. Berlin, 1968: ‘o8-tehk te?’ (‘three trees’), ‘Can-tul
winik’ (“four men’’)). The first word in these phrases (‘o8’, ‘Can’) is a
11.4. Determiners, quantifiers and classifiers 461
numeral; the third (‘te?’, ‘winik’) is a common noun; and the second
(‘tehk’, “tul’) is a classifier. Classifiers are syntactically obligatory
elements in all such expressions; and the classifier that is employed
depends upon the nature of the entity or set of entities that is being
referred to. The classifier for plants is ‘tehk’; the one for human beings
is ‘tul’; the one for animal is ‘koht’; and so on.
Tzeltal is typical of what we may refer to, loosely but conveniently, as
classifier-languages, in that classifiers are obligatory in phrases con-
taining numerals. (Hence the alternative term, ‘numeral classifier’,
which 1s frequently used in the literature.) There are many languages,
including Mandarin Chinese and Vietnamese (cf. Chao, 1968; Emeneau,
1951), in which classifiers are also obligatory with demonstratives: i.e.
in phrases which might be translated into English as ‘that tree’, ‘this
man’, etc. In such languages, it is commonly, if not universally, the case
that there is a special pluralizing classifier, which occurs with demonstra-
tives (but not with numerals) and replaces the semantically appropriate
classifier that would be used in non-plural constructions (cf. Greenberg,
1972). For example, (i) ‘one book’, (ii) ‘three books’, (iii) ‘this book’,
and (iv) ‘these books’ are translated into Mandarin Chinese as (i) ‘i ben
shu’, (ii) ‘san ben shu’, (itt) “che ben shu’ and (iv) ‘che hsie shu’. The
word ‘ben’ is the classifier used, in general, for flat objects, whereas
“hsie’ can be used for any kind of plurality or collectivity. One way of
bringing out the difference is by translating the Chinese expressions
tivity book’. ,
into a kind of Quasi-English as follows: (i) ‘one flat-entity book’, (ii)
‘three flat-entity book’, (iii) ‘this flat-entity book’ and (iv) ‘this collec-
assumption. :
indicative. However, nothing of what is said below depends upon this
* On the copula in various languages, cf. Asher (1968), Christie (1970), Ellis &
Boadie (1969), Kahn (1973), Kiefer (1968), Lehiste (1969), Li (1972), Lyons
(1967) and several articles in Verhaar (1967-73). Of particular interest is
Kahn (1973), which is much broader than the title would suggest and includes
a valuable discussion of the philosophical issues.
472 Semantics and grammar ITI
ascriptive sentences; (ii) the subject and the complement of equative,
but not ascriptive, sentences are freely permutable. The difference
between equative and ascriptive sentences is clear enough if we com-
pare the sentences ‘John is the chairman’ with ‘John is intelligent’: ‘the
chairman’ is a nominal (NP) and ‘intelligent’ an adjective; and the
nominal, but not the adjective, is permutable with the subject-NP. (The
utterance Intelligent 1s fohn is of course acceptable; but it is stylistically
restricted, and it 1s associated with a very particular intonation-pattern.
Whether one should generate a system-sentence ‘Intelligent is John’ to
account for it is, to say the least, debatable.) Adjectives cannot occur as
equative complements (in sentences that have an NP-subject). It is the |
difference between the sentences ‘John is the chairman’ and ‘John 1s a
writer’ that is both less obvious and (it must be admitted) more contro-
versial; and this difference has been obscured in many treatments of
English by classifying both kinds of complements as nominal (in what
is arguably an equivocal use of the term ‘nominal’). ‘The occurrence of
the indefinite article form a@ is a purely automatic consequence of the
fact that the subject-NP is singular and ‘writer’ is a countable noun.
The complement in the ascriptive sentence is not, therefore, the NP-
expression ‘a writer’, but the N-expression ‘writer’.
Generally speaking, an equative complement (in a sentence with an
NP-subject) can be a proper name, a pronoun or a definite noun-phrase,
but not an adjective; and an ascriptive complement can be a noun or an
adjective, but not a pronoun or a proper name. True, there is the prob-
lem that definite noun-phrases may also occur as ascriptive comple-
ments. But this problem is solved, we assume, by invoking the notion of
grammatical ambiguity (cf. 10.4). We take ‘John is the writer’, and more
obviously ‘John is the author of this book’, to be grammatically am-
biguous in terms of the distinction between equative and ascriptive
sentences. This distinction also accounts, it should be noted, for two
different kinds of apposition*: cf. “The chairman, John Smith, proposed
a wote,of thanks’ ws. ‘The chairman,,a prominent local author, proposed
a vote of thanks’.
othe semantic distinction between equative and ascriptive structures
is/that the former are used, characteristically, to identify the referent of
one expression with the referent of another and the latter to ascribe to
the referent of the subject-expression a certain property. The equative
copula is, therefore, the linguistic correlate of the identity-operator in
mathematics or logic; and, whether or not it is “‘a disgrace to the human
race’, as Russell once remarked, that the same copula is used in many
12.2. Predicative structures 473
languages in both equative and ascriptive sentences (cf. Kahn, 1973: 4)
it is certainly important, in the semantic analysis of statements, to dis-
tinguish those which answer the question of the form What ts fohn?
from those which answer the question Who ts fohn? 'The sentence ‘John
is the chairman’ can be used to answer either of these two questions.
Similarly, Russell’s (1905) famous sentence ‘Scott is the author of
Waverley’ can be used either to answer the question Who 1s Scott? (and
this is the sense in which Russell took it: cf. 7.2) or, to make the point
very crudely, to comply with the instruction Tell me something about
Scott. The fact that an equative sentence with an NP-subject must have
an NP-complement is a natural, if not inevitable, reflex of the fact that
such sentences have as their most characteristic function that of identi-
fying an entity referred to by means of one expression with an entity
referred to by another expression. So too is the fact that the subject-NP
and the complement-NP are permutable.
Let us now turn our attention to sentence-nuclei containing a locative
complement: cf. (5) and (5a) above. Locative expressions are not
generally recognized as constituting a major class of sentence-con-
stituents on a par with nominals and verbs. In traditional grammar, they
are treated as just one of several subclasses of adverbs or adverbial
phrases. Unlike all other adverbials, however, locative expressions may
be used predicatively (as complements of the copula in English) with
first-order nominals as their subjects. That locative expressions may be
used in this way is hardly surprising. The location of the persons,
animals and things with which we interact in everyday life is no less
interesting and important to us than their actions and physical or other
properties. Where is X? 1s as natural a question as What is X doing? or
What is X like?; and the grammatical structure of English and other
languages reflects this, in that we can say of an entity where it is (or where
it has been, was or will be) without saying what it is like, what it is
doing, what is happening to it or anything else about it. Locative
adverbials may be used as readily as verbs, adjectives and nominals in
the nuclei of kernel-sentences; and they may also be used (like various
other kinds of adverbials) as extra-nuclear adjuncts.
The distinction between locative and possessive complements is, at
first sight at least, straightforward enough in English. It is worth noting,
however, that the term ‘possessive’, as it is traditionally employed by
linguists, is somewhat misleading: it suggests that the basic function of
the so-called possessive constructions that are found in many languages
is the expression of possession or ownership. Generally speaking, however,
474 Semantics and grammar III
a phrase like ‘X’s Y’ means no more than “‘the Y that is associated
with X’’; and the kind of association holding between Y and X is
frequently one of spatial proximity or attachment. It can be argued that
so-called possessive expressions are to be regarded as a subclass of
locatives (as they very obviously are, in terms of their grammatical
structure, in certain languages). We will not press this point here (cf.
Lyons, 1968: 388ff). We are more concerned to discuss the distinction
between locative adverbials and nominals and, with it, the distinction
between equative structures like (3) and predicative locative structures
like (5). In doing so, we shall see that there is a further distinction
to be drawn between entity-referring nominals and place-referring
nominals,
That there is a semantic difference between identifying the referent of
one nominal with the referent of another nominal, on the one hand, and
saying of the referent of a nominal that it is located in a certain place, on
the other, is obvious enough, provided that the referent of the nominal
is an entity, rather than a place. But nominals in English may also refer
to places (e.g., ‘London’, ‘that field over there’); and, just as it is
possible to identify two entities, so it is possible to identify two places
(cf. London 1s the capital of England). It is also possible to say of one
place that it is contained within (and is therefore part of) another place
(cf. London is in England). |
The difference between locative adverbials and place-referring nomi-
nals is not, in fact, clear-cut in all syntactic positions in English. For
example, the demonstrative adverbs ‘here’ and ‘there’ and the demon-
strative pronouns ‘this’ and ‘that’ are equally appropriate as substitutes
for ‘this place’/‘that place’ in an utterance like Thzs/that place is where we
agreed to meet. Furthermore, there are many utterances of this kind that
are ambiguous according to whether they are construed as having an
equative or a predicative nucleus. Indeed, since the verb ‘be’ serves as
both an equative and a locative copula in English, it is difficult at times
to be sure that a particular sentence expresses the proposition “‘X be
(identical with) Y”’ or “X be at/in Y”’, if both X and Y are places. For
example, London ts where I met him, unlike My friend is where I met him,
is in principle ambiguous, although it would normally be understood as
equative (cf. “London is the place where . . .’’ vs. “‘ London is in the
place where .. .’’). Of course, the difference between “‘X be (identical
with) Y”’ and “X be at/in Y”’ disappears in the limiting case in which X
and Y are spatially co-extensive; and it is because the boundaries of the
referents of such locative adverbials as ‘here’ or ‘there’ (or ‘where I met
12.3. Locative subjects 475
him’) are indeterminate that we cannot say that a sentence like ‘London
is where I met him’ necessarily has an equative nucleus.
The difference between entities and places is something that we shall
come back to in a later chapter (cf. 15.5). It is sufficient for our present
purpose to emphasize the fact that there are many nominal expressions
in English which can be understood as referring either to entities or to
places according to the context in which they are used. For example,
‘the church’ or ‘the house’ may refer to a physical entity, which,
though it is normally located in a particular place, would still be
identifiable as the same thing if it were moved to another place. But the
same expressions may also refer to places (or spaces) within which other
entities are located: cf. fohn is in the church. The question, therefore,
arises whether all such expressions (unlike those which may be used to
refer only to entities or only to places) should be regarded as inherently
ambiguous. If they are so regarded, their ambiguity might be most
satisfactorily accounted for by recognizing a deep syntactic distinction
in English between locative and non-locative nominals. Without pur-
suing this point any further for the present (cf. 15.5), we turn now to the
more controversial topic of locative subjects.
cold’. , :
(8) Loc (+Cop)-+A.
If this is the structure that underlies ‘It is cold in London’, the question
now arises whether it is also the structure that underlies ‘London is
things. |
borne in mind in any consideration of the traditional definition of com-
mon nouns as lexemes which denote classes of persons, places and
12.4. Valency
Our discussion of the structure of sentence-nuclei has been deliberately,
though tacitly, restricted so far by the assumption that it is the function
482 Semantics and grammar III
of propositions to ascribe properties to entities. As we saw towards the
end of the last section, however, it is only by stretching the sense of the
term ‘property’ considerably beyond its normal usage that we can refer
to the location of an entity as one of its properties (i.e. as one of its
distinctive attributes). In fact, it is only for a relatively small subset of
the propositions expressed by the sentences of English (and other
natural languages) that the notion of ascribing properties to entities is at
all satisfactory. We are more often concerned to describe the events and
processes in which persons, animals and things are involved than we are
to describe either the essential or contingent qualities of persons,
animals and things. Furthermore, much of what can be described in
terms of static properties or relations ~ in terms of the physical attri-
butes of an entity or its involvement in some state-of-affairs — can also
be described in the more dynamic terms of potentiality for action or
interaction. ‘To say that something has the physical property of hard-
ness, for example, is to say that it will resist pressure; to say that
something is located in such-and-such a place is to indicate where one
must go or direct one’s gaze in order to obtain or find the thing in
question.
It has been plausibly suggested, in several recent works on the
acquisition of language by children, that the earliest and most basic
grammatical constructions that are identifiable in children’s speech can
be accounted for as the product of what Piaget has called sensorimotor
intelligence (cf. 3.5).2 According to Piaget, cognition develops on the
basis of the child’s interaction with the persons and things in his
environment: it is by virtue of his operation upon them and their opera-
tion upon him that he comes to know their properties and to categorize
them conceptually. By the time that the child reaches the final stage in
the development, or maturation, of sensorimotor intelligence (when he is
between 18 and 24 months old), he can not only draw attention to, or
comment upon, persons and things in the environment that engage his
interest, but also comment upon their absence or disappearance and ask
where they are. From this it may be inferred that he has acquired a
conception of the existence and continuous identity, through time, of
what we are calling first-order entities. Among these, there are some that
he now knows to be, like himself, self-moving and others (himself, his
parents, etc.) that he knows to be both self-moving and capable of
operating, in various ways, upon other entities; and this, it is suggested,
. 3 'The Piagetian point of view is well explained in H. Sinclair (1972, 1973).
Cf. also Bates (1976), Brown (1973), Nelson (1974).
12.4. Valency 483
provides the basis for the child’s earliest intuitions of such semantically
and grammatically important notions as animacy and agency. An agent
is, initially at least, any entity that is capable of operating upon other
entities, effecting some change in their properties or their location; an
animate being is one that is able to move itself without the intervention
of any external agency.
The child’s conception of animacy and agency may or may not
develop in this way. An alternative view is that it is innate. But whether
it is innate or not, there is little reason to doubt that it is universal in
men; and it constitutes an important part of naive realism (cf. 11.3).
The conceptual framework within which we organize and describe our
perceptions of the physical world, whatever language we speak, is one in
which we can identify, not only states-of-affairs of shorter or longer
duration, but also events, processes and actions. |
There is, unfortunately, no satisfactory term that will cover states, on
the one hand, and events, processes and actions, on the other. We will
use the term situation* for this purpose; and we will draw a high-level
distinction between static and dynamic situations. A static situation (or
state-of-affairs, or state*) is one that is conceived of as existing, rather
than happening, and as being homogeneous, continuous and unchanging
throughout its duration. A dynamic situation, on the other hand, is
something that happens (or occurs, or takes place): it may be momen-
tary or enduring; it is not necessarily either homogeneous or continuous,
but may have any of several temporal contours; and, most important of
all, it may or may not be under the control of an agent. If a dynamic
situation is extended in time, it is a process* ; if it is momentary, it is an
event*; and, if it is under the control of an agent, it is an action*.
Finally, a process that is under the control of an agent is an activity*;
and an event that is under the control of an agent is an act*.
What precisely is involved in the notion of agency is a difficult ques-
tion; and one that we will not go into here.t We may think of the
paradigm instance as being one in which an animate entity, X, inten-
tionally and responsibly uses its own force, or energy, to bring about an
event or to initiate a process; and the paradigm instance of an event or a
_ process in which agency is most obviously involved will be one that
results in a change in the physical condition or location of X or of some
other entity, Y. Each of the features that have been singled out for
* On agency and causality, cf. Cruse (1973), Davidson (1967), Fodor (1970),
Givoén (1975), Kastovsky (1973), Kenny (1963), Kholodovié (1969), Miller &
Johnson-Laird (1976), Wierzbicka (1972).
484 Semantics and grammar ITI
mention here — animacy, intention, responsibility and the use of its own
internal energy-source — is separable, in non-paradigm instances, from
each of the others. It is a fair assumption, however, that languages are
designed, as it were, to handle the paradigm instances; and it is only to
be expected that the applicability of notions like agency should be
unclear in non-paradigm instances.
If it is the case that the child first operates, at the level of practical
intelligence, with a concept of agency as physical manipulation, he very
soon learns that he can obtain things that he wants, not only by grasping
them himself, but also by using language to get others to pass them to
him. Indeed, the child’s realization that he can influence the behaviour
of other agents by emitting appropriate vocal and non-vocal signals
indicative of his interests and desires comes long before there is any
evidence that he is beginning to master the grammar or vocabulary of
any particular language-system; and it is upon this basis that the child
develops his understanding of the instrumental function of such
utterance-acts as requests and commands.
We shall be going into this question in some detail in a later chapter
(16.2). Here we are concerned to point out that language-behaviour
itself is activity, in the course of which acts of various kinds are per-
formed, over and above the purely physical, or physiological, acts in-
volved in the production of utterance-signals. What is now commonly
referred to as the theory of speech-acts* rests upon this fact (cf. 16.1). :
So too does our decision to apply the term ‘situation’, not only (as is
customary) to the contexts, or settings, in which utterances are produced
(cf. 14.1), but also to the states, events and processes that are described
by utterances. We must be careful, of course, to make it clear what is
being referred to when we correlate a particular situation with a par-
ticular utterance. It is important to realize, however, that there is no
terminological equivocation involved. The sense in which we have
introduced the term ‘situation’ in this section is broader than, but it
subsumes, the sense in which it is used to refer to the context, or setting,
of an utterance. To make a statement is to engage, as an agent, in a
particular kind of situation, which is related, by virtue of the descriptive
function of language, to another situation; and the situation that is
described by the propositional content of a statement may itself be that
of making a statement or some other kind of utterance (cf. He said that
it was raining, He toldlasked me to open the door).
Of particular interest, in the present connexion, is the fact that
languages can be used, not only descriptively, but also instrumentally:
12.4. Valency 485
as tools with which to operate upon situations and change them. What
this means is that certain utterances may operate as components of
situations that are causally related to other situations. For example, the
command Open the door! may operate in some antecedent situation of
which the consequent situation of the door being open is the effect.
Generally speaking, the instrumental function of language involves
indirect, rather than direct, causation: the effect that is achieved is not
brought about directly by the utterance itself (as it is in the case of
utterances that are held to have a magical or sacramental character — cf.
Ali Baba’s Open sesame!), but by a secondary agent, to whom the
utterance is addressed. The initiating agent imposes his will upon the
effective agent by uttering an appropriate request or command. There
are, of course, various ways of indirectly causing something to happen
as an initiating agent. What is being emphasized here is the fact that the
instrumental function of language is explicable within the more general
account of agency, instrumentality and causation presented in this
chapter.
The distinction between static and dynamic situations is relevant to
the analysis of the grammatical category of aspect* in many languages
(cf. 15.6). It is also lexicalized in English in the opposition between such
verbs as ‘be’ and ‘have’, on the one hand, and ‘become’ and ‘get’, on
the other. The progressive aspect in English has as one of its semantic
functions that of representing a situation, not simply as existing, but as
happening, or developing, through time; and when it has this function,
it cannot be associated with a verb denoting a static situation (e.g.
‘know’). In so far as they both have a temporal extension, however,
states and processes (including activities) are similar. The fact that it
makes sense to enquire, with respect to a state or a process, “‘ How long
did it last?”’ is reflected by the fact that verbs denoting states or pro-
cesses may be used freely with such temporal adverbials as ‘for a long
time’. In this respect verbs denoting states and processes are in con-
trast with verbs denoting events (including acts), which occur with
punctual, rather than durative, adverbials; and in those languages in
which a syntactic distinction can be established between adjectives and
verbs, the majority of verbs, if not all, denote processes and events,
whereas the majority of adjectives denote states (cf. 11.3). There are
many languages, however, in which what is traditionally described as the
perfect tense of a dynamic verb may be used to represent a state as
having resulted from an antecedent event or process (cf. 15.6).
English, as we have seen, draws a fairly sharp distinction, within
486 Semantics and grammar III
dynamic situations, between processes or events that are under the
control of agents, and those that are not. All processes and events are
happenings, but only actions (i.e. activities or acts) are doings. To ask,
in English, What ts happening? is to presuppose merely that some
process is taking place, and this process may or may not be an activity.
To ask, on the other hand, What ts X doing? or What ts being done
(by X)? is to presuppose that the situation in which one is interested is
an activity. This distinction between non-agentive and agentive hap-
penings is based, we may assume, on a universal appreciation of what
constitutes agency in paradigm instances; and it may well have its
reflex in the grammatical and lexical structure of all languages. But
languages differ, to some degree at least, with respect to the way in
which the semantic notion of agency is grammaticalized and extended to
non-paradigm situations, This point will be explained and illustrated in
the following section. But first we must introduce some further gram-
matically relevant dimensions 1n terms of which we can classify situa-
tions.
For this purpose, we will make use of the general concept of valency*,
which derives from Tesniére (1959: cf. also Heger, 1971; Helbig, 1971)
and has now been quite extensively employed (especially in recent
Soviet work: cf. Kholodovi¢, 1969, 1974; Apresjan, 1974) in the typo-
logical comparison of different language-systems. The concept of
valency can be seen, as far as its ancestry within linguistics is concerned,
as something which takes over and extends the more traditional, but
more restricted, notions of transitivity and government. But it is also
quite clearly relatable to the predicate-calculus classification of predica-
tors in terms of the number of arguments that they take in well-formed
formulae (cf. 6.3): a one-place predicator could be described, from this
point of view, as having a valency of 1, a two-place predicator as having
a valency of 2, and so on. What is traditionally described as a transitive
verb is a verb which has a valency of 2 and governs a direct object.
But valency covers more than simply the number of expressions with
which a verb may or must be combined in a well-formed sentence-
nucleus. It is also intended to account for differences in the membership
of the sets of expressions that may be combined with different verbs. For
example, ‘give’ and ‘put’, in their most common uses, both have a
valency of 3, but they differ with respect to one of the three expressions
which (in the extended sense of ‘government’) they may be said to
‘govern: ‘give’ governs a subject, a direct object and an indirect object;
and ‘put’ governs a subject, a direct object, and a directional locative.
12.4. Valency 487
We will therefore say that they differ in valency: they are associated
with two distinct valency-sets*.
The number of distinct valency-sets in any language-system is quite
restricted ; and there would seem to be few, if any, verbs in any language,
with a valency of greater than 3. But there are in most languages, and
probably in all, grammatically productive mechanisms for decreasing or
augmenting what might be referred to as the intrinsic valency of a verb.
For example, transitive verbs in English are intrinsically bivalent; but
when they occur in the passive they are, like intransitive verbs, mono-
valent. The adverbial phrase ‘by John’, which occurs in the passive
sentence ‘The door was opened by John’ (in contrast with the nominal
‘John’ in the corresponding active sentence ‘John opened the door’) is
an adjunct and, as such, it does not belong to the sentence-nucleus. ‘The
passive of the transitive verb ‘open’, it will be observed, has the same
valency as the active of the intransitive verb ‘open’. The difference
between ‘The door was opened’ (without the adjunct ‘by John’) and
‘The door opened’ is that the former represents the situation as an act
in which the agent is not referred to, whereas the latter represents the
situation as an event (which may or may not be an act). The passive
voice, in languages in which this category is identifiable, is generally, if
not always, associated with a decrease of valency; and there are said to
be many languages in which the passive cannot be employed if the agent
is specified. One of the principal functions of the passive, in fact, would
seem to be that it provides for the description of an act or an activity
without specification of the agent or, alternatively, for the description of
a state which 1s represented as the result of some antecedent act.5
The converse process, whereby the intrinsic valency of a verb is
augmented rather than decreased, is found most obviously in those
languages in which there is a productive causative* construction (e.g.,
Turkish, Japanese, Georgian). This has the effect of increasing the
valency of the verb by 1, so that intransitive verbs become transitive, as
it were, and transitive verbs become trivalent. What is particularly
remarkable about these constructions is that the resultant derived
valency-set is usually identical with the intrinsic valency-set of other
verbs. For example, the valency-set of the causative of an intransitive
verb will be the same as the intrinsic valency-set of transitive verbs; and
the valency-set of the causative of a transitive verb will be the same as
the intrinsic valency-set of a verb which, like ‘give’ in English, takes
Since all locomotion necessarily involves both a source and a goal, (4)
and (5) can be combined to yield
(6) MOVE (ENTITY, SOURCE, GOAL).
12 'To the extent that “‘the order of elements in language parallels that in physical
experience or the order of knowledge’”’ (Greenberg, 1963: 103; cf. Friedrich,
1975) a language is iconic*, rather than arbitrary (cf. 3.4). Gruber (1967,
1975) has argued that subject—predicate constructions develop, ontogenetically
(and in some, but not all, languages), out of topic-comment constructions;
and he has linked this with the development of constative* out of prior
performative* constructions (cf. 16.1). Much the same view is taken by Bates
(1976) ; and it is relatable to earlier speculations about the origins of grammar.
It is arguable that grammar, and more especially syntax, develops by virtue
of the “freezing’’ of what was originally iconic into what is subsequently an
arbitrary “‘formal requirement”’ (see p. 500, n. 8 above) and the progressive
decontextualization of utterances (cf. 14.6).
13
The Lexicon
3) X+h->Y,
which holds between their stems:
(5) Nx + ly > Ay
may be read as saying “‘lexemes of Class Ay are derived from lexemes of
class Nx by the suffixation of -/y (to the appropriate stem)”’, where Nx
and Ay are arbitrarily labelled subclasses of nouns and adjectives.
There are many morphologically simple forms in English which
function as stems for both verbs and nouns (doubt, answer, skin, knife,
etc.) or both adjectives and verbs (dirty, clean, dry, etc.). These can be
brought within the scope of the notion of derivation by recognizing
conversion*, or zero-derivation* (1.e. derivation by means of the afhixa-
13.2. Complex lexemes 523
tion of an identity-element) as a morphological process.’ For example,
the nouns ‘release’ and ‘attempt’ might be said to be derived from the
(6) V, + 9->N, |
verbs ‘release’ and ‘attempt’ in accordance with the formula
(where @ stands for the identity-element). The reason why these nouns
are said to be derived from the corresponding verbs, and by means of
sufhixation, is that they belong to the same subclass of nouns as ‘exten-
sion’, ‘justification’, ‘arrangement’, etc., which are clearly deverbal and
derived by suffixation: deverbal nominalization is characteristically a
matter of suffixation in English. The formula given above as (6) can
therefore be seen as a particular instance of
(7) V+ VNq > Na,
where VNg is a class of deverbal nominalizing suffixes (of which the
identity-element, or zero, is one and -zon, -al, -ment, etc., are others).§
Conversion, or zero-derivation, is very productive in English; and it 1s
usually, though not always, clear which of the pair of lexemes related by
conversion is simple and which is complex in terms of the general
patterns of derivation manifest in the language.
So much by way of general background to the notion of derivation.
The formulae are purely ad hoc; but they will serve the present pur-
pose, and we do not want to go into more detail than is strictly necessary.
We are concerned with the theoretical status of complex (i.e. derived)
lexemes. Should they be listed in the lexicon and, if so, what informa-
tion should be associated with them? Two extreme views might be
maintained on this issue: (i) that no complex lexemes should be included
in the lexicon; (11) that every complex lexeme should be listed separately
in the lexicon and provided with its own lexical entry.
The arguments in favour of listing complex lexemes individually in
the lexicon, rather than deriving them by rule in the grammar, are well
* “Conversion’ is the term used by Quirk et al. (1972: 1009ff). Arguably, the
term ‘conversion’ — “‘the derivational process whereby an item is adapted to
a new word-class without the addition of an affix”’ — carries different implica-
tions from the term ‘zero-affixation’, which can be understood as implying
the addition to the stem of the identity-element functioning as an affix. For
discussion of the criteria that might be applied in deciding between these
alternatives in particular instances, cf, Haas (1957).
8 The reader is reminded of the ambiguity of the term ‘nominalization’:
(i) ‘‘creation of nouns’’; (ii) ‘“‘transformational process whereby nominals
(NPs) are constructed” (cf. 10.3, 11.1).
524 The Lexicon
known (cf. Matthews, 1974): derivational rules are characteristically less
productive than inflexional rules; and their syntactic and semantic effect
is, in many instances, unpredictable. To take the question of restricted
productivity first: what is of crucial importance here is the fact that
certain derived lexemes, which one might expect to exist and to be in
_ current use, not only have never been attested, but are rejected by
native speakers, even though they are morphologically regular and would
satisfy the appropriate formula. For example, there is no derived noun
in English that is related, syntactically and semantically, to the verb
‘salute’ as ‘dilution’ is related to ‘dilute’, ‘pollution’ to ‘pollute’, etc.
(cf. Matthews, 1974: 50). Not only is there no noun whose stem is
salution: there is no noun at all that fills this lexical gap. Conversely,
there are many lexemes in English which are morphologically, syntac-
tically and semantically similar to various kinds of complex lexemes, but
which cannot be derived synchronically from existing lexemes. For
example, ‘doctor’ and ‘author’ are reasonably classified as agentive
nouns (like ‘actor’, ‘painter’, etc.) and their stems are such that they
might be held to contain the agentive suffix -er/-or. But there is no verb
whose stem is doct- or auth-. Examples of derivational gaps of this kind
could be multiplied almost indefinitely.®
Let us now turn to the semantic problems involved in the generation
of complex lexemes within the grammar. It has often been pointed out
that the meaning of very many complex lexemes is more specialized than
that of the lexemes from which they appear to be derived. The reason
for this would seem to be that complex lexemes are like simple lexemes,
in that, once they are created or introduced into the language-system
and pass into general currency, they may be institutionalized and, by
virtue of their use in particular contexts, develop more or less specialized
senses. For example, the noun ‘recital’ is morphologically parallel to
such other deverbal nouns of action as ‘refusal’, ‘approval’, ‘acquittal’,
etc. The form of its stem and its function as a noun of a certain kind can
be accounted for by means of the formula
(8) Vr + al > Na,
where Vr is the class of verbs whose stems may take this particular
nominalizing suffix, Ng is the class of action nouns, and (8), like (6), is
but one of a set of formulae, all of which may be subsumed under (7)
® The existence of these two derivational gaps is readily explained in terms of
the historical development of English. The lexemes ‘doctor’ and ‘author’
come, ultimately, from Latin, as do many other such nouns,
13.2. Complex lexemes 525
above. Now there are contexts in which the syntactic and semantic
relationship between ‘recital’ and ‘recite’ is parallel to that which holds
between ‘refusal’ and ‘refuse’, ‘selection’ and ‘select’, ‘arrangement’
and ‘arrange’, etc.; and it is easy to see that, if the syntactic rules of
English were to generate a set of expressions of the form
(9) the N, of NP, (by NP,)
from structures underlying transitive sentences of the form
(10) NP, V NP,
and if the lexical entries for ‘recite’, ‘refuse’, ‘release’, ‘arrange’,
‘select’, etc., contained enough morphological information for us to
know that ‘refuse’ and ‘recite’ were members of V;r, ‘release’ was a
member of Vp, and so on, then it would be possible to generate ‘the
recital of the Lord’s Prayer (by the congregation)’, ‘the release of the
prisoners (by the terrorists)’, etc., within the grammar. So far so good.
But there are obvious problems which arise if we eliminate the noun
‘recital’ from the lexicon and generate it by means of a transformational
rule in the grammar. If Lawrence Olivier 1s billed to give a Shakespeare
recital, he will indeed recite Shakespeare, but if Yehudi Menuhin gives
a Mozart recital he will play, rather than recite, the music of Mozart.
Furthermore, it is only certain kinds of music that are played at recitals:
we would not expect to hear the Jupiter Symphony played at something
that was advertised as a Mozart recital. What constitutes and is referred
to in English as a recital is determined by accepted cultural conventions;
and one cannot be said to know the meaning of ‘recital’ unless one has
some knowledge of these conventions.
There are syntactic problems too. The expressions ‘the Shakespeare
recital’ or ‘the Olivier recital’ do not of course conform to the pattern
set forth in (9). The expression that would be accounted for by (8) and
(9) is ‘the recital of Shakespeare (by Olivier)’; and this is of doubtful
acceptability. Let us suppose then, it might be suggested, that there are
two lexemes, one of which is derived by nominalization (of restricted
productivity) and appears in such expressions as ‘the recital of the
Lord’s Prayer’ and the other of which is simple, like the noun ‘concert’.
Which of these two lexemes is found in ‘the poetry recital’? This ex-
pression is surely not ambiguous; and yet it can be related equally well
to ‘the recital of the Lord’s Prayer’ (cf. ‘the recital of poetry’), on the
one hand, and to ‘the sonata recital’, ‘the jazz concert’, etc., on the
other. It seems perverse to say that there are two distinct lexemes,
526 The Lexicon
‘recital,’ and ‘recital,’ ; and it is not even clear that there are two sharply
distinguishable senses involved. There is the further difficulty that, in
addition to ‘recital’, we also have ‘recitation’, which satisfies (7). But
‘recital’ and ‘recitation’ are not intersubstitutable in all contexts.
‘Recitation’ is perhaps more readily generated from sentential struc-
tures containing the verb ‘recite’ than is ‘recital’; but ‘recitation’ also
has its own specialized and institutionalized senses, which must be
accounted for in the lexicon.
Enough has been said to give some indication of the difficulties which
arise for the proposal that all derived lexemes should be generated by a
combination of syntactic and morphological rules. What then of the
alternative proposal, that every complex lexeme should be listed
separately in the lexicon? First of all, it should be noted that, in one way
or another, we must relate, syntactically and semantically, those simple
and complex lexemes which do enter into paired sets of expressions and
sentences like ‘John’s refusal of the job’: “John refused the job’, ‘their
solution of the problem’:‘They solved the problem’, etc. Chomsky
(1970) has suggested one way of doing this. In effect (and we need not
go into the details of the formalism upon which it depends), it provides
for the generation, by the base rules of the grammar, of both the
nominal and the sentential structures, NP,’s X of NP, and NP X NP,
where X is realized by a noun (with its appropriate complements) in the
one structure and a verb in the other. Certain lexemes would then be
listed in the lexicon as having alternative stem-forms refusal and refuse,
solution and solve, destruction and destroy, etc., according to whether
they occur in the nominal or the sentential structure. What is being
proposed, then, is to simplify the transformational rules of the grammar
by extending the rules of the base and by handling derivation within the
lexicon. Chomsky’s proposal is made, of course, within the framework
of his own theory of generative grammar. The validity of his criticisms
of what he calls the transformationalist, as opposed to the lexicalist,
account of derived lexemes is independent, however, of this fact; and
the same points as he makes have been made by scholars of a quite
different theoretical persuasion. What is currently referred to as the
controversy between transformationalists and lexicalists is a particular
version of the more general controversy, of longer standing, between
those who wish to account for the distribution and meaning of complex
lexemes by means of productive syntactic and morphological rules and
those who favour the listing of all such lexemes in the lexicon. Let us
grant that Chomsky’s proposal or some alternative formulation of the
13.2. Complex lexemes 527
lexicalist hypothesis is technically feasible. The question we are con-
cerned with here is whether it is desirable to include all the derived
lexemes of a language in the lexicon.
The answer to this question depends upon the answer to a prior
question: is it possible, or in practice feasible, to list all the derived
lexemes of a language? As we have seen, it is difficult, if not impossible,
to arrive at a satisfactory pre-theoretical test of grammatical accept-
ability (cf. 10.2). Our model of the language-system will inevitably
generate, as well-formed sentences, many strings of word-forms which
would be regarded as unacceptable by some native speakers: e.g.
‘Football was played by him yesterday’. So even the most obviously
productive syntactic rules of English, such as the rule (whatever its
precise formulation) which relates active and passive sentences, give
rise to problems of this kind: for the sentence ‘He played football
yesterday’ is unquestionably acceptable. The relevance of this point to
the status of complex lexemes ts that there appears to be no difference of
kind, pre-theoretically, between the productivity of what are universally
regarded as syntactic processes and the productivity of at least some
derivational processes.
The native speaker is as free to construct de-adjectival abstract nouns
with stems ending in -ness, for example, and to use them in certain
syntactically specifiable positions as he is to form passive sentences from
underlying active structures. It is not even clear that the existence of a
generally accepted alternative stem-form (whether this is also derivable
by means of a more or less productive rule or not) inhibits the operation
of the more general rule for deriving abstract nouns from adjectives.
However that may be, we cannot rely upon the existence of a particular
derived lexeme in some corpus of actually attested utterances of English
as either a necessary or a sufficient condition of the existence of that
lexeme in the language-system. Nor can we argue very convincingly
that the native-speaker’s formation of a derived lexeme, on those occa-
sions on which he does form one himself by applying the productive
derivational principles inherent in the language-system (rather than
looking it up, as 1t were, in his own internalized lexicon), results from
the exercise of some peculiarly creative ability. At least some part of
what is customarily held to fall within the scope of derivation appears to
be rule-governed in the same way that the construction of grammatically
acceptable utterances is.
We are faced, then, with a dilemma. Neither of the two extreme pro-
posals referred to earlier seems to be theoretically justifiable. This being
528 The Lexicon
so, it is reasonable to consider listing in the lexicon only those derived
lexemes that are morphologically, syntactically or semantically idio-
syncratic in some way and excluding from the lexicon any lexeme whose
stem-form and whose distribution and meaning can be accounted for by
means of productive rules. There will be problems, of course, in the
application of this criterion. One problem, as we have seen, is that it is
not always possible to assign determinate limits to the productivity of
certain derivational processes; and yet, if we do not restrict the condi-
tions under which the derivational rules apply, we will certainly generate
a host of unacceptable lexemes. This problem is inherent in the whole
process of constructing a generative model of the language-system. It
may well be that further research will lead to refinements in the specifi-
cation of the conditions under which the derivational rules operate, so
that ultimately the linguist will be able to claim, with greater justification
than he can at present, that his model generates all and only the pre-
theoretically acceptable complex lexemes of the language that he is
describing. It is only to be expected, however, that, just as there are very
many strings of word-forms of indeterminate acceptability, so there will
be a number of morphologically complex stems whose pre-theoretical
status with respect to acceptability is equally indeterminate. In such
cases, we can decide, as a matter of methodological principle, to let the
model itself resolve the question for us.
Consider, for example, all the adjectives in English whose stems end
in the suffix -able or -ible. Many of these, though by no means all, can be
accounted for in terms of a synchronically productive process of de-
verbal adjectivalization. In so far as this process is productive in present-
day English, it is restricted to transitive verbs. Let us begin, therefore,
by setting up the formula
(11) Vir + able > Az
where Vir is the class of transitive verbs and A; 1s an arbitrarily labelled
subclass of adjectives. This formula will account for the morphological
relationship between ‘read’ and ‘readable’, ‘drink’ and ‘drinkable’,
etc. Now it is well known that the unrestricted application of a formula
like (11) will yield a certain number of lexemes that would be rejected by
perhaps the majority of English speakers: ‘gettable’, ‘fetchable’, etc.
The first question that would confront us in relation to these putative
lexemes is that of deciding whether they are definitely unacceptable or
not in the dialect of English that we are describing. Let us assume, for
the sake of the argument, that some of them are definitely unacceptable
13.2. Complex lexemes 529
and that others are pre-theoretically indeterminate with respect to
acceptability. The next question is whether there is any phonological,
morphological, syntactic or semantic property in terms of which we can
predict the applicability of (11) to particular transitive verbs in the
lexicon. A priori, any one or any combination of several factors might be
relevant: whether a particular verb-stem has a certain phonological
structure or not, whether it is itself morphologically simple or complex,
whether it is recognizably of Latin or Germanic origin, whether the
verb belongs to a syntactically or semantically restricted subclass of
transitive verbs, and so on. There is, in fact, no obvious single property
or combination of properties of this kind in terms of which we can
predict the applicability or non-applicability of the derivational formula
Vir + able -» Az.!° At the same time, it is clear that the process that the
formula is intended to account for is extremely productive; and there
are certain morphologically specifiable subclasses of transitive verbs
(e.g., those whose stems end in -zge or -if/y) to which the formula seems
to apply without restriction, in the sense that none of the resultant
lexemes is definitely unacceptable. This being so, it would be un-
reasonable to take the view that all the adjectives whose stems end in
-able should be listed in the lexicon.
The simplest solution to the problem, though not necessarily the
most satisfactory, is to let the formula Vir + able + Az apply without
restriction in our model of the language-system. Alternatively, we might
decide to mark a certain number of transitive verbs (e.g., ‘get’, ‘fetch’)
as exceptions to this derivational process, allowing it to operate in all
other instances. If the rule which derives the members of Az is left un-
restricted, it will of course cover transitive verbs whose stems end in
-1ze or ~-ify, provided that they are listed in the lexicon as transitive
verbs or that they are themselves derived by rule and their syntactic
function is assigned to them as part of the process of derivation. But,
even if we were to shrink from admitting into the grammar of English,
with or without specific exceptions, the very general rule Vir -+ able —> Az,
we could still include a rule which made reference to the morphological
composition of the verb-stems ending 1n -zze and -zfy and the syntactic
10 Hasan (1971: 152) suggests: ‘‘this suffix -able can be used with that set of
verbs which can realize the process ‘reaction’ in an active transitive clause
where two participants are required but where the role ‘affected’ can be
mapped only onto the subject ... This explains why one may say Fim is a
likeable fellow but not Jim is a puzzleable fellow’’. Some such principle may
be operative; but it is not always clear what one may or may not say, on the
one hand, and what is here covered by the term ‘affected’, on the other.
530 The Lexicon
information associated with the lexemes of which they are to be stems.
The point is that it is, in principle, feasible to handle by rule as much of
the derivational regularity in a language as is empirically justifiable.
Any derived lexeme that is syntactically or semantically irregular, in
that its distribution or meaning is unpredictable by general rule from
the lexeme whose stem is the synchronic source of its morphological
derivation, must of course be listed in the lexicon. But it will not neces-
sarily be provided with a full lexical entry. There are different kinds and
degrees of derivational irregularity; and it is frequently the case that part,
though not all, of the syntactic function and meaning of an irregular
derived lexeme is predictable by rule. To return to the adjectives whose
stems end in -able. Many of these adjectives, and perhaps all of those
that we would wish to regard as being completely regular, can be inter-
preted in terms of a modalized* passive predicative phrase, the modality
in question being that of possibility or ability (cf. 17.1). For example,
‘His anger is justifiable’ means “‘His anger can be justified’’, or more
precisely ‘‘His anger is such that it can be justified’’; “His assets are
unrealizable’ means “ His assets (are such that they) cannot be realized”’;
and so on. Let us assume, then, that all the adjectives whose stems end
in -able and whose meaning and distribution is regular in terms of a
transformational rule, which derives them from an underlying modalized
sentential structure of the appropriate form, containing the transitive
verb whose stem is the form to which -able is suffixed, are removed from
the lexicon. Granted that this 1s the norm, we can distinguish various
kinds of derivationally irregular adjectives.
One adjective that is morphologically and syntactically regular in
terms of the formula Vir + able —- Az and only partly irregular from a
semantic point of view is ‘readable’. Anything that is readable is such
that it can be read. The adjective ‘readable’, however, is commonly
used, and perhaps most commonly used, to imply rather more than
‘capable of being read’’: a readable novel, for example, is normally
understood to be a novel that one can read with pleasure or interest.
This sense of ‘readable’, we may assume, results from its institutionali-
zation and may not be derivable by rule. ‘Readable’ must therefore be
provided with its own lexical entry and this specific sense of the lexeme
accounted for in box (iv): cf. figure 11. But no special morphological or
syntactic information need be given in the lexical entry. What we want
to say about ‘readable’ is that it is derived from ‘read’ (1.e. that its stem
is derived from the stem of ‘read’) by the suffixation of able. At first
sight, the most obvious way of doing this is to put into box (i) or (11)
13.2. Complex lexemes 531
something like read + able (where the plus-sign indicates the process of
suffixation). But there is nothing here to tell us that read is the stem of
‘read’. Suppose, therefore, we were to leave box (i) empty and to put
into box (ii) the address of ‘read’ (rather than its stem), together with an
indication of the process of suffixation: 1.e. 798 + able. Associated with
this method of representing the morphological composition of the stem
one would have, appropriately formalized, the convention that, in
default of any information to the contrary, the lexeme is morpho-
logically and syntactically regular. Looking up entry 798, we will find
the transitive verb ‘read’ with the stem read and, since there is no
specific information in box (1) of the entry for ‘readable’ indicating that
its stem is in any way phonologically irregular, we will form the stem
readable; and, in default of any further information to the contrary in
boxes (ii) and (iii), we will infer that it does indeed belong to the syn-
tactic class Az, that it forms abstract nouns with a stem ending in -zty
(cf. readability), and so on. Furthermore, the regular, though perhaps
less common, sense of ‘readable’ need not be assigned to it at all in the
lexicon, since it is derivable by the more general rule.
There are very many other adjectives with stems in -able for which a
similar treatment would seem to be required, if we are to capture, for-
mally, both their grammatical regularity and their semantic idio-
syncrasies. What has been outlined in the previous paragraph may not
be the most appropriate way of doing this. The point that we wish to
emphasize here is that by including in the entry for ‘readable’ a cross-
reference to the lexeme ‘read’, rather than simply to the form read, we
can in principle make use of this in giving the semantic information
associated with ‘readable’ in box (iv). It may well be that there are
other morphologically and syntactically regular adjectives of class Az
whose meaning is related to the verbs from which they are derived as
the meaning of ‘readable’ (in its more specialized sense) is related to the
meaning of ‘read’. If so, we can, and presumably should, account for this
in our formalization of the semanticinformation associated with ‘ readable’
in box (iv); and, even if the specialization of meaning involved in this
case is peculiar to ‘readable’, it is after all specialization. ‘Readable’ does
not mean, for example, “‘tasty”’ or “‘capable of rational demonstration ”’.
The native speaker’s knowledge of the meaning of ‘readable’ is pre-
sumably based upon his knowledge of the meaning of the verb ‘read’
and is supported, to the degree that ‘readable’ is derivationally regu-
lar, by the general grammatical and lexical structure of the language.
We have assumed that all the semantically regular adjectives with
532 The Lexicon
stems ending in -able can be paraphrased by means of a modalized
passive predicative phrase: ‘can be obtained’, ‘can be justified’, etc.
There are many such adjectives, however, which allow, or require, an
interpretation in which the modality of the associated predicative phrase
is that of necessity or obligation, rather than possibility. To say that
something is valuable, preferable, commendable, deplorable, enviable,
detestable, etc., is to say that it ought to be, rather than that it can be,
valued, preferred, commended, deplored, envied, detested, etc. All the
verbs from which these adjectives are derived are verbs of evaluation. It
is conceivable, therefore, that the derivation of the adjectives in question
might be handled within the grammar by virtue of a rule that is sensitive
to a distinction between transitive verbs of evaluation and other transi-
tive verbs. This would then be another class of semantically regular
adjectives with stems in -able for which no distinct lexical entries are
required. But there would be exceptions to the proposed subregularity.
‘Criticizable’, for example, differs in this respect from ‘deplorable’ and
‘detestable’. Let us suppose, therefore, that only one kind of complete
derivational regularity is allowed. We will, then, put ‘deplorable’,
‘enviable’, etc., into the lexicon and handle them in the way suggested
for ‘readable’. But in this case we should certainly wish to give recogni-
tion to the fact that the semantic specialization involved is found
throughout a significant number of lexemes and that it depends upon
the relationship between the modalities of necessity and possibility, or
obligation and permission, which is of importance elsewhere in the
grammatical and lexical structure of English (cf. 17.1). Just as an im-
perative sentence, like ‘Sit down!’ or ‘Come in!’, may be used, in the
appropriate circumstances, either to issue a command or to grant per-
mission, so there are certain derived adjectives with stems in -able which
are interpretable in terms of either necessity or possibility according to
the context. One example is ‘payable’. This bill 1s payable immediately
would normally be understood to imply the necessity, rather than the
possibility, of immediate payment; whereas This bill 1s payable at any
post office is paraphrasable as ‘‘ This bill.can be paid at any post office’.
If the imperative is normally associated with the modality of necessity,
but may in certain circumstances be associated with possibility or per-
mission, the converse appears to be the case as far as derived adjectives
with stems in -able are concerned. This point is mentioned here, simply
to give the reader some idea of the way in which a fuller and more
systematic account of the meaning of one class of semantically regular
derived lexemes might proceed.
13.2. Complex Lexemes 533
All the lexemes with stems ending in -able that have been referred to
so far have been morphologically and syntactically regular in terms of
the formula Vir + able -> Az. There are others that are morphologically
irregular in that there is no corresponding verb-stem from which their
stems can be derived. They fall into several subclasses. One such sub-
class is exemplified by ‘feasible’, ‘legible’, ‘edible’, ‘intelligible’, etc.
Although there are no verbs in English whose stems are feas-, leg-, ed-,
intellig-, etc., it is arguable that the adjectives whose stems are formed
from these bound* roots do in fact satisfy the formula Vir + able > Az
(-zble being a variant of -able);4! and some of them at least are seman-
tically regular. For example, ‘edible’ is related semantically to ‘eat’ as
‘justifiable’ is to ‘justify’ or ‘obtainable’ is to ‘obtain’; and it is less
specialized in meaning than the morphologically regular “eatable’. One
way of accounting for the distribution and meaning of ‘edible’ (and
also for the fact that it is morphologically regular with respect to
nominalization: cf. ‘edibility’) might be to put ed +- zb/e in box (i) and
‘eat’ + able in box (ii) of the lexical entry for ‘edible’. Given the
appropriate conventions, everything else is derivable by rule. “Legible’
is like ‘edible’, except that its meaning is somewhat more specialized
than ‘“‘can be read’’, but specialized in a different way from that of the
morphologically regular ‘readable’; and this would need to be indicated
in box (iv). ‘Edible’, ‘legible’, and many other lexemes, then, are
morphologically and syntactically regular in terms of the formula
Vir + able — Az.
There are, however, several lexemes with stems in -able (or -zble) that
do not satisfy this formula: ‘horrible’, ‘knowledgeable’, ‘reasonable’,
etc. In each case, the lexeme is an adjective, as the form of its stem
suggests; but none of them belongs to the class of adjectives which are
nominalized by virtue of the rule Vir + able/ible +- ity — Nz, as are all
the regular adjectives with stems in -able. Furthermore, many of them
are morphologically and semantically idiosyncratic in relation to the
lexeme from which they appear to be derived. For example, the stem of
‘knowledgeable’ is patently analysable as knowledge +- able. But know-
ledge never functions elsewhere in English as a verbal stem; and, even if
we were to recognize knowledge in this instance as a bound verbal stem
11 For present purposes, a free* form may be defined as one that may function
as a word, phrase or complete utterance, and a bound* form, in contrast, as
any form that is not free. The distinction between free and bound forms is of
particular importance in work that derives, directly or indirectly, from
Bloomfield (1935).
534 The Lexicon
(meaning “‘know”’ or whatever), the modality of possibility or necessity,
which we have associated with Vir-+-able —> Az, is irrelevant to the mean-
ing of ‘knowledgeable’. Since N + able > A is certainly not a syn-
chronically productive rule of modern English, there is little point in
treating ‘knowledgeable’ as anything other than a simple lexeme in the
lexicon, despite the fact that it is obviously related semantically to
‘knowledge’. The same holds for all the other adjectives with stems in
-able which do not satisfy the formula Vir + able > Az.
We will not proceed any further with our discussion of derived, or
complex, lexemes. What we have tried to do here is to show, with
reference to just one class of derived lexemes in English, that the prob-
lem which faces the linguist is not simply that of deciding whether a
particular lexeme can or cannot be generated by rule. He must account,
as systematically as he can, for various kinds and degrees of derivational
regularity. Little progress has yet been made towards the solution of
this problem within the framework of generative grammar, or indeed
within any coherent theory of the structure of language.!? As we saw
earlier, Chomsky has argued for what he calls the lexicalist treatment of
complex lexemes. But he has also accepted the possibility, in principle,
of “‘a compromise solution that adopts the lexicalist position for certain
items and the transformationalist position for others’’ (1972: 17). Such
proposals as we have made in our discussion of the question, informal
though they have been, are consistent with this kind of compromise
solution.
12 Householder (1959), and later Halle (1972), propose that there should be a
sub-grammar, within the grammar of English and other languages, whose
function it will be to define the set of derivationally well-formed complex
lexemes, a subset of which will be listed, in the lexicon, as having been actuali-
zed. For a range of views, cf. Brekle (1970), Chapin (1967), Chomsky (1970),
Dubois (1962), Guilbert (1975), Lees (1960, 1970), Lipka (1972), Ljung
(1970), Newmeyer (1971). An important work that antedates the rise of
generative grammar is Kurylowicz (1936). For various patterns of derivation
in English cf. Adams (1973), Marchand (1969).
13.3. Compound lexemes 535
more stems (with or without morphological modification). In so far as
the distinction can be drawn, in particular languages, between word-
forms and combinations of word-forms, a corresponding distinction can
be established between word-compounds and phrasal compounds. ‘The
distinction is less clear in English than it is in many other Indo-European
languages; and the inconsistency with which spaces and hyphens are
employed in written English reflects this fact. The principal criterion for
drawing the distinction, in spoken English, between word-compounds
and phrasal compounds is that of stress. Generally speaking, each word-
form in English (if it is a form of a lexeme belonging to one of the major
parts-of-speech) has a single primary stress; and the position of primary
stress in word-forms of more than one syllable is determined by the
morphological composition of the stem. Given that both simple and
derived stems in English have a distinctive stress-pattern, compound
stems which have a single primary stress (e.g., screwdriver, blackbird, boy-
friend, window box) may also be classified as word-stems, regardless of
whether they are conventionally hyphenated or not in the written
language.
But we are less concerned here with the distinction between word-
compounds and phrasal compounds, than we are with the distinction
between compound lexemes*, on the one hand, and what, for want of a
better term, we will call syntactic compounds*, on the other. Syntactic
compounds are like completely regular derived lexemes in that their
meaning and distribution can be accounted for in terms of the produc-
tive rules of the language-system; and, for that reason, they need not be
listed in the lexicon. Indeed, unlike derived lexemes, they could not in
principle be listed in the lexicon, since, in certain languages at least, they
are infinite in number.
Compound lexemes frequently originate as syntactic compounds and,
having become institutionalized, acquire a more or less specialized mean-
ing. An obvious example is ‘country house’, which is regular enough in
terms of the syntax of English, but which, as a compound lexeme in
British English, denotes a much smaller class of dwellings than does the
expression ‘house in the country’. If it were not for the fact that
‘country house’ has, ‘for historical reasons, come to be associated in
Britain with what once were and in some cases still are the non-
metropolitan residences of the aristocracy, there would be no reason to
treat it as a compound lexeme. It is interesting to compare ‘country
house’ in this respect with the French ‘maison de campagne’, which one
might expect to be its translational equivalent. ‘Maison de campagne’
536 The Lexicon
is also institutionalized as a ready-made or fixed expression, in so
far as it is regularly employed by those who normally live in the town in
order to refer to the house in the country in which they might spend their
week-ends and their holidays. But its meaning is less specialized than
that of “country house’ (the closest equivalent to which is probably
‘chateau’ — conventionally, but in most cases inappropriately, translated
into English as ‘castle’); and it is, for this reason, less obviously a com-
pound lexeme than ‘country house’.
The process by which syntactic compounds are institutionalized as
lexemes has been aptly called petrification (cf. Leech, 1974). This
metaphorical.term is intended to suggest two distinguishable aspects of
the process in question: solidification and shrinkage. As soon as any
regularly constructed expression is employed on some particular occa-
sion of utterance, it is available for use again by the same person or by
others as a ready-made unit which can be incorporated in further
utterances; and the more frequently it is used, the more likely it is to
solidify as a fixed expression, which native speakers will presumably
store in memory, rather than construct afresh on each occasion. In this
respect, frequently used syntactic compounds are like frequently used
regular derived lexemes. Solidification, then, is a natural consequence
of the normal use of language; and, just as naturally, though by no
means inevitably, it leads to the other aspect of the process of petrifica-
tion, shrinkage or semantic specialization. Just as a simple lexeme may,
by virtue of its use in particular contexts, become more restricted in its
sense and denotation than it was in some earlier period, so too may
derived lexemes and syntactic compounds. This is a more or less in-
evitable consequence of the normal use of language; and it creates both
practical and theoretical problems for the lexicographer. How does he
decide whether the process of petrification has-gone far enough, in any
particular instance, to justify the inclusion of a separate lexical entry?
And what kind of information should be associated with compound
lexemes in the lexicon?
It is easy enough to formulate the general criteria for inclusion in the
lexicon: a lexical entry is required for compound lexemes (and it is this
property which makes them lexemes) if and only if they are phono-
logically, morphologically, syntactically or semantically idiosyncratic. In
practice, this criterion is difficult to apply, because it is hard to draw a
sharp distinction between regularly constructed, but institutionalized,
syntactic compounds and petrified compound lexemes. We have assumed
that ‘country house’ 1s, on semantic grounds, to be classified as a com-
13.3. Compound lexemes 537
pound lexeme in British English. But it is indistinguishable in terms of
its phonological, morphological and syntactic characteristics from in-
numerable endocentric noun expressions in which the head-noun is
modified by another adjectivalized noun, like ‘week-end cottage’, ‘car
radio’, or “garden furniture’ (cf. 10.3); and many of these are certainly
institutionalized, in that they denote classes of things that have a more
or less distinctive role in present-day life. It is for cultural reasons that
“week-end cottage’ is of more frequent occurrence than, say, ‘week-day
cottage’. But if anyone should choose to live in town at the week-end
and in the country during the week, he would be quite free to refer to
his country residence (if it were a cottage) by means of the expression
‘week-day cottage’. This expression 1s fully acceptable and semantically
interpretable in terms of the productive rules of the language-system;
but it has not been institutionalized. It requires but little reflexion to see
that institutionalization, like petrification, is a matter of more or less,
rather than yes or no. Not only 1s ‘country house’ syntactically endo-
centric, in that its distribution is identical with that of ‘house’, but it is
semantically regular to the extent that its sense is related to that of its
component head-noun in terms of hyponymy. In this respect it differs
from ‘public house’ (in British English), and still more from ‘green-
house’ (which is identifiable, phonologically, as a word-compound). The
native speaker’s understanding of the sense and denotation of ‘country
house’ is presumably supported by, though it cannot be completely
explained by, his recognition of its internal syntactic structure and his
knowledge of the meaning of both ‘house’ and ‘country’. Furthermore,
the existence of ‘country house’ as a compound lexeme does not abso-
lutely prevent the construction by native speakers of the corresponding
syntactic compound whose distribution is identical: a sentence like ‘I
don’t like country houses’ is presumably ambiguous.
Granted that ‘country house’ is a compound lexeme, but that it is to
be related both grammatically and semantically to the simple lexemes
‘country’ and ‘house’, we come up against the theoretical problem that
there is no obvious way of doing this satisfactorily within the framework
of generative grammar. Conventional dictionaries sometimes adopt the
practice of incorporating lexical entries for compound lexemes within the
entries for one or other of the component simple lexemes, and this can
be seen as an informal way of indicating that there is some kind of
relationship between the compound and its lexical components. But the
nature of this relationship is not made explicit. Conventional dictionaries
trade very heavily, and justifiably in view of their aims, upon their users’ ~
538 The Lexicon
intuitive knowledge, not only of the grammatical structure of the lan-
guage, but also of the kinds of things that the language is normally used
to describe or to refer to. If we are to make explicit the degree to which
the meaning and distribution of the compound lexeme ‘country house’
is determined by the meaning and distribution of ‘house’ and ‘country’,
“we must first of all have some means of representing within its lexical
entry the fact that it is composed of ‘house’ and ‘country’ and that it is,
as far as its distribution is concerned, a regular endocentric noun-
compound. We must then be able to use this information in the semantic
part of the lexical entry to indicate the relation of hyponymy that holds
between the compound as a whole and the simple lexeme ‘house’. It is
not possible to do more than make some very tentative suggestions here
as to the way in which this might be done; and the principal reason is
that the treatment of syntactic compounds within the framework of
generative grammar is, if anything, even more problematical than is the
treatment of derivational morphology.
We have been assuming that endocentric noun-expressions like
‘week-end cottage’, ‘garden furniture’, ‘car radio’, etc., are to be
generated by the productive rules of the language-system; and this
would appear to be a reasonable assumption. It has frequently been
pointed out, however, that the semantic relationship between the head-
noun and the modifying noun in such phrases is extremely diverse. If
they are to be transformationally derived from some underlying senten-
tial structure in which the head-noun is part of the subject and the
modifying noun is part of the predicate, we must allow for the trans-
formational derivation of the same denominal adjectival modifier from
many different predicative expressions. This is not in itself an objection
to the transformational derivation of syntactic compounds. Indeed, the
fact that many regular syntactic compounds are, in principle, ambiguous
is naturally accounted for by deriving them from several distinct under-
lying structures. For example, “the London train’ might refer to a train
which is going to London or coming from London (cf. Has the London
train left yet? vs. Has the London train arrived yet?). The expression
‘London taxis’, on the other hand, will normally be understood to refer
to taxis which operate in London, as will ‘London buses’. But ‘the
London bus’ can also be used to refer to a bus which is going to or
coming from London. It is inconceivable that the syntactic and semantic
subclassification of either ‘London’ or ‘train’, ‘bus’ and ‘taxi’ in the
language-system could be such that these differences could be accounted
for by rule. In any case, it is surely only our knowledge of the fact that
13.3. Compound lexemes 539
trains normally operate between towns, taxis within towns and buses
both between and within towns that leads us to say that one interpreta-
tion is more normal than another in any particular instance. Once this
point is conceded, it is difficult to avoid the conclusion that every
syntactic compound is, in principle, highly ambiguous, although one
interpretation rather than the others, whether it is institutionalized or
not, may seem more natural or more normal in particular utterances.
In this respect, syntactic compounds are no different from very many
sentences, whose ambiguity usually goes unnoticed because they are
interpreted within a framework of shared ontological or contextual
assumptions. As we have seen, it is a debatable point whether sentences
can be said, in general, to have a determinate and integral number of
meanings (cf. 10.4). It is also matter for dispute how much of the
potential ambiguity of sentences is to be excluded by selection restric-
tions*, formalizable in our model of the language system, and how much
should be held to fall outside the scope of linguistics entirely (cf. 10.5).
Both of these questions are obviously relevant to the analysis of syntactic
compounds.
What we are concerned with here is a rather different question. On the
assumption that syntactic compounds are to be generated, within the
grammar, from underlying sentential structures, and that any given
syntactic compound may be derived from several sources, are any or all
of the ambiguities preserved in the formal representation of the syntactic
structure of the resultant compound? Is ‘London bus’, for example,
assigned the same syntactic analysis in the surface structure of sentences
regardless of whether it means “‘bus from London’’, “bus to London”’,
‘bus in London’’, etc.? As far as the interpretation of regular syntactic
compounds, like ‘London bus’, is concerned, this question might not
seem to be very important. After all, the different interpretations are
satisfactorily accounted for in the deep structure representation; the
deeper syntactic differences are not relevant to the distribution of the
noun expression ‘London bus’ throughout the sentences of the lan-
guage; and there are no correlated morphological or phonological
differences. What reason, then, is there to preserve the deeper syntactic
differences in the surface structure representation?
One reason is that it would enable us to classify compound lexemes in
the lexicon according to the subtype of syntactic compound with
which they are structurally identical. For example, on the assumption
that ‘country house’ as a compound lexeme, is structurally identical
with syntactic compounds like ‘country cottage’ under one of its
540 The Lexicon
interpretations, which are derived from a sentential structure in which
the adjectivalized noun occurs in a predicative locative expression (cf.
‘cottage which is in the country’ — ‘country cottage’), it could be
classified in the lexicon as an endocentric noun compound of the par-
ticular subtype composed of ‘house’ and ‘country’ (these two simple
lexemes being referred to by their lexical addresses). Part of the meaning
of ‘country house’, as well as its distribution, would then be predictable
by rule, in much the same way as the meaning and distribution of regu-
lar complex, or derived, lexemes is predictable by rule. If the meaning of
‘country house’ is (roughly) “‘house in the country belonging to (or
having once belonged to) an aristocratic family”’, all that would need to
be explicitly represented in the semantic part of the lexical ‘entry is
‘belonging to... an aristocratic family’’. Just how this part of the sense
of ‘country house’ might be represented,.in terms of universal or
culturally specific semantic components or otherwise, is of course
problematical (cf. 9.9). But this problem is there anyway, even for simple
lexemes. It is not produced as an artefact of the proposals that we are
making in relation to compound lexemes.
‘Country house’ will serve as an exemplar of what is a very large class
of compound lexemes. As we have seen, it is completely regular as far as
its phonological, morphological and syntactic properties are concerned;
and its status as a lexeme depends solely upon its idiosyncratic and un-
predictable semantic specialization. Its sense, we have decided for the
sake of the argument, is the product of three components “X.”’, ‘“Y”’
and ‘‘Z’’, where “‘X”’ is the sense of ‘house’, “Y”’ is the sense of the
expression “‘in the country” and “Z”’ is the idiosyncratic residue. It
would be possible, of course, to disregard the fact that ‘country house’
is a compound lexeme in the semantic part of its lexical entry. But we
are making the surely plausible assumption that the native speaker’s
knowledge of the meaning of ‘country house’ is determined, in part, by
his knowledge that it is composed of a noun-head, ‘house’, and a noun
modifier, ‘country’; and that these two simple lexemes are combined
according to the productive rules of the language to yield a particular
kind of endocentric noun compound, each lexeme having the sense, or
one of the senses, that it has elsewhere in the language. Let us assume,
therefore, that the compound lexeme ‘country house’ is represented in
the lexicon as a combination of the two lexemes 731 and 1321 (731 being
the address of ‘country’ and 1321 being the address of ‘house’) and
furthermore that it is characterized in the syntactic part of the lexical
entry as belonging to a particular subclass of endocentric noun expres-
13.3. Compound lexemes 541
sions, which we will arbitrarily label N,. This information is sufficient
to account for the phonological, morphological and syntactic regularity
of ‘country house’ in relation to its constituent simple lexemes.
What is now required is some convention, whereby we can interpret
this information, together with what other information is given in the
semantic part of the lexical entry, in order to derive its sense from the
sense of ‘country’ and ‘house’. We know from the grammar that N, 1s
a bi-partite phrasal construction N ++ N, in which the first constituent
is an adjectivalized locative modifier and the second constituent is the
head. If there were no entry for ‘country house’ in the lexicon, it would
be interpreted semantically, as an instance of N,, as an expression mean-
ing “‘house in the country”’; and this part of its sense we want to be able
to derive by rule. Let us assume, therefore, that for every compound
lexeme composed of n simple lexemes, there are m + 1 spaces set aside in
the semantic part of the lexical entry; that one of these is reserved for
the idiosyncratic part of the sense of the component; and that each of
the other spaces is associated with one of the constituents of the com-
pound. In the present instance, there will be three spaces for semantic
information. We will refer to them as the H-space, the M-space and the
S-space (where H, M, and S stand mnemonically for ‘head’, ‘modifier’
and ‘specialization’). Since ‘country house’, under the assumptions we
are making, is regular in so far as its sense is a function of the senses of
‘country’ and ‘house’ when they are combined in an expression of the
class N,, we will leave the H-space and the M-space empty in the seman-
tic part of the lexical entry. But we will put in the S-space, in whatever
format is adopted for this purpose, the information that ‘country house’,
as a compound lexeme, includes in its sense the component “ belonging
to an aristocratic family’’.
Granted that the meaning of the compound lexeme is the product of
“X”, “Y” and “Z”’, the values of X and Y will be determined (by
virtue of the convention that the absence of any information to the con-
trary implies regularity) as the sense of lexemes 731 and 1321, respec-
tively; and these will be combined with ‘‘Z’’, which is given in the
S-space.
Several points may now be made on the basis of the technique that
has just been outlined. First of all, it is obvious that there are, in prin-
ciple, several ways in which the components of the sense of a compound
lexeme may be combined to yield the sense of the compound as a whole.
The proposals that have been made here depend very heavily upon the
assumption that there is a limited and determinate number of ways in
542 The Lexicon
which the senses of lexemes may be combined, or amalgamated, and
that each of these may be associated systematically with particular kinds
of syntactic relations. It may well be, of course, that the meaning of even
regular syntactic compounds is too loosely related to that of their con-
stituent lexemes for the meaning of the compound to be predictable by
rule. But, if this is so, it will be impossible to generate by means of a
finite set of rules operating (in association with the grammar) upon a
finite lexicon the meaning of all the sentences of English. For the set of
regularly derivable syntactic compounds in English (and presumably in
all languages) is indefinitely large. The assumption that the meaning of
an expression is a function of the meaning of its constituents is taken for
granted in much of the most recent work in theoretical semantics; and it
is difficult to see how this assumption could be abandoned without
simultaneously abandoning the attempt to formalize the semantic struc-
ture of sentences.
Our proposals also depend upon the assumption that the way in which
the idiosyncratic part of the meaning of a compound lexeme is amalgama-
ted with the meanings of its constituent simple lexemes ts precisely
specifiable. That this assumption is perhaps untenable becomes clear if
we compare ‘country house’ with ‘washing machine’. The fact that
“washing machine’ in English (like ‘machine a laver’ in French) is now
a compound lexeme, or at least is well on the way to having acquired
lexemic status in the language-system, is suggested by the fact that such
utterances as Is that a washing machine or a dish-washer?, or even Is that a
washing machine or a dish-washing machine?, appear to be completely
acceptable. Structurally, ‘washing machine’ can be related to an in-
definitely large class of expressions (many of which are institutionalized):
‘reading lamp’, ‘gardening jacket’, “swimming costume’, etc. (We will
here discount the possibility that ‘washing machine’ is structurally
ambiguous, being comparable also with ‘running water’, ‘sleeping
partner’, ‘standing committee’, etc.). The meaning of ‘washing
machine’, in so far as it is regular, is “machine (used/usable) for wash-
ing’’. If the sense of ‘clothes’ is to be amalgamated with this, however,
it is obvious that it must be combined in some way as the object of
‘washing’: cf. ‘clothes-washing machine’, ‘machine for washing
clothes’. It follows that this information must be given in the S-space.
Alternatively, if what is put in the S-space is no more than “‘(used/usable)
for clothes”’ and if the convention is held to apply, according to which
the idiosyncratic part of the sense is combined with “‘machine for
washing”? in the same way that “for washing”’’ is combined with
13.3. Compound lexemes 543
‘‘machine”’, the resultant conjunction “‘ washing machine (used/usable
for clothes’) will be underspecified. But this is, once again, a more
general problem: ‘tennis dress’ normally means “‘dress for playing
tennis (in)”’, ‘bread-knife’ means “knife for cutting bread (with)’’; and
so on. In the case of ‘washing machine’ there is at least present in the
forms of the compound a form of the verb ‘wash’; and if “dress for
tennis” and “knife for bread’’ are considered specific enough to serve
as the meaning of ‘tennis dress’ and ‘bread knife’ (which, though they
are institutionalized as expressions, are surely not to be regarded as
lexemes), there is perhaps no reason why “machine for washing for
clothes” should not be regarded as a satisfactory analysis of the sense of
the compound lexeme ‘washing machine’. We can, however, leave open
the possibility that it may be necessary to specify in the S-space of a
compound that the idiosyncratic part of its meaning combines with that
of one, rather than another, of the constituent lexemes and that it does
so in a particular way.
Before we continue with the discussion of compound lexemes, it
should be pointed out that, if the way in which the idiosyncratic part of
the meaning of a compound lexeme is too loose and too diverse to be
brought within the scope of rules of the kind that operate in the deter-
mination of the meaning of regular syntactic compounds (as has often
been suggested), this point holds equally well for simple and complex
lexemes. They are no less likely to acquire, by institutionalization and
subsequent petrification, idiosyncratic restrictions than are compound
lexemes. What is at issue, in fact, is the whole question of lexical de-
composition: is it possible to represent the sense of lexemes, without
residue, as a compositional function of sense-components and, if so,
what are the combinatorial principles? ‘This is one of the most funda-
mental, and controversial, questions of theoretical semantics at the
present time. If it cannot be answered in the affirmative for compound
lexemes, it seems clear that lexical decomposition, as such, must be
rejected completely; and we have already seen that lexical decomposition
is suspect on other grounds (cf. 9.9).
A further point that must be made explicit before we proceed has to
do with the relevance, in relation to compound lexemes, of the distinc-
tion between homonymy* and polysemy* (cf. 13.4). A sentence like ‘I
hate the country’ is ambiguous in English: so too is ‘I hate the town’.
Under one interpretation, ‘the country’ and ‘the town’ are singular
definite referring expressions, and the lexemes ‘country’ and ‘town’
occurring in them denote such classes of entities (or places) as England,
544 The Lexicon
France, Germany, etc., and Manchester, Birmingham, Stratford-upon-
Avon, etc. Under the other interpretation, ‘the country’ and ‘the town’
_ do not refer to entities (or places) of this kind at all; their mode of
reference is similar to, if not identical with, that of singular mass nouns.
Now, it is clear that the compound lexeme ‘country house’ (unlike
‘house in the country’) is not ambiguous. If the ambiguity of ‘I hate the
country’ is accounted for by recognizing two distinct lexemes ‘country,’
and ‘country,’ (the first of which is not pluralizable and never occurs
except with the definite article), there is no problem, in so far as our
proposed technique for handling the relationship between ‘country
house’ and ‘country’ is concerned. ‘Country,’ and ‘country,’ will have
two different lexical addresses; and it will be the address of ‘country,’
that is given in the M-space. But if ‘country’ is treated as one lexeme
with two meanings, “‘country,’’ and “country,’’, we shall have to be
able to identify in the lexical entry for ‘country house’ which of the two
senses is involved. The distinction between homonymy and polysemy,
as we shall see later, is very difficult to establish on general grounds, and
may indeed rest upon ultimately untenable assumptions about the
discreteness of the senses of lexemes. However that may be, the point
that we have raised in this paragraph is an important one.
Having discussed, at some length, the nature of such compound
lexemes as ‘country house’, we can now move on to deal more briefly
with other kinds of compound lexemes, whose meaning cannot be
accounted for, even in principle, as the product of the meaning of a
regular syntactic compound, on the one hand, and of a more specialized,
idiosyncratic component, on the other. First of all, there are several
classes of compound lexemes which are phonologically and grammatic-
ally regular, but semantically irregular with respect to one or more of
the simple lexemes of which they are composed. For example, ‘public
school’ in British English is semantically regular with respect to its head:
‘public school’ is a hyponym of ‘school’, and this can be indicated in
the lexicon in the way that the hyponymy of ‘country house’ and .
‘house’ is indicated. But the sense of the adjective ‘public’ does not
enter into the meaning of the compound lexeme ‘public school’ as the
sense of ‘country’ is included in the sense of ‘country house’. ‘Public
school’ denotes a subclass of the class of institutions in Great Britain
denoted by ‘private schools’ (which, like ‘State school’, but unlike
‘grammar school’, ‘prep school’, etc., is a regularly derivable expres-
sion).
It might be argued, of course, that ‘public’, in this case, has a meaning
. 13.3. Compound lexemes 545
which it never has in any other context. Suppose we put ‘public,’ into
the lexicon as a homonym of ‘public,’ (the more common and more
freely combinable adjective) and associate with ‘public,’ whatever it is
in the sense of ‘ public school’ which distinguishes it from ‘State school’,
‘grammar school’, ‘prep school’, etc. ‘Public school’ would then be
regularly derivable; and it would not need its own lexical entry. But we
should have to indicate in the lexical entry for ‘public,’ the fact that it
can occur only as a pre-noun modifier and, moreover, that the only noun
it can modify is ‘school’. (Adopting this treatment would be like putting
a verb ‘auth’ into the lexicon with the information that its stem auth
must necessarily be combined with the derivational agentive sufhx to
form the stem of the noun ‘author’: cf. 13.2.) Another alternative is to
opt for polysemy rather than homonymy, saying that ‘public’ has (at
least) two distinct senses “‘public,’’ and “‘public,’”’. Once again, we
should have to indicate in the lexical entry for ‘public’ that, when it
means ‘‘public,’’, it must be combined with ‘school’. The theoretical
implications of adopting one, rather than the other, of these solutions
should not be overlooked. In the first case, by taking ‘public,’ as a
distinct lexeme and restricting its syntactic distribution in the way that
is required, we should be setting up, in effect, a one-member subclass of
adjectives in English. In the second case, we should be making the syn-
tactic distribution of ‘public’ a function of its meaning. There would
seem to be little point in thus complicating the syntactic description of
the language in order to handle what can be handled equally well by
putting into the lexicon an entry for ‘public school’, which relates it
semantically to ‘school’, but not to ‘public’. How might this be done?
Let us recall that the convention suggested for ‘country house’ was
that it should be regarded as being composed of two nouns, identified by
their lexical addresses, and that the semantic part of the entry for
‘country house’ should have three spaces: an H-space for the head, an
M-space for the modifier, and an S-space for the idiosyncratic part of
its meaning. Adopting the same format for ‘public school’ (and assum-
ing that it is identified as an endocentric compound), we would leave the
H-space empty (thus accounting for the hyponymy of ‘public school’
and ‘school’) and we would indicate by a special symbol, the identity-
symbol or zero (%), in the M-space that the modifier is semantically
vacuous. ‘The third space would then carry the idiosyncratic part of the
meaning of ‘public school’. The meaning of ‘public school’ would be
thus determined as “school which is and which Z’’. We have here used
the identity-symbol, or zero, to represent the semantically vacuous
546 The Lexicon
component: i.e. the part contributed by the adjective ‘public’. Several
questions arise in relation to this treatment.
First, it might be asked why, if ‘public’ makes no contribution to the
meaning of ‘public school’, it is nonetheless recognized as a constituent
of the compound lexeme. The answer is that we need to get the form
public, just as we need to get the form school, from the lexicon; and we
must either put the form public in the entry for ‘public school’ or obtain
it from the lexical entry for ‘public’. If we identify ‘public school’ as
public + 523 (i.e. as a combination of a form and a simple lexeme — 523
being the address of ‘school’), we are saying that there is no relationship
of any kind, other than accidental coincidence of form, between the
simple lexeme ‘public’ and the modifier of ‘school’ in the compound
lexeme ‘public school’. The alternative, which we have adopted, is
based on the assumption that the existence in the vocabulary of English
of a simple lexeme ‘public’ and the fact that ‘public school’ is of the
same syntactic type, and has the same stress-pattern, as regularly
derivable phrases composed of an adjective and a noun, is sufficient to
justify this treatment. It must be admitted, however, that in default of
somewhat arbitrary.
any positive morphological reasons in favour of this solution, it is
(i) Each of the participants must know his role* and status*. Lin-
guistically relevant roles are of two kinds: deictic and social. Deictic
roles derive from the fact that in normal language-behaviour the speaker
addresses his utterance to another person (or other persons) who are
2 According to Goffman (1964): “‘It hardly seems possible to name a social
variable that doesn’t show up and_have its little systematic affect upon speech
behaviour: age, sex, class, caste, country of origin, generation, region,
schooling; cultural cognitive assumptions; bilingualism, and so forth’’. For
exemplification and discussion, cf. Bauman & Sherzer (1974), Bright (1966),
Fishman (1965, 1968, 1971, 1972a, b), Giglioli (1972), Gumperz & Hymes
(1971), Hymes (1964, 1974), Pride (1970), Pride & Holmes (1972).
14.2. Communicative competence 575
present in the situation and may refer to himself, to the addressee(s) or
to other persons and objects (whether they are in the situation or not),
not by means of a name or description, but by means of a personal or
demonstrative pronoun, whose reference is determined by the participa-
tion of the referent in the language-event, at the time of the utterance.
Deictic roles are grammaticalized in many, though not all, languages in
what is traditionally called the category of person*. We will discuss this
in more detail in the chapter dealing with deixis* (15.1). Here it is
sufficient to say that in English the use of ‘I’ (and ‘we’) is determined,
in normal language-behaviour, by the speaker’s assumption of the role
of speaker in relation to the addressee(s) and by his referring to himself
as the person fulfilling this deictic role. The addressee must be able to
identify the referent of ‘I’ and also the referent of ‘you’; and this
implies that he knows that he is being addressed. Many of the non-vocal
paralinguistic phenomena which accompany and are integrated with
spoken utterances have this vocative* function of inviting a particular
person to assume the role of addressee; and names, titles or special
terms of address based on social status may be used, and in some situa-
tions are obligatory, in order to identify the addressee (cf. 7.5).
Social roles are culture-specific functions, institutionalized in a society
and recognized by its members: for example, the function of being a
doctor, a parent, a teacher, a customer, a priest. These roles are typically
reciprocal: doctor-to-patient and patient-to-doctor, parent-to-child and _.
child-to-parent, and so on. The most obvious effect of social role, as a
contextual variable, lies in its determination of terms of address: as when
‘Sir’, ‘Doctor’ or ‘My lord’ (in the courtroom) are used with vocative
function in English. The speaker in using such expressions accepts, and
shows that he accepts, his role vis-a-vis the addressee. In many lan-
guages there is a richly differentiated set of terms of address which the
speaker must control if he is to produce appropriate utterances in
various situations. Social role may also determine the selection of per-
sonal pronouns and associated components of the grammatical structure
of utterances. A clear instance of this is the use of the so-called royal
first-person plural pronoun by a monarch, the Pope or a bishop in a
number of European languages (“‘ We have taken unto ourself [sic] . . .’’),
and, in Japanese, the use of a special first-person pronoun by the Em-
peror. Generally speaking, however, it would seem to be status, rather
than role, which is the determining factor in the selection of pronouns.
For example, the fact that in the Russian army, before the Revolution,
an officer would address a private soldier as ‘ty’ (roughly comparable
576 Context, style and culture
with the French ‘tu’ and the German ‘du’), but be addressed by the
soldier as ‘vy’ (cf. French ‘vous’, German ‘Sie’), is explicable in terms
of more general principles based on status (cf. Friedrich, 1966). Role
normally implies status. There are, however, many aspects of language-
behaviour that are systematically determined by social role: the use of
various characteristic expressions by a judge addressing the jury or a
preacher addressing the congregation, by lovers in situations of inti-
macy, by a person saying his prayers, and so on. Role may also be the
primary determining factor in the switch from one dialect to another, or
even from one language to another, in situations of diglossia* —a
phenomenon that will be referred to later in this section.
By social status is meant the relative social standing of the partici-
pants. Each participant in the language-event must know, or make
assumptions about, his status in relation to the other; and in many
situations status will also be an important factor in the determination of
who should initiate the conversation. The participants may not agree
about their relative status; each speaking to the other as superior-to-
inferior, or more commonly perhaps (and in a way that is often con-
ventionalized in language by means of an accepted code of politeness) as
inferior-to-superior; or one treating the other as an equal, while he is
himself addressed as a superior or inferior. Societies vary considerably,
of course, in the degree to which status is explicitly recognized as such
and institutionalized in dress, titles and so on; and the degree to which
language-behaviour is determined by status also varies from one
language to another. But there is probably no language for which it is —
totally irrelevant.
Once again, the most obvious correlate of social status in language-
behaviour, as far as the utilization of the language-system is concerned,
is in the use of particular terms of address and personal pronouns. It is
supported and confirmed by such paralinguistic phenomena as eye-
movements, gestures, posture and physical contact or proximity (cf. 3.2).
The importance of status in the selection of certain terms of address in
American English has been demonstrated in a now classic paper by
Brown and Ford (1961); and their work has been carried further by
Ervin-Tripp (1969). In many European languages, though not in
Modern English, the selection of a second-person singular pronoun is
determined, partly at least, by relative social status: the particular deter-
mining factors vary, however, from one language to another, and indeed
from one social group to another within the various language-communi-
ties. In Japanese and Korean social status and deictic role jointly
14.2. Communicative competence 577
determine the selection of all the personal pronouns; and status (to-
gether with other factors) governs the selection of particular forms of
certain verbs (cf. Martin, 1964; Harada, 1975). But status, like role, also
determines, and probably in all languages, the selection of a wide range
of stylistic factors in phonology, grammar and vocabulary; and the socio-
linguistic literature contains many illustrations of this from all over the
world.
Sex and age are so often determinants of, or interact with, social
status that they may be conveniently mentioned here. The terms of
address employed by a person of one sex speaking to a person of another
sex, or by a younger person speaking to an older person, may differ from
those which would be employed in otherwise similar situations by people
of the same sex or of the same age. This phenomenon is so pervasive and
SO apparent even to the casual observer of language-behaviour that
exemplification is unnecessary. The Women’s Liberation movement has
recently drawn attention to some of the linguistic difficulties which stand
in the way of their achieving social equality with men: notably, to the
fact that few of the major languages of the world provide a general
term of address for a woman which is not determined by her marital
status. The sex of the participants is grammatically relevant in many
languages. In Thai men employ one first-person pronoun and women
another, and there are other systematic differences of grammatical
structure; and in a number of other languages in various parts of the
world there are more extensive grammatical differences, as well as
differences in phonology and vocabulary, between the language of men
and women (cf. Haas, 1944; Grootaers, 1952). In the Romance and
Slavonic languages the sex of the participants determines the form of
certain adjectives and certain verb forms according to the category of
gender; and this, it should be noted, unlike the gender agreement which
holds between third-person pronouns or noun phrases and verbs or
adjectives, is wholly a matter of contextual appropriateness. For example,
Je suis heureux and Fe suis heureuse in French (“‘I am happy’”’) are both
grammatically well-formed; the first utterance, however, would normally
be produced by a man or boy, the second by a woman or girl. The
qualification implied by the use of the word ‘normally’ is, as always,
necessary. What counts is not, in principle, the actual sex of the partici-
pants, but the sex that is ascribed to them or they ascribe to themselves
in the situation. A man might be playing a woman’s part in a play, for
example; and there are other obvious situations in which a man might
appropriately say Fe suzs heureuse.
578 Context, style and culture
(ii) The participants must know where they are in space and time. At
first sight, this might appear to be an unnecessary condition to impose
upon the appropriateness of utterances. Consider, however, an utterance-
token like We are having a fine summer here in Queensland this year pro-
duced by someone in Edinburgh in December. It is grammatically and
semantically well-formed, but situationally inappropriate; and it is for
this reason that it is uninterpretable (except, again, under rather special
circumstances). One cannot be having a fine summer during winter in a
place where one is not. The situational inappropriateness of the utter-
ance derives from the fact that ‘here’ is a deictic adverb which refers to
the place where the speaker is (or believes himself to be) at the time of
utterance, and the tense of the verb, as realized in the form are having,
refers to a period of time which contains the point of time at which the
utterance is made. ‘The speaker of a language must control and be able to
correlate at least two different systems of spatiotemporal reference: one
is the deictic system whose co-ordinates are created by the act of utter-
ance itself (cf. 15.1); the other is a culture-specific system for referring
to time and place that is lexicalized in the language he is speaking.
The appropriate use of greetings such as Good afternoon! or Happy
Christmas! is similarly dependent upon the speaker’s knowledge of the
time at which he is producing them. In order to be able to employ them
correctly the speaker must know (in addition to certain other facts) what
counts as afternoon or Christmastide and whether it is indeed the after-
noon or Christmastide at the time of utterance. He can of course
deliberately violate the normal conditions governing the use of such
greetings. For example, he might say Good afternoon! in the middle of
the morning to a colleague arriving late to work; and his utterance will
be understood as situationally appropriate, but ironical. Irony depends
upon and presupposes the participants’ knowledge of the normal con-
ditions of situational appropriateness.
The speaker and addressee are normally in the same spatiotemporal
location; and it is probably true to say that all languages are designed, as__
it were, to operate in such circumstances. Problems of spatiotemporal
reference arise when the participants are separated in space and time.
We have only to think of the difficulties we encounter in this respect
when we make a long-distance telephone-call (e.g., from Great Britain
to the United States). The speaker can either adopt the spatiotemporal
co-ordinates of his own location (greeting the addressee, let us say, with
Good afternoon!) or he can project himself into the spatiotemporal loca-
tion of the addressee (saying Good morning!). But the speaker is not
14.2. Communicative competence 579
completely free with respect to the possibility of projection into his
addressee’s spatiotemporal location: there are restrictions. For example,
if we are in London, speaking (in English) to someone in New York, we
can say, appropriately, either We are going to New York next week or
We are coming to New York next week. We can also say We are going
there next week, and even We are coming there next week (where the deictic
adverb ‘there’ refers to New York). What we cannot say without |
violating the rules which govern the use of ‘here’ is We are coming here
next week (with ‘here’ referring to New York). We are coming here next
week is a perfectly grammatical utterance (more clearly so perhaps than
We are coming there next week, which some speakers of English find un-
acceptable). But it is situationally inappropriate. The use of ‘come’,
unlike the use of ‘here’, allows the speaker to project himself into a
deictic context centred on the addressee.
The conditions under which deictic projection* is permitted (if I may
introduce a term for the phenomenon just illustrated) would seem to
vary, to some degree at least, from one language to another. For example,
the French ‘venir’ and the Italian ‘venire’ (“to come’’) cannot be used
in deictic projection as freely as the English ‘come’ can. Similarly, in
Classical Latin it was possible, when writing letters, to use the so-called
epistolary past tense in referring to events taking place at the time of
writing; and this practice is explained in terms of the writer’s projection
of himself into the situation the receiver would be in when the letter
arrived. The past tense cannot be used in this way in English.
The non-deictic system of spatiotemporal reference was described
above as culture-specific. It is important to realize that there may be
alternative, and even conflicting, systems used by different groups
within a language-community. The Jewish New Year and the Christian
New Year do not coincide; Christmas is celebrated somewhat later by
the members of some of the orthodox churches than it is by the members
of other Christian sects; and so on. The interpretation of phrases like
‘over the New Year’ and ‘at Christmas’, in terms of some external and
neutral system of temporal reference, may vary accordingly. Even more
striking are the discrepancies which arise in different parts of the
English-speaking world in the correlation of local seasonal reference
(e.g., ‘this summer’) with standard calendar reference (e.g., ‘in July’ or
‘in December’). The situational inappropriateness of We are having a
fine summer here this year, said in Edinburgh in December, depends in
part on its violation of the system of local seasonal reference. Our ideal
omnicompetent speaker of English must be able to control and inter-
580 Context, style and culture
relate, appropriately, the deictic system, and a whole set of secular and
religious holidays or feasts. Whether knowledge of the kind should be
included within linguistic competence is a moot point (cf. Leech, 1969:
118). But it certainly belongs to language-competence as this is manifest
in the appropriate or inappropriate use of English.
(v) The participants must know how to make their utterances appro-
priate to the subject-matter; and the importance of subject-matter as a
determinant in the selection of one dialect or one language rather than
another in bilingual or multilingual communities has been stressed by
582 Context, style and culture
such writers as Haugen (1953), Weinreich (1953) and Fishman (1965).
More recently, however, Fishman (1972c) has pointed out that the
greater appropriateness to subject-matter of one language rather than
another in multilingual settings ‘may reflect or be brought about by
several different but mutually reinforcing factors”; and he has suggested
that the selection of one language rather than another may be simply a
consequence of the fact that “certain socio-culturally recognized spheres
of activity are, at least temporarily, under the sway of one language or
variety’.
Crystal and Davy (1969) introduce the term province* for ‘‘the
features of language which identify an utterance with those variables in
an extralinguistic context which are defined with reference to the kind of
occupational or professional activity being engaged in’’; and they make
the point that “‘subject matter, in so far as this is a question of the use of
distinctive vocabulary, is but one factor among many which contributes
to a province’s definition, and in any case has predictive power only
in a minority of extremely specialist situations”. This is undoubtedly
correct.
It does not follow, however, that the semanticist should not be con-
cerned with subject-matter as a contextual variable. Its importance is
revealed as soon as we consider the practical problems of disambiguating
utterances which contain lexemes with more than one sense: e.g., That
plant is an eyesore. If the conversation in which this utterance occurs is
concerned with the layout or appearance of a garden it will presumably
be taken to have a different meaning from the meaning that the same
utterance (i.e. as a token of the same type) would have in a conversation
devoted to the architectural merits of a group of factory buildings. Ad-
mittedly, other situational variables might suffice, in particular instances,
to disambiguate such utterances. But, in principle, our omnicompetent
speaker can talk about anything, whatever occupational or professional
activity he happens to be engaged in at the time and whatever social role
he happens to be performing. The fact that his choice of vocabulary will
be very largely determined by subject-matter may well imply that the
selection of one word rather than another falls outside the scope of
stylistics (“the description of the linguistic characteristics of all situa-
tionally-restricted uses of language’’: Crystal & Davy, 1969: go). But
we cannot, as semanticists, neglect the fact that the speaker can assume,
and normally does so unconsciously, that particular lexemes will be
interpreted by the addressee in one sense rather than another by virtue
of the subject-matter of the utterance in question and previous utter- _
14.2. Communicative competence 583
ances in the conversation. So far, however, little progress has been made
in giving a theoretically satisfying account of this phenomenon.
When research in machine-translation was being actively pursued in a
number of centres in different countries throughout the world some
years ago, it was suggested by certain scholars that homonymous or
polysemous lexemes could be disambiguated by means of a computer-
program which would scan a text and determine its subject-matter in
terms of the occurrence in the text of a preponderance of lexemes from a
certain area of the vocabulary; and this technique is now regularly
employed, with a fair measure of success, in automatic indexing and
information retrieval. In its most sophisticated and_ linguistically
most interesting form, the proposal to disambiguate homonymous or
polysemous lexemes in this way presupposes an analysis of the lexi-
cal structure of the language-system on thesaurus*, or field-theory*
principles (cf. 8.2). It might be assumed, for example, that the noun
‘plant’ would be shown in the thesaurus as belonging to at least two
fields, the one field containing such lexemes (in one of their senses) as
‘vegetable’, ‘bush’, ‘flower’, ‘lawn’, ‘garden’, ‘grow’, ‘prune’,
‘weed’, and the other field containing such lexemes as ‘factory’,
‘machine’, ‘manufacture’, ‘equipment’, ‘building’. The idea under-
lying this approach to the contextual resolution of lexical ambiguities is
intuitively attractive. It is doubtful, however, whether any purely
mechanical, or algorithmic, procedure for disambiguation can be devised
along these lines, even presupposing the existence of an ideal thesaurus
(cf. Bar-Hillel, 1964: 178). Nonetheless, it seems to be an inescapable
fact that the participants’ awareness of the subject-matter is a potential
and frequently relevant disambiguating factor in everyday language-
behaviour, whether this can be accounted for in terms of the co-
occurrence in a text of a relatively large number of lexemes from the
same semantic field or not.
There is another aspect of subject-matter, which relates to the ex-
pressive* function of language (cf. 2.4). This is the selection by the
speaker of elements which make the utterance appropriate to his attitude
towards, or his emotional involvement in, what he is talking about. He
may be ironical, enthusiastic, sceptical, reserved, scornful, sentimental ;
and so on. Although the speaker’s attitude towards the subject-matter
may be influenced by such other situational factors as degree of formality
and the interpersonal relations subsisting between him and the addressee,
it is, in principle, distinguishable from these other factors. For example,
some speakers might avoid using what are generally regarded as obscene
584 Context, style and culture
words in more formal situations, and in informal situations when
addressing a member of the opposite sex, but might use them quite
freely, in relation to the same subject-matter, when talking informally to
some of their own sex; and their employment of such words might be
indicative of their attitude towards the subject-matter, as well as having
the particular social function of promoting solidarity.
(vi) The participants must know how to make their utterance appro-
priate to the province* or domain* to which the situation belongs. ‘The
term ‘province’ has already been introduced, under (v), with its defini-
tion by Crystal & Davy (1969). The term ‘domain’ is taken from Fish-
man (1965), who defines a domain as a “cluster of social situations
typically constrained by a common set of behavioral rules”’ and relates
it to “those ‘generally termed’ spheres of activity which have more
recently been independently advanced by others interested in the study
of acculturation, intergroup relations and bilingualism’’. Yet a third
term that is quite widely used in the recent literature of linguistics and
stylistics is register*, which has been defined in terms of systematic
variation “‘by use in relation to social context’? (Leech, 1966: 68; cf.
Halliday, McIntosh & Strevens, 1964: 77; Strang, 1968: 21). ‘Register’,
however, is commonly held to subsume, not only the phenomena
covered by ‘province’ and ‘domain’, but also subject-matter.
Scholars who have been concerned with systematic variation of the
kind that we are referring to here would be among the first to admit that,
whatever technical terms they may employ, their theoretical discussions
and classification of the phenomena are tentative and provisional. Fish-
man (1965) relates the concept of the domain of language-behaviour, on
the one hand, to subject-matter, and on the other, to locale* and role-
relations. He points out that “most major social institutions are associa-
ted with a few primary locales’’. For example, the domain of the family
is primarily associated with the home; the domain of religion is primarily
associated with the church; the domain of employment is primarily
associated with the office or factory; and so on. Within each domain a
variety of characteristic reciprocal role-relations (and their converses)
can be identified: mother-to-father, wife-to-husband, parent-to-child;
priest-to-parishioner; secretary-to-boss; etc. The locale of the utterance
and the role-relations of the participants tend to be mutually reinforcing
and congruent; and they also tend to be congruent with the subject-
matter. But they can be incongruent; and, in such cases, one can
investigate which of the components, if any, is dominant in the determina-
14.2. Communicative competence 585
tion of the structure of the utterance. “If one meets one’s clergyman at
the race track the impact of the locale on the topics and role-relation-
ships that normally obtain is likely to be quite noticeable’? (Fishman,
1972c: 22). Fishman is mainly concerned with the establishment and
validation of a theoretical framework within which one can describe, and
perhaps explain, systematic variation in language-behaviour (code-
switching*) in diglossic or multilingual communities. The contextual
variables that he and other sociolinguists have discussed in connexion
with the notion of domain are equally important, however, in the analysis
of the situational appropriateness of utterances in what are normally
regarded as monolingual communities.
‘Province’ (as used by Crystal & Davy, 1969: 71ff) is narrower in
scope than ‘domain’; and it fits into a somewhat different analysis of the
major situational variables. Province features are defined “‘ with reference
to the kind of occupational or professional activity being engaged in’’,
and they are said to “provide no information about the people involved
in any situation — about this social status or relationship to each other,
for example’’. Conversation is regarded as a province, but the point is
made that “conversation is different from all other provinces in that it
is the only case where conventional occupational boundaries are irrele-
vant’’. Other provinces in English include the language of public
worship, advertising, newspaper reporting, science and law; and samples
of texts in some of these provinces are discussed in detail by the authors.
utterances. |
that is implied by ‘decontextualization’: they are derived from utter-
ances by the elimination of all the context-dependent features of
14.2. Communicative competence 589
Spoken utterances of everyday conversation tend to be heavily con-
text-dependent, as well as being characterized by errors and other
performance-phenomena, which, we are assuming, are eliminable by
regularization. One aspect of context-dependence is manifest in what is
traditionally called ellipsis. A conversation consisting entirely of gram-
matically complete text-sentences would generally be unacceptable as a
text; and it is part of the language-competence of a speaker of the
language (if not of his linguistic competence in the narrower sense) that
he should be able to produce grammatically incomplete, but con-
textually appropriate and interpretable, sentence-fragments*. For
example, the utterance As soon as I can (produced with the appropriate
stress pattern and intonation) might occur in a text in reply to an
utterance (intended and taken as a question) such as When are you
leaving? The grammatical structure of the context-dependent sentence-
fragment As soon as I can, and at least part of its meaning, can be
accounted for by describing it as an elliptical, appropriately con-
textualized, version of the utterance I’m leaving as soon as I can. Ellipsis,
then, is one of the most important and one of the most obvious effects of
contextualization; and decontextualization, in the case of sentence-
fragments such as the one just illustrated, consists in supplying some
element or elements from the preceding co-text.®
Ellipsis is not the only phenomenon to be taken into account in the
decontextualization of text-sentences or sentence-fragments. There is a
whole range of other phenomena, including the use of pronouns, the
definite article, word-order, sentence connectives and such prosodic
features as stress and intonation. Any of these features may suffice to
make a text-sentence or sentence-fragment context-dependent. For
example, the text-sentence I haven’t seen him before cannot be inter-
preted unless the referent of the pronoun ‘he’ can be correctly identified
by the hearer; and the referent will normally have been mentioned in
the preceding co-text. The different, but related, text-sentence I haven't
seen him before (where the pronoun ‘he’, in its form him, bears heavy
stress) is also context-dependent; but the referent of ‘he’ need not have
been mentioned in the co-text. The referent might be some person in
the situational context, who is identified paralinguistically by the speaker
as he makes the utterance (e.g., with a gesture of the hand or a movement
of the head). There is some disagreement among linguists as to how
3 It is not being suggested that everything that would be described traditionally
as a sentence-fragment is to be treated as the product of ellipsis (cf. Allerton,
1975; Shopen, 1973).
590 Context, style and culture
many of these phenomena should be accounted for as part of the struc-
ture of system-sentences. Here we are concerned to emphasize that the
grammatical and semantic coherence of text-sentences and text-frag-
ments within a text is but one aspect of the global problem of contextual
appropriateness; and that it cannot be handled without taking into
account situational factors and the non-linguistic features of utterances
and their co-text.
If linguistic semantics is taken to be that branch of semiotics which
deals with the way in which meaning (of all kinds) is conveyed by lan-
guage, it must be accepted that a comprehensive theory of linguistic
semantics will need to be based upon, or include, a theory of contextual
appropriateness, It is arguable, however, that, at the present time at
least, the construction of such a comprehensive theory of linguistic
semantics is too ambitious a task. There are various ways in which we
can set about constructing a partial theory of linguistic semantics, or a
set of partial theories, each of which will abstract from, or take for
granted, facts which other theories try to systematize and explain. One
such partial theory, which might be appropriately described as a theory
of microlinguistic semantics, would be restricted to the analysis of the
meaning of maximally decontextualized system-sentences. It would be
concerned with the sense and reference of linguistic expressions, with
the range of semiotic functions (or speech-acts: cf. 16.1) that can be
performed by the utterance of particular sets of sentences, with the
implications and presuppositions which hold between the propositions
expressed by sentences (assuming that the sentences are uttered under
certain standard conditions) and with the validation of these proposi-
tions in terms of truth-conditions holding in some actual or possible
world. It would not be concerned, except incidentally and minimally,
with socio-culturally determined variation, with textual coherence or
with the other aspects of contextualization mentioned in this section.
Much of the recent work in the formal analysis of meaning in language
falls within the scope of microlinguistic semantics as we have just
defined this field. Provided that it is appreciated that the distinction of
microlinguistics from sociolinguistics or stylistics is a purely methodo-
logical distinction, based upon the linguist’s regularization, standardiza-
tion and decontextualization of utterances, there 1s much advantage to
be gained from the deliberate neglect in microlinguistic semantics of
contextual appropriateness. Within the restricted framework of micro-
linguistic semantics, we can give a satisfactory account of the sense of
most lexemes in the vocabularies of languages and, no less important,
14.2. Communicative competence 5g!
we can investigate the way in which the grammatical structure of
system-sentences determines their meaning and their characteristic
semiotic function in utterances; and this is clearly one of the central
tasks of linguistic semantics.
Microlinguistic semantics, as it has been developed so far, deals
primarily with descriptive meaning. Language, however, is not merely
an instrument for conveying factual information; it also serves a variety
of social and expressive functions. Indeed, as we have already seen, it is
difficult, in the last resort, to draw a sharp distinction between the
descriptive and the interpersonal functions of language (cf. 2.4). No
satisfactory and comprehensive theory of semantics can afford to neglect
social and expressive meaning in language; and in doing so it must draw
fully upon the notion of contextual appropriateness. If this is held to
fall within the scope of sociolinguistics or stylistics, then at least this
part of sociolinguistics and stylistics is to be included in linguistic
semantics; and it should always be borne in mind that methodological
distinctions within linguistics do not necessarily reflect any inherent
differences in the internalized system of rules which underlie language-
behaviour.
In a previous chapter, it was pointed out that the distinction fre-
quently drawn by semioticians between semantics and pragmatics* was
of uncertain applicability in the analysis of meaning in natural languages
(cf. 4.4). One way of drawing it by definition (with respect to micro-
linguistics) is to say that microlinguistic semantics deals with the
meaning of maximally decontextualized system-sentences and that micro-
linguistic pragmatics studies the meaning that these sentences have
when they are uttered (as text-sentences) in particular classes of con- }
texts. One can perhaps study the meaning of propositions, and their
truth-conditions in relation to possible worlds, without invoking the
notion of the context-of-utterance. But one cannot get from sentences
to the propositions expressed by them (even supposing that we would
wish to do so) without taking account of certain contextual features (cf.
Stalnaker, 1972: 383). It is for this reason that we have said that
microlinguistic semantics deals with the meaning of maximally, rather
than fully, decontextualized system-sentences. The context-dependence
of many system-sentences (and hence the necessity of invoking prag-
matic concepts in the analysis of their meaning) is especially clear in the
case of sentences containing deictic elements (cf. 15.1).
592 Context, style and culture
14.3. Conversational implicatures and presupposition
The term implicature* was introduced into the philosophy of language
by Grice in his William James lectures in 1967/8 (cf. Grice, 1975). It is
now quite widely employed, not only by philosophers, but also by
linguists.
As we have already seen, the term ‘implication’ is normally used in
philosophical semantics to refer to the truth-functional relation of
material implication; and this in turn is distinguished from strict im-
plication, or entailment (cf. 6.2). In everyday usage, the words ‘implica-
tion’ and ‘imply’ are used in what appears to be a quite different sense
from that which is associated by definition with the operation of material
implication under the standard interpretation of the propositional cal-
culus (hence the so-called paradoxes of material implication) and in
: what is certainly a much broader sense than that borne by the terms
‘entailment’ and ‘entail’ in philosophical semantics. Grice’s notion of
implicature 1s intended to cover at least some of the difference between
the broader, everyday, notion of implication and the narrower, philoso-
phical, notion of entailment. He is also concerned to show how implica-
tures co-operate with, and supplement, material implication in the
everyday use of language: we will not go into this part of his programme.
The notion of implicature rests upon a distinction between what is
actually said and what is implied (but not entailed) in saying what is
said. As we shall see later, there are various senses in which the verb
‘say’ can be interpreted. At least two of these are relevant in the present
connexion: “‘say,’’ and “‘say,”’ (cf. 16.1). For example, if someone says,
(i.e. utters a token of the utterance-type that is conventionally repre-
sented as) It 1s cold in here he would normally be saying, (i.e. asserting
the proposition) that it is cold where he is. Considerable attention has
been devoted by Grice and other philosophers of the so-called ordinary-
language school (cf. 6.1) to the analysis of the conditions under which
in saying X one can be held to have said,, and to have meant, that p
(where X is an utterance-signal and.p is a proposition). They have also
been much concerned with explicating the notion of meaning in terms
of which it is reasonable to assert that one can say (i.e. “‘say,’’) that p
without meaning that p. It turns out that in one sense of ‘mean’ we
cannot say that p without meaning that p; and the reasons why this is so
will occupy us later (cf. 16.1). There is a different, but undoubtedly
related, sense of ‘mean’, however, in which it is possible, in saying that
p, to mean that q (p % q), instead of, or in addition to, p. For example,
in saying, that it is cold where one is (by saying, Jt 7s cold in here) one
14.3. Conversational implicatures and presupposition 593
might mean, or be implying, that the heating should be turned up, that
one’s host pays more heed to his fuel bills than to the comfort of his
guests, and so on. Given that certain conditions are satisfied, we shall be
entitled to say (in terms of Grice’s notion of implicature) that these
various additional propositions are implicated*, though not asserted:
they are implicata* of the utterance It 1s cold in here (under certain con-
textual conditions).
Grice distinguishes two kinds of implicature: conventional and con-
versational. The difference between them is not always clear-cut in
particular cases. In principle, however, the difference seems to be that,
whereas a conventional implicature depends upon something additional
to what is truth-conditional in the normal (i.e. conventional) meaning of
words, a conversational implicature derives from a set of more general
conditions which determine the proper conduct of conversation. It is the
so-called conversational implicatures with which we are concerned here;
and henceforth the terms ‘implicature’ and ‘implicate’ will be used
without qualification in this narrower sense.
The conditions from which implicatures derive are formulated by
Grice as maxims, grouped under the four headings of quantity, quality,
relation, and manner.
Quantity. (i) Make your contribution as informative as is required (for
the current purposes of the exchange); (ii) do not make your contribu-
tion more informative than is required.
Quality. Try to make your contribution one that is true: (1) Do not
say what you believe to be false; (i1) Do not say that for which you lack
adequate evidence.
Relation. Be relevant.
Manner. Be perspicuous: (1) Avoid obscurity of expression; (11) Avoid
ambiguity; (iii) Be brief (avoid unnecessary prolixity); (1v) Be orderly.
All of these maxims relate in a fairly obvious way to the more general
purpose of promoting the efficient communication of propositional
information. They are inherently restricted, therefore, to what we have
identified as the descriptive function of language. But much, if not most,
of the semantic information contained in everyday language-utterances
is social and expressive, rather than descriptive (cf. 2.4). In so far as
Grice’s maxims are inapplicable in the analysis of utterances whose
function is something other than that of augmenting the addressee’s
store of propositional knowledge, they need to be supplemented and
qualified in various ways. It has been pointed out, for example, that
politeness and consideration for the feelings of one’s addressee may
594 Context, style and culture
impose requirements that are in conflict with any or all of Grice’s
maxims (cf. R. Lakoff, 1973).
The usefulness of Grice’s maxims is further reduced by the generality,
not to say vagueness, with which they are formulated. We have already
looked at some of the problems that arise in connexion with the quanti-
fication of semantic information (cf. 2.3). As for relevance and perspi-
cuity, it is, if anything, far more difficult to evaluate utterances in terms
of these two properties than it is to quantify the amount of semantic
information in an utterance. The fact that Grice’s maxims have not
been, and perhaps cannot be, fully formalized makes his notion of
implicature rather less precise than a logician would like it to be.* It is
undeniable, however, that, whether they are fully formalizable or not,
the pre-theoretical notions that Grice has dealt with in his formulation
of the maxims of quantity, quality, relevance and manner have an im-
portant explanatory role to play in the semantic analysis of texts.
By appealing to the maxim of quantity, for example, we can account
for the fact that, if X says to Y .
(1) Have you finished your homework and put your books away?
and Y replies
(2) I have finished my homework,
X can reasonably infer that Y has not put his books away. Presented
with the conjunction of p and q, Y has deliberately chosen to assign a
truth-value to just one of the conjuncts, p, when he might have assigned a
truth-value to the whole conjunction, p & g (by saying yes), if not only p,
but also g, were true. Given that X has no reason to believe that Y is
violating the maxim of quantity (or any of the other maxims), X is
entitled to assume that g is false. At the same time, it is obvious that p
does not entail ~g. Nor can Y be held to have asserted ~g (or, alter-
natively, to have denied q: cf. 16.4). He has merely implicated ~q; and
he has done so by his failure to assert g (in a context in which he could
be expected to assert q).
Taken together, the maxims of quantity and quality can be invoked,
as we shall see later, to. account for the fact that, if someone says J think
it’s raining or It may be raining, he can be held to have implied that he
does not know for certain that it is raining (cf. Caton, 1966; Ducrot,
1972). According to the maxim of quantity, we should be as informative
4 They are partially formalized, and fully discussed in relation to the notions of
presupposition and implicature, in Gazdar (1976).
14.3. Conversational implicatures and presupposition 595
as we need to be. The proposition “It may be raining”’ is less informa-
tive than the proposition “It is raining’’, since it is compatible with both
“It is raining”’ and “It is not raining’’. The speaker would presumably
have said It 1s raining, without qualifying in any way his own commit-
ment to the truth of the proposition “It is raining’’, if he had known for
certain that it was raining. For knowledge that p is true constitutes
adequate evidence for asserting p. It follows that, by saying either J think
it’s raining or It may be raining, the speaker can normally be held to
imply (i.e. to implicate) that he does not have the evidence that would
enable him to make the more informative assertion It 1s raining. On the
other hand, if the speaker, having said It is raining, is asked Why do you
think it’s raining?, he can quite reasonably, though at first sight illogic-
ally, reply, I don’t think it’s raining: I know tt is, It is interesting to note,
in this connexion, that, in the everyday use of language, not only Jt may
be raining and I think it’s raining, but also It must be raining and I know
it’s raining, involve a weakening of the speaker’s commitment to the
truth of the proposition “It is raining’? (cf. 17.2). This too can be
explained in terms of the Gricean maxims: if the speaker’s evidence is
unimpeachable or his commitment to the truth of p so firm that there
is no doubt at all in his mind that p is true, he will not feel obliged to
make explicit the fact that this is so. By being more informative, in this
respect, than he need be, he draws the addressee’s attention to the
possibility that the evidence for p is not as strong as it ‘might be.
It is characteristic of implicatures that derive from the maxim of
quantity, though perhaps not of others, that they can be explicitly can-
celled or qualified by the speaker ; and in this respect they differ sharply
from entailments. For example, if X said to Y
(3) I tried to telephone Fohn yesterday,
it would normally be reasonable for Y to infer that X had failed to con-
tact John. But this implicature can be cancelled without contradiction.
If Y asks X
® For a different view cf. Hull (1975), Keenan & Hull (1973).
14.3. Conversational implicatures and presupposition 599
A fourth important class of utterances in connexion with which the
notion of presupposition has been invoked recently by linguists is that
of utterances containing so-called factive* verbs (cf. 17.2). Anyone who
says
(21) fohn realizes that tt is raining
(in order to make a statement) is committed by his use of the verb
‘realize’ to the truth of the proposition expressed by the complement-
clause: he presupposes that it is raining. There are very many English
verbs, notably ‘know’, that are factive in this sense; and most of them
(but not ‘know’) can be grouped together in terms of several syntactic
characteristics that they share (cf. Kiparsky & Kiparsky, 1970). This
kind of presupposition, it will be observed, also holds constant under
negation and interrogation (except under certain conditions of context-
dependency that we will temporarily disregard). Hence the peculiarity
of an utterance like
(22) I don’t know that it 1s raining
(with don’t unstressed) — provided that it is construed as meaning
(23) “That it is raining is a fact of which I am unaware’’,
rather than
(24) “‘I am inclined to doubt that it is raining’’.’
If (22) is construed as meaning (24), it is, of course, perfectly acceptable.
The four kinds of presupposition that have been mentioned so far
differ from one another in various ways. But each of them can be seen as
involving a fairly natural sense of the pre-theoretical term ‘ presupposi-
tion’. In each case it is reasonable to say that the speaker, in making an
assertion or asking a question, assumes or presupposes that something is
so. For example, if X says to Y
(25) Why does God tolerate man’s wickedness?
’ There are circumstances in which (22) is interpretable as meaning (23): cf.
Let us suppose (for the sake of argument) that I don’t know that it’s raining, in
the utterance of which the speaker may be well aware of the fact that it is
raining and yet invite his addressee to operate, hypothetically, with the
assumption that he, the speaker, is not aware of this fact. In such circum-
stances (22) is not peculiar under this interpretation; otherwise it is. As for
(24): it is perhaps only in certain dialects of non-literary English that (22)
will support this interpretation ; and when it does, it may have a characteristic
rhythm and intonation-contour.
600 Context, style and culture
Y can quite reasonably say, in ordinary non-technical English,
(26) I do not accept the presuppositions that you are making
or
(27) I do not accept the presuppositions of your question.
In saying either (26) or (27), Y may be challenging any or all of the
following propositions (and perhaps others): ‘“‘God exists”’, ‘Mankind
is wicked’’, ‘‘God tolerates man’s wickedness’’. The difference between
(26) and (27), it will be noted, is that, whereas (26) treats presupposition
as a relation between persons and propositions (i.e. what they hold to
be true and would be prepared to assert), (27) treats presupposition as a
relation between utterances and propositions. Given that the verb
‘presuppose’, in its pre-theoretical sense, is more or less synonymous
with ‘assume’, we can perhaps legitimately infer that, in its pre-
theoretical sense at least, ‘presuppose’, like ‘assume’, denotes primarily
a relation between person and utterances (i.e. utterance-signals as tokens
of certain types: cf. 1.6). In other words, ‘presuppose’, in its pre-
theoretical sense, would seem to be primarily a verb of propositional
attitude: it seems to be more like ‘assume’ (or “believe’) than ‘entail’.
It is hardly surprising that the truth-conditional definition of pre-
supposition, in which presupposition is taken to be a relation that is
logically comparable with entailment, should fail to apply to everything
that falls within the scope of the pre-theoretical notion of presupposition.
Faced with the fact that the truth-conditional definition of presupposi-
tion, which, for the moment at least, we are assuming to be applicable to
statements like The King of France 1s bald, cannot be applied to the
other kinds of presupposition, some scholars have opted for a distinction
between so-called semantic presupposition, defined in terms of truth-
conditions, and so-called pragmatic presupposition (cf. Keenan, 1971).®
However, in attempting to formalize the so-called semantic notion of
presupposition in such a way that it 1s both coherent and distinct from —
entailment, they have been forced to extend the two-valued propositional
calculus. One extension consists in admitting a third truth-value into
the system. Another, which comes closer perhaps to formalizing the
notion of presupposition that Strawson invoked against Russell, consists
: in allowing formally for the possibility of truth-value gaps*: i.e. the
8 The term ‘so-called’ is used here simply to draw attention to the fact that in
the present work ‘semantic’ is not restricted, in opposition with ‘pragmatic’,
to what can be handled by means of a truth-conditional theory (cf. 4.4, 6.6,
14.2).
14.3. Conversational implicatures and presupposition 601
possibility that certain statements should lack a truth-value. We will not
go into either of these alternatives. It has been cogently argued, recently,
that the so-called semantic approach to the definition of presupposition
fails to handle certain crucial cases without otherwise unmotivated
adjustments and qualifications (cf. Kempson, 1975; Wilson, 1975). It
has also been argued that much of what proponents of truth-conditional
definitions of presupposition have taken to be presupposition is in fact
entailment.
So far we have assumed that the truth-conditional definition of pre-
supposition is indeed applicable to such statements as The King of
France is bald. But Russell, of course, took the view that the proposition
it expresses is false; and there are many that would agree with him.
What is not emphasized as strongly as it ought to be in linguistic treat-
ments of presupposition is that it is rather pointless arguing whether a
statement has a determinable truth-value or not, unless we know what
statement we are in fact discussing and what its thematic structure 1s.
Not only is this rarely made clear, but the notion of presupposition has
all too often been discussed, by linguists and by philosophers, in terms of
sentences, rather than utterances. As we saw earlier, several thematically
distinct statements (as well as several thematically distinct utterances of
other kinds) may be put into correspondence with the same system-
sentence (cf. 12.7). What is being said about what — what comment is
being made about what topic — depends upon the thematic structure of
the utterance. In so far as there is any pre-theoretical dispute as to
whether the proposition expressed by The King of France 1s bald is false,
on the one hand, or neither true nor false, on the other, this can be
explained, at least partly, by the possibility of taking ‘the King of
France’, in different tokens of this utterance-type, to be thematic or not.
If it had been previously asserted by X that no currently reigning
European monarch happened to be bald and then Y said, in all serious-
ness, The King of France is bald, X could quite reasonably retort That’s
not true — there is no King of France. Even if it turned out that Y was
referring to Giscard d’Estaing, it would be reasonable, in this context, to
say that what he said was false, by virtue of the failure of its existential
presupposition. The reason is that, in the context that we have just
constructed, ‘the King of France’ is not the theme. It has been asserted
by X that the class of reigning European monarchs contains no bald-
headed member. Y’s counter-assertion that this class contains the King
of France is reasonably described as false, independently of whether the
person that Y is referring to happens to be bald or not. For the point at
602 Context, style and culture
issue, in this context, is not whether a particular person is bald or not,
but whether there are any bald-headed, currently reigning European
monarchs (cf. Cooper, 1974: 36ff). It is only when a referring expression
is thematic that failure of the existential presupposition results in what
Strawson and those who take the same view as he does would call a
truth-value gap; and Strawson himself appears to have come, more
recently, to the same conclusion (cf. Strawson, 1964b).
One further point is worth making about The King of France is bald.
This is that, if ‘the King of France’ is construed, not as a definite
description, but as a title whose relationship to its bearer is like that of a
name to its bearer (cf. 7.5), the existential presupposition is one that
cannot be captured by means of the proposition “‘There is a king of
France’’.® The fact that this is so further limits the usefulness of truth-
conditional definitions of presupposition, even in respect of the existen-
tial presupposition of referring expressions, for which, at first sight, they
seem to be especially appropriate. Names are not true of their bearers in
the way that expressions that denote the defining property of a class are
true of the members of that class. And yet, when names (and titles that
are arbitrarily associated with their bearers) are used as referring expres-
sions, they do not differ from definite descriptions as far as their existen-
tial presuppositions are concerned. What counts is whether there is a
referent that is appropriately referred to by means of the expression in
question. This is a more general condition than is the satisfaction of a
particular set of truth-conditions.
We will say no more about such classic examples as The King of
France is bald. 'The existential presuppositions of referring expressions
were fully discussed in a previous chapter (7.2): it suffices here to re-
emphasize the importance of drawing a distinction between correct and
successful reference and to insist upon the fact that reference is always,
in principle, context-dependent.
So too, it would seem, is any linguistically useful notion of presupposi-
tion. Given that the truth-conditional definition of presupposition is, to
say the least, of very restricted coverage and cannot be applied to actual
or potential utterances unless certain assumptions are made about the
thematic structure of the utterances and about the contexts in which
® Suppose, for example, that, although France was still a republic, X had
conferred upon him (not necessarily by any official institution in France) the
courtesy-title ‘the King of France’. X could then be correctly referred to by
means of this expression and the proposition “‘X is the King of France”
would be true, but not “X is (a) king of France’”’, and still less ‘‘ X is (a) king’’.
14.3. Conversational implicatures and presupposition 603
they occur, there would seem to be little point in drawing a theoretical
distinction between two kinds of presupposition in terms of the dis- :
tinction between semantics and pragmatics — a distinction which is, in
general, of doubtful value as far as the analysis of the structure of natural
languages is concerned (cf. 4.4).
In addition to the four kinds of presupposition that have been men-
tioned so far, there are others that have been discussed recently by
linguists, which further extend the notion. McCawley (1968) has said
that the adjective ‘buxom’ carries with it the presupposition that who-
ever it is applied to is female, so that
(28) My neighbour is buxom
will be understood as implying that the referent of ‘my neighbour’ is |
female. Similarly, it has been suggested, the meaning of ‘bachelor’ can
be split into two parts: what is presupposed, that the entity to which
‘bachelor’ ts applied is male, adult and human, and what is asserted,
that the entity in question 1s not married (cf. Fillmore, 1971a). G. Lakoff
(19714) has said of
(29) John told Mary that she was ugly and then shé insulted him
(where she and him bear heavy stress) that it carries the presupposition
that to tell someone that she is ugly is to insult her. Keenan (1971) says
of the French utterance
(30) Tu es dégotitant
(“You are disgusting’) that it (pragmatically) presupposes that ‘‘the
addressee is an animal, child, socially inferior to the speaker, or per-
sonally intimate with the speaker’’. Fillmore (1971b) has said of |
(31) John accused Harry of writing the editorial
that it presupposes that John regarded the writing of the editorial as
something reprehensible; and of
(32) Please open the door
that it presupposes that, at the time of the utterance, the door is shut and
the addressee is in a position to comply with the request that is addressed
| to him. |
This is a somewhat heterogeneous set of examples. In each case it is
reasonable to say that the term ‘ presupposition’ is being used in a way
that is consistent with its everyday pre-theoretical sense. But any
604 Context, style and culture
theoretical concept of presupposition which covers all these cases of
what may be pre-theoretically classified as presupposition is likely to be
too broad to be of any real value. It might be suggested, for example, that
the presuppositions of an utterance are the conditions that it must
satisfy, if it is to be interpretable and appropriate, in the context in
which it occurs. This kind of definition would certainly cover everything
that has been classified in the recent literature as presupposition. But it
would cover much else besides, including everything in the context that
determines the form or interpretation of an utterance.
One way of narrowing the definition is by talking of propositions to
whose truth the speaker is committed, rather than of conditions that the
utterance must satisfy. We might say, for example, that an utterance
presupposes a proposition p if and only if the speaker assumes that p is
true and assumes that the addressee also assumes that p is true (cf.
Karttunen, 1973). The problem then arises as to what is meant by
assuming that a proposition is true. For example, one can presumably
assume that one’s addressee 1s one’s social inferior and demonstrate by
one’s behaviour that one has made this assumption without having
entertained at any time the specific proposition “The addressee is my
social inferior’’. The point is that there is a distinction to be drawn, in
principle, between the belief that something is so and the belief that a
certain proposition is true. We will not go into the problems of making
this distinction precise. It is worth pointing out, however, that, in saying
(21) fohn realizes that it 1s raining, the speaker 1s committed to the truth
of the specific proposition ‘It is raining’’, which 1s part of the propo-
sitional content of the utterance. But, even if we feel entitled to say that
by uttering (29) John told Mary that she was ugly and then shé insulted
him the speaker commits himself to the truth of a proposition, we cannot
be sure what proposition this is. Is it “’T’o tell someone that he is ugly
constitutes an insult” or “To tell a girl that she is ugly constitutes an
insult’’? Or is it some other proposition, more general or more specific?
There is no way of telling from the utterance itself and the meaning of
_ its verbal and non-verbal component.
_ Most of the definitions of presupposition to be found in the recent
literature take the presuppositions of an utterance to be a set of propo-
sitions. An alternative (though not necessarily incompatible) view is that
they are the conditions that must be satisfied before the utterance can be
used felicitously to perform its function as a statement, a question, a
promise, a request, etc. (cf. Fillmore, 1971b). This notion of the
felicity-conditions* of an utterance is something that we shall come back
14.3. Conversational implicatures and presupposition - 605
to (cf. 16.1). Here it is sufficient to make two points. The first is that the
felicity-conditions of an utterance need not be described as propositions
_ to whose truth the speaker subscribes, though they can be, and fre-
quently are, so described. The second, and more important, point is
that, in saying that the presuppositions of an utterance are necessary
conditions for its felicitous use, we are still operating with a very broad
notion of presupposition, unless we distinguish between various kinds
of felicity-conditions. It has been argued by Cooper (1974) that the
conditions that count as presuppositions are all ontological, in that they
have to do, not necessarily with existence, but with whatever kind of
ontological satisfaction is appropriate to the entity, state-of-affairs, event,
process, etc., in question. ‘I'he existential presuppositions of expressions
that refer to individuals (first-order entities: cf. 11.3) would thus be no
different in kind from the presuppositions of utterances containing fac-
tive verbs. Referring expressions presuppose that certain entities exist;
and existence is the ontological condition for first-order entities: factive
utterances presuppose that certain states-of-affairs obtain, that certain
events occur, etc.; and obtaining, occurrence, etc., is the ontological
condition for states-of-affairs, events, etc. (which, in so far as they may
be referred to as entities in particular languages, are second-order
entities: cf. 11.3).
This view of presupposition has the advantage that it provides a.
unified and theoretically motivated account of most of what has been
considered, pre-theoretically, to be a case of presupposition. It does so
by emphasizing reality rather than truth. For example, instead of saying
that (29) presupposes the truth of some specific proposition, it says that
(29) presupposes that a certain event (viz. John’s insulting Mary) took
place. Similarly, instead of saying that (g) presupposes the truth of
‘There is-a King of France’’, it says that (9) presupposes the existence
of some entity that is identifiable (in context) by means of the expres-
sion ‘the King of France’; and, as we have seen, this is a more defensible
point of view.
What has just been said of presuppositions can also be said of im-
plicatures. Earlier in this section we described the implicata of an
utterance as propositions. But it will now be obvious that it is often more
plausible to say that a speaker implicates that something exists, is so or
has occurred than that he implicates some determinable set of specific
propositions. What then is the difference between implicature and
presupposition?
Pre-theoretically, the difference would seem to be that, whereas what
606 Context, style and culture
is presupposed is what the speaker takes for granted and assumes that
the addressee will take for granted as part of the contextual background,
what is implicated 1s what the addressee can reasonably infer, but is not
necessarily intended to infer, in the context in which the utterance
occurs, from what 1s said or 1s not said. There is nothing in this pre-
theoretical account of the difference between them, it will be observed,
to prohibit the possibility of one and the same fact being both pre-
supposed and implicated. Hence the various attempts that have been
made recently to subsume presuppositions under the notion of implica-
ture and to account for their presence in terms of Grice’s maxims of
quantity, quality, relation and manner. So far, however, there is far from
being general agreement as to the feasibility of accounting for pre-
supposition in this way.
It is generally agreed that implicatures can be cancelled or qualified
in particular contexts. If it is conceded that presuppositions cannot (and
our pre-theoretical characterization of presupposition would suggest
that they cannot), there would be at least this difference between
implicatures and presuppositions. This difference has been challenged
by several scholars (cf. Wilson, 1975). It has been challenged, however,
on the basis of a truth-conditional theory of semantics, a controversial
view of negation and entailment, and the failure, or refusal, to draw a
distinction between sentences and utterances, on the one hand, and
between propositions and facts, on the other. Needless to say, most of
the argumentation in this area (including such argumentation as there
has been in this section) is very heavily theory-dependent. That being so,
it is almost impossible to compare one view of presupposition with
another, within a common terminological and conceptual framework,
without thereby prejudicing the decision one way or the other. What has
been said about implicature and presupposition in this section is no
more than a very general, non-technical and, for the most part, pre-
theoretical introduction to the two notions. There is by now a quite
these notions.1° ,
considerable technical literature devoted to the problems of formalizing
tion. |
linguistic form. But they obviously do not satisfy the rest of the defini-
language-behaviour. |
stylistics, is not generalizable to the most typical and most basic kind of
15.1. Person-deixis
The term ‘ deixis’ (which comes from a Greek word meaning “pointing”
or “‘indicating’’) is now used in linguistics to refer to the function of
personal and demonstrative pronouns, of tense and of a variety of other
grammatical and lexical features which relate utterances to the spatio-
temporal co-ordinates of the act of utterance.1 As employed by the
Greek grammarians, the adjective deictic* (‘deiktikos’) had the sense of
‘‘demonstrative’’, the Latin ‘demonstrativus’ being the term chosen by
the Roman grammarians to translate ‘deiktikos’ in the works of the
Stoics, of Dionysius Thrax and of Apollonius Dyscolus, which laid the
foundations of traditional grammar in the Western world. It is worth
noting that what we now call demonstrative pronouns were referred to
as deictic articles in the earlier Greek tradition and that the Greek word
‘arthron’, from whose Latin translation, ‘articulus’, the technical term
, article* derives, was no more than the ordinary word for a link or joint.
It was only in the later tradition that the Greek equivalent of ‘pronoun’
was used; and this fact is of some significance. The point is that in early
Greek, no sharp distinction can be drawn, in terms of their forms or
syntactic and semantic function, between demonstrative pronouns, the
definite article and the relative pronoun: the term ‘article’ was at first
applied to them all, and it was chosen, presumably, because they were
regarded as connectives of various kinds.
The term ‘pronoun’ carries quite different implications from ‘article’.
It suggests that the characteristic function of pronouns is to operate as
substitutes for nouns. But to say that pronouns deputize syntactically
and semantically for nouns and that this is their primary, or basic,
1 On deixis in general, cf. Antinucci (1974), Benveniste (1946, 1956, 1958a),
Buhler (1934), Collinson (1937), Fillmore (1966, 1970), Frei (1944), Hjelmslev
(1937), Jakobson (1957), Kurylowicz (1972). The account of deixis given here
draws, eclectically, upon a variety of additional sources, not all of which have
been listed in the Bibliography.
15.1. Person-deixis 637
function is seriously misleading in two respects. First of all, it fails to
draw the distinction between nouns and nominals (cf. 11.3): pronouns
are referring expressions, and they are syntactically equivalent to
nominals, not nouns. Secondly, to say that pronouns are primarily sub-
stitutes, whether for nouns or nominals, is to imply that their anaphoric*
function is more basic than their deictic function. The difference
between deixis and anaphora, and the connexion between them, will be
discussed in the present chapter (15.3); and we shall see that it is deixis
that is the more basic of these two kinds of pronominal reference. The
term ‘pronoun’ is now so well entrenched in the technical vocabulary of
linguistics that it would be futile to attempt to dispense with it. We must
be wary, however, of its traditional implication of substitutability for
nouns (or nominals).
The fact that the Latin-based term ‘demonstrative’ has been special-
ized in linguistic terminology in the sense that the Greek grammarians
gave to ‘deiktikos’, enables us to employ the terms ‘deictic’ and
‘deixis’ in a wider sense; and this is now common practice in linguistics.
As we shall see, deixis covers not only the characteristic function of the
demonstrative pronouns, but also tense and person, and a number of
other syntactically relevant features of the context-of-utterance. Deixis
is also involved in the philosophical notion of ostension*, or ostensive |
definition* (cf. 7.6); and it is worth noting that ‘ostensive’, ‘deictic’
and ‘demonstrative’ are all based upon the idea of identification, or
drawing attention to, by pointing. So too is Peirce’s term ‘indexical’,
which has been employed in the recent philosophical literature in roughly
the sense that we are assigning to ‘deictic’ (cf. 4.2).
By deixis* is meant the location and identification of persons, objects,
events, processes and activities being talked about, or referred to, in
relation to the spatiotemporal context created and sustained by the act of
utterance and the participation in it, typically, of a single speaker and at
least one addressee.
The grammaticalization and lexicalization of deixis is best understood
in relation to what might be called the canonical situation of utterance:
this involves one-one, or one—many, signalling in the phonic medium
along the vocal-auditory channel, with all the participants present in the
same actual situation able to see one another and to perceive the associ-
ated non-vocal paralinguistic features of their utterances, and each
assuming the role of sender and receiver in turn (cf. 2.2, 3.1, 3.2). There
is much in the structure of languages that can only be explained on the
assumption that they have developed for communication in face-to-face
638 Deixis, space and time
interaction. This is clearly so as far as deixis is concerned. Many utter-
ances which would be readily interpretable in a canonical situation-of-
utterance are subject to various kinds of ambiguity or indeterminacy if
they are produced in a non-canonical situation: if they are written rather
than spoken and dissociated from the prosodic and paralinguistic fea-
tures which would punctuate and modulate them (there are limitations,
as we have seen, upon the principle of medium-transferability: cf. 3.3);
if the participants in the language-event, or the moment of transmission
and the moment of reception, are widely separated in space and time; if
the participants cannot see one another, or cannot each see what the
other can see; and so on. Some of the complications which arise in
language-behaviour by virtue of the spatiotemporal separation of the
participants were mentioned in the previous chapter (14.2).
The canonical situation-of-utterance is egocentric* in the sense that
the speaker, by virtue of being the speaker, casts himself in the role of
ego and relates everything to his viewpoint. He is at the zero-point of the
spatiotemporal co-ordinates of what we will refer to as the deictic con-
text (cf. 14.1). Egocentricity is temporal as well as spatial, since the role
of speaker is being transferred from one participant to the other as the
conversation proceeds, and the participants may move around as they
are conversing: the spatiotemporal zero-point (the here-and-now) 1s
determined by the place of the speaker at the moment of utterance; and
it is this, as we shall see, which controls tense* (cf. 15.4).
The grammatical category of person* depends upon the notion of
participant-roles and upon their grammaticalization in particular lan-
guages. The origin of the traditional terms ‘first person’, ‘second
person’ and ‘third person’ is illuminating in this connexion. The Latin
word ‘persona’ (meaning “‘mask’’) was used to translate the Greek
word for ‘‘dramatic character”’ or “‘role’’, and the use of this term by
grammarians derives from their metaphorical conception of a language-
event as a drama in which the principal role is played by the first person,
the role subsidiary to his by the second person, and all other roles by the
third person. It is important to note, however, that only the speaker and
addressee are actually participating in the drama. The term ‘third
person’ is negatively defined with respect to ‘first person’ and ‘second
person’: it does not correlate with any positive participant role. The
so-called third-person pronouns are quite different in this respect from
the first-person and second-person pronouns.
That there is a fundamental, and ineradicable, difference between
first-person and second-person pronouns, on the one hand, and third-
15.1. Person-detxts 639
person pronouns, on the other, is a point that cannot be emphasized too
strongly. One of the questions that we raised, but did not answer, in our
discussion of reference in a previous chapter was whether personal pro-
nouns are, in principle, dispensable (cf. 7.2). As we shall see presently,
third-person personal pronouns are obviously dispensable in favour of
demonstrative pronouns; and there are many languages that do not have
third-person personal pronouns comparable with the English ‘he’,
‘she’, ‘it’ and ‘they’. There is perhaps no language, however, in which
there are no first-person and second-person pronouns. But is it possible,
or feasible, for a language without first-person and second-person pro-
nouns to operate as a natural semiotic system under essentially the same
conditions as do the actual languages that we are familiar with (cf.
4.4)!
It is clear that first-person and second-person pronouns, as such, are
not essential. Many languages grammaticalize the category of person by
inflecting the main verb. Latin will serve as a familiar example. The
sentence ‘Odi profanum vulgus’ (“I hate the common herd’’) has no
first-person pronoun in it: it is the form odi which indicates (though not
by means of any isolable segment or morpheme) that the speaker would
normally be referring to himself if he were to utter this sentence. Latin
grammaticalizes the category of person by means of morphological
variation in the verb-form only in so far as the subject of the verb is
concerned. There are other languages, however, in which the verb is
inflected for the category of person with respect to both the subject and
the object in the case of transitive verbs and with respect to the subject,
the direct object and the indirect object in the case of verbs with a
higher valency* (cf. 12.4). All these languages, it would appear, also
have first-person and second-person pronouns, which are used in cer-
tain constructions. However, let us admit, for the sake of the argument,
that personal pronouns as such are completely dispensable, provided
that the category of person is grammaticalized morphologically in the
verb-form. Let us also discount, in the present connexion, certain well-
known differences between languages as to the way in which they
grammaticalize the category of person: whether they have a distinction
between an inclusive (‘‘I and you”’) and an exclusive (“I and he/they’’)
first-person plural; whether they have different kinds of second-person
or third-person pronouns; and so on. The question that we are con-
cerned with transcends these differences between languages. What we
are asking is whether it is possible, or feasible, for a language to dispense
completely with the grammatical category of person. For simplicity of
640 Deixis, space and time
exposition, however, we will talk throughout in terms of personal
pronouns. The point to be borne in mind is that the category of person
depends crucially upon the grammaticalization of the participant-roles,
and more especially upon the grammaticalization of the speaker’s
reference to himself as the speaker.
As we have seen, there are in English three grammatically distinct
kinds of singular definite referring expressions: proper names, definite
noun-phrases and pronouns (7.2). Now it is probably true that all lan-
guages have a class of expressions (or provide the means for constructing
and using such a class of expressions) which on semantic grounds can
be described as proper names, though in many languages they cannot be
distinguished, in terms of their internal grammatical structure, from
-noun-phrases constructed according to the productive grammatical
rules of the language. Furthermore, it is intuitively clear that a language
with proper names could dispense with personal pronouns. To see that
this is so, all we have to do is to make minor adjustments to the grammar
of English, so that, in what we will call Quasi-English, someone whose
name is ‘John Smith’ will not say J am hungry, but John Smith be
hungry (it being understood that speakers normally refer to themselves
and to the addressee by name) and the addressee will respond, not with
Are you?, but with Be John Smith? It will be noted that we have put be
rather than are and ts in these Quasi-English utterances in order to
eliminate variation with respect to the category of person from the forms
of the verb. So far, so good. The obvious practical difficulty, of course, -
is that the addressee might not know the name of the speaker. But this
is soluble, in principle, in various ways. If the speaker had reason to
believe that the addressee might not know his name, he could point to
himself (or identify himself paralinguistically in some other way) whilst
making the utterance. Alternatively, he could reply to the addressee’s
enquiry Who be John Smith? by saying Fohn Smith be the person speaking,
provided that it is understood, by an existing convention, that it is in
this way that speakers identify themselves in such circumstances. It is
also intuitively clear that personal pronouns could be dispensed with in
favour of definite descriptions. Provided that the conventions exist and
are understood, John Smith might say The person speaking be hungry.
As we shall see presently, there are certain logical problems attaching
to the analysis of such utterances. But it seems clear that, given the
existence of the appropriate conventions, the expression ‘the person
speaking’ or ‘the speaker’ (or even ‘the person here’) could serve their
purpose of referring to the speaker in terms of his participant-role; and
15.1. Person-detxis 641
‘the listener’, or ‘the addressee’, could equally well replace the second-
person pronoun ‘you’. .
Although philosophers and logicians have generally discussed the
status of personal pronouns in relation to proper names and definite
descriptions, it is more interesting for the linguist to consider this
question from a somewhat different point of view. As we have seen,
there are many languages in which participant-roles are grammaticalized
or lexicalized, at least partly, in terms of social status or social roles (cf.
10.1). Let us, therefore, construct a rather different version of Quasi-
English, in which there are neither proper names nor personal pronouns,
but a special subset of definite descriptions (included in the full set of
definite descriptions existing at present in English) whose application in
referential and vocative function is determined by social status. Since
the principle is unaffected by the number of degrees and dimensions of
status that are lexicalized in a language-system, we will, for simplicity,
admit just one dimension and two degrees: superior and inferior,
lexicalized in the opposition ‘master’: ‘servant’. T'wo points should be
emphasized at the outset: first, that none of the assumptions that we
shall make about the conventions which determine the applicability of
‘master’ and ‘servant’ in Quasi-English is at all unreasonable in the
light of what we know of the operation of actual language-systems in
particular societies; and second, that ‘master’ and ‘servant’ are ordinary
countable nouns, which (like ‘man’, ‘tree’, ‘book’, etc.) may be used
with a determiner in singular definite noun-phrases and without a
determiner as vocative expressions. This version of Quasi-English 1s
identical with ordinary English except that it lacks the grammatical
category of person.
Let us now establish the conventions for the use of ‘master’ and
‘servant’ in vocative and referring expressions. First, it may be assumed
that in most cases of social interaction it will be clear to any arbitrary
pair of participants whether they are of equal social status or not and, if
they are of unequal status, which of them is the superior and which the
inferior. Social superiority may depend upon social role (parents being
superior to their children, teachers to pupils, and so on), sex (women
being superior to men), age (an older person being superior to a younger
person), and various other factors. What the socio-cultural correlates of
status are is of no consequence, provided that they are identifiable; that
this is a plausible assumption is clear from the fact that there are many
languages (e.g., Japanese or Korean) in which status is grammaticalized
in this way. When there is conflict between any two correlates (e.g., when
642 Deixis, space and time
an older man is talking to a much younger woman), and whenever the
participants are in any doubt as to their relative status, this conflict or
doubt will be resolved by their operating with an assumption of social
equality. And the convention which determines the use of vocative and
referring expressions in cases of social equality (a convention which
operates in many languages) is that each will refer to himself as an
inferior, and will address and refer to the other as a superior.
Given these conventions: if John Smith is of superior status, he will
say The master be hungry, and his addressee will respond with Be the
master? (in place of Are you?); if John Smith is of inferior status, he will
say The servant be hungry and his addressee will say Be the servant? ; and
if they are of actual or assumed equal status John Smith will say The
servant be hungry and his addressee Be the master? So too for vocative
expressions: the English utterances It’s raining, Sir[Fohn|my friend will
be translated into Quasi-English as It be raining, master when said by an
inferior or an equal, and as It be raining, servant when said by a superior
to an inferior.
We have now constructed a sociolinguistically plausible language-
system based on English, but lacking personal pronouns. It might be
objected that the noun-phrases ‘the master’ and ‘the servant’ are in-
directly related to participant-roles; and this is true. But it does not
follow from this fact that they are personal pronouns, or even that they
grammaticalize the category of person. Under the assumptions that have
been made, the conditions that determine the reference of ‘the master’
and ‘the servant’ when they refer to the speaker or hearer are no dif-
ferent in kind from the conditions which determine their reference in
context-independent utterances. Nor can we say that ‘master’ and
‘servant’ differ in sense or denotation, according to whether the sen-
tences ‘The master be hungry’ and ‘The servant be hungry’ are
uttered in order to make an assertion about oneself (or one’s addressee)
or about some other person. These sentences, considered as system-
sentences of Quasi-English, are no more ambiguous or indeterminate in
meaning than is the English sentence ‘The master is very kind’, which
a generation or so ago, if not to-day, might have been uttered equally
well by a servant addressing the master of the house or by some other
person with reference to the master of the house.
That Quasi-English is a possible natural language would seem to be
proved by the fact that the correlates of status mentioned above interact
with person in determining the situational appropriacy of personal pro-
nouns and honorific expressions of address and reference in many
15.1. Person-deixis 643
languages; and in certain situations, even in English, honorific expres-
sions can substitute for personal pronouns. All that we have done in
constructing this version of Quasi-English is to generalize this possi-
bility and to simplify the conventions for deciding status.
If the arguments put forward above are valid, they show that it is
possible, in principle, for a natural language to use definite descriptions
instead of personal pronouns; and furthermore, to use definite descrip-
tions which, unlike ‘the speaker’ and ‘the hearer’, do not directly
identify their referents in terms of their participant-roles. This does not
mean that the notion of participant-roles is irrelevant to the interpreta-
tion of utterances in a language of the kind envisaged: clearly they are.
But they are not grammaticalized or lexicalized in the structure of
sentences. Throughout this work, we are concerned to maintain the
distinction between sentence-meaning and utterance-meaning. One
reason for establishing this distinction in the first place derives from the
fact that the same sentence may be uttered to perform various speech-
acts (cf. 16.1). Another reason is the related fact that the utterance or the
context-of-utterance may contain non-linguistic information which
contradicts the information that 1s linguistically encoded in the utter-
ance-signal (cf. 3.1). For example, the meaning of a sentence like ‘John
is a brave man’ is not affected by its being uttered ironically (the irony
being indicated paralinguistically). The same principle applies in the
analysis of the sentences of the version of Quasi-English that we have
just envisaged. Given the conventions which determine the interpreta-
tion of utterances in context, particular text-sentences are translatable
from Quasi-English into English, and conversely, in much the same way
as particular text-sentences are translatable from any one actual language
into another. Translation between any two languages always operates, in
principle, with respect to contextualized utterances; and the fact that the
Quasi-English The master be hungry is translatable into English some-
times as J am hungry, sometimes as You are hungry and sometimes as The
master is hungry gives us no grounds whatsoever for saying that the
Quasi-English system-sentence ‘The master be hungry’ is ambiguous.
Comparison of a language like Quasi-English with such actual lan-
guages as English or French (or indeed any of the actual languages that
have been studied and described by linguists) brings out clearly the
distinctive character of person-deixis. It is tempting for logicians, and
linguists making use of formal logic in the analysis of natural languages,
to begin by attempting to eliminate from their representation of the
meaning of the sentences of particular languages all the deictic features
644 Deixis, space and time
which make the truth-value of the propositions expressed by those sen-
tences dependent upon the context-of-utterance. We have already noted
that this involves the elimination of tense from the formal representa-
tion of the structure of propositions (cf. 14.1); and we will return to this
question later. Let us concentrate here upon the elimination of person-
deixis.
Suppose John Smith says J am hungry and that he does so, in what
may be described loosely as normal conditions, in order to make a de-
scriptive statement about himself. Has he expressed the same proposition
as some other person who says, at more or less the same time and again
in normal conditions, fohn Smith is hungry? The answer to this question
turns, in part, upon the way in which we choose to define the term
‘proposition’. It is easy to envisage circumstances under which it is
reasonable to reformulate the propositional content of an utterance like
I am hungry in terms of the propositional content of an utterance like
John Smith 1s hungry: we frequently do this when we report what others
have told us. Having heard John Smith say I am hungry, we might very
well say to someone else fohn Smith is hungry and, if asked to justify this
assertion, we might say He told me so or He said that he was. But this
process of reformulation depends upon our ability to interpret the
original utterance in the light of our knowledge of the identity of the
speaker; and we cannot in general eliminate the deictic features of an
utterance-token without adding or removing information in the process
of conversion. This will become clearer in our discussion of illocu-
tionary force* and subjective modality*, to which the notion of speaker-
involvement is central (cf. 16.1, 17.2).
It may be noted at this point, however, that, although there are cogent
reasons for saying that, if John Smith says (or believes) that he is hungry
and someone else says (or believes) that John Smith is hungry, both
John Smith and the other person have said (or believe) the same thing,
there are equally cogent reasons for denying that this is so. It is arguable
that the beliefs that we have about ourselves and the propositions that
we express about ourselves are necessarily different from the beliefs that
others have about us or the propositions that they express about us. The
philosophical problems attaching to the notion of self-knowledge need
not concern us. We shall see later, however, that, as far as their semantic
interpretation is concerned, there is much in common between first-
person pronouns and reflexive pronouns. For example,
15 T am assuming that (8) is ambiguous: i.e. that the second clause means either
‘‘Harry (also) nominated John”’ or ‘‘ Harry (also) nominated himself’’. There
may be some disagreement as to whether it can sustain the former inter-
pretation.
15.3. Detxts, anaphora and the universe-of-discourse 667
tinguishing between such sentences as (9) and (10); and Castafieda
(1968) has drawn attention to the fact that there is a close connexion
between the third-person reflexive pronoun, as it 1s employed in propo-
sitions that attribute self-knowledge to others, and the first-person
role of speaker. ,
pronoun which the speaker uses to refer to himself in his participant-
16 Cf. Bach (1970), Dik (1973), Dougherty (1969), Fauconnier (1974), Jacken-
doff (1972), Kuno (1972a), Langacker (1969), Lees & Klima (1963), McCawley
(1969), Partee (1970, 1975a), Postal (1971), Ross (1969b).
17 The raised arrows are token-quotes indicating token-reflexivity in Reichen-
bach’s (1947) sense (cf. 1.4).
668 Deixis, space and time
sight it might appear to be. It is not co-referential with any antecedent
expression; it refers to, but is not co-referential with, a preceding lin-
guistic form. Textual deixis is frequently confused with anaphora, by
virtue of the traditional formulation of the notion of pronominal
reference (according to which, as we have seen, a pronoun is said to
refer to its antecedent) and the common failure to distinguish clearly
between linguistic and non-linguistic entities. There is no need to give
further examples of this kind of textual deixis: the text of the present
book is full of them. It should be noted however that the sense in which
forms and text-sentence occur in the co-text is different from the sense
in which lexemes or expressions occur in the co-text.
At one remove from what might be called pure textual deixis, though
not as clearly distinct from it as anaphora, is the relationship which
holds between a referring expression and a variety of third-order entities,
such as facts, propositions and utterance-acts (in the more abstract sense
of ‘utterance-act’ noted in 1.6). This may be exemplified by means of
the following text: (X says) I’ve never even seen him (and Y responds)
That’s a lie. It is clear that ‘that’ does not refer either to the text-sen-
tence uttered by X or to the referent of any expression in it. Some
philosophers might say that it refers to the proposition expressed by the
sentence uttered by X; others, that it refers to the utterance-act, or
speech-act (cf. 16.1), performed by X. However, under either of these
analyses of the reference of ‘that’, its function seems to fall somewhere
between anaphora and deixis and to partake of the characteristics of
both. Let us say that its function is that of impure textual deixis. It is not
always easy to draw the distinction between pure and impure textual
deixis in particular instances.
‘This’ and ‘that’, in English, may be used deictically to refer not only
to objects and persons in the situation and to linguistic entities of
various kinds in the text or co-text, but also to refer to events that have
already taken place, are taking place or are going to take place in the
future. The conditions which govern the selection of ‘this’ and ‘that’
with reference to events immediately preceding and immediately fol-
lowing the utterance, or the part of the utterance in which ‘this’ and
‘that’ occur, are quite complex. They include a number of subjective
factors (such as the speaker’s dissociation of himself from the event he is
referring to), which are intuitively relatable to the deictic notion of
proximity/non-proximity, but are difficult to specify precisely. What
does seem clear, however, is that the use of the demonstratives in both
temporal and textual deixis, and also in anaphora, is connected with
15.3. Detxts, anaphora and the universe-of-discourse 669
their use in spatial deixis. ‘This is more obviously so in many languages
other than English. For example, in Latin the distal demonstrative ‘ille’
(“that’’) is used anaphorically to refer to the referent of the more
remote of two possible antecedents and the proximal demonstrative ‘hic’ |
(“‘this’’) to refer to the referent of the nearer of two possible antecedents;
and they can frequently be translated (into somewhat stilted English) as
‘the former’ and ‘the latter’, respectively. The same is true of the
German ‘jener’: ‘dieser’, the Spanish ‘ése’ (‘aquel’):‘éste’, the French
‘celui-la’:‘celui-ci’, the Turkish ‘o’:‘bu’, and so on. It is the notion of
relative proximity in the co-text to the moment of utterance that con-
nects anaphora and textual deixis with temporal reference; and it is the
more general principle of localization* (cf. 15.7) that relates temporal
reference, in many languages at least, to the more basic notion of spatial
deixis.
As we saw in the previous section distinctions of proximity are
lexicalized or grammaticalized in the pronoun-systems of many lan-
guages; and they are commonly combined with other distinctions, based
on status, sex, size, shape, etc. In so far as they are used deictically, it is
the function of pronouns to draw the attention of the addressee to
referents in the situation, identifying these referents for the addressee in
terms of their position relative to the zero-point of the deictic space, on
the one hand, and of their status, sex, size, shape, etc., on the other.
What now concerns us 1s the way in which the basically deictic distinc-
tion of proximity operates in anaphora. A simple example will serve.
In English, as we have seen, the third-person pronouns are neutral
with respect to proximity, but distinguished in terms of gender. Turkish,
by contrast, has three demonstratives distinguished in terms of deictic
proximity, but neutral with respect to gender. Latin also has three
demonstratives, distinguished in terms both of proximity and of gender
(and number). Both Turkish and Latin, as was pointed out above, make
use of their proximal and distal demonstratives for anaphoric reference.
The effect of these differences in the descriptive content of pronouns is
readily seen if we consider the way in which we might translate the
pronouns in a short text in English into Latin and Turkish respectively:
John and Mary came into the room: hé (i.e. ‘the male-one’’) was laughing,
but shé (“the female one’’) was crying. The Latin version might translate
‘he’ with ‘ille’ (‘‘that-male-one’’) and ‘she’ with ‘haec’ (“‘this-female-
one”’); in Turkish ‘he’ might be translated with ‘o’ (“‘that-one’’) and
‘she’ with ‘bu’ (‘‘this-one’’), Suppose now that we reverse the order of
the conjoined nominals ‘John’ and ‘Mary’ in the English text. This has
670 Detxts, space and time
no effect upon the choice of pronouns in English: Mary and Fohn came
into the room: shé was crying, but hé was laughing. In both Latin and
Turkish, however, if we reverse the order of the antecedents, by virtue
of the lexicalization of deictic proximity in the anaphoric demonstra-
tives, we will now translate ‘he’ as ‘hic’ (“‘this-male-one’’) and ‘bu’
(‘‘this-one’’), and ‘she’ as ‘illa’ (“that-female-one’’) and ‘o’ (“‘that-
one’’).
The example that has just been given is very simple. But it does
illustrate clearly the way in which deictic distinctions can be used to
identify the antecedents of anaphoric expressions. Anaphora involves
the transference of what are basically spatial notions to the temporal
dimension of the context-of-utterance and the reinterpretation of deictic
location in terms of what may be called location in the universe-of-
discourse*. ‘The notion of previous mention, which is commonly invoked
in discussions of anaphora, depends upon the temporal relation which
holds (in a spoken text) between the anaphoric expression and its ante-
cedent. The basically deictic component in an anaphoric expression
directs the attention of the addressee to a certain part of the text or
co-text and tells him, as it were, that he will find the referent there. It is
not of course the referent itself that is in the text or co-text. The
referent is in the universe-of-discourse, which is created by the text and
has a temporal structure imposed upon it by the text; and this temporal
structure is subject to continuous modification. To say that the referent
has a textual location implies, then, that it will be found in a certain part
of the universe-of-discourse as this is structured, temporally, by the
text; and subsequent reference to this referent by means of an anaphoric
expression will identify the referent in terms of the textual location of
the antecedent. Let us suppose, for the purpose of illustration, that the
English demonstratives ‘this’ and ‘that’, in anaphoric expressions, do
no more than simply encode the distinction of temporal proximity in
relation to the moment of utterance. ‘This animal’, used as an anaphoric
expression, will direct the attention of the addressee to the most proxi-
mate referent in the universe-of-discourse satisfying the sense of
‘animal’; ‘that animal’ will refer to a textually more remote referent;
and ‘the animal’ will refer to some animal which has a textual location,
in the sense explained, but will give the addressee no information about
its textual location. No such information will be required of course, if
it is the only animal that has been previously mentioned; and no in-
formation will be required, if there is a generally accepted convention
that, in default of any specific information as to the textual location of
15.3. Detxis, anaphora and the universe-of-discourse 671
the referent, it is taken to be the most recently mentioned entity that is
being referred to.
Things are not quite as simple as this illustration might suggest. The
anaphoric use of ‘this’ and ‘that’ in English involves other considera-
tions besides the relative proximity of the antecedent to the moment of
utterance; and ‘this’ ws. ‘that’ cannot be used, as the Latin ‘hic’ vs.
‘ille’ and the Turkish ‘bu’ ws. ‘o’ can, to mean ‘‘the latter”’ vs. ‘“‘the
former’’. One cannot say John and Mary came into the room: that person
was laughing, but this person was crying, to mean “John and Mary came
into the room: John was laughing, but Mary was crying’’. It is not
being maintained, however, that the anaphoric use of demonstratives, in
English or in any other language, is totally predictable from their deictic
use. The important point is that, independently of whether particular
languages make anaphoric use of demonstratives or not, what logicians
commonly refer to as the universe-of-discourse (or point-of-reference* :
cf. 6.6) is not simply an unstructured set of potential referents, each of
which is equally accessible throughout a text or conversation. Some of
the potential referents are more salient than others; and saliency is in
part determined by recency of mention. In so far as recency of mention
is itself a deictically based notion and is encoded, in one way or another,
in the anaphoric pronouns used 1n particular languages, anaphora rests
ultimately upon deixis.
However, it requires but little reflexion to see that the potential
referents in the universe-of-discourse cannot be indexed solely, or even
primarily, in terms of recency and relative order of previous mention.
The limitations of human memory are such that, without having imme-
diate access to a transcript of all that has been said previously (or,
alternatively, to some continually updated computer-file), we could not
operate with a system of anaphoric reference which employed expres-
sions meaning, for example, “the twelfth most recently mentioned
entity” or ‘‘the twelfth entity mentioned in the present text’’. The tem-
poral structure imposed upon the universe-of-discourse by the succession
of referring expressions in texts 1s, therefore, of very limited duration;
and the anaphoric use of the basically deictic distinction of proximity
to the zero-point of the context-of-utterance is determined by this fact.
Furthermore, salience in the universe-of-discourse is not simply a
matter of recency of previous mention. Indeed, there need not have been
any previous mention. As Isard (1975) points out, if a child reaches
towards the lion’s cage in order to pat what he takes to be a friendly big
cat, the zoo-keeper can say
672 Deixis, space and time
(11) Be careful, he might bite you,
without there having been any previous reference to the lion. In this
case, the lion is present in the context-of-utterance; and, although the
form he would probably be unstressed, the reference of ‘he’ might well
be described as deictic by virtue of the almost inevitable paralinguistic
accompaniment of eye-gaze and gesture. Other examples can be pro-
duced, however, which show that a potential referent is salient in the
universe-of-discourse, even though it is not present in the situation-of-
utterance and has not been mentioned previously by either the speaker
or the addressee. For example, I might offer my condolences to a friend,
whose wife has just been killed in a car-crash, by saying
(12) I was terribly upset to hear the news: I only saw her last week.
Naturally enough in these circumstances, there is no need for me to
specify what news I am referring to or who the referent of ‘she’ is.
Examples like (12) show us very clearly that entities need not have been
mentioned previously in order for them to be salient in the universe-of-
discourse. If the notion of anaphora is so defined that it presupposes the
occurrence of a correlated antecedent expression in the text or co-text,
then ‘she’ is obviously not anaphoric in (12). And yet its function in
(12) appears to be no different from the function it has in
(13) I know Mrs Smith very well: I only saw her last week.
In both (12) and (13) ‘she’ refers to the currently most salient person;
and, since we know or believe that the person we are referring to is a
woman, we are obliged by the grammatical and lexical structure of
English to use ‘she’, rather than ‘he’ or ‘it’.
Many scholars, including Buhler (1934), would say that the reference
| of ‘she’ in (12) is deictic, rather than anaphoric, on the grounds that it
involves pointing to something in the intersubjective experience or
common memory of speaker and addressee, rather than to something in
the external situational context (cf. Crymes, 1968: 62-3). It is obvious,
however, that the notion of intersubjective experience, or common
memory, is the more general notion, without which anaphoric reference,
as it is traditionally conceived, cannot be explained. Such writers as
Kristeva (1969) and Barthes (1970) have insisted that what is commonly
referred to as intersubjectivity should be more properly described as
intertextuality*, in that the shared knowledge that is applied to the
interpretation of text is itself the product of other texts (cf. Ducrot &
15.3. Deixis, anaphora and the universe-of-discourse 673
Todorov, 1972: 446; Culler, 1975: 139). Up to a point this is true; and
especially in so far as literary texts are concerned. But not all of the
intersubjective knowledge that is exploited in the interpretation of texts
derives from what has been previously mentioned; and, in the last
resort, there would seem to be no reason to deny that the reference of
‘she’ in (12) is anaphoric.
Both deixis and anaphora are far more complex than the somewhat
schematic account of them given here might suggest. What has been said
will be sufficient, it is hoped, to justify the assertion that deixis is more
basic than anaphora. Anaphora presupposes that the referent should
already have its place in the universe-of-discourse. Deixis does not;
indeed deixis is one of the principal means open to us of putting entities
into the universe-of-discourse so that we can refer to them subsequently
(cf. Isard, 1975).?8 ,
There is much in the more recent work on pronouns that we have
deliberately left on one side in our treatment of deixis and anaphora and
will do no more than mention here. The standard approach to the
analysis of pronominal reference by logicians is to treat pronouns as the
natural-language correlates of the variables that might be used instead
of constants in the well-formed formulae of the predicate calculus or of
some other logical calculus (cf. 6.3). It has been pointed out by Partee
(1975a) that although the pronouns-as-variables* analysis works well
with certain sentences, there are others for which it is far from appro-
priate. For example,
(14) No-one drives when he is drunk
expresses a proposition which (if we neglect the difference between
‘no-one’ and ‘nothing’, on the one hand, and between ‘he’/‘she’ and
‘it’, on the other) may be represented, loosely, as :
(14’) “No x drives when x is drunk”’.
This illustrates the pronouns-as-variables analysis; and in a more formal
representation of the structure of (14’) the variable x would be shown as
being bound* by the universal quantifier in both positions of occurrence
(cf. 6.3).
example: ,
To be contrasted with (14) are such sentences as Karttunen’s (1969)
18 Strictly speaking, it is not the entities themselves that are put into the
universe-of-discourse, but their intensional correlates (cf. Lyons, 1977).
674 _ Detxis, space and time
(15) The man who gave his paycheck to his wife was wiser than the
man who gave it to his mistress.
Under the interpretation of (15) that concerns us here, ‘it’ is not co-
referential with ‘his paycheck’; and there will have been no previous
mention of the entity to which ‘it’ refers. For the analysis of (15),
Partee (1975a) proposes a treatment which, following Geach (1962), she
calls a pronouns-of-laziness* treatment. It is characteristic of pronouns-
of-laziness that they can be substituted for expressions that are identical,
but not necessarily co-referential, with antecedent expressions. In this
respect, therefore, the pronouns-of-laziness analysis is the one that is
formalized by the earliest version of pronominalization in Chomskyan
transformational grammar: ‘it’ would be substituted for ‘his paycheck’,
by means of an optional transformational rule, at some stage in the
syntactic derivation of (15); and the question of co-reference would
simply not arise. Failure to apply this rule of pronominalization would
result in the derivation of
(16) The man who gave his paycheck to his wife was wiser than the
man who gave his paycheck to his mistress,
which, under the interpretation that concerns us here, is equivalent to
(15). The use of a pronoun-of-laziness may be seen as a purely stylistic
or rhetorical device, which enables the speaker or writer to avoid repeti-
tion of the antecedent.
We will not go further into the pronouns-as-variables analysis or the
pronouns-of-laziness analysis. All that needs to be said here is that
neither of them appears to be capable of handling everything that the
other can handle, although there are many sentences whose meaning can
be accounted for equally well by either. This point has been well argued
by Partee (1970, 1975a). But it must also be emphasized that neither the
pronouns-as-variables analysis nor the pronouns-of-laziness analysis is
particularly successful in handling either the deictic reference of pro-
nouns or their anaphoric reference to entities that have not been pre-
viously mentioned; and many of the sentences whose meaning can be
accounted for by the pronouns-as-variables analysis or the pronouns-of-
laziness analysis can also be handled in terms of an analysis which takes
deixis to be basic. Granted that no uniform treatment of the relation
between pronouns and their antecedents is possible, it is arguable that
the use of pronouns as deictics is more basic than their use as variables
or their use as pronouns-of-laziness.
15.3. Deixis, anaphora and the universe-of-discourse 675
It may be added that the pronouns-as-deictics analysis can be
naturally extended to handle such sentences as
(17) John wants to catch a fish and to eat it for supper,
which have also been much discussed recently. The problem here is that
‘it’ is used in the second clause as an expression with singular definite
reference, whereas (under the relevant interpretation) ‘a fish’ in the
first clause does not refer to any specific individual and carries no
presupposition of existence. And yet the notion of co-referentiality
appears to be relevant to the interpretation of (17) in a way in which it
was not relevant to the interpretation of either (14) or (15). What seems
to be involved here is the treatment of some hypothetical entity as if it
were an actual entity. As we have already seen, there can be anaphoric
reference to entities that have not been previously mentioned, provided
that they are in the universe-of-discourse; and hypothetical entities are
treated exactly like actual entities in this respect. Even though no
previous reference has been made to some particular fish, the hypotheti-
cal entity whose existence and whose being caught by John are pre-
conditions of the actualization of the possible world-state described by
the proposition ‘‘John wants to eat the fish that he catches’’ enters into
the universe-of-discourse as a potential referent no less readily than does
some actual entity that has been introduced by means of detxis or the
use of a non-deictic referring expression.
Throughout this section we have concentrated upon pro-nominals. It
is arguable, however, that anaphora involving other kinds of forms and
expressions (with the exception of pro-locatives, to which we will come
presently) cannot be accounted for in the way that we have accounted
for the anaphoric reference of pro-nominals. For there are many other
sets of forms and expressions (not all of them classifiable as pronouns)
whose function is best accounted for in terms of the notion of gram-
matical substitution: e.g., the pro-noun one in such sentences as ‘I want
the red scarf not the blue one’; the pro-verbal ‘do’ in such sentences as
‘I will go to the party if you do’; etc.
Mention should also be made at this point of the so-called classifiers*,
whose pronominal function was referred to in an earlier chapter (cf.
11.4). Many theoretical discussions of the nature and use of pronouns
fail to draw attention to the fact that in many languages classifiers are
used anaphorically as pro-nouns in much the same way as the form one
is employed in English.
That the use of pro-nouns, pro-verbals, etc., cannot be explained in
676 Deixis, space and time
terms of co-reference, or in terms of their reference to entities in the
universe-of-discourse, is evident from the fact that they are not referring
expressions at all. Their function is closer to, if not identical with, that
of the so-called pronouns-of-laziness mentioned above. There is, how-
ever, one other set of expressions whose function is very similar to what
we have described as the basic function of pro-nominals: such adverbs
of place and time as ‘here’ vs. ‘there’ and ‘now’ vs. ‘then’. Because of
their anaphoric function, they have frequently been classified as pro-
nominal adverbs: they are more suitably described as pro-locatives* and
pro-temporals*.
As we saw in the previous section, there is a very close connexion
between the deictic function of the demonstratives ‘this’ and ‘that’,
and the locative adverbs ‘here’ and ‘there’. The same connexion is to
be noted as far as the pro-locative, anaphoric function of ‘here’ and
‘there’ is concerned: cf.
(18) I was born in London and I have lived here/there all my life
and
(19) I was born in London and this/that is where I have lived all my
life.
In both (18) and (19) the use of the pro-locative ‘here’ vs. ‘there’ and
the pro-nominal ‘this’ vs. ‘that’ is simultaneously deictic and anaphoric,
since the selection of one expression rather than the other is determined,
under normal conditions of utterance, by whether the speaker is in
London or not at the time. Similarly, in both
(20) You mustn’t come at six: that’s when John is coming
and
SELF’’). }
‘above’, ‘below’, etc. “X is above’, when used deictically, would be
understood to mean ‘‘X is outside and upwards of here”’ (or, equiva-
lently, as “X is outside and upwards of the space containing
The second point to be stressed is that many, if not all, of the locative
expressions involving dimensionality and orientation can be seen as
being implicitly, or covertly, directional: i.e. as being dynamic, rather
than static. It has been argued that, in order to account for the meaning
of such overtly directional sentences as
(14) Gwyneth walked through the kitchen,
we have to recognize expressions which are used to refer to the path*
(or route) that is taken on the journey from a source to a goal (cf.
700 Detxis, space and time
Bennett, 1975: 18ff). Once the notion of a path is recognized in such
obviously directional expressions as ‘through the kitchen’ in (14), it
can also be identified in the analysis of what appears to be locative
expression:
(15) Gwyneth is through the kitchen.
The place referred to is here identified by means of the route that must
be taken in order to get there. It is in this sense that a locative expression
may be implicitly, or covertly, dynamic.
This point is more significant than it might seem to be at first sight.
It has already been noted that the asymmetrical and dynamic construc-
tion ‘How far is it from X to Y?” can be seen as more basic than the
more symmetrical and static construction ‘‘What is the distance
between X and Y?” Distance is always measured from a point-of-
reference* (to use this term in a sense that is related to, though some-
what different from, the sense in which it is employed in model-
theoretic semantics: cf. 6.6). To say that X is three miles away from Y
implies that one must travel three miles from the point-of-reference, Y,
in order to arrive at X. If there is no other point-of-reference, explicit or
implicit, in the context of an utterance like
(16) The church ts three miles away,
it is generally the location of the speaker that is taken to be the starting-
point of the actual or imaginary journey. Locative expressions involving
orientation with respect to “‘up”’ vs. ““down”’ and “front” vs. “‘back”’
are similar to expressions like ‘three miles away’ in that there is always
a point-of-reference explicitly or implicitly referred to in the utterances
containing them. It is always possible to give a dynamic interpretation
in terms of a journey from the point-of-reference.
Furthermore, visual perception can be seen in terms of the metaphor
of travel: one looks into the distance at, or towards, an object; one’s
gaze travels to and reaches, or grasps, the object. The ambiguity of (12),
“The church is behind the town-hall’, which we accounted for in terms
of confrontation, has been explained in terms of the difference between
(i) ‘The church is on the other side of the town-hall from where I am
looking”’ and (ii) “The church is on the other side of the town-hall from
the road”’, with (i) involving a deictic point-of-observation (“seen from
here’’) and (ii) a non-deictic point-of-orientation (cf. Leech, 1969: 167,
182). If perception is interpreted as making a kind of visual journey from
a point-of-reference, as source, to the object of perception, as goal, the
15.5. Spatial expressions 701
fact that a point-of-observation and a point-of-orientation, if explicitly
mentioned, are referred to by means of a source-expression (‘from
here’, ‘from the road’, etc.) is automatically explained. So too is the
fact that many locative expressions can be seen as covertly, or implicitly,
dynamic. What we may now refer to as the action-schema of the journey
involving a source, a goal and, optionally, a path is of very general
applicability in the semantic and grammatical analysis of languages (cf.
12.6, 15.7).
The shape, dimensionality and orientation of entities (and spaces) is
crucial in the analysis of the meaning of such positional and qualitative
adjectives in English as ‘long’:‘short’, ‘far’:‘near’, ‘high’:‘low’,
‘deep’: ‘shallow’, ‘wide’:‘narrow’, and ‘thick’:‘thin’. Positional
adjectives can always be regarded, from a semantic point of view at least,
as being transformationally derivable from predicative locative struc-
tures: for example, a distant building is one that is relatively far from
some point-of-reference. A long building, on the other hand, is one that
is extended in a particular spatial dimension; and its length can be
regarded as one of its physical properties independent of its location in
space. Since ‘high’ and ‘low’, unlike most of the antonymous adjectives
with which we are concerned, can be used in either kind of construction,
phrases such as ‘a high window’ are, in certain contexts, ambiguous:
(i) “a window that is relatively far up (from a point-of-reference)”’ vs.
(ii) ‘a window that is significantly extended in the vertical dimension”’.
The distinction between the positional and the qualitative sense of an
adjective like ‘high’ rests upon the distinction between distance and
extension. But these notions, as we have seen, are interrelatable by
virtue of their correlation in terms of measurement: if a street is a
hundred yards long, measured from one extremity to the other, the two
extremities are a hundred yards apart. There is also an obvious percep-
tual correlation between extension and distance; and in so far as per-
ception can be analysed in terms of the notion of a visual journey, the :
static qualitative notion of extension can be based upon the more dyna-
mic notion of distance.
In the discussion of the factors which control the selection of the
correct qualitative adjectives of extension, it is convenient to neglect,
initially, the question of orientation (inherent, canonical and actual).
This immediately eliminates ‘high’ (and ‘tall’) vs. ‘low’; but not, as we
shall see, ‘deep’ vs. ‘shallow’.
The first question to be asked in relation to three-dimensional
physical objects is whether they have a maximal dimension (1.e. whether
702 Deixis, space and time
their extension is significantly greater in any one dimension than it 1s in
the other two). If an object has no maximal extension, it cannot be said
to have length. We do not say of a tennis ball that it is longer than a golf
ball, and we would not normally say that it is wider, broader, thicker or
deeper. Instead, we use the most general qualitative adjectives of spatial
extension, which denote overall size regardless of shape and dimen-
sionality: ‘big’ vs. ‘little’, ‘large’ vs. ‘small’.
If an object has a maximal dimension, this is identified as its length;
and the question of overall shape becomes criterial for the labelling of
the non-maximal dimensions. If the extension of the object in the other
two dimensions is negligible in relation to its length, we then collapse
these two dimensions, as it were, in the single dimension of thickness:
we talk, for example, of a long thick pole. If the object is significantly
extended in one of the other dimensions, then the opposition of ‘wide’
vs. ‘narrow’ comes into play, but only if one of the two non-maximal
extensions is significantly greater than the other; and it is this dimension
to which ‘wide’ vs. ‘narrow’ applies. The third dimension, if identified
and described, is subject to a further, and in part independent, criterion:
whether the object is regarded as hollow or not (i.e. whether it has an
interior space). If it is hollow, it may be described as being deep or
shallow; if solid, as being thick or thin. In summary, then, we can say
that any unoriented three-dimensional object whose dimensions are
identifiable and significantly different in extension will have length as its
first dimension, width as its second dimension and either thickness or
depth as its third dimension.
For two-dimensional spaces (or figures) the same general considera-
tions of shape and maximality apply to determine the first dimension as
being that of length. In labelling the second non-maximal dimension,
| however, we choose between width and thickness, according to whether
the non-maximal dimension is treated as being significant or not in its
extension. We can describe a line as long and thin (as we can so describe
a pole or stick), but not as long and narrow. To say that something is a
line is to say that it is essentially unidimensional, approximating to the
ideal geometrical line. In contrast, we would describe a street as long and
narrow, rather than as long and thin: the non-maximal dimension of the
street is, of necessity, significant in its extension.
So far we have been dealing with unoriented entities and spaces. As
soon as we take orientation into account, we must concede primacy to
the vertical dimension. The general principle is that verticality (whether
inherent, canonical or actual) is dominant over maximality. Height (and
15.6, Aspect 703
tallness) are measured upwards from the point-of-reference, which is
commonly, though not necessarily, ground-level. Depth 1s measured
downwards from the point of reference: but it applies only to spaces or
to hollow entities; and this correlates with the use of ‘deep’ vs. ‘shallow’
for the third dimension of unoriented hollow entities.
The vertical dimension is not necessarily the maximal dimension.
Whether it is or is not, orientation is again dominant in the assignment
of length, width and depth or thickness to the non-vertical dimensions.
If the entity has an inherent or canonical front, it has width (from side
to side) and depth or thickness (from front to back); if it has no inherent
or canonical front, it has length (from end to end) and width (from side
to side). The same rectangular building may be described in terms of
width and depth, according to whether it is treated as having a canonical
front or not; and a dual-purpose piece of furniture may be described as
long and wide, when it is being used or considered as a table, but as wide
and deep when it is thought of as a desk with a canonical orientation in
the horizontal plane. The important point to note is that it is contradic-
tory to say that the width of an object exceeds its length: we simply do
not talk about its length if, by virtue of its canonical orientation, we have
assigned width to its greater horizontal extension.
This very brief account of the meaning of qualitative and positional
spatial adjectives in English is incomplete; and it may be inaccurate in
certain details. But it will serve to show how intimate is the connexion
between the lexicalization of distinctions of shape and the lexicalization
of dimensionality and orientation in English.
There is no reason to believe that English is in any way untypical in
this respect. German (cf. Bierwisch, 1967) and French (cf. Greimas,
1966: 35) are very similar, as are the other Indo-European languages. As
for shape, and more especially dimensionality, it should also be men-
tioned that there are many languages in which this appears to be one of
the principal factors determining the selection of classifiers (cf. 11.4). It
is frequently the case that one classifier is used for saliently one-
dimensional objects, another for saliently two-dimensional objects and
a third for saliently three-dimensional objects; and there are other
places in the structure of language in which these distinctions are
grammaticalized or lexicalized (cf. Friedrich, 1970).
15.6. Aspect
That the notion of aspect*, if not the term ‘aspect’ itself, is less familiar
to non-linguists than are the notions of tense and mood is largely a
704 Deixts, space and time
matter of historical accident. The Stoics had realized that, in addition
to precedence and successivity, there was another factor involved in the
determination of what Aristotle, and subsequently the Alexandrians,
referred to as tense (i.e. “time’’); and they identified this as being what
we would now describe as an aspectual distinction of completeness vs.
incompleteness (cf. Robins, 1967: 29). Unfortunately, in the later
development of the Greco-Roman grammatical tradition, which has in-
fluenced, and in many respects distorted, the grammatical analysis of the
majority of the world’s better-known languages and the way they are
taught in our schools and universities, the terms ‘perfect’ and ‘im-
perfect’ (which derive from the Latin translations of ‘complete’ and
‘incomplete’) came to be used in collocation with ‘tense’. Furthermore,
the definitions of the so-called present, perfect, imperfect and plu-
perfect tenses, not only in Greek and Latin, but also in other languages,
tend to obscure the difference between past vs. present vs. future, on the
one hand, and perfect vs. imperfect, on the other.
What is traditionally referred to as the present tense of Latin or
Greek is more appropriately described as being the present imperfect:
it is present (or, better still, both non-past and non-future) in tense and
imperfect in aspect. The so-called perfect and imperfect tenses contrast
with the so-called present tense in being present perfect and past im-
perfect, respectively; and the misleadingly named pluperfect (‘further
[in the past] than the perfect’’) 1s the past perfect. The morphological
structure of both Greek and Latin supports this two-dimensional
classification. So too, as far as Greek is concerned, does syntactic and
semantic analysis. In Latin, however, the present perfect forms were
also used in contexts in which, in the historical, as distinct from the
experiential, mode of description, Greek would use the so-called aorist
(egrapse), literary French would use the so-called past definite (2/ écrivit),
Russian (under certain rather more specific conditions) would use the
so-called past perfective (on naptsal), and so on. The fact that the
so-called perfect tense of Latin (scripsit) has this double function — in
which respect it is quite different from the so-called perfect tense of
English (he has written), literary French (il a écrit) or Standard German
(er hat geschrieben), but similar to the similarly named forms of collo-
quial French and some southern dialects of German — is responsible, no
doubt, for much of the confusion surrounding the terms ‘perfect’ and
‘perfective’ in linguistics (cf. Comrie, 1976: 16ff).
There can be no question of entering here upon a full-scale treatment
of aspect. Our purpose is merely to introduce the reader to some of the
15.6. Aspect 705
aspectual distinctions that are grammaticalized in languages and to
emphasize the importance of these distinctions in the construction of a
general theory of the structure of language. It is, as we have said, largely
a matter of historical accident that the notion of aspect does not figure
as prominently in traditional grammar as does the notion of tense. Aspect
is, in fact, far more commonly to be found throughout the languages of
the world than tense is: there are many languages that do not have tense,
but very few, if any, that do not have aspect. Furthermore, it has been
argued recently that aspect is ontogenetically more basic than tense, in
that children whose native language has both, come to master the former
more quickly than they do the latter (cf. Ferreiro, 1971). We will not go
into this issue. But we would re-emphasize the fact that aspect has
frequently been confused with tense in the standard treatments of
particular languages; and that this has had its effect upon the more
theoretical discussions of temporal relations in language by linguists,
philosophers and psychologists. The main difference between tense and
aspect, as we have already seen, is that, whereas tense is a deictic cate-
gory, which involves an explicit or implicit reference to the time of
utterance, aspect is non-deictic (cf. 15.4). It has also been pointed out,
however, that the distinction between tense and aspect is hard to draw
with respect to what is sometimes described as relative, or secondary,
tense. We will say no more about this.
The term ‘aspect’ is currently used by linguists as the rather un-
satisfactory, but conventionally accepted, translational equivalent of the
| term that is employed in Russian (‘vid’) to refer to the opposition of
perfective and imperfective in the Slavonic languages. Usually, though
not invariably, it is extended to cover a variety of other oppositions, in
so far as they are grammaticalized in the structure of particular lan-
guages — oppositions based upon the notions of duration, instantaneity,
frequency, initiation, completion, etc. We have tacitly adopted this
relatively broad conception of aspect, according to which the opposition
| between the progressive and the non-progressive forms in English (cf.
he 1s writing vs. he writes), the opposition between the simple past and
the imperfect in literary French (il écrivit vs. il écrivatt), the opposition
between the progressive and the aorist forms in Turkish (okuyor, ‘“‘he
is writing’’, vs. okur, ““he writes regularly/habitually”’) and comparable
grammaticalized oppositions in other languages, are all legitimately
classified as aspectual.
A distinction is sometimes drawn between aspect and Aktionsart*.
The specialized employment of the German term ‘Aktionsart’ (which,
706 Deixis, space and time
in origin, meant nothing more than “kind of action’’) rests upon one or
other of two more general distinctions: (i) the distinction between
grammaticalization and lexicalization; and (11) the distinction, within
morphology, between inflexion and derivation. The fact that neither of |
these two distinctions is itself clearcut, coupled with the further fact
that, in so far as they are partially, but not wholly, coincident, some
scholars operate with the one and some scholars with the other, has been
responsible for a good deal of confusion in the use of the term “‘Aktion-
sart’ (cf. Comrie, 1976: 6). Partly for this reason and partly because
‘Aktionsart’ is in itself a very unsatisfactory term, in that (a) it is more
naturally applied to the denotata of verbs, rather than to some semantic
property of the verbs themselves, and (b) the term ‘action’ (traditional
though it is in this sense) is too narrow, we will make no further use of
the term ‘Aktionsart’. We will introduce instead the term ‘aspectual
character’. The aspectual character of a verb, or more simply its
character*, will be that part of its meaning whereby it (normally) denotes
one kind of situation rather than another. For example, ‘know’ differs
from ‘recognize’ in English, as ‘kennen’ differs from ‘erkennen’ in
German or ‘znatj’ from ‘uznatj’ in Russian, by virtue of its aspectual
character. ‘Know’ (like ‘kennen’ and ‘znatj’) normally denotes a state,
whereas ‘recognize’ (like ‘erkennen’ and ‘uznatj’) normally denotes an
event. It is generally accepted nowadays that any discussion of aspect
from a semantic point of view must also take account of what we are
referring to as the character of particular verbs. It is well known, for
example, that certain subclasses of verbs in English do not normally
occur in the progressive aspect (cf. Leech, 1971: 14ff; Palmer, 1974:
7off). Aspect and character are interdependent in this way because they
both rest ultimately upon the same ontological distinctions.
As we saw earlier, most verbs, in all languages, are inherently dynamic,
in that they normally denote either events (including acts) or processes
(including activities), rather than states (cf. 11.3). Indeed, this distinc-
tion between dynamic and static situations is an important part of what
we have taken to be the ontological basis for the syntactic and morpho-
logical distinction that some, but not all, languages draw between verbs
_and adjectives. As we also saw earlier, however, even in those languages
in which there is a sharp grammatical distinction between verbs and
adjectives, a minority of verbs may be stative, rather than dynamic, in
character: cf. ‘know’, ‘have’, ‘belong’, ‘live’, ‘contain’, etc., in English.
Stative verbs constitute the most important subclass of verbs that do not
normally occur in the progressive aspect in English. Stativity*, then, is
15.6. Aspect 707
lexicalized, rather than grammaticalized, in English: it is part of the
aspectual character of particular verbs. It is grammatically relevant,
however, in that it is incompatible with progressivity*, which is gram-
maticalized in English, as it is in Spanish, Italian, Icelandic, Irish, etc.,
but not in French, German, Latin, Greek, Russian, etc. (cf. Comrie,
1976: 32ff). Whether a language grammaticalizes either stativity or pro-
gressivity (or both, or neither) is something that cannot be predicted in
advance of an empirical investigation of the grammatical and semantic
structure of the language. The incompatibility of stativity and pro-
gressivity is explicable, however, in terms of the language-independent
ontological distinction of static and dynamic situations. Such contrasting
pairs as the stative She has a headache and the non-stative She is having a
bath (comparable, semantically, with She 1s washing her hair) are es-
pecially instructive in this connexion. Even more instructive perhaps
is She 1s having a headache (or She is having one of her headaches), which
can be given several different interpretations according to the context in
which it occurs, but which must necessarily be construed as describing a
dynamic, rather than a static, situation.
Stativity and progressivity are but two of the semantic notions to
which reference is commonly made in general treatments of aspect (cf.
Comrie, 1976; Friedrich, 1974). Others are duration, completion,
habituality, iteration, momentariness, inception and termination. All of
them, it will be noted, are non-deictic temporal notions. It is our main
purpose in this section to show how they relate to, and depend upon, the
subclassification of situations that we first invoked in connexion with
the concept of valency (cf. 12.4). In all that follows we will take for
granted the validity of the distinction between events, states and pro-
cesses, on the one hand, and of the distinction between acts and activities
(as distinguishable kinds of actions), on the other. Events, it will be
recalled, are non-extended dynamic situations that occur, momentarily,
in time; processes are extended dynamic situations that last, or endure,
through time; states are like processes in that they too last, or endure,
through time, but they differ from processes in that they are homo-
geneous throughout the period of their existence; acts and activities are
agent-controlled events and processes, respectively (cf. 12.4). :
Certain consequences derive immediately from this classification. One
of them has been mentioned already: the incompatibility of stativity
with progressivity.22 Another is the possibility of grouping states and
22 This is a different explanation from the one given in Lyons (1968: 315f),
which is rightly criticized by Bennett (1975: 111).
708 Deixis, space and time
processes together, in contrast with events, in terms of the notion of
duration. In so far as aspect is concerned, either one of these two binary
distinctions might be taken to be more basic than the other and gram-
maticalized in particular languages. If we take the notion of markedness
into account, however, we see that there are, in principle, six, rather than
two, possibilities:
(i) stative vs. non-stative
(ii) dynamic vs. non-dynamic
(iii) stative vs. dynamic
(iv) durative vs. non-durative
(v) punctual vs. non-punctual
(vi) durative vs. punctual
We also see that the progressive vs. non-progressive distinction that is
grammaticalized in English cannot be identified with any of these six:
progressivity involves both dynamicity, unlike (iv), and durativity,
unlike (ii), Whereas progressivity is a natural enough concept, so that
states and events may be grouped together negatively in contrast with
processes, in a two-term system of aspects which grammaticalizes the
distinction between a marked progressive and an unmarked non-
progressive (as in English), it 1s difficult to see any positive reason for
bringing both events and states together in contrast with processes. To
the six possibilities just listed we may therefore add a seventh,
(vii) progressive vs. non-progressive
rather than a further three. All but (111) and (vi), it will be noted, are
privative oppositions of the kind that give rise to the structurally import-
ant phenomenon of markedness (cf. 9.7).
In the present state of our knowledge of the grammaticalization of
aspectual distinctions throughout the languages of the world, it is
impossible to say with any degree of confidence just how many of the
seven potential two-term systems actually exist. But (iv), (v) and (vi1)
would seem to be quite common. English, as we have seen, 1s but one of
several European languages that exemplifies (vii); French, and many
other languages, exemplify (iv) in the past tense at least; and, according
to what is probably the standard analysis these days, Russian exemplifies
(v), in that the so-called perfective positively represents a situation as an
event, whereas the corresponding imperfective, being the unmarked
term, simply fails to represent it as an event and therefore only nega-
tively, as it were, has anything to do with durativity. The difference
15.6. Aspect 709
between (iv) and (v) is complicated, however, by the fact that what is the
marked member of an aspectual opposition in the experiential mode of
description may be the unmarked member in the historical mode, and
conversely (cf. 15.4). The perfective is generally taken to be the seman-
tically marked member of the perfective-vs.-imperfective opposition in
Russian and the other Slavonic languages (cf. Comrie, 1976: 112). But
its relatively high frequency of occurrence in written texts (in which
respect it is comparable with the Greek aorist and the French simple
past) would tend to suggest that, as far as the historical mode of descrip-
tion in the past tense is concerned, the perfective is unmarked (cf.
Friedrich, 1974: 30).
However that may be, the English progressive, the French imperfect,
the Russian imperfective, and comparable aspects in other languages —
comparable in that they can be accounted for in terms of (iv), (v) or (vii)
— may all be used, in the historical mode of description, to represent one
situation as a state or process within which some other situation,
represented as an event, is temporally located. Such sentences as the
following exemplify this possibility (cf. Comrie, 1976: 3):
(1) John was reading when Mary came in
(2) Jean lisait quand Marie entra (French)
(3) Ivan éital kogda Maria voSla (Russian).
Mary’s entry is represented as an event occurring at some point within
the period during which John’s reading was going on.
It is important to realize that, in languages in which the distinction
between events and processes is grammaticalized in the aspectual sys-
tem, whether a situation is represented as the one or the other does not
depend upon some absolute measure of duration. What is, both objec-
tively and as perceived by the speaker, the same situation may be repre-
sented as either a process or an event according to whether the speaker is
concerned with its internal temporal structure or not. It is for this
reason that either
(4) Ilrégna pendant trente ans
or
(5) Il régnait pendant trente ans, |
both translatable into English as
(6) He reigned for thirty years,
710 Deixts, space and time
may be used in French to describe what is objectively the same situation
(cf. Comrie, 1976: 17). Looked at from one point of view, a thirty-year
reign is just as much an event as is a sudden explosion or a flash of
lightning. It all depends upon whether the person who refers to the
situation in question 1s concerned to treat 1t in one way rather than the
other. As we saw earlier, in the historical mode of description dynamic
situations are normally represented as events and in the experiential
mode as processes; and this accounts for some of the gaps and asym- .
metries to be found in particular languages (cf. 15.4). It also accounts for
certain stylistic nuances associated with the use of one aspect rather than
another in particular languages.
The fact that there is an element of subjectivity involved in sub-
classification of situations as events, states and processes does not
invalidate the temporal distinctions upon which this subclassification is
based. If a situation is represented in one way rather than another, then
it becomes subject to the logic of temporal relations which determines
the acceptability of certain combinations of aspectual notions and the
unacceptability of others (cf. Miller & Johnson-Laird, 1976: 442ff).
Among the more important of the principles which govern the logic of
aspectually relevant temporal relations the following may be taken as
axiomatic: (i) given the undimensional directionality of time and our
punctual conceptualization of events (i.e. as second-order entities with
position, but no magnitude, in the continuum of time), two or more
events may be ordered in terms of precedence and successivity, but one
event cannot be included, wholly or partly, within another; (ii) by virtue
of our everyday assumptions about time (notwithstanding our commit-
ment to the theory of relativity), two or more events can be represented
as absolutely simultaneous; (111) since states and processes are extended
in time, but events are not, an event may be included, as a point, within
the temporal extension of a state or process; (iv) two (or more) states or
processes may be ordered, not only in terms of precedence and success-
ivity, but also in terms of co-extension or (total or partial) inclusion.
These principles, it will be noted, lend themselves very naturally to a
localistic formulation (cf. 15.7); and they go a long way towards account-
ing for the use of the major aspectual distinctions that are gram-
maticalized in the more familiar European languages.
But they do not go all the way. We must now draw upon the additional
notion of phase*, not in the rather specialized sense in which the term
‘phase’ has been employed (in contrast with ‘aspect’) by such scholars
as Joos (1964) and Palmer (1974), but in a sense which is closer to that
15.6. Aspect 711
which it bears in everyday usage (cf. Comrie, 1976: 48). Durative situa-
tions (i.e. states and processes), unlike events, may have — and, unless
they are either omnitemporal or eternal (cf. 15.4), necessarily will have —
both a beginning and an end (at different points in time). Furthermore,
if they are temporally bounded, in that they have a beginning and an
end, they will have, between their beginning and their end, indefinitely
many temporal phases. States differ from processes (including activi-
ties), as we have seen, in that the former are homogeneous and un-
changing throughout their successive phases, whilst the latter are not.
There is a sense, however, in which processes, no less than states, can be
said to consist of successive homogeneous phases. Just as ‘‘John has
loved Mary from t; to ¢;”” entails “John loved Mary at ¢;’”’, so “John has
been running from % to t;’’ entails “John was running at t,”’, where ty
refers to any of the infinity of points or periods of time between #; and t;
(i.e. ti < te < tj). This similarity between states and processes, which
was noted by Aristotle, has been emphasized recently by a number of
scholars, including Ryle (1949), Kenny (1963) and Vendler (1967).
They have pointed out further that, within the class of processes,
there is a subclass, which Vendler calls accomplishments*, to which the
notion of completion is applicable. It makes sense to enquire with respect
to various kinds of processes “Has it stopped?’’. But the question ‘‘ Has
it finished?’ carries with it the more specific presupposition that the
process to which reference is being made 1s one that proceeds towards a
climax, or natural terminal point. For example, the process of deciding
is an accomplishment (in this rather specialized sense of the term
‘accomplishment’) which has as its terminal point the event (in this
case an act) of reaching a decision. Since deciding, unlike running or
singing (but like running a race or singing a song), is an accomplishment,
it makes sense to ask such questions as “‘How long did John take to
decide?”’. Accomplishments take time and are completed in time,
rather than merely going on and coming to an end in time. Furthermore,
whereas “‘ John is deciding”’ implies “John has not (yet) decided”, “‘John
is running” does not imply “John has not (yet) run’’, but rather “John
has run”’ (or “‘ John has been running’”’); and “ John has decided”’ implies
‘John is no longer deciding’, whereas “John has run’”’ does not imply
‘John is no longer running”’ (unless it is known that John had a certain
amount, or stint, of running to do, such that it makes sense to say “John
has finished running” or even “John has done his running”’). These
logical differences make clear the importance of taking into account
the distinction between accomplishments and non-accomplishments
712 Deixis, space and time
in theoretical discussions of the nature of aspect (cf. Dowty, 1972A).
The terms ‘perfective’ and ‘perfect’ owe their origin to the tradi-
tional view that the basic function of the aspects to which these labels
are attached in particular languages is to express completion. As we
have just seen, however, the notion of completion is inapplicable to
non-accomplishments. To this extent, therefore, the traditional terms
‘perfective’ and ‘perfect’ are potentially misleading.
Accomplishments are processes which have as their end-point an
event. Vendler (1967) also recognizes, and distinguishes from accom-
plishments, a subclass of situations which he describes as achievements*.
For example ‘arrive’ is an achievement-denoting verb. So too are
‘remember’, ‘forget’, ‘die’ and such phrases as ‘win the race’, ‘eat up
his dinner’, etc. Achievements are events, rather than processes. It
follows that an achievement-denoting verb cannot normally be used
with a period-of-time adverb; and it cannot normally be used in a dura-
tive aspect (e.g., in the English progressive). Nor does it make sense to
ask with respect to what is presented as an achievement and therefore
instantaneous, ““How long did it take?’”’. There are many apparent
exceptions to these generalizations. Most of them are to be accounted for
by the fact that particular kinds of achievements are frequently associ-
ated with particular kinds of activity whose successful performance
results in the achievement; and what appears to be an achievement-
denoting verb is to be taken, in the context, as an activity-denoting verb.
For example, John is winning might be interpreted as meaning “‘John is
performing in such a way that he is likely to win’’. What it cannot mean
is “‘John is in the process of winning’’. For winning is not a process.
Somewhat different are The train 1s now arriving at platform six and They
are now reaching the top, in which the durative verb-expressions, ‘be
arriving’ and ‘be reaching’, describe the terminal phase of processes
which will naturally issue in the achievements of arriving at platform six
and reaching the top respectively. Different again are such apparent
counter-examples as He is (always) forgetting something or other, in
which (under the most likely interpretation of this utterance) the pro-
gressive aspect has an iterative function: a series of events (in this case
achievements) is represented as if it were a process in progress.
It has now become clear that such notions as duration, completion,
momentariness, inception and termination (which were listed earlier as
being among the more common notions to which reference is made in
general discussions of aspect) are not all applicable to every kind of
situation. If each of them was grammaticalized in some distinctive way
15.6. Aspect 713
in the aspectual system of a particular language, there would be severe
restrictions upon the combination of certain aspects with verbs having a
certain aspectual character. Some languages do have a rich set of distinct
aspects. It is not uncommon, however, for there to be no more than two or
three formally distinct aspects, the distribution of which is rather wider
than the terms that are employed to label them would tend to suggest.
It may then happen, and frequently does, that one and the same aspect
will be interpreted differently according to the character of the verb.
For example, the most typical use of the Ancient Greek aorist, in
which respect it is similar to the Russian perfective or the literary French
simple past, is to represent a dynamic situation as an event. Since static
situations cannot, in the nature of things, be represented as events, one
might suppose that stative verbs could not be used in Ancient Greek in
the aorist aspect. But some of them at least could be; and it is interesting
to note that in many such instances the aorist 1s most naturally inter-
preted as ingressive (or inchoative) — i.e. as indicating entry into the state
that the verb normally denotes. Even in English, the simple past of the
stative verb ‘know’ can be interpreted as ingressive in an appropriate
context: cf. J knew immediately what he had in mind. As static situations
cannot be represented as events, so events cannot be represented as
durative processes; and, once again, it might be expected that there
would be a prohibition upon the use of event-denoting verbs in a dura-
tive aspect. It has already been pointed out, however, that achievement-
denoting verbs may be used in the progressive aspect in English to
describe the terminal phase of a process which naturally issues in the
achievement that the verb denotes: cf. He is dying.
Many other examples could be given, from different languages, of the
occurrence of verbs in aspects that are semantically incompatible, at
first sight at least, with the aspectual character of the verbs in question.
Such examples are commonly regarded as exceptions to any general
statements that can be made about the meaning of particular aspects, or
alternatively, in atomistic rather than structural accounts, as evidence
that a particular aspect has several distinct functions. It would be
foolish to suggest that there are no arbitrary exceptional and unpredict-
able phenomena, here as elsewhere, in the use that is made of the
grammatical resources of particular languages, or, alternatively, that a
particular aspect cannot have more than one meaning. To a very con-
siderable extent, however, the more specific aspectual meaning that a
verb-form has can be seen as the product of the central, or basic, func-
tion of its aspect and its character.
714 Deixis, space and time
It may be added, in this connexion, that, although we have talked
throughout of the aspectual character of verbs, it is not only verbs, but
also nouns and adjectives, that have, or may have, a particular aspectual
character. That this is so will be obvious from our earlier discussion of
the ontological basis for the grammatical distinction between various
parts-of-speech and their syntactically relevant subclasses (cf. 11.3). The
second-order noun ‘explosion’, for example, denotes an event, whereas
‘peace’ denotes a state: hence the difference in acceptability between
The explosion occurred at three o'clock and Peace occurred at three o'clock.
Rather less obvious perhaps is the fact that, especially when we are
talking about the aspectual character of verbs, we must be careful to
make it clear that the term ‘verb’ in this context denotes expression-
classes and not merely lexeme-classes (cf. 11.1). The lexeme ‘read’, for
example, normally denotes an activity (i.e. an agent-controlled process),
whereas ‘read the book’ (containing a singular definite referring expres-
sion) will normally, though not necessarily, be construed as denoting an
accomplishment (terminating in completion): hence the difference in
acceptability between I shall have read the book by three o'clock and I
shall have read by three o'clock, in contrast with the absence of any such
difference in acceptability between I shall have been reading the book by
three o'clock and I shall have been reading by three o'clock. A more
detailed treatment of aspect and aspectual character would need to go
into the question of the interdependence that holds between aspect, on
the one hand, and number, countability and specificity of reference, on
the other (cf. Quirk et al., 1972: 148). As far as certain languages are
concerned it would also have to take into account negation (cf. Russian
Kto-to pozvonil, ‘“Someone telephoned”’ vs, Nikto ne zvonil, ‘‘ No-one
telephoned’”’: perfective vs. imperfective) and the grammatical category
of case (cf. Pettersson, 1972). Few parts of a language-system illustrate
better than its aspect-system does the validity of the structuralist slogan:
Tout se tient (“Everything hangs together’’: cf. 8.1).
But our treatment of aspect must, of necessity, leave much on one
side. We will bring it to an end (if not a conclusion!) by mentioning
briefly a number of specific points of particular importance in semantics.
The first has to do with the nature of what is traditionally called the
perfect (as distinct from the perfective) in grammatical descriptions of
particular languages (cf. Comrie, 1976: 52ff; Friedrich, 1974: 16ff).
This is commonly, and perhaps always in origin, a stative aspect, with
the more specific feature that it is used to represent the state that it
denotes as being consequent upon the completion of the process which
15.6. Aspect 715
the verb (in other aspects than the perfect) denotes. Generally speaking,
this is true of the English perfect, as far as process-denoting verbs are
concerned: cf. [ have learnt my irregular verbs now. What is normally a
stative verb in one language is often translatable by means of the perfect
aspect of a process-denoting verb in some other language (cf. Comrie,
1976: 57). In languages that have both stative verbs and a stative
(resultative) perfect there is still a semantic difference, in principle, not
only between the perfect and the non-perfect of a particular stative verb
(cf. I know my irregular verbs now vs. I have known my irregular verbs
since I was in the fourth form or I have known my irregular verbs in the
past, but I don’t know them any more), but also between the perfect and
the non-perfect of what may be thought of, from this point of view, as
corresponding, stative and non-stative verbs. ““X knows Y”’ does not
entail ““X has learned Y’’; and, strictly speaking, ‘“‘X is dead’’ does not
entail “X has died’’, though “X has died”’ entails “X is dead’’ (pro-
vided that we exclude the possibility of resurrection or rebirth: Christ has
died, but he now lives; Fohn Smith has died several times on the operating
table, but each time he has been brought back to life and now he 1s living a
full and useful life in Surbiton). In practice, however, we tend to operate
with a less strict notion of implication, than entailment (cf. 14.3). And
one of the consequences is that, just as the perfect of ‘get’ has now come
to be used in English as if it were semantically equivalent to the non-
perfect of the stative verb ‘have’ (cf. ‘He has/has got a cold’), so what is,
from a diachronic point of view, a stative perfect may develop into a
stative non-perfect. What is lexicalized in English in the contrasts
between ‘have’ and ‘get’ and between ‘be’ and ‘have’ (if this is indeed
lexicalization: 13.1) is grammaticalized in the aspect-systems of many
languages.
It has often been pointed out that the English perfect has certain uses
which make it more like a tense than an aspect; and there is something
to be said for this view. There is a very general tendency for aspects like
the perfect, which are retrospective in the sense that they carry an
implicit reference to a point or period of time preceding that of their
primary temporal reference, to develop into simple past tenses or, more
specifically, tenses referring to the recent past (cf. Anderson, 1973b: 37).
This has happened, for example, in modern spoken French. As far as
English is concerned, however, the basically aspectual nature of the
perfect is manifest in the well-known restrictions that govern the com-
bination of the perfect with point-of-time adverbs (having definite,
rather than indefinite, reference). It is also manifest in the interesting
716 Detixis, space and time
property of the English perfect, usually interpreted in terms of the
notion of current relevance, which makes Queen Victoria has visited
Brighton, under conditions of non-emphatic stress and for anyone who
presupposes that Queen Victoria is dead, less acceptable, or less im-
mediately interpretable, than Brighton has been visited by Queen Victoria
(cf. Palmer, 1974: 53; Chomsky, 1969; McCawley, 1973).
The next point has to do with the related notions of habituality and
interaction (or frequency). The term ‘habitual’ is hallowed by usage;
but it is something of a misnomer in that much of what linguists bring
within its scope would not generally be thought of as being a matter of
habit. For example, it would be absurd to say that the apple-trees at the
bottom of the garden are in the habit of shedding their fruit in October,
as one might say of John Smith that he habitually goes to church, or
changes his shirt, on Sundays. The term ‘habitual’, then, is conven-
tionally applied by linguists to situations (and, derivatively, to the
aspects that describe such situations) to which a much broader, but
intuitively related, set of terms is applicable, including ‘customary’,
‘frequent’, ‘regular’, ‘usual’ and even ‘normal’; and ‘iterative’ or
‘frequentative’ are commonly employed in the same sense. It is often
explained: first, that the regular iteration of an event creates a series
which may be represented as a unitary durative situation with many of
the properties of a state, and that this accounts for certain uses of the
aspects in particular languages; and, second, that, in so far as “some-
times’’ shades into “‘often’’ and “often’’ may approximate asymptotic-
ally to “‘always’’, a habitual aspect is appropriately used for the expres-
sion of so-called timeless truths like “Cows eat grass’’. We have already
had occasion. to comment upon the different senses that ‘timeless’ can
bear and, in particular, upon the distinction between “‘omnitemporal”’
and ‘“‘eternal’’ and the relevance that this has to gnomic and generic
statements (cf. 15.4). It so happens that in many, and perhaps most,
languages the same aspect is, or may be, used both for habitual situa-
tions (in the broadest technical sense of ‘habitual’) and in generic state-
ments: the imperfective in Russian, the simple non-progressive in
English, the so-called aorist in Turkish, and so on. In Swahili, however,
a generic and a habitual sense of “‘Cows eat grass’’ can apparently be
distinguished in terms of a grammaticalized aspectual distinction (cf.
Ashton, 1947: 38; Closs, 1967). It is also worth noting that there
are languages in which habituality and progressivity go together (e.g.,
in the past-tense imperfect in French and Latin) and languages in
which they are grammaticalized differently (e.g., English and 'Turkish).
15.6. Aspect 717
Mention of generic states in the previous paragraph brings us to the
final point. This has to do with what is variously referred to as a dis-
tinction between permanent and transitory, or between essential (or
absolute) and contingent, states (cf. Comrie, 1976: 103ff). Some such
distinction is grammaticalized in several languages, whether by means
of a distinction in the case of predicate nouns and adjectives (as in
Finnish, Czech and, to a relatively minor extent nowadays, Russian) or
by means of a difference in the ascriptive or locative copula that is used
in certain constructions (as in Scottish Gaelic, Irish, Spanish and
Portuguese). Brazilian Portuguese is particularly interesting in that it
has extended the ‘ser’ vs. ‘estar’ distinction, which Spanish draws with
respect to predicative adjectives (cf. Es guapa: Estd guapa, ‘‘She is
pretty”’ vs. “She looks pretty (today)’’), to a wide range of predicative
constructions including locatives (cf. Thomas, 1969: 226). Further-
more, it affords proof that, as appears to be so also in Swahili, a systema-
tic distinction must be drawn between what is contingently, though
perhaps always, the case and what is essentially, generically or (in some
non-statistical sense) normally the case.?3 If something belongs in a
certain place, even though it is not, never has been and, as far as one
knows, never will be in that place, it is possible to assert, without odd-
ness or contradiction, the following proposition: “‘X is (é) at/in Y, but
X is (estd) never at/in Y (and never has been, and never will be at/in
Y)’’. The theoretical importance of examples like this, from particular
languages, is that they demonstrate the inadequacy, not to say irrele-
vance, of universal quantification as a means of accounting for the
meaning of statements having to do with the essential properties or
normal location of things.
The treatment that has been given of aspect and of the aspectual
_ character of verbs in this section has been selective, rather than com- |
prehensive.*4 It has been biased towards the discussion of a relatively
small number of semantically relevant general points, each of which
could do with considerably more exemplification and, at times, qualifica-
tion than it has been possible, in the space available, to provide.*> We _
23 Many convincing examples to support this point are given by De Lemos
(1975), to whom I am indebted for what I have learned of the operation of the
‘ser’ vs. ‘estar’ distinction in Portuguese.
*4 Friedrich (1974) suggests that the three most basic oppositions in terms of
which a variety of aspectual systems can be analysed are durative vs. non-
durative, completive vs. non-completive and stative vs. non-stative
25 For further discussion and exemplification, in addition to the works cited in
the text, cf. Bull (1963), Heger (1963), Hirtle (1975), Johanson (1971), Klein
(1974), Schopf (1974), Verkuyl (1972), Wunderlich (1970).
718 Deixts, space and time
turn now to the thesis of localism, which, though it can be, and has been,
defended on a much wider front, has also been put forward with
particular reference to aspect (cf. Anderson, 1973b).
15.7. Localism
The term localism* is being used here to refer to the hypothesis that
spatial expressions are more basic, grammatically and semantically, than
various kinds of non-spatial expressions (cf. Anderson, 1971, 19734).
Spatial expressions are linguistically more basic, according to the
localists, in that they serve as structural templates, as it were, for other
expressions; and the reason why this should be so, it is plausibly sug-
gested by psychologists, is that spatial organization is of central import-
ance in human cognition (cf. Miller & Johnson-Laird, 1976: 375ff).
Stronger and weaker versions of localism can be distinguished
according to the range of grammatical categories and constructions that
are brought within its scope. At its weakest, the localist hypothesis 1s
restricted to the incontrovertible fact that temporal expressions, in many
unrelated languages, are patently derived from locative expressions. For
example, ‘‘nearly every preposition or particle that is locative in English
is also temporal”’; the prepositions for, since and till, which are temporal
rather than spatial in Modern English, “derive historically from loca-
tives’’; and ‘‘those prepositions which have both spatial and temporal
use developed the temporal meaning later in all instances’”’ (Traugott,
1975). What is true of prepositions and particles is true also of very many
verbs, adverbs, adjectives and conjunctions, not only in English, but in
several languages.
The spatialization of time is so obvious and so pervasive a phenom-
enon in the grammatical and lexical structure of so many of the world’s
languages that it has been frequently noted, even by scholars who would
not think of themselves as subscribing to the hypothesis of localism. It is
more characteristic of what is clearly identifiable as localism that it
should treat the grammatical categories of tense and aspect from this
point of view. Tense, as we have seen, is a deictic category; and there is
an obvious parallel between spatial and temporal deixis. As ‘here’ and
‘there’ can be analysed as meaning “‘at this place’’ and “‘at that place”’,
respectively, so ‘now’ and ‘then’ can be analysed as meaning “‘at this
time’”’ and “‘at that time’’. Moreover, by virtue of the interdependence
of time and distance (in that what is further away takes longer to reach),
there is a direct correlation between temporal and spatial remoteness
from the deictic zero-point of the here-and-now (cf. 15.1). It is not sur-
15.7. Localism 719
prising, therefore, that the tense-systems of various languages should
make use of what are locative, and more specifically deictic, forms or
expressions in order to draw such distinctions as past vs. non-past; and
we shall see later that what is commonly regarded as past tense (and has
been so treated in this chapter) is perhaps better analysed, in certain
languages at least, in terms of the more general notion of modal remote-
ness (cf. 17.3).
Aspectual distinctions are even more obviously spatial, or spatializ-
able, than tense-distinctions. Situations are second-order entities whose
relation to time is comparable to the relation that first-order entities
have to space. Events occur at particular points in time, whereas states
and processes endure throughout a certain period of time (cf. 15.6). Asa
point is contained within a line, so an event can be located within the
duration of a state or process: cf. Our first child was born (at a time) when
we were very hard up; He was run over (at a time) when he was crossing the
road. It is no less natural, it would appear, to say of a first-order entity
that it is in some state or process at a particular time and, in doing so, to
make use of the spatial analogy, or metaphor, of location or containment:
cf. ohn is in a state of blissful ignorance; Fohn ts in the process of cleaning
_ his teeth. There are many languages in which the aspectual notions of
progressivity or stativity (and, more especially, contingent stativity) are
expressed by means of constructions that are patently locative in origin
(cf. Anderson, 1973b; Comrie, 1976: 98ff). Indeed, the English pro-
gressive derives, diachronically, from what was once an overtly locative
construction: cf, I’ve been a-courting Mary Fane, where the dialectal, and
now rather archaic, form a-courting can be accounted for in terms of its
derivation from at (on, or in) courting.
Localists commonly treat temporal location as being less concrete
than spatial location, but more concrete than various kinds of so-called
_ abstract location (cf. Anderson, 1971: 100ff; Pottier, 1974: 56). For
example, looked at from this point of view, ‘in despair’ can be said to be
less concrete than ‘in London’. Several kinds of abstract location have
already been referred to in the course of this work. Some of them may be
mentioned again in connexion with the hypothesis of localism.
It was pointed out in chapter g that an understanding, not only of such
directional opposites as ‘up’:‘down’ and ‘front’: ‘back’, but of oppo-
sition in general, is based upon the analogical extension of distinctions
‘‘which we first learn to apply with respect to our own orientation and
the location or locomotion. of other objects in the external world”’ (cf.
9.2). The use of particular prepositions, verbs or adverbs in English |
720 Deixis, space and time
provides plenty of evidence for this view: prices are said to go up or
come down, according to whether they increase or decrease; if X is
judged to be better than Y, X is said to be above or in front of (or to
precede) Y; and so on. Much of what is commonly thought of as being
metaphorical in the use of language can be brought within the scope of
the thesis of localism.
It is less obvious perhaps that much of what is not usually thought of
as being metaphorical can also be brought within the scope of localism.
As we have seen, acquiring a certain property or entering a certain state
stands in the same semantic relation to having that property or being in
that state as arriving at (or coming into) a place does to being in that
place (cf. 9.2). To say that X has become Y or has acquired Z is tanta-
mount to saying, on the localistic interpretation, that X has passed from
the state of not being Y or not having Z to the state of being Y or having
Z. Similarly, to say that X has ceased to be Y or has lost Z is tanta-
mount to saying that X has passed from being Y or having Z to not
being Y or not having Z. The process whereby someone or something
passes from one state to another may be accounted for in terms of the
localistic notion of a journey* and, as far as the grammatical structure of
languages is concerned, in terms of the valency-schema MOVE (ENTITY,
SOURCE, GOAL), introduced in a previous chapter (cf. 12.6). Analysed in
this way, all verbs denoting a change of state may be regarded as verbs-
of-motion (cf. Miller & Johnson-Laird, 1976: 526ff).
If an entity travels from a source to a goal, it usually does so by
taking a particular route, or path*, which may or may not be referred to
(cf. such expressions as ‘through the wood’, ‘along the Embankment’,
‘by way of Beachy Head’) in descriptions of the journey (cf. Bennett,
1975: 18). Journeys are initiated by the entity’s departure from the
source and terminated by the entity’s arrival at the goal. Both departure
and arrival are achievements (in Vendler’s sense: cf. 15.6). If the source
and the goal are conceived as areas, rather than points, departure and
arrival will be achievements that have the more particular logical pro-
perty of being describable as border-crossings* (cf. Jessen, 1974). By
generalizing these localistic notions from the paradigm case of so-called
concrete locomotion (in which a first-order entity moves from one
physical location to another in some measurable interval of time) to
various kinds of abstract locomotion, the implicational relations that
hold between such pairs of prepositions as ‘‘X has learned Y’’ and
‘*X (now) knows Y”’’ or between “X has forgotten Y”’ and “X no longer
knows Y’’, on the one hand, and between “‘X has arrived in Y’’ and
15.7. Localism 721
‘“X is (now) in Y”’ or between “X has departed from Y”’ and “X is no
longer in Y’’, on the other, can be brought together within a common
framework. By means of the achievement of learning Y (which may or
may not be the termination of some process), one crosses the border
between the ignorance and the knowledge of Y; by forgetting Y, one
crosses the border in the reverse direction (i.e. going out of, rather than
coming into, the state of knowledge). Among the many sets of expres-
sions in English whose aspectual character and implicational relations
can be accounted for within this localistic framework are ‘go to sleep’,
‘(be) asleep’, ‘wake up’, ‘(be) awake’; ‘get’, ‘have’, ‘lose’; ‘get
married’, ‘(be) married’, ‘get divorced’; ‘be born’, ‘live’, ‘die’, ‘(be)
dead’. In fact, all that was said in the previous section about aspect and
aspectual character could be reformulated in terms of the localistic
notion of travelling from one place or state to another place or state.
It will also be clear from our earlier discussion of causativity and
transitivity how these two notions relate to the hypothesis of localism
(cf. 12.5). The very term ‘transitive’, as we have seen, derives from the
paradigm instance of an agentive situation in which both the operative
schema AFFECT (AGENT, PATIENT) and the factitive schema PRODUCE
(CAUSE, EFFECT) are applicable. The hypothesis of localism has always
laid great stress upon the natural association of the valency-roles
SOURCE, CAUSE and AGENT, on the one hand, and of GOAL, EFFECT and
PATIENT, on the other — an association that is manifest in, and according
to the localists accounts for, certain apparent coincidences in the use of
such grammatical cases as the ablative* or dative*, or prepositions (or
postpositions) meaning ‘“‘from”’ or “to”’, in many unrelated languages
(cf. Anderson, 1971; Anderson & Dubois-Charlier, 1975).
Causes, according to the view of causality that we have adopted here,
are second-order entities; as such, they may be conceived, localistically,
as the sources of their effects. Similarly, at a higher level of abstraction,
the antecedent proposition, p, may be thought of as the source from
which the consequent proposition, g, proceeds in a complex conditional
proposition: i.e. “if p, then q’’ can be interpreted, localistically, as ‘gq
comes from p’’. This would suggest that the semantic and gram-
matical parallelism that holds between causal, conditional and temporal
clauses can be accounted for, not only in terms of the successivity in
time of causally related situations, but, more particularly, in terms of the
localistic notion of a journey (cf. 12.5).
| Even instrumental adverbials and adverbials of manner, which, like
locative, temporal and causal adverbials are characteristically adjuncts,
722 Detixis, space and time
rather than nuclear constituents, in simple sentences, may be brought
together, from a localistic point of view, and analysed in terms of the
notion of a path (cf. Anderson, 1971: 171). It is noticeable, in this con-
nexion, that the form how in English (in both its relative and its interro-
gative function: cf. ‘This is how he did it’ and ‘How did he do it?’)
subsumes both instrumental adverbials and adverbs of manner (cf.
‘with a knife’ and ‘carelessly’); and in each instance it can be para-
phrased with a phrase containing the word ‘way’ (cf. ‘This is the way in
which he did it’ and ‘In what way did he do it?’). It is presumably for
this reason that Pottier (1974: 197) classifies the French adverb ‘ainsi’
(“‘thus”’’) as the notional (i.e. abstract or third-order) correlate of the
spatial deictic ‘ici’ (“here’’) and the temporal deictic ‘maintenant’
(““now’’). As we saw earlier, with respect to what is commonly, but
perhaps wrongly, thought of as the universal circumstantial role of
instrument, there may be several alternative templates for the gram-
matical and lexical categorization of what is objectively the same situa-
tion (cf. 12.6). At least one of these templates, however, would seem to
be spatial in origin.
Two kinds of states that have been discussed more frequently, per-
haps, than others in terms of the notion of abstract location are those of
possession and existence. This is because there are very many unrelated
languages throughout the world in which overtly locative constructions
are used in sentences that would be translated into English as ‘John has
a book’ (or ‘The book is John’s’) and ‘There are unicorns’, or ‘ Uni-
corns exist’ (cf. Allen, 1964; Asher, 1968; Boadi, 1971; Christie, 1970; __
Li, 1972; Lyons, 1967, 1975). In everyday usage the term ‘possession’
is more or less equivalent to ‘ownership’ (though jurists may draw a
sharp distinction between the two terms): whatever X is said to possess
may be described as his property. In traditional grammatical usage
‘ possession’ and ‘possessive’ are construed much more broadly. Indeed,
it can be argued that they are highly misleading: it is only a minority of
what are traditionally called possessive constructions that have anything
. to do with property or possession; and there is no reason to believe that
this minority constitutes a particularly important, or basic, subclass of
the total class. However that may be, there can be no doubt that the
localist interpretation of so-called possessive constructions is eminently
plausible on both semantic and syntactic grounds. The question Where
1s the book? can be answered, equally well, with either J?’s on the table or
John has it; and there is no reason to treat the verb ‘have’, here and
_. elsewhere, as anything other than a transformationally inserted variant
15.7. Localism 723
of the locative copula. We saw earlier that there is nothing paradoxical,
untraditional though it may be, about the postulation of underlying
locative subjects (cf. 12.3). We can say that ‘John’ (or ‘at John’) is the
underlying locative subject in ‘John has a book’, as ‘on the table’ is the
underlying locative subject in ‘There is a book on the table’; in ‘John
has the book’, on the other hand, the underlying personal locative ‘at
John’ is perhaps best regarded as being predicative (cf. ‘The book is on
the table’).
Existential constructions are, if anything, even more obviously of
locative origin than so-called possessive constructions are. It can be
argued, in fact, that existence is but the limiting case of location in an
abstract, deictically neutral, space (cf. Lyons, 1975); and that it is the
deictically neutral, so-called existential, sense of the adverb ‘there’ that
is adjectivalized (and amalgamated with a pronominal element) in the
definite article (cf. 15.2). But whether we take this view of the matter or
not, the locative basis of existential constructions, in many, if not all,
languages, is hardly open to doubt and has been quite widely recognized.
Once the locative source of existential constructions is postulated, the
way lies open for a localistic interpretation, not only of such existential-
causatives* (traditionally said to manifest an object-of-result) as ‘God
created Adam’, ‘John painted a picture’ (cf. 12.5), but also of the
linguistic representation of such central logical notions as quantification,
negation, knowledge and truth. Since we are not concerned, in this work,
to push the hypothesis of localism to the limits of its coverage or to
develop it in any detail, we will leave quantification and negation on one
side (cf. Anderson, 1975) and confine ourselves to a brief, and some-
what speculative, localistic account of knowledge and truth.
The concepts of knowledge, truth and existence are interconnected in
that the proposition “‘X knows p”’ cannot be true unless the proposition
p itself is true (cf. 17.2); and p cannot be true, of some world (or world-
state), unless the situation that it describes actually exists in the world
(or world-state) in question (cf. 6.5). Now propositions, which we have
classified as third-order entities, may be treated either as purely abstract,
non-psychological, entities or, alternatively, as objects of knowledge and
belief. Truth is the third-order correlate of what for first-order entities
is existence in space; and a statement like That ts so (where ‘that’ refers
to a proposition) is structurally comparable with a statement like
X exists (where X refers to a first-order entity). This, in a nutshell, 1s
one version of a localistic theory of truth; and (although we shall not
develop this point in the present work) it will be seen later that this con-
724. Deixis, space and time
ception of truth fits in with the logical analysis of utterances with which
we shall be operating, according to which every statement contains an
‘“it-1s-so’’ component (cf. 16.5).
If propositions are treated as psychological entities, rather than as
purely abstract third-order entities, then it is natural to treat as their
location the persons (or the minds, or brains, of the persons) who have
what philosophers might describe as a propositional attitude (know-
ledge, belief, etc.) with respect to them. It is obvious that the process of
communicating propositional information is readily describable, as is
the process of transferring possession (cf. Miller & Johnson-Laird,
1976: 558ff), in terms of the localistic notion of a journey: if X com-
municates p to Y, this implies that p travels, in some sense, from X to Y.
It does not follow, of course, that Y will believe p; and, even if Y does
believe p, he cannot be said to know p, unless p is true. At most, he can
be said to have been acquainted with p. It may be suggested, therefore,
that ‘“‘p is at X”’ (where X is a person) is the underlying locative struc-
ture that is common to “X knows p’’, “X believes p’’, ‘“X has p in
mind’’, etc. There is much in the structure of particular languages,
however, to suggest that ‘““X knows p”’ is comparable with “X has Y”’
and should therefore be regarded as the most typical member of the
class of propositions subsumable under “‘p is at X”’. We will not develop
this admittedly rather speculative point in the present work. It may be
mentioned, however, that, as the localistic notion of truth fits in with the
tripartite analysis of utterances that is to be adopted later, so the localistic
analysis of propositional knowledge that has been adumbrated here can
be extended to cover many of the constructions in which modality (both
epistemic and deontic: cf. 17.2, 17.5) is objectified, or propositionalized,
in natural languages.
At the beginning of this section the point was made that the hypothesis
of localism can be maintained in either a stronger or a weaker form. It is
only in a relatively strong version of localism that the linguistic expres-
sion of truth and modality, not to mention negation and quantification,
would be brought within its scope; and nothing that is said about these
notions elsewhere in this book depends upon a localistic interpretation of
them. There is much else that has been referred to in this section —
notably aspect, the grammatical category of case, and existential and }
possessive constructions — for which a localistic analysis would be far
more widely accepted.
16
Mood and illocutionary force
16.1. Speech-acts
Throughout much of this book so far we have been mainly concerned
with the descriptive function of language: i.e. with the way language is
used to make statements*.! But language also serves as an instrument for
the transmission of other kinds of information. Not all the utterances we
produce are statements; and statements, as well as questions, commands,
requests, exclamations, etc., will contain a certain amount of non-
descriptive information, which may be characterized, broadly, as
expressive* (or indexical*) and social* (cf. 2.4). Furthermore, the trans-
mission of descriptive information is not usually an end in itself. When
we communicate some proposition to another person, we do so, nor-
mally, because we wish to influence in some way his beliefs, his attitudes
or his behaviour.
To produce an utterance is to engage in a certain kind of social inter-
action. This is a fact that, until recently, logicians and philosophers of
language have tended to overlook, though it has often been stressed by
linguists, psychologists, sociologists and anthropologists. One of the
most attractive features of the theory of speech-acts, which was intro-
duced into the philosophy of language by J. L. Austin, is that it gives
explicit recognition to the social or interpersonal dimension of language-
behaviour and provides a general framework, as we shall see, for the
discussion of the syntactic and semantic distinctions that linguists have
traditionally described in terms of mood* and modality*.
Austin’s theory of speech-acts* was developed over a number of
years; and in its final version (in so far as Austin himself succeeded in
producing a final, or definitive, version before his death) it is deliberately
7 For example in Ancient Greek the form émén (‘‘truly’’, ‘‘verily’’) was regu-
larly used, with or without the accompanying performative verb-form
hémnumi (‘‘I swear’’), in oaths.
§ Austin began this task himself and introduced a number of general classes of
speech-acts (cf. Fraser, 1974).
738 Mood and tllocutionary force
and governed by, felicity conditions of the same kind as those which
govern other forms of behaviour and social interaction in that society.
At this point, it is convenient to introduce the notion of what have
been called parenthetical verbs*. Verbs such as ‘suppose’, ‘believe’ and
‘think’ may be used parenthetically in the first person of the simple
present tense “to modify or weaken the claim to truth that would be
implied by a simple assertion’? (Urmson, 1952). Their function, as
described by Urmson, is illustrated by sentences like
(9) She’s in the dining-room, I think;
and it is comparable, if not identical, with what was referred to in
chapter 3 as the prosodic and paralinguistic modulation* of utterances
(cf. 3.1).
The similarity between performative and parenthetical verbs will be
obvious. In fact, it would seem to be desirable to widen the definition of
parenthetical verbs offered by Urmson so that it also includes performa-
tive verbs used parenthetically. Sentences such as the following
(10) I'll be there at two o’clock, I promise you
illustrate the parenthetical use of performative verbs. In uttering a
sentence like (10), the speaker adds to the first clause, with which he
performs the illocutionary act of promising, a second clause which
makes explicit the nature of his speech-act; and the parenthetical ‘I
promise you’ confirms, rather than establishes, the speaker’s commit-
ment (cf. [’ll be there at two o'clock — that’s a promise). Now, as (10) is
related, both semantically and grammatically, to
(11) I promise (that) I'll be there at two o’clock,
so (9) is related to
(12) I think (that) she’s in the dining-room;
and it is arguable that
(13) I promise to be there at two o’clock
is semantically, if not grammatically, equivalent to (11). Just how these
sentences are related, grammatically and semantically, is a controversial
question. It has been argued by J. R. Ross (1970) and others that all
sentences contain an underlying performative verb of saying: we will
come back to this question later (16.5).
16.1. Speech-acts 739
The parallelism between parenthetical and performative verbs was
noted by Benveniste (1958a), independently of both Austin and Urm-
son; and Benveniste emphasized their non-descriptive role as markers of
subjectivity (“indicateurs de subjectivité’’) — i.e. as devices whereby the
speaker, in making an utterance, simultaneously comments upon that
utterance and expresses his attitude to what he is saying. This notion of
subjectivity* is of the greatest importance, as we shall see, for the under-
standing of both epistemic* and deontic* modality (cf. 17.2, 17.4).
In a related article, Benveniste (1958b) also draws attention to what
he calls delocutive* verbs. These may be defined as follows: a verb ‘x’
is delocutive if it is morphologically derived from a form «x and if it
means “‘to perform the (illocutionary) act that is characteristically per-
formed by uttering x (or something containing x)’. This definition is
hardly precise enough, as it stands (cf. Ducrot, 1972: 73ff): but it will
serve for our present purpose. The important point to note is that x is a
form that is uttered in the performance of the act that is denoted by the
lexeme ‘x’ and that there is a morphological relationship between x and
the forms of ‘x’. For example, the Latin ‘salutare’ (“to greet’’) is
morphologically related to the stem-form of ‘salus’ and thus to Salus!
(“‘Greetings!’’); the French ‘remercier’ (“to thank’’) is morphologic-
ally related to merci (cf. Merci!, “Thank you!’’); the English ‘to wel-
come’ is morphologically related to the form welcome (cf. Welcome!).
Of course, ‘salutare’ does not mean “‘to say Salus!’’, any more than
‘remercier’ means ‘“‘to say Merci!’ or ‘to welcome’ means “‘to say
Welcome!’’. But one way of greeting a person in Latin was to say Salus/,
as one way of thanking someone in French is to say Merci! and one way
of welcoming someone in English is (or was) to say Welcome! Moreover,
in each case the utterance that serves as the basis for the morphological
derivation of the verb denoting the more general act of greeting, thank-
ing or welcoming is one that is (or was) characteristically used for this
purpose. The conventionalization of the utterance of x is prior to the
creation of the lexeme ‘x’ or to the association with the pre-existing
lexeme ‘x’ of the sense ‘‘to perform the act that is characteristically
performed by uttering x’’. Up to a point, therefore, we are justified in
saying that the more general sense “‘to greet”’ developed out of the more
specific sense ‘‘to say Salus/”; and so for the more general sense of
‘remercier’, ‘to welcome’, etc.
What must be emphasized, however, is that the verb ‘say’, which we
have used in referring to the more specific sense of ‘salutare’ does not,
and cannot, simply mean “‘utter’’; and the reason why this is so is
740 Mood and illocutionary force
crucial for a proper understanding of both delocutive and performative
verbs, and of the connexion between them.
It is a commonplace of the philosophical discussion of language that
the verb ‘say’ (and more or less comparable verbs in other languages)
has several distinguishable senses. Austin himself (1962: g2ff) analyses
the act of saying (“in the full sense of ‘say’ ’’) into three component acts:
(i) the act of “uttering certain noises’’; (11) the act of “‘uttering certain
vocables or words, i.e. noises of certain types belonging to and as
belonging to a certain vocabulary, in a certain construction”’; (iii) the
act of using the product of (11) “with a certain more or less definite sense
and a more or less definite reference (which together are equivalent to
meaning)’’. It is easy to see that Austin’s analysis is, from the linguistic
point of view, either incomplete or imprecisely formulated (e.g., it is not
made clear, under (i), how much of the vocal signal is covered by the
non-technical term ‘noise’, and no attempt is made, under (ii), to dis-
tinguish between forms, lexemes and expressions); and the technical
terms that Austin does introduce at this point (notably ‘phatic’ and
‘rheme’) tend to be used quite differently by linguists. But his general
intention is clear enough; and, as far as it goes, his analysis would seem
to be on the right lines. At least these three kinds of acts are involved in
the complex act of saying.
Vendler (1972: 6ff) draws a broad distinction, as others have done,
between saying something in the full sense of the word (let us call this
“‘say,’’) and saying something in the weak sense which is “roughly
equivalent to uttering, mouthing or pronouncing”’ (let us call this
‘‘say,’’); and he points out that “‘no illocutionary act will be performed
if, for one thing, the speaker does not understand what he is saying or,
for another, he does not intend to perform such an act, that is, does not
intend his audience to take him to be performing one”’ (p. 26). It is in-
herent in the notion of performing an illocutionary act (i.e. of saying in
the sense ‘“‘say,”’) that the speaker shouJd both understand and mean
what he says (in the sense “‘say,’’). It might be argued that we are
frequently held to be responsible, in a court of law for example, for the
unintended consequences of our actions. This is true, but irrelevant. If
we are judged guilty of breach of promise by virtue of the utterance of
something that we did not intend to be taken as a promise, our guilt is
established, at law, in terms of the eminently practical principle that we
must be deemed to have made a promise if we have ostensibly performed
an act which is conventionally interpreted as making a promise and if
there is no clear indication at the time that we are not to be taken
16.1. Speech-acts 741
seriously. But it is one thing to be deemed to have made a promise; it is
another to have actually made a promise. If we indicate clearly (i.e. in a
way that any reasonable person could be expected to interpret correctly)
that our utterance is not to be taken seriously as a promise, we shall not
only not have made a promise, but we shall not even be deemed to have
made a promise. What was said earlier about promising, and more
generally about the performance of any illocutionary act, is to be con-
strued in terms of this proviso. One cannot unwittingly or unintention-
ally say something 1n the sense “say,” merely by saying something in
the sense “say,’’. Furthermore, as Vendler points out, whereas ‘say’ in
the sense “say,” 1s an activity-verb, in the sense “‘say,’’ it is an
accomplishment-verb: it follows that the truth of the proposition “X
says,...” at a particular point in time carries no implications whatsoever
with respect to the truth of the proposition ‘‘X says, ...’’ at the same,
or any subsequent, point in time (cf. 15.6).
This distinction between “say,” and “‘say,”’ is by no means sufficient
to support all the weight that it is sometimes expected to bear: in
particular, it will not of itself suffice for drawing the distinction between
direct and indirect discourse from a semantic point of view.® Apart from
the various problems that philosophers have discussed in their attempts
to make precise all that is involved in “‘say,’’, there are quite serious
problems attaching to the interpretation of “‘say,’’; and these have not
been so extensively discussed. It is clearly of some importance, for
example, to distinguish between the type-token identity that is relevant
to the notion of repetition and the type-token identity that is relevant to
mimicry. Repetition and mimicry are two quite different kinds of
replication* (cf. 1.4). When we assert truly that X has correctly repeated
Y’s utterance, we abstract from all sorts of phonetically describable
differences in the utterance-signals. Voice-quality is certainly not rele-
vant to the specification of the truth-conditions of the sentence “Mary
repeated what John had said’. For Mary to make an attempt to replicate
John’s characteristic voice-quality in response to his request that she
should repeat what he had said would be, to say the least, supereroga-
tory. So too would be her attempt to replicate the paralinguistic, and
even some of the prosodic, features in his utterance. The truth-condi-
tions of ‘Mary repeated what John had said’ (unlike those of ‘Mary
imitated what John had said’) are presumably identical with the truth-
® For some interesting comments on the relationship between corresponding
direct and indirect discourse constructions cf. Banfield (1973), Partee (1972),
Zwicky (1971).
742 Mood and illocutionary force
conditions of ‘John said X and so did Mary’ (where X is a form or an
utterance-signal and ‘and’ is construed to mean ‘‘and subsequently’’).
But it is remarkably difficult to establish, other than by methodological
fiat, what these truth-conditions are.!° It is presumably a necessary
condition of the relevant kind of type-token identity that the two tokens
of X should contain the same forms in the same order. But this is rarely,
if ever, a sufficient condition; and it is not clear that there is any deter-
minate set of additional conditions that would be appropriate to decide,
for all occasions of its utterance, whether ‘John said X and so did Mary’
is being used to assert a true proposition or not. In short, “say,” is far
from being as straightforward as one might think. So too is the distinc-
tion that philosophers frequently invoke between what is said (in the
sense ‘‘say,’’) and the manner of saying it.
Crude though it is, the distinction between “say,’’ and ‘“‘say,”’ may
be used to throw light on the nature of performative and delocutive
verbs, and on the nature of the relationship between them. As we have
seen, the utterance by X of J promise can never of itself be a condition of
the truth of the proposition “X promises’’. However, in so far as I
promise serves as a performative formula whose utterance (in the
appropriate circumstances) 1s associated by convention with the act of
promising (i.e. of committing oneself, under pain of dishonour, social
disapproval or some other such sanction, to some future act or course of
action), as the utterance of Hello! is associated with the act of greeting
and the utterance of Welcome! with the act of welcoming, ‘“‘X said,
I promise’ will generally be held to imply “X promised’’, just as ‘X
said Hello!/Welcome! (to Y)”’ will generally be held to imply ‘“‘X greeted/
welcomed Y’’. It is arguable, therefore, that the performative use of
I promise is logically, if not historically, prior to the descriptive use of the
verb ‘to promise’ and that the token-reflexivity of particular utterances
of I (hereby) promise . . . is a secondary consequence of this fact (cf.
Ducrot, 1972: 73ff). However that may be, the semantic connexion
between the Latin delocutive verb ‘salutare’ and the performative
formula Salus! is obviously no different, as far as the distinction between
“say,” and “‘say,’’ is concerned, from the semantic connexion that
10 By the term ‘methodological fiat’ I am referring to the more or less deliberate
process of standardization* that is an inevitable part of linguistic analysis and
description (cf. 14.2). There are of course constraints upon the linguist’s
fiat: up to a point native speakers will agree that two utterances are tokens of
the same type, the one being a repetition of the other. But dialectal and
stylistic variation are such, in most language-communities, that the question
is not always pre-theoretically decidable.
16.1. Speech-acts 743
holds between the descriptive sense of the verb ‘to promise’ and the
performative formula J promise. The fact that the performative formula
I promise, unlike Salus! (or Hello! or Thanks!), contains the first-person
singular form of the corresponding descriptive verb, and may thus be
construed as token-reflexive (cf. 1.4), is, from this point of view, irrele-
vant.
As we have seen, in Austin’s later doctrine all utterances, including
Statements, are taken to be performative utterances. Much of the
original motivation for introducing the term ‘performative’, therefore,
disappears in the subsequent development of the theory of speech-acts.
But the distinction between explicit and primary performatives remains;
so too does the distinction between the performative and the purely
descriptive use of such verbs as ‘say’ and ‘promise’. Each of these two
points requires a final brief comment.
It is not absolutely clear on what grounds Austin draws his distinction
between explicit and primary performatives: in particular, it is not clear
whether an explicit performative must necessarily contain a performa-
tive verb. (The reader should note at this point that, whenever the term
‘performative’ is employed as a noun in this book, it is to be construed
as an abbreviation for ‘performative utterance’. In this respect, we base
our usage upon Austin’s (1962: 6). Other writers treat the noun ‘per-
formative’ as an abbreviation for ‘performative verb’; and this can
occasionally lead to confusion.)"' If we take seriously the criterion of
“making explicit (which 1s not the same as stating or describing) what
precise action is being performed”’ (cf. Austin, 1962: 61), it is obvious
that, in principle, the element that makes explicit the illocutionary force
of an utterance need not be a verb. For there is no reason to suppose that
only verbs have the function of making things explicit. Indeed, it need
not be a word, or even a particle: it could be some prosodic or para-
linguistic feature. But Austin certainly argues throughout as if the only
way in which the illocutionary force of the utterance can be made
explicit is by means of a performative verb (in the first-person singular) ;
and his examples all suggest that this is so. It very much looks, 1n fact, as
if Austin is covertly and perhaps illegitimately restricting the interpreta-
tion of “‘making explicit’’. Exegesis is rendered the more difficult in that
Austin, like most philosophers and many linguists, does not explain how
16.3. Questions
It has been argued that questions can be analysed satisfactorily as sub-
types of mands (cf. Hare, 1949; Lewis, 1969: 186). According to this
proposal Who 1s at the door? might be analysed as an instruction to the
addressee to name (or otherwise identify) the person at the door and
Is he married? as an instruction to assert one of the component simple
propositions of the disjunction “He is married or he is not married’”’.
Essentially the same proposal has been made more recently, within the
framework of generative grammar, by several linguists; and it has been
quite widely accepted. The advantage of this analysis of questions is
that it would enable us to handle the illocutionary force of the three main
classes of utterances in terms of the two primitive notions of asserting
and issuing mands. There are, however, a number of objections to the
proposal that questions should be analysed as instructions to make a
statement. None of these objections is perhaps conclusive. Taken
together, however, they point the way to an alternative, and more
general, analysis of the meaning of questions.”
17 For other approaches to the analysis of questions from a logical and lin-
guistic point of view: cf. Aqvist (1965), Bach (1971), Baker (1970), Hamblin
754 Mood and tllocutionary force
‘The first point to note is that the grammatical structure of what we
will call yes-no* questions (1.e. questions to which we can respond
appropriately with the words ‘yes’ or ‘no’ in English, and their equiva-
lents in other languages) is, in many languages, similar to that of
declarative sentences. In fact, the difference between questions and
statements is commonly drawn solely in the non-verbal component of |
utterances; and it is one that can be associated with an intonation pattern
or paralinguistic modulation of the utterance which expresses the
speaker’s doubt. This fact would suggest that the difference between
declarative sentences and interrogative sentences (in those languages in
which such a distinction is drawn in the verbal component of sentences)
results from the grammaticalization of the feature of doubt. It would be
generally agreed that one of the felicity-conditions attaching to the
appropriate utterance of questions (other than so-called rhetorical
questions) is that the speaker should not know the answer to his ques-
tion. It is for this reason that certain authors prefer to analyse questions
as meaning, not “Assert that such-and-such is so”’, but “‘ Bring it about
that I know that such-and-such is so”’ (cf. Aqvist, 1965; Householder,
1971: 85; Hintikka, 1974b); and it is worth noting that ‘Tell me that
such-and-such is so” can be interpreted in either of these two ways.
What is at issue is whether, in uttering a question, the speaker necessarily
assumes that his addressee knows the answer. If he does not make this
assumption he can hardly impose upon the addressee the obligation to
supply the answer.
The second point to be made is that, if yes-no questions were a sub-
class of mands, one might expect that the response No would indicate the
addressee’s refusal to comply with the mand (1.e. his refusal to state
whether something is or is not so). But this is not the case. If the
addressee says No in response to a question of the form Is the door open?,
he is answering the question. But if he says No in response to what is
clearly a mand, such as Open the door, he is refusing to do what he is
being commanded or requested to do.
Finally (and this is the most important point), it does not seem to be
essential to the nature of questions that they should always require or
expect an answer from the addressee. It is true that, in normal everyday
conversation, we generally expect the questions that we utter to be
answered by our addressee. But this is readily explained in terms of the
(1973), Hudson (1975), Hull (1975), Keenan & Hull (1973), Prior & Prior
(1955), Rohrer (1971).
16.3. Questions 755
general conventions and assumptions which govern conversation. If I
say [ wonder whether the door 1s open or I don’t know whether the door is
open, which (like the question Is the door open?) express my doubt as to
the state-of-affairs which obtains, the addressee can appropriately re-
spond to my utterance, if he is in a position to do so, by resolving my
doubt. Given that this is so, all we need to assume in order to account
for the fact that questions normally expect and obtain an answer is a
conventional association between the utterance of a question and the
expectation of an answer from the addressee. In principle, however, this
association is independent of the illocutionary force of questions.
What seems to be required, in fact, is a distinction between asking a
question of someone and simply posing* the question (without neces-
sarily addressing it to anyone). When we pose a question, we merely give
expression to, or externalize, our doubt; and we can pose questions
which we do not merely expect to remain unanswered, but which we
know, or believe, to be unanswerable. To ask a question of someone is
both to pose the question and, in doing so, to give some indication to
one’s addressee that he is expected to respond by answering the question
that is posed. But the indication that the addressee is expected to give an
answer is not part of the question itself.
The advantage of this analysis of questions is that it 1s more general
than their analysis as mands. It covers, not only information-seeking
questions, but various kinds of rhetorical and didactic questions without
obliging us to treat these as being in any respect abnormal or parasitic
upon information-seeking questions (cf. Bellert, 1972: 59-63). Ithasthe
further advantage that it puts factual questions into more direct corres-
pondence with statements and what are traditionally described as
deliberate questions (e.g., Should I wash my hair to-night?, What am I
to do?) into more direct correspondence with mands and other kinds of
directives. Corresponding statements and factual questions, on the one
hand, and corresponding mands and deliberative questions, on the
other, can be said to have the same phrastic and tropic, but to differ in
their neustic. This is not simply a difference between the presence and
the absence of an element meaning “I-say-so’’; it is the difference
between the presence of an I-say-so element and the presence of an
I-don’t-know element.
One of the inadequacies of the analysis of questions as mands which
has not so far been mentioned is its failure to account satisfactorily for
the difference between wondering whether something 1s so and asking
oneself whether something is so. According to Hare (1971: 85):
756 Mood and illocutionary force
(1) Lf wonder whether that ts a good movie?
which he classifies as an indirect interrogative, is “very similar in
meaning”’ to
(2) I ask myself “Is that a good movie?”’
Similar in meaning they may be, but there is an important difference
between them; and one should not be misled by the fact that in certain
languages the verb used to refer to, or give expression to, an act of
wondering ts a reflexive form of the verb meaning “‘ask”’. The equivalent
of (1) in French, for example, is
(3) fe me demande si c’est un bon film.
But the expression ‘se demander’ does not normally mean “‘to ask
oneself’’; the most common French expression used to refer to, or to
perform, acts of asking oneself whether something is so is ‘se poser la
question’ (“to put the question to oneself’’). The difference between
wondering and asking oneself is the difference between simply posing a
question and putting a question to oneself as the addressee with the
intention of answering it. For one can ask questions of oneself, in
soliloquy and discursive reasoning, just as one can make statements or
issue mands to oneself; and to ask a question of oneself is to perform a
mental or illocutionary act which is governed by the same felicity-
conditions as those which govern information-seeking questions
addressed to others. If Sherlock Holmes asks himself whether his visitor
is married or single, he does so with the expectation and intention, after
considering the evidence, of answering the question which might be
formulated, in an utterance, as ,
(4) Is he married? |
If Sherlock Holmes merely wonders whether his visitor is married he
poses the same question, but he does not necessarily expect to be able to
answer it.
Wondering, like entertaining a proposition, is first and foremost a
mental act: indeed, it is one way of entertaining a proposition. In order
for wondering to be converted into an illocutionary act by means of
utterance, it must be the speaker’s intention to tell the addressee that he
has a particular proposition in mind and that he is entertaining it in
what we may refer to as the dubitative mode.'® Otherwise the utterance
18 'The term ‘mode’ is quite commonly used in this sense by philosophers. It is
related to, though distinguishable from, the sense in which it was employed
16.3. Questions 757
is at most informative, rather than communicative (cf. 2.1); and illo-
cutionary acts, as we have seen, are necessarily communicative (cf. 16.1).
I wonder whether that 1s a good movie may or may not be used (like Do you
know whether that is a good film?, I don’t know whether that 1s a good film,
Can you tell me whether that is a good film?, etc.) to ask, indirectly, a
question of one’s addressee.
So far all the questions that we have actually discussed have been of
the yes—no type. But there is another class of question which, following
Jespersen (1933: 305), we will call x-questions*. (Jespersen’s term for
yes—no questions is ‘nexus-question’: cf. also Katz, 1972: 207.) As
Jespersen points out, in x-questions “we have an unknown quantity x,
exactly as in an algebraic equation”’ and “‘the linguistic expression for
this x is an interrogative pronoun or pronominal adverb’’. Since the
interrogative pronouns and adverbs in English are words, which, in their
written form, typically begin with wh- (who, what, when, where, etc.),
x-questions are commonly referred to in the literature as wh-questions;
and wh- is sometimes treated, by linguists, as the orthographic form of
an interrogative morpheme which, when it is combined with indefinite
pronouns or adverbs, has the effect of converting them into interrogative
elements whose characteristic function it is to be used in x-questions
(cf. Katz & Postal, 1964; Katz, 1972: 204ff).
Not only x-questions, but also yes—no questions, can be treated as
functions which contain a variable (or “unknown quantity’’, to use
Jespersen’s phrase). When we ask a question of our addressee, what we
are doing, in effect, is inviting him to supply a value for this variable. A
yes—no question, like Js the door open?, contains a two-valued variable. It
is equivalent to the bipartite disjunctive question Js the door open or not? ;
and it can be appropriately answered with either Yes (which implies the
proposition expressed in the statement The door is open) or No (which
implies the proposition expressed by The door is not open). A factual
yes—no question presupposes* (in one of the senses of this term: cf.
14.3) the disjunction of two propositions, each of which has associated
with it an it-is-so tropic. Similarly, a deliberative yes—no question (e.g.,
Shall I get up?) presupposes the disjunction of a corresponding positive
or negative proposition associated with a so-be-it tropic.
An x-question is a many-valued function, which presupposes the
earlier for the two ways of describing situations (cf. 15.4). The term ‘mode’,
like ‘mood’, derives from the Latin ‘modus’, which being a word of very
general meaning (‘‘manner’’, ‘‘way’’, etc.) acquired several distinct technical
uses.
758 Mood and tllocutionary force
disjunction of a set of propositions (positive or negative according to the
form of the question), each member of the set differing from the others
in that it supplies a different value for the variable. For example,
(5) Who left the door open?
presupposes the disjunction of the set of propositions expressed by the
statements that could be made by uttering
(6a) John left the door open
(6b) That little boy left the door open
(6c) Uncle Harry left the door open
etc.
More particularly, (5) presupposes the proposition expressed by
(7) Someone left the door open;
and the indefinite pronoun ‘someone’ (in its non-specific interpretation:
cf. 7.2) can also be thought of as a variable whose range of possible
values depends upon the universe-of-discourse. If the addressee re-
sponds to (5), uttered as a question (whether it is asked of him or merely
posed), by making the statement
(8) No-one left the door open,
he is denying (7) and thereby refusing to accept one of the presupposi-
tions of (5); he is not answering the question, but rejecting it. If, on the
other hand, he replies to (5) by uttering (7) — that is to say, by asserting
what (5) presupposes — he is evading, rather than answering, the ques-
tion (cf. Katz, 1972: 213).
We will not discuss the grammatical structure of interrogative sen-
tences in detail. One point should be made, however, in connexion with
the grammatical relationship between (5) and (7). In most of the Indo-
European languages, the forms of the interrogative pronouns and
adverbs are related etymologically to indefinite pronouns and adverbs:
cf. English who, whom, what, etc.; French quz, que, quand, etc.; Russian
kto, cto, etc.; Greek tis/tis, pote/pote, etc.; Latin quis, quando, etc. In
many languages, including English, a set of indefinite pronouns and
adverbs has been created by affixing an adjectival modifier meaning
‘“‘some’’ to the original common pronominal or adverbial element, or to
some replacement of it: cf. English someone, something; French
quelqu’un, quelque chose (where the numeral meaning “‘one”’ and a noun
meaning ‘“‘thing’’ replace the original pronominal element); Russian
16.3. Questions 759
kto-to “‘someone (specific)”’, kto-nibudj “someone (non-specific)”’ ;
Latin, guidam “‘someone (specific)”’, aliquis ‘someone (non-specific) ”’;
English somewhere, French quelque part (where the noun ‘part’ replaces
the original adverbial element). In Classical Greek the same forms are
used as both interrogative and indefinite pronouns (tus, tz, tina, etc.); and
the difference between them is one of accentuation (the indefinite pro-
noun normally being unaccented) and their position of occurrence in the
sentence.
These various morphological relationships clearly depend upon the
grammaticalization of some semantic property which is common to what
may be regarded as corresponding statements and questions. We have
seen that the question (5) presupposes the proposition expressed by the
statement (7), with ‘someone’ taken in its non-specific interpretation.
But consider now the effect of questioning the proposition expressed by
(7), not by means of (9)
(9) Did anyone leave the door open?
(where ‘anyone’ may be regarded as a grammatically determined variant
of non-specific ‘someone’), but by means of the prosodic (and para-
linguistic) modulation of (7). This prosodic (and paralinguistic) modula-
tion we will symbolize with a question-mark:
(10) Someone left the door open?
This utterance (which might be naturally used in English to express
doubt, surprise, etc.), if it is taken as a question, might be appropriately
answered with Yes or No. But it is easy to see that it might also be con-
strued as an x-question, presupposing the proposition expressed in (7)
and expecting the addressee to respond by supplying a value for
‘someone’.
Given that this is so, it is also easy to see that there are various ways
in which languages might systematically distinguish between yes—no
questions containing an indefinite pronoun (or adverb), x-questions and
indefinite statements, like (7), without necessarily grammaticalizing the
difference between all three classes of utterances, or any two of them, in
the verbal component. Suppose, for example, that we were to associate a
falling-intonation with statements and a rising-intonation with ques-
tions, and that we were to assign heavy stress to the indefinite pronoun
(or adverb) in x-questions. This of itself would be sufficient to maintain
the distinction between the three classes of utterances. Needless to say,
the relationship between statements and the two kinds of questions 1s
760 Mood and illocutionary force
rarely, if ever, systematically made solely in the non-verbal component
of languages. But it is important to realize that it need not be gram-
maticalized in terms of a structural difference between declarative and
interrogative sentences; and furthermore that, if the distinction between
declarative and interrogative sentences is grammaticalized, particular
languages might well employ prosodically (and paralinguistically) modu-
lated declarative sentences, containing indefinite pronouns or adverbs,
for x-questions, reserving interrogative sentences for yes—no questions;
or alternatively that they might employ non-verbally modulated declara-
tive sentences for yes—no questions and use interrogative sentences
containing special pronominal or adverbial forms for x-questions. In
other words, the distinction between yes—no questions and x-questions
is a logical, or semantic, distinction that is universal, in the sense that it
can be drawn independently of the grammatical and lexical structure of
particular languages; but the difference between two kinds of interroga-
tive sentences, and even the difference between interrogative and
declarative sentences, is not.
The morphological relationship between interrogative and relative
pronouns (and adverbs) that holds in most Indo-European languages
including English is also worth commenting upon briefly in connexion
with x-questions. The forms who, when and which (cf. also German
‘welcher’, Latin ‘qui’, etc.), which are diachronically related to, if not
identical with, the interrogative/indefinite pronouns of the earlier Indo-
European languages, are found in both restrictive* and non-restrictive*
relative clauses in most dialects of modern English: cf.
(11) That man, who broke the bank at Monte Carlo, is a mathemati-
cian
(12) The man who/that broke the bank at Monte Carlo 1s a mathema-
tician.
Non-restrictive relative clauses, like (11), which are set off by commas
in written English and are at least potentially distinguishable by rhythm
and intonation in the spoken language, do not concern us here.!® Nor do
(14) The someone broke the bank at Monte Carlo man is a mathema-
tician,
16.4. Negation ,
So far we have treated the assertion of a negative proposition (‘‘it is the
case that not—p’’) as equivalent to the denial of the corresponding
positive proposition (‘it 1s not the case that p’’). Both of these are
symbolized in the propositional calculus as ~p (cf. 6.2). As soon as we
start considering propositions containing a modal operator of possibility,
however, it becomes clear that a distinction needs to be drawn between
21 Strictly speaking, one should draw a distinction between wanting the door to
be open and wanting ‘‘The door is open”’ to be true (and also, though less
obviously, between causing the door to be open and causing ‘‘The door is
open” to be true). But the distinction is not relevant to the argument.
16.4. Negation 769
the negation of the modal operator and the negation of a simple propo-
sition within the scope of the modal operator (cf. 6.3): e.g., ~nec p (‘it
is not necessary that p’’) vs. nec ~p (“it is necessary that not—p’’).
There is also a clear difference of meaning in utterances which result
from the negation of a performative verb and the negation of its comple-
ment:
(1) Lf don’t promise to assassinate the Prime Minister
(2) I promise not to assassinate the Prime Minister.
It is only (2) that can be said in the performance of the illocutionary act
of promising; and in this case it would be a promise to refrain from
doing something. Utterance (1), on the other hand, might be a statement
with which the speaker explicitly refuses to make, or denies that he is
making, a promise. Assertion, like promising, is an illocutionary act; and,
as (1) differs from (2), so the statement
(3) The door zs not open
differs from the statement
(4) I do not say that the door is open.
There is no way of representing this difference in the propositional
calculus, which does not allow for the negation of the assertion-sign.
In modal logic, the difference between the negation of the modal
operator and the negation of the proposition within the scope of the
modal operator is commonly referred to in terms of a difference between
external and internal negation; and the difference between (1) and (2),
or between (3) and (4), has been described in the same terms (cf. Hare,
1971: 82). It is possible, of course, to negate both the performative verb
and the complement, as in
(5) I do not promise not to assassinate the Prime Minister
(6) I do not say that the door 1s not open.
And, just as ~nec ~? is not equivalent to p (but to poss p) so (5) and
(6) are not equivalent to I promise to assassinate the Prime Minister and
The door is open. In other words, if one negative is external and the
other internal, two negatives do not make a positive. The relationship
_ between I do not say that the door 1s not open and The door is open is
therefore different from the relationship that holds in the propositional
calculus between ~ ~p and p.
This much about negation is relatively uncontroversial. But it is
770 Mood and illocutionary force
arguable that there are at least two kinds of external negation; and that
the difference between them can be accounted for by assigning one to
the neustic and the other to the tropic. It is negation of the neustic (the
I-say-so component) that is exemplified by (1) and (4). We will refer to
this kind of negation as performative negation*. By negating the neustic
we express our refusal or inability to perform the illocutionary act of
assertion, promising, or whatever it might be. But to do this is in itself
to perform an illocutionary act: an act of non-commitment. Acts of
non-commitment are to be distinguished, on the one hand, from saying
nothing and, on the other, from making descriptive statements. Con- ,
sider, for example, the circumstances under which it might be appro-
priate to utter a declarative sentence like
(7) I don’t say that John is a fool.
If someone has asserted or implied that X goes around saying that John
is a fool, X can deny that this is so by uttering (7) as a descriptive
statement; and in this case don’t will bear heavy stress, as it normally
does (as we shall see presently) in denials*. X’s utterance-act might then
be reported by means of another descriptive statement such as
(8) X said that he didn’t/doesn’t say that Fohn is a fool.
But X might also utter (7), without stressing don’t, not to deny that he
goes around making a particular assertion, but to express, or indicate,
his refusal, on this very occasion, to put his signature, as it were, to
“IT say so’’; and this is a positive act, which might be reported by
(9) X wouldn’t/couldn’t say that/whether Fohn was a fool,
rather than by (8) or
(10) X didn’t say that Fohn was a fool.
The theory of speech-acts, as it has been developed so far, does not seem
to allow for acts of non-commitment. They are nonetheless of frequent
occurrence in the everyday use of language; and their perlocutionary
effect is characteristically different from that of statements. If we express
our refusal to assert that p is so, by means of an act of non-commitment,
we will often create in the mind of the addressee the belief, which he did
not previously hold, that p may in fact be true;?* especially if the situa-
tion is such that we might be expected to assert that not—p 1s so.
22 We might even be held to invite the inference that p is true: cf. Zwicky &
Geis (1971) for the cases of what they call invited inference.
16.4. Negation 771
When we negate the tropic (the it-is-so component) of a statement,
we are still making a statement; but it is a different kind of statement
from that which is made by negating the phrastic. We are denying* that
something is so: we are rejecting a proposition (either positive or nega-
tive) that we might be expected to accept as true. One reason for
believing that we are expected to accept that p holds might be that our
interlocutor has just explicitly presented it to us in a statement or a non-
open question: but this is not the only reason. The situation may be
such, or the proposition itself of such a nature (a generally accepted
truism), that there need be no previous assertion to which our denial is
linked. Nevertheless, the proposition that is accepted or rejected will
always be, in some sense, in the context. It seems reasonable therefore
to draw a distinction between context-bound* and context-free* state-
ments, and to account for the difference between the denial of p and the
simple assertion of not—p in terms of this distinction. We will say that
to reject p by means of a context-bound statement is to deny* that p
holds and to accept p by means of a context-bound utterance is to con-
firm* that p holds. In either case p may be positive or negative. Denials*
and confirmations* are thus the two major subclasses of context-bound
statements.
Just as a negative sentence may be uttered to deny a positive propo-
sition, SO a positive sentence may be uttered to deny a negative proposi-
tion. But this does not invalidate the distinction between the negation of
the tropic and the negation of the phrastic that we have just drawn. If
(3) is uttered as a denial, it will bear heavy stress on the negative particle
(and, if not is contracted with the verb, upon isn’t). If
(11) The door 1s open
is uttered as a denial of the proposition expressed by the context-free
assertion of (3) it will have a heavy stress on the form zs. So too if it is
uttered to confirm, rather than simply to assert, that the door is open.
The distinction between context-free and context-bound statements is
systematically maintained, in English, by stress; it is not generally
represented in the verbal component of utterances.
The distinction between negation of the phrastic, which we will
henceforth refer to as propositional negation*, and negation of the tropic,
which we will call modal negation*, is something that must be taken
account of in the treatment of presupposition. According to what is
probably the most generally accepted criterion for at least one class of
presuppositions (cf. 14.3), one proposition, p, is said to presuppose
172 Mood and tllocutionary force
another proposition q, if q is entailed, or strictly implied, by both p and
its negation: 1.e. if both p entails g and ~~? entails g, then p presupposes
q (cf. Keenan, 1971). Granted that the context-free assertion of a propo-
sition, p, or its propositional negation, ~p, commits the speaker to a
belief in the truth of any proposition, qg, that is presupposed by , this
does not hold for the denial of p. If someone were to assert that the
present King of France is bald, we could quite reasonably deny this by
saying:
(12) The present King of France is not bald: there is no King of France.
Similarly, though it would be irrational for someone to utter
(13) L don’t know that the earth ts round
as a context-free assertion (‘know’ being a factive* verb which normally
presupposes the truth of the proposition expressed by its complement:
cf. 17.2), there is nothing wrong with the utterance of (13) as a denial.*8
It might further be argued that there are two kinds of propositional
negation: one of which converts the proposition into its contradictory*
and the other into its contrary*. The logical distinction between contra-
dictories and contraries has already been referred to, in a previous
chapter, in connexion with the difference between gradable and un-
gradable opposites (9.1). An utterance like
(14) I don’t like modern music
is more likely to be interpreted as expressing a proposition that is the
contrary, rather than the contradictory, of
(15) “I like modern music’’.
But both interpretations are possible. Since the conversion of a propo-
sition into its contrary seems to depend upon the negative being more
closely associated with the predicate, than with the subject—predicate
23 As was noted earlier (14.5), tokens of (13) may also be interpreted as meaning
‘‘I am not sure that the earth is round”’ or even “‘I am inclined to believe
that the earth is not round”’. In such cases don’t will not normally bear heavy
stress (though ts may if the utterance is context-bound). It is suggested later
in this section, in the discussion of (29), that examples like (13), as well as
examples containing such verbs of propositional attitude as ‘think’ and
‘believe’ (which are normally handled in terms of the notion of transferred
negation), are perhaps best analysed as containing performative negation.
There is a close parallel between the negation of a performative verb as in (1)
or (4) and the negation of a parenthetical* verb (cf. 16.1); and the same general
notion of positive non-commitment seems to be relevant to both.
16.4. Negation 773
link, or nexus* (to use Jespersen’s term), within the proposition, we
might appropriately distinguish the two kinds of propositional negation
as predicative negation* and nexus negation*. Both types of negation are
found in
(16) I don’t not like modern music,
which is equivalent to
(17) I don’t dislike modern music.
It seems to be the case that the application of propositional negation to a
gradable expression (e.g., ‘like’) will always tend to produce a contrary,
rather than a contradictory, whether the language-system lexicalizes the
contrary (e.g., ‘dislike’) or not.
If we recognize two kinds of propositional negation, as suggested in
the previous paragraph, we are going against the traditional logical
maxim that negation of the predicate is equivalent to negation of the
proposition. Furthermore, it might be maintained that utterances like
(16) and (17) would rarely, if ever, occur as context-free assertions; and,
if this is so, there is no need to distinguish nexus negation from modal
negation. Four kinds of negation may well be too many, but it does seem
clear that at least three kinds are required in order to handle utterances
with an it-is-so tropic (1.e. statements and factual questions).
One may now ask whether the same three kinds of negation are to be
found in utterances with a so-be-it tropic. There is of course a clear
difference between explicitly refusing to tell someone to do something
and telling him not to do something. We therefore need to allow for at
least two kinds of negation: performative negation and either modal or
propositional negation. One of the points that we shall have to deal with
in our discussion of deontic modality* is how acts of non-commitment
with a so-be-it tropic are related to permissions* and exemptions* (cf.
17.4). For the present, however, the following will serve as an analysis
of the meaning of utterances in which the speaker explicitly refuses to
commit himself to the imposition of an obligation: “‘I do not say so — so
be it — (that) p”’.
The main problem that concerns us here, however, is to decide
whether prohibitions, like
(18) Don’t open the door,
contain a negated tropic or a negated phrastic. And this is a tricky
problem. Does (18), construed as a prohibition, mean “I (hereby) im-
pose upon you the obligation not to make it so that p holds”’ or “I
774 Mood and tllocutionary force
(hereby) impose upon you the obligation to make it so that not—p
holds’’? As we have already seen, prohibitions are not normally intended
or taken as instructions to carry out, but to refrain from carrying out,
some course of action. The speaker does not want the addressee to bring
about a state-of-affairs of which not—p is true; for this state-of-affairs
already exists. The reason why he issues his prohibition is that he thinks
that, in default of the prohibition, the addressee will, or may, bring
about a state-of-affairs of which p (the contradictory of not—p) would be
true. It seems preferable, therefore, to treat prohibitions as having a
negated tropic: i.e. as resulting from modal negation. This analysis, it
should be noted, groups prohibitions with denials, rather than with
context-free negative assertions. It also suggests that corresponding
positive and negative jussive sentences (e.g., ‘Open the door’ and
‘Don’t open the door’), in their most characteristic use, will be con-
strued as contradictories, rather than contraries; and, in general, this
seems to be correct.
At the same time, it must be recognized that negative jussive sen-
tences, used to issue prohibitions, are sometimes intended and taken as
directives to bring it about that not—p holds. This is especially clear
when not—p? is the contrary, rather than the contradictory, of p. For
example,
(19) Don’t trust him
can be understood as being the contrary of
(20) Trust him,
equivalent to
(21) Distrust him.
If the difference between contradictory and contrary statements 1s
associated with the distinction between modal and propositional nega-
tion, we should perhaps allow for propositional negation in directives as
well as statements. Generally speaking, however, prohibitions will
involve modal negation; and they are to be analysed as ‘I say so — let it
not be so — (that) p”’, rather than as “‘I say so — so be it — (that) not—p”’.
Two further points may now be mentioned, each of which has been
the subject of considerable discussion recently. The first is the inter-
action between negation and information-focus* (cf. 12.7). When con-
trastive stress occurs elsewhere than at the end of the clause in English,
this is an indication that the clause has marked, rather than neutral,
16.4. Negation 775
information-focus; and, if this happens in a negative sentence, the
sentence will tend to be interpreted (on particular occasions of its
utterance) as carrying rather specific presuppositions (cf. Quirk e¢ al.,
1972: 382; Jackendoff, 1972: 252). For example, the utterance Mary
didn’t come (with contrastive stress on Mary) is comparable with Jt was
not Mary (but someone else) that came: but the two utterances are not
equivalent, since it is also possible to say J don’t know whether anyone
came (but) Mary didn’t come (i.e. it is possible to cancel what is normally
taken to be a presupposition of Mary didn’t come, the proposition
‘Someone came”’: cf. 14.3). It is obvious that stress, and also intona-
tion, must be taken into account in any comprehensive discussion of
negation; and, when they are taken into account, further complexities
become apparent. The question of information-focus is clearly of some
relevance to the notion of modal negation; but the connexion between
them is, in certain respects, obscure.
The second point has to do with what has been called transferred
negation* (cf. Quirk et al., 1972: 789). The term suggests (as do
‘negative transportation’ or ‘negative raising’ — which are the terms
most commonly used in Chomskyan transformational grammar) that the
phenomenon to which it applies involves some kind of displacement of
the negative operator. For example,
(22) L didn’t think he would do it
is rightly said to have two interpretations: roughly, (i) “It is not the
case that I thought he would do it”’ and (ii) ‘‘I thought that he would
not do it”. The former of these is unusual, however, in colloquial
English and almost inevitably requires heavy stress on didn’t: and this
fact suggests that it is a modal negation that is involved. Under the
second interpretation, which with normal stress and intonation is by far
the more common, (22) is generally said to be equivalent to
(23) I thought (that) he would not do it
and to be derivable, in a transformational grammar, from the same
underlying structure, the negative element having been transferred from
the subordinate to the main clause.
Whether (22) and (23) are equivalent is a moot point. It is important
to notice, in this connexion, that the construction in question (which is
to be found in many languages) is especially common with verbs
denoting belief and assumption; and furthermore that it is far more
common with first-person subjects than it is with third-person and
and |
776 , Mood and illocutionary force
second-person subjects. What is interesting about (22) is that, although
it has a first-person subject, it is in the past tense. In this respect it
stands mid-way between
(24) L don’t think he'll do it
1'The term ‘predicate’ is here being used in the sense in which it 1s custo-
marily employed by logicians (cf. 6.2, 12.2).
17.1. Necessity and possibility 789
not but be true (i.e. it is true in all possible worlds) by virtue of the
sense of ‘bachelor’ and ‘married’. In which case, we can say that the
truth of (8), and therefore of (6), is guaranteed by the truth of
(9) nec ((x) B(x) > ~ M(z)).
The difference between (8) and (9) is that (8) states it as being simply a
matter of fact that all bachelors are unmarried, whereas (g) makes it a
matter of logical necessity.
Either (8) or (9), it should be emphasized, is sufficiently strong to
validate (6); and we need not be concerned here with the problem of
deciding whether the relationship between ‘bachelor’ and ‘married’
that is built into the lexical structure of English is such that we would be
justified in treating the proposition expressed by the sentence ‘All
bachelors are unmarried’ as analytic. It is to avoid, or circumvent, such
problems that Carnap (1956) casts all his meaning-postulates* in the
form of non-modal universally quantified propositions like (8).
What we want to explicate here is the meaning of ‘must’ in sentences
like
2 The so-called root modals are treated by Ross (1969a), G. Lakoff (1972) and
others who have adopted the auxiliaries-as-main-verbs analysis as being
trivalent rather than bivalent (i.e. as being di-transitive, rather than transi-
tive: cf. 12.4) in the case of deontic utterances that are given a performative,
rather than constative, interpretation. On this and related points, cf. Huddles-
ton (1974).
17.2. Epistemic modality and factivity 793
deontic modality (cf. 17.4). Not only is (16) a deontic interpretation of
‘Alfred must be unmarried’. So too is
(18) “I (hereby) oblige Alfred to be unmarried”’,
which is performative, rather than constative, in the original sense of
‘performative’ (cf. 16.1). The fact that both epistemic and deontic
modality can be interpreted either subjectively or objectively means that
Kurylowicz’s account of the distinction between epistemic and deontic
modality cannot be correct, as it stands. Nor can the proposed analysis
in terms of the difference between an intransitive (epistemic) and a
transitive (deontic) use of modal verbs. Why, then, has it seemed
plausible to so many linguists to think of epistemic modality as being
more subjective than deontic modality? This is a question that requires
an answer.
(1) Kx(p),
where Kx is the operator of epistemic necessity; and that (1) is related
to p in terms of epistemic necessity as nec (p) 1s related to p in terms of
alethic necessity. If Kx(p) is true, then p must be true, in some sense of
‘must’ that it is the task of epistemic logic to explicate.
Many philosophers would say that the relationship between Kx(p)
and p is one of presupposition* and not simply implication, since the
truth of p is a necessary condition, not only of the truth of “X knows
that p’’, but also of the truth of its negation, “X does not know that
p”. In saying He does not know that Edinburgh 1s the capital of Scotland
(provided that it is intended as a context-free statement, rather than a
denial: cf. 16.4), the speaker 1s committed to the truth of the proposition
‘Edinburgh is the capital of Scotland’’, as he is committed to the truth
of the same proposition by saying (as an assertion) He knows that
Edinburgh is the capital of Scotland.®? Any predicator (verb, adjective,
etc.), which behaves like ‘know’ in this respect can be said to have the
property of factivity*, or to be a factive* predicator. These terms were
introduced into linguistics by Kiparsky & Kiparsky (1971), who pointed
out that there is a wide range of factive predicators in English in addi-
married), |
as a fact by the addressee (J agree; Yes, I know, etc.); it can be hypothe-
sized in a real conditional statement; and it can be referred to by the
complement of a factive predicator (1 knew that Alfred must be un-
doubt. }
the proposition expressed in his utterance; and both of them may
well originate, ontogenetically, in the same psychological state of
(43) . poss p
which may be read as (42) “‘Possibly/perhaps, it is the case that p”’ and
(43) “I say that it is possibly the case that p”’. Each of these operators
can be negated; so can the propositional variable and either the un- -
qualified neustic or the unqualified phrastic.
Under this system of representation, possibility is treated as a two-
valued variable, which in one position alternates with interrogation as a
qualifier of the I-say-so component and in the other position operates as
a qualifier of the it-is-so component.
The system also allows for utterances with both subjective and objec-
tive modality:
(44) poss poss p
which may be read as “‘Possibly/perhaps, it is possibly the case that p”’
and is exemplified by one interpretation of
(45) Perhaps it may be raining.
It therefore allows for a total of twenty-four notationally distinct and
semantically non-equivalent modal formulae (in addition to eight non-
modal formulae) each containing the same propositional variable; and
each of these can in fact be realized in English, as the reader can verify
for himself by constructing appropriate situations-of-utterance. This
number can be increased if possibility in either or both positions is
treated as a scale along which a greater or less number of degrees are
recognized; and English provides the means for indefinitely finer and
finer gradations of subjective and objective epistemic modality. We can
not only say, as has been noted, I¢ 1s probably raining, but also It is very
probably raining, It ts almost certainly raining, and so on.
Of the twenty-four formulae that can be constructed in terms of
possibility and negation within a framework which distinguishes the
neustic from the tropic, only four find their equivalents in standard
possibility-based systems of modal logic:
(46) . poss p
(47) . ~ poss p
(48) . poss ~ p
(49) . ~ poss ~p
17.2. Epistemic modality and factivity 805
The reason is that in standard systems of non-epistemic and non-deontic
modal logic all statements are construed, in our terms, as having pre-
fixed to them an unqualified I-say-so component. Subjective epistemic
modality, as was mentioned in the previous section, is something that
logicians have not been concerned with. It is of such importance,
however, in the ordinary use of language that it should be explicitly
recognized in any formalization of linguistic semantics and distin-
guished, in principle, from objective modality.
At the same time, the relationship between subjective and objective
epistemic modality is such that it seems appropriate to handle both in
terms of the same modal operator, assigning to it two places of occur-
rence. Its function in the neustic position is to express different degrees
of commitment to factuality; and in this respect it qualifies the illocu-
tionary act in much the same way that a performative verb parenthetic-
ally qualifies, or modulates, the utterance of which it is a constituent in
an explicitly performative utterance or a primary performative with a
performative clause tagged on to it. Looked at from this point of view,
It may be raining (construed as a subjectively modalized utterance)
stands in the same relationship to It’s raining, I think or I think it’s
raining, as Is it raining? does to Is it raining, I wonder? or I wonder
whether it’s raining and It’s raining to It’s raining, I tell you or I tell
you it’s raining. Relationships of this kind are quite naturally expressed
within a system which treats interrogation and subjective modality as
qualifications of the I-say-so, or performative, component of the utter-
ance.
As epistemic possibility seems to be more basic, in English at least,
than epistemic necessity, so subjective epistemic modality seems to be
more basic than objective epistemic modality. This is the most natural
conclusion to be drawn from the fact, mentioned in the previous section,
that few linguists have even considered the possibility that epistemic
modality could be anything other than a matter of the speaker’s attitude
towards the propositional content of his utterance; and most discussions
of mood and modality in linguistics seem to take it for granted that
epistemic modality is subjective, in this sense. Hence the common, but
strictly speaking false, statement that the modal verbs cannot occur with
epistemic function in conditional sentences in English. Provided that an
utterance like
(50) If it may be raining, you should take your umbrella
is taken to express objective, rather than subjective, epistemic modality,
806 Modality
it is interpretable and fully acceptable. What is excluded from condi-
tional clauses is the expression of subjective epistemic modality; and
this is for the obvious reason that everything that comes within the scope
of the conditional operator (“if p, then q’’) must be part of the propo-
sitional content.
Utterances like (50) are undoubtedly rare in English; and the reason
is that it is much more natural to use modal verbs for subjective, than
for objective, epistemic modality. Far more natural than (50) is :
(51) If tt ts possible that it will rain, you should take your umbrella;
or, better still,
(52) If there ts a possibility of rain you should take your umbrella.
The constructions used in (51) and (52) are more appropriate than is the
construction used in (50) for the expression of objective epistemic
modality; and it is consistent with their status that, unlike the construc-
tion used in (50), they can contain a tense other than the simple present
and they can be recursively combined without limit. The objectification*
of epistemic modality is a precondition of our being able to talk about
past or future possibilities (cf. Yesterday there was some possibility that it
would rain today) and of there being one epistemic modality within the
formulae ,
scope of another (cf. It is possible that it is necessary that .. .).
What might be proposed, therefore, is that of the following two
(53) poss. p
and
(54) . poss p
(53) is more basic than (54), as far as the everyday use of language is
concerned; and that (54) can be thought of as being derived from (53)
by a process of objectification.® Indeed, we can go one step further, from
(57) . . (~poss p) :
(56) . . (poss p)
(58) .. (poss ~ p)
(59) .-(~poss ~ p)
The difference between .possp and ..(possp) is the difference
between “I say so —it is possibly so—that p” and “‘I say so — it is
so — that possibly p’’.
In most dialects of English not more than one modal verb can occur
within the same clause. But both a modal verb and a modal adverb may
be combined. When this happens a distinction is to be drawn between
modally harmonic* and modally non-harmonic* combinations. For
example, ‘possibly’ and ‘may’, if each is being used epistemically, are ,
harmonic, in that they both express the same degree of modality,
whereas ‘certainly’ and ‘may’ are, in this sense, modally non-harmonic.
It has been pointed out by Halliday (1970a: 331) that the adverb and
the modal verb may, and normally do, ‘reinforce each other’ in a
modally harmonic combination; so that, in an utterance like
(60) He may possibly have forgotten
which, under the interpretation that concerns us here, differs but little
from either
(61) He may have forgotten
or
(62) He has possibly forgotten,
808 Modality
there is a kind of concord running through the clause, which results in
the double realization of a single modality. In contrast with (60), how-
ever,
(63) He may certainly have forgotten
and
(64) Certainly he may have forgotten
cannot be construed as having but a single modality running right
through the clause. Since the adverb and the modal verb are non-
harmonic they cannot but be independent; and one must be within the
scope of the other.
As far as (64) is concerned, ‘may’ is obviously within the scope of
‘certainly’, since (64) means “‘It is certainly the case that he may have
forgotten’’, rather than ‘‘It may be the case that he has certainly for-
gotten’’. The interpretation of (63) is rather less clear cut. What is
clear, however, is that, whether it is interpreted as “It is certainly the
case that he may have forgotten”’ or as “It may be the case that he has
certainly forgotten’’, no more than one of the two modal expressions can
express subjective epistemic modality (though they may both express
objective epistemic modality) and it is the one which expresses subjec-
tive epistemic modality that has the wider scope. These two principles
are very important and we shall draw upon them later: (i) subjective
modality always has wider scope than objective modality; (ii) no simple
utterance may contain more than a single subjective epistemic modality
(though this single modality may be expressed, as in (60), in two or more
places). Both of these principles are explicable in terms of the fact that
subjective epistemic modality has to do with the qualification of the
performative component of the utterance.
We shall return to the question of epistemic modality in various
places in later sections of this chapter. But there will not be space to
deal with it fully. One point should be borne in mind throughout.
Although it might appear that a statement is strengthened by putting
the proposition that it expresses within the scope of the operator of
epistemic necessity, this 1s not so, as far as the everyday use of language
is concerned. It would be generally agreed that the speaker is more
strongly committed to the factuality of “It be raining” by saying Jt zs
raining than he is by saying It must be raining. It is a general principle,
to which we are expected to conform, that we should always make the
strongest commitment for which we have epistemic warrant*. If there is
17.3. Tense as a modality 809
no explicit mention of the source of our information and no explicit
qualification of our commitment to its factuality, it will be assumed that
we have full epistemic warrant for what we say. But the very fact of
introducing ‘must’, ‘necessarily’, ‘certainly’, etc., into the utterance
has the effect of making our commitment to the factuality of the propo-
sition explicitly dependent upon our perhaps limited knowledge. There
is no epistemically stronger statement than a categorical assertion, sym-
bolized here as .. p.
7 Palmer (1974: 38) says of sentences like ‘Yesterday John was coming to-
morrow’: ‘‘ The past tense . . . is epistemic — there was a time at which the
statements that John is coming tomorrow or John comes tomorrow were
valid’’. This notion of validity is captured in the analysis presented here by
the relativization of factuality to the intensional world indexed by wi.
17.3. Tense as a modality 815
events, or states-of-affairs, has been philosophically controversial (cf.
Prior, 1955: 240); and many philosophers would deny that we can make
statements about the future at all, on the grounds that we cannot have
knowledge, but only beliefs, about future world-states. What purports to
be a statement describing a future event or state-of-affairs is therefore,
of necessity, a subjectively modalized utterance: a prediction rather than
a statement. Looked at from this point of view, the difference between
It will be raining (tomorrow) and It may be raining (tomorrow) depends
upon the speaker’s subjective evaluation of the probability of ‘‘it be
raining’”’ being true of w;. But we have already seen that It may be raining
(tomorrow), like It may be raining (now), can in principle be interpreted
as being either subjectively or objectively modalized; and the same is
true of It will be raining (tomorrow). The speaker can treat the future as
known, as a fact that belongs to w; (= wo), whether he is epistemologic-
ally justified in doing so or not. He can say, without doing violence to the
structure of English, I know that it will rain tomorrow (or I know that it
is going to rain tomorrow); and he can also embed a future-tense clause as
the complement of the factive verb ‘know’ in attributing knowledge of
the future to another, as in He knows that it will rain tomorrow. As we
have seen, the use of a factive verb in a context-free assertion commits the
speaker to the truth of the proposition expressed by the complement
clause (cf. 17.2). We must therefore allow for the possibility of seman-
tically well-formed descriptive statements with the logical structure
(14) (to) (te) (tp)|to = ts <
Utterances of this kind have the illocutionary force of acts of telling (cf.
He told me that it would be raining); and they are to be distinguished, in
principle, from predictions, wishes, promises, etc., all of which are
modalized in one way or another.
It is nonetheless a linguistically important fact that, unlike the
prophets referred to by Augustine, we are seldom in a position to lay
claim to knowledge of the future; and it is no doubt for this reason that
reference to future world-states is grammaticalized in the category of
mood, rather than tense, in many languages. The so-called future tenses
of the Indo-European languages, as we have already seen, are of secon-
dary development (cf. 15.4). There was no future tense in Proto-Indo-
European. But there was probably a fairly rich system of moods,
including, in addition to the indicative and the imperative, a subjunctive, .
an optative, and a desiderative. The desiderative*, as the term implies,
was used for the expression of wants, needs and desire. The optative*,
816 Modality
however, which was preserved as a formally distinct mood in Greek and
Sanskrit was never, as far as we know, restricted to wishes (although the
term ‘optative’ might suggest that it was). It was the mood of contra-
factivity and remote possibility, as the subjunctive was the mood of
non-factivity and less remote possibility and the indicative the mood of
factivity and straightforward assertion. What appears to be the same
triple distinction of contra-factivity, non-factivity and factivity is gram-
maticalized in certain American Indian languages (e.g., Hopi) which,
unlike the Indo-European languages, lack the category of tense entirely
(cf. Whorf, 1950); and it has often been suggested that such languages,
which grammaticalize modal, rather than either deictic or non-deictic
temporal, distinctions are in some sense more subjective than languages
that grammaticalize tense. Interesting though this question is, we will
not pursue it here. Our main purpose is to emphasize the connexion
between subjective epistemic modality and reference to the future.
That there is a connexion is perhaps obvious enough, independently
of any arguments that might be based upon the grammatical structure of
particular languages. But there is, in fact, ample linguistic evidence.
What is conventionally regarded as the future tense (in languages that
are said to have a future tense) is rarely, if ever, used solely for making
statements or predictions, or posing and asking factual questions, about
the future. It is also used in a wider or narrower range of non-factive
utterances, involving supposition, inference, wish, intention and desire.
Furthermore, the future vs. non-future distinction is frequently neutral-
ized in subordinate or negative clauses, in participles and nominaliza-
tions, in association with all moods other than the indicative, and in
various other constructions: this fact would suggest that the opposition
of the future to the present is less central in the structure of the lan-
guages than is the opposition of the past to the present (which Is, in any
case, an opposition that is more widely grammaticalized throughout the
languages of the world). So too would the fact that the so-called future
tense is in many languages constructed according to a different pattern
of formation than is the past or the present tense.
There is also a good deal of diachronic evidence to support the view
that reference to the future, unlike reference to the past, is as much a
matter of modality as it is of purely temporal reference. ‘Throughout the
history of the Indo-European languages what are traditionally described
as future tenses have invariably been created, independently in different
languages, from word-forms or phrases that were originally used to
express, not futurity as such, but various kinds of non-factivity. We
17.3. Tense as a modality 817
cannot go into the details of the historical development of the forms that
came to be used for reference to the future in different languages. It is
sufficient to point out that the most common sources of the future
tenses in the Indo-European languages are, on the one hand, the sub-
junctive, which we have classified as the mood of generalized non-
factivity, and, on the other, word-forms or phrases expressing the more
specific non-factive notions of intention and desire. Future tenses out-
side the Indo-European languages, in so far as their historical develop-
ments can be traced, commonly appear to have had a similar origin. The
fact that the so-called future tense in English is formed by means of an
auxiliary verb, ‘will’, which originally expressed intention, is by no
means an isolated historical accident. Nor is the fact that English sen-
tences containing the so-called future tense may be used to express
various kinds of subjective epistemic modality including inference,
supposition and prediction. There is a demonstrable historical con-
nexion between reference to the future and non-factivity in too many
languages for it to be regarded as a matter of accident that languages
should rarely, if ever, distinguish systematically between statements of
fact about the future and subjectively modalized predictions.
There is also ample synchronic and diachronic evidence for the con-
nexion between prediction and deontic* modality, which we shall be
discussing in a later section. It was pointed out earlier (16.2) that there
is a sense in which directives necessarily refer to the future; and also
that in many languages the functions of the subjunctive merge with those
of the imperative. The relationship between the subjunctive and the
imperative is traditionally accounted for in terms of the notion of will.
But there is no formal difference in the Indo-European languages
between the two kinds of subjunctive that are traditionally distinguished
by grammarians as the subjunctive of will and the subjunctive of likeli-
hood (or possibility); and it is arguable that there is no difference of
meaning between them either.
In many languages throughout the world, the notions of possibility
and obligation are associated with the same non-factive, or subjunctive,
mood; and this is commonly also the mood of prediction, supposition,
intention, and desire, as it is in many of the Indo-European languages.
The so-called subjunctive of will is related to the imperative and to
deontic modality in traditional treatments of the Indo-European system
of moods and tenses; and the so-called subjunctive of likelihood to the
future tense and to prediction. But, as we have just seen, the future tense
in many languages, within and outside the Indo-European family, has
} 818 Modality
developed from forms that originally expressed desire and intention; and
desire and intention can hardly be separated from will. The notions of
will and likelihood, as these notions are employed in standard treatments
of the Indo-European system of mood, can therefore both be subsumed
under the more general notion of non-factivity.
It was because the subjunctive was the mood of non-factivity that it
could be used, in contrast with the indicative, for the expression of either
likelihood or will: i.e. in either epistemic or deontic utterances. We do
not need to say that the Proto-Indo-European subjunctive as such had
two distinct meanings, in order to account for the possibility of its being
used in predictions, like the future, or in directives, like the imperative.
The distinction between a prediction and a directive, like the distinc-
tion between a prediction and a promise is a matter of illocutionary force;
and this is not necessarily grammaticalized in the structure of primary
performatives (cf. 16.1). The sentence ‘You will be here at three
o'clock’ can reasonably be said to have the same meaning whether it is
uttered as a prediction or a directive, just as the sentence ‘I will be here
at three o’clock’ can be said to have the same meaning whether it is
uttered as a prediction or a promise.
So far we have tacitly assumed that all non-actual world-states must
be either past or future in relation to wo, the world as it actually is at fo.
But this is not so. We can make contra-factive assertions about states of
the world as it might have been at to. We can say, for example,
(15) If he hadn’t missed the plane, he would now be in London.
The reason for mentioning such utterances in this section is that in
many languages, including English, the grammatical category of past
tense is regularly used to convert a non-factive into a contra-factive
utterance.® Unlike (15), the following is non-factive, rather than contra-
factive:
(16) If he hadn’t missed the plane, he will now be in London.
Similarly in Latin, which, unlike Greek and Sanskrit, did not preserve a
distinctive mood of contra-factivity and remote possibility (the opta-
tive), the past tense of the subjunctive could be employed in contrast
with the present tense of the subjunctive to distinguish between non-
factive and contra-factive statements. ®
wicz (1964). Mood in Latin and Ancient Greek is treated from the viewpoint
of generative semantics (and on the basis of postulated underlying abstract
performative verbs: cf. 10.5, 16.5) by Lakoff (1968) and Lightfoot (1975)
respectively. On the non-factivity of the Spanish subjunctive, in certain con-
texts, cf. Rivero (1971). .
820 Modality
particular case of “‘there’’ vs. “here’’).1° Under this interpretation, tense
would bea specific kind of modality; and modality would be more closely
related to deixis. What is conventionally described as the present tense
would be the product of non-remoteness (“now’’) and factivity; past-
ness and futurity would not be defined directly in terms of temporal
indices (to = t; > tj; and tp = t; <t,;), but in terms of remoteness
(“then’’) and either factivity or non-factivity, the so-called past tense
being the product of remoteness and factivity, and the so-called future
tense being the product of non-remoteness and non-factivity; and
contra-factivity would be the product of remoteness and non-factivity.
There are certain obvious advantages in an analysis of this kind. It
would more directly reflect the difference in the epistemic status of the
past and the future. This difference is often expressed in the philoso-
phical literature by saying that, whereas the past is closed, the future is
open; and that, as a consequence, whereas statements about the past are
either true or false at the time of their utterance, statements about the
future (or, at least, about the occurrence of contingent events in the
future) are neither true nor false, but indeterminate in truth-value, at
the time of their utterance (cf. Gale, 1968b: 169ff). Regardless of
whether we say that statements about so-called future contingents have a
determinate truth-value or not (and this is a philosophically contro-
versial question with which we need not be concerned), we can hardly
deny that there is a difference in the epistemic status of the past and the
future; and, as we have seen, there are languages in which it is the
difference in the epistemic status of the situations being described,
rather than their temporal location in relation to the zero-point of
utterance that is of primary importance in determining the selection of
one so-called tense rather than another. For such languages (e.g., Hopi),
if not for all, what is often described as tense (cf. Hockett, 1958: 237) is
surely more properly regarded as the grammaticalization of epistemic
modality (i.e. as mood).
We will not pursue this question any further. What has been said will
be sufficient to show what is meant by saying that tense is a kind of
modality; and it is at least conceivable that different language-systems
should attach greater or less weight to temporal, rather than epistemic,
considerations in the grammaticalization of what is traditionally de-
scribed in terms of past, present and future. Certain systems of tense-
logic give recognition to the’epistemic difference between the past and the
10 This is the view that Joos (1964) takes of the past vs. non-past opposition in
English.
17.3. Tense as a modality 821
future, but they all seem to take the temporal notions of presentness and
precedence to be basic in the logical definition of tense (cf. Rescher &
Urquhart, 1971: 52ff). It may very well be that different kinds of tense-
logic are appropriate for the formalization of the tensed propositions
that are expressible in different languages and that, for certain languages
at least, tense-logic and epistemic logic will be indistinguishable. We
shall continue to operate with the temporal indices ¢o, ¢; and tj and with
the notions of pastness and futurity that are definable in terms of them.
In doing so, however, we recognize that something other than temporal
considerations may be involved.
The notational system developed, in this chapter, for the representa-
tion of tense is well adapted for the representation of what we described
earlier as the Augustinian view of tense: the view that past, present and
future are all located (in memory, observation or anticipation) in the
experiential present. The normal condition would therefore be t; = to:
i.e. the intensional world (w;) from which, as it were, we are asked to
look at the extensional world (w;) is the one that is defined as being
identical, temporally, with the speaker’s world. Deviation from this
norm would involve deictic projection into a past (t < to) or a future
intensional world (cf. 15.4). This would account for the use of so-called
secondary tenses in such texts as: Fohn was in a quandary (to = ti > ty)
—it was raining (t, >t; =t,;)—he had caught a cold on the previous
occasion (t, > t; > t;)- he would see her (anyway) on the following day
(to > t; < t;). Here, it will be noted, everything after John was in a
quandary (which has primary tense, defined by the normal condition
to = t;) involves secondary tense (cf. to > ti).
Of particular interest, in this connexion, is the analysis of
(18) It was raining
as
(19) (fo .) (te .) (tip)/to > te = by
(i.e. aS meaning, as it were, “It was a fact that it is raining’’) rather
than as
(20) (to.) (ta -) (stp)/to = te > by
(i.e. as “It is a fact that it was raining’’). It so happens that in English,
though not in all languages, the past tense can be used either as a
primary or a secondary tense: the difference between (19) and (20)
would come out quite strikingly in a language like Latin, which would
use a so-called indirect discourse construction for (19).
822 Modality
Granted that the notational system developed here for the representa-
tion of tense accounts neatly enough for the Augustinian view of tense
(and for deictic projection, or empathetic deixis, in so far as this deter-
mines the use of the so-called secondary tenses), it may be seen as an
objection to the system that it makes truth and factuality relative to some
temporally indexed intensional world. But truth, according to our
everyday view of the matter, 1s timeless (i.e. eternal); and, as was pointed
out above, for anyone who takes this view, “‘It is the case that p”’ will be
a timeless proposition, regardless of whether p itself is time-bound or
timeless (cf. 15.4).
There is no difficulty in representing the notion of objective and
eternal truth within the notational system that we have devised here. All
that is required is the restriction of the value of t; to timelessness: let us
symbolize this as tx. Tense would then be simply a matter of the
relation between ¢o and ¢;; and (18) would be analysed as
(21) (to.) (t¢-) (4p) /to > te; ty = be
(i.e. “It is timelessly true that at t; < fo it be raining”’).
The condition t; = ta would have exactly the same meaning (whatever
that might be) as the traditional phrase ‘sub specie aeternitatis’ (‘‘from
the viewpoint of eternity’’).
The problem lies in trying to make sense, philosophically, of a
formula like (21). This is a problem that faces anyone who tries to
reconcile the objectivity and eternity of truth with the subjectivity of
temporal becoming. It is more acute, of course, for a theologian who,
like St Augustine, wishes to reconcile the omniscience of God with the
openness of the future and with free will. But it is a problem which
arises, independently of all such considerations, in any serious analysis
of our everyday assumptions.
In so far as we lay claim to factual knowledge of the past and the
future, we assume that the factual status of what we know at any point
in time is unchanging and eternal, even though some of the propositions
that we know are tensed propositions (cf. 15.4). If it makes sense to say
_ _I know that it was raining yesterday and I know that it will rain tomorrow,
we must allow that both 4 = ¢) and t= to are assumed to hold
simultaneously by the person making either of these utterances. It is
part of what some philosophers would refer to as the logic of the verb
‘know’ that anyone using it commits himself to the objectivity and
eternity of what he claims to know. If, for any reason, he subsequently
17.4. Deontic modality 823
comes to the view that what at f, he claimed to know is not true, he will,
of course, say that he was wrong in what he said earlier, rather than that
what was true in wij = wo has now become false. It is by making this
kind of practical adjustment that we resolve, in everyday matters, the
problem of reconciling the eternity of truth with the temporality and
subjectivity of the viewpoint from which we assert that something is
true. We make the same kind of adjustment as we switch from the
experiential to the historical mode of discourse and from the dynamic to
the static conception of time (cf. 15.4).
The notational system that has been introduced in this section
neither resolves, nor does it beg without resolving, the philosophical
controversy between the temporalistic and the timeless theories of truth,
which “has raged for over two thousand years, with able philosophers
lining up on different sides’’ (cf. Gale, 1968a: 137). What the system
does, and is intended to do, is to make possible the treatment of tense
from an epistemic point of view. It allows us to relativize truth to some
time-bound intensional world (t; 4 t). But it does not force us to do
this: we can always hold to the notion of absolute truth (4; = tx), if we
wish. There are considerable advantages, however, as far as the analysis
of tense in natural languages is concerned, in allowing for the maximum
degree of flexibility in the assignment of values to t.
as
relate this so-be-it component to the modal notions of necessity and
possibility, bearing in mind the fact that deontic modality, like predic-
tion, involves a reference to a future world-state and that it is connected
in some way with intention, desire and will. If we assign tense-operators
to the neustic, tropic and phrastic of directives, we can represent the
logical structure of a command like
(2) Open the door
Tavistock Publications. |
Ardener, E. (ed.) (1971). Social Anthropology and Language. London:
Bibhography 851
Argyle, M. (1972). ‘Non-verbal communication in human social interaction’.
In Hinde (1972: 243-69).
Asher, R. E. (1968). ‘Existential, possessive, locative and copulative sentences
in Malayalam’. In Part 2 (1968) of Verhaar (1967-73). Pp. 88—111.
Ashton, E. O. (1947). Swahili Grammar, 2nd ed. London: Longman.
Atkinson, M. A. & Griffiths, P. G. (1973). ‘Here’s here’s, there’s here and
there’. In Edinburgh Working Papers in Linguistics, 3. 29-73.
Austin, J. L. (1962). How To Do Things With Words. Oxford: Clarendon Press.
Austin, J. L. (1970). Philosophical Papers, 2nd ed. London & New York:
Oxford University Press. (First edition, 1961.)
Ayer, A. J. (1936). Language, Truth and Logic. London: Gollancz. (Second
edition, revised, 1946.)
Babcock, 5S. (1972). “Periphrastic causatives’. Foundation of Language 8. 30-43.
Bach, E. (1968). ‘Nouns and noun-phrases’. In Bach & Harms (1968: 91-122.)
Bach, E. (1970). ‘Problominalization’. Linguistic Inquiry. 1. 121-2.
Bach, E. (1971). ‘Questions’. Linguistic Inquiry. 2. 153-66.
Bach, E. & Harms, R. (eds.) (1968). Universals in Linguistic Theory. New
York: Holt, Rinehart & Winston.
Bacon, John (1973). ‘The semantics of generic THE’. Journal of
Philosophical Logic 2. 323-39.
Baker, C. L. (1970). ‘Notes on the description of English questions’.
Foundations of Language 6. 197-219.
Baker, C. L. (1975). ‘The role of the part-of-speech distinctions in
generative grammar’. Theoretical Linguistics 2. 113-31.
Banfield, A. (1973). ‘Narrative style and the grammar of direct and indirect
discourse’. Foundations of Language 10. 1-39.
Bar-Hillel, Y. (1964). Language and Information. Reading, Mass. :
Addison-Wesley.
Bar-Hillel, Y. (1967a). ‘Dictionaries and meaning rules’, Foundations of
Language 3. 409-14. (Reprinted in Bar-Hillel, 1970: 347-53.)
Bar-Hillel, Y. (1967b). Review of Fodor & Katz (1964). In Language 43.
526-50. (Reprinted in Bar-Hillel, 1970: 243-69.)
Bar-Hillel, Y. (1970). Aspects of Language. Jerusalem: Magnes.
Bar-Hillel, Y. (ed.) (1971). Pragmatics of Natural Language.
Dordrecht-Holland: Reidel.
Barthes, R. (1970). S/Z. Paris: Seuil.
Bartsch, R. (1972). Adverbialsemanttk. (Linguistische Forschungen, 6.)
Frankfurt: Athenium.
Bartsch, R. & Vennemann, T. (1972). Semantic Structures. Frankfurt:
Athendum.
Bates, E. (1976). Language and Context: the Acquisition of Pragmatics. New
York: Academic Press.
Bauman, R. & Sherzer, J. (eds.) (1974). Explorations in the Ethnography of
Speaking. London & New York: Cambridge University Press.
Bazell, C. E. (1964). ‘Three misconceptions of ‘‘grammaticalness’’’.
Monograph Series on Language and Linguistics 17. 3-9. (Reprinted in
O’Brien, 1968: 25-31.)
852 Bibliography
Bazell, C. E. et al. (eds.) (1966). In Memory of 7. R. Firth. London:
Longman.
Beekman, J. & Callow, J. (1974). Translating the Word of God. Grand
Rapids, Michigan: Zondervan.
Bellert, I. (1970). ‘On the semantic interpretation of subject-predicate
relations in sentences of particular reference’. In Bierwisch & Heidolph
(1970: 9-26).
Bellert, I. (1972). On the Logico-Semantic Structure of Utterances. Wroclaw:
Ossolineum.
Bene3, E. & Vachek, J. (eds.) (1971). Stilisttk und Soziolinguisttk. Miinchen: List.
Bennett, D. C. (1972). ‘Some observations concerning the locative directional
distinction’, Semiotica 5. 109-27.
Bennett, D. C. (1975). Spatial and Temporal Uses of English Prepositions.
London: Longman.
Bennett, J. (1973). ‘The meaning-nominalist strategy’. Foundations of
Language 10. 141-68.
Bennett, J. (1976). Linguistic Behaviour. Cambridge: Cambridge University
Press
Benveniste, E. (1946). ‘Structure des relations de personne dans le verbe’.
Bulletin de la Société de Linguistique 126. 1-12. (Reprinted in Benveniste,
1966: 225-36.)
Benveniste, E. (1956). ‘La nature des pronoms’. In M. Halle et al. (eds.)
For Roman Jakobson. The Hague: Mouton. (Reprinted in Benveniste,
1966: 251-7.)
Benveniste, E. (1958a). ‘De la subjectivité dans le langage’ (from Journal de
Psychologie). In Benveniste (1966: 258-66). |
Benveniste, E. (1958b). ‘Les verbes délocutifs’ (from A. G. Hatcher & K. L.
Selig (eds.) Mélanges Spitzer. Bern: Francke). (In Benveniste, 1966: 277-85.)
Benveniste, E. (1966). Problémes de Linguistique Générale. Paris: Gallimard.
(English translation, Problems in General Lingutstics. Coral Gables, Fla.:
University of Miami Press, 1971.)
Benveniste, E. (1967). Le Vocabulaire des Institutions Indo-Européennes.
Paris: Minuit. (English translation, Indo-European Language and Society.
London: Faber & Faber, 1973.)
Berlin, B. (1968). Tzeltal Numeral Classifiers: A Study in Ethnographic
Semantics. The Hague: Mouton.
Berlin, B., Breedlove, D. E. & Raven, P. M. (1966). ‘ Folk taxonomies and
biological classification’. Science 154. 273-5. (Reprinted in Tyler, 1969.)
Berlin, B., Breedlove, D. E. & Raven, P. H. (1974). Principles of Tzeltal
Plant Classification. New York & London: Academic Press. }
Bierwisch, M. (1967). ‘Some semantic universals of German adjectivals’.
Foundations of Language 3. 1-36.
Bierwisch, M. (1970). ‘Semantics’. In Lyons (1970: 166-84).
Bierwisch, M. & Heidolph, K. E. (1970). Progress in Linguistics. The Hague:
. Mouton.
Biggs, C. (1975). ‘Quantifiers, definite descriptions and reference’. In
Keenan (1975: 112-20).
Bibliography 853
Bloomfield, L. (1931). Review of Ries (1931). In Language 7. 204-9.
Bloomfield, L. (1935). Language. London: Allen & Unwin. (American
edition, 1933.)
Boadi, L. A. (1971). ‘Existential sentences in Akan’. Foundations of Language
7. 19-29.
Boas, F. (1911). Introduction to the Handbook of American Indian Languages.
Washington D.C. (Reprinted, Washington, D.C.: Georgetown University
Press, 1965. Also, Lincoln, Neb.: University of Nebraska Press, 1965.)
Bolinger, D. L. (1952). ‘Linear modification’. Publications of the Modern
Language Association of America 67. 1117-44. (Excerpts reprinted in
Householder, 1972: 31-50.)
Bolinger, D. L. (1965). ‘The atomization of meaning’. Language 41. 555-73.
Bolinger, D. L. (1967a). ‘The imperative in English’. In To Honor Roman
Jakobson. The Hague: Mouton.
Bolinger, D. L. (1967b). ‘Adjectives in English’. Lingua 18. 1-34.
Bolinger, D. L. (1968). ‘Judgements of grammaticality’. Lingua 21. 34-40.
Bolinger, D. L. (1971). The Phrasal Verb in English. Cambridge, Mass. :
Harvard University Press.
Bonomi, A. & Usberti, B. (1971). Sintasst e Semantica nella Grammatica
Trasformazionale. Milano: Saggiatore.
Botha, R. P. (1968). The Function of the Lexicon in Transformational
Grammar. The Hague: Mouton.
Botha, R. P. (1973). The Justification of Linguistic Hypotheses. The Hague:
Mouton.
Bowers, J. S. (1975). ‘Adjectives and adverbs in English’. Foundations of
Language 13. 529-62.
Boyd, J. & Thorne, J. P. (1969). ‘The semantics of modal verbs’. Fournal of
Linguistics §. 57-74.
Braine, M. (1963). ‘The ontogeny of English phrase structure’. Language 39.
I-14.
Bréal, M. (1897). Essai de Sémantique. Paris. (English translation, Semantics.
London: Holt, 1900. Republished, New York: Dover, 1964.)
Brekle, H. E. (1970). Generative Satzsemantik und Transformationelle Syntax
im System der Englischen Nominalkomposition. Miinchen: Fink.
Bresnan, J. (1970). ‘On complementizers: towards a syntactic theory of
complement types’. Foundations of Language 6. 297-321.
Bright, W. (ed.) (1966). Sociolinguistics. The Hague: Mouton.
Brooke-Rose, C. (1958). 4 Grammar of Metaphor. London: Secker &
Warburg.
Brown, R. (1958). Words and Things. Glencoe, IIl.: Free Press.
Brown, R. (1973). A First Language. Cambridge, Mass.: Harvard University
Press.
Brown, R. & Ford, M. (1961). ‘Address in American English’. Journal of
Abnormal and Social Psychology 62. 375-85. (Reprinted in Hymes, 1964:
234-44.)
Brown, R. & Gilman, A. (1960). ‘The pronouns of power and solidarity’.
In Sebeok (1960: 253-76). (Reprinted in Giglioli, 1972: 252-82.)
854 Bibliography
Browne, W. (1970). ‘Noun phrase definiteness in relatives: evidence from
Macedonian’ & ‘More on definiteness markers: interrogatives in Persian’.
Linguistic Inquiry 1. 267-70 & 359-63.
Bruner, J. S. (1974/5). ‘From communication to language’. Cognition 3.
255-87.
Bruner, J. S. (1975). ‘The ontogenesis of speech acts’. Journal of Child
Language 2. 1-19.
Bihler, K. (1934). Sprachtheorie. Jena: Fischer. (Reprinted Stuttgart:
Fischer, 1965.)
Bull, W. E. (1963). Time, Tense and the Verb. Berkeley & Los Angeles:
University of California Press.
Burling, R. (1965). ‘How to choose a Burmese classifier’. In M. Spiro (ed.)
Context and Meaning in Cultural Anthropology. New York: Free Press.
Pp. 243-65.
Burton-Roberts, N. (1975). ‘Nominal apposition’. Foundations of Language
13. 391-419.
Callow, K. (1974). Discourse Considerations in Translating the Word of God.
Grand Rapids, Michigan: Zondervan.
Carnap, R. (1956). Meaning and Necessity, 2nd ed. Chicago: Chicago
University Press.
Carter, R. M. (1976). ‘Chipewyan classificatory verbs’. International Journal
of American Linguistics 42. 24-30.
Castafieda, H. (1968). ‘On the logic of attributing self-knowledge to others’.
Fournal of Philosophy 65. 439-56.
Caton, C. E. (ed.) (1963). Philosophy and Ordinary Language. Urbana, HL:
University of Illinois Press.
Caton, C. E. (1966). ‘Epistemic qualification of things said in English’.
Foundations of Language 2. 37-66.
Cedergren, H. J. & Sankoff, D. (1974). ‘Variable rules: performance as a
statistical reflection of competence’. Language 50. 333-55.
Chafe, W. L. (1968). ‘Idiomaticity as an anomaly in the Chomskyan
paradigm’. Foundations of Language 4. 109-27.
Chafe, W. L. (1970). 4 Semantically Based Sketch of Onondaga. (Indiana
University Publications in Anthropology and Linguistics, 25. 1.)
Chafe, W. L. (1971). Meaning and the Structure of Language. Chicago &
London: University of Chicago Press.
Chao, Y. R. (1968). A Grammar of Spoken Chinese. Berkeley & Los Angeles:
University of California Press.
Chapin, P. (1967). ‘On affixation in English’. In Bierwisch & Heidolph
(1970: 49-63).
Chatman, S. (ed.) (1971). Literary Style: A Symposium. London: Oxford
University Press.
Chomsky, N. (1957). Syntactic Structures. The Hague: Mouton.
Chomsky, N. (1965). Aspects of the Theory of Syntax. Cambridge, Mass.:
MIT Press.
Chomsky, N. (1968). Language and Mind. New York: Harcourt, Brace &
World. (Enlarged edition, 1972.)
Bibhography 855
Chomsky, N. (1969). ‘Deep structure, surface structure and semantic
interpretation’. In Steinberg & Jakobovits (1971: 183-216). Reprinted in
Chomsky (1972: 62-119).
Chomsky, N. (1970). ‘Remarks on nominalization’. In Jacobs & Rosenbaum
(1970: 184-221). (Reprinted in Chomsky, 1972: 11-61.)
Chomsky, N. (1972). Srudies on Semantics in Generative Grammar. The
Hague: Mouton.
Chomsky, N. (1976a). The Logical Structure of Linguistic Theory. New York
& London: Plenum. (Partly revised version of unpublished 1955
manuscript, with a new Introduction.)
Chomsky, N. (1976b). Reflections on Language. London: Temple Smith.
Christie, J. J. (1970). ‘Locative, possessive and existential in Swahili’.
Foundations of Language 6. 166-77.
Christophersen, P. (1939). The Articles: A Study of their Theory and Use in
English. Copenhagen: Munksgaard & London: Oxford University Press.
Cicourel, A. V. (1973). Cognitive Sociology. Language and Meaning in Social
Interaction. Harmondsworth: Penguin.
Clark, E. V. (1973). ‘What’s in a word’. In Moore (1973: 28-63).
Clark, E. V. (1974). ‘Normal states and evaluative viewpoints’. Language 50.
316-32.
Clark, E. V. (1977). ‘From gesture to word: the natural history of deixis’. In
J.S. Bruner & A. Garton (eds.) Human Growth and Development. London:
Oxford University Press. .
Clark, E. V. & Garnica, O. K. (1974). ‘Is he coming or going?’. fournal of
Verbal Learning and Verbal Behavior 13. 556-72.
Clark, H. H. (1973). ‘Space, time, semantics and the child’. In T. E. Moore
(1973: 28-64).
Cohen, L. J. & Margalit, A. (1970). ‘The role of inductive reasoning in the
interpretation of metaphor’. Synthese 21. 483-500. (Reprinted in
Davidson & Harman, 1972: 722~—740.)
Cole, P. (1974). ‘Indefiniteness and anaphoricity’. Language 50. 665-74.
Cole, P. & Morgan, J. L. (eds.) (1975). Syntax and Semantics, 3: Speech
Acts. New York & London: Academic Press.
Closs, E. (1967). ‘Some copula construction in Swahili’, fournal of
African Linguistics 6. 105-31. .
Collinson, W. E. (1937). Indication: A Study of Demonstratives, Articles and
Other ‘ Indicaters’. (Language, Monograph 17.) Baltimore: Linguistic
Society of America.
Comrie, B. (1976). Aspect. London, New York & Melbourne: Cambridge
University Press. |
Conklin, H. C. (1972). Folk Classification: A Topically Arranged Bibliography
of Contemporary and Background References Through 1971. New Haven:
Department of Anthropology, Yale University.
Cooper, David E. (1974). Presupposition. The Hague & Paris: Mouton.
Coseriu, E. (1970). Sprache: Strukturen und Funktionen. 'Tibingen: Gunter
Narr.
Cresswell, M. J. (1974). ‘Adverbs and events’. Synthese 28. 455-81.
826 Bibliography ,
Cruse, D. A. (1973). ‘Some thoughts on agentivity’. Journal of Linguistics
Q. 11-23.
Crymes, R. (1968). Some Systems of Substitution Correlations in Modern
American English. The Hague: Mouton.
Crystal, D. & Davy, D. (1969). Investigating English Style. London: Longman.
Culler, J. (1975). Structuralist Poetics. London: Routledge & Kegan Paul.
Dahl, O. (1969). Topic and Comment: A Study in Russian and General
Transformational Grammar, (Slavica Gothenburgensia, 4.) Stockholm:
Almaqvist & Wiksell.
Dahl, O. (1970). ‘Some notes on indefinites’. Language 46. 33-41.
Dahl, O. (1971). ‘Tenses and world-states’. Gothenburg Papers in Theoretical
Linguistics, 6.
Dane, F. (1968). ‘Some thoughts on the semantic structure of the sentence’.
Lingua 21. §5-9.
Dascal, M. & Margalit, A. (1974). ‘A new ‘“‘revolution”’ in linguistics? —
‘“Text-grammars’”’ vs. ‘‘Sentence-grammars’”’’. Theoretical Linguistics 1.
195-213.
Davidson, D. (1967). ‘The logical form of action sentences’. In N. Rescher
(ed.) The Logic of Decision and Action. Pittsburgh: University of
Pittsburgh Press. Pp. 81~120.
Davidson, D. & Harman, G. (eds.) (1972). Semantics of Natural Language.
Dordrecht-Holland: Reidel.
Davidson, D. & Harman, G. (1975). The Logic of Grammar. Encino,
California: Dickenson.
Davy, D. & Quirk, R. (1969). ‘An acceptability experiment with spoken
output’. fournal of Linguistics 5. 109-20.
De Lemos, C. G. (1975). The Use of ‘Ser’ and ‘Estar’ in Brazilian
Portuguese with Particular Reference to Child Language Acquisition.
University of Edinburgh, Ph.D. Dissertation.
Dik, S. (1968). Co-ordination. Amsterdam: North-Holland.
Dik, S. (1973). ‘Crossing co-reference again’. Foundations of Language 9.
306-26.
Dik, S. (1975). ‘The semantic representation of manner adverbials’. In
A. Kraak (ed.) Linguistics in the Netherlands. Assen & Amsterdam:
Vangorevm.
Dixon, R. M. W. (1968). “Noun-classes’. Lingua 21. 104-25.
205-23). :
Dixon, R. M. W. (1971). ‘A method of semantic description’. In Steinberg
& Jakobovits (1971: 436-71).
Dixon, R. M. W. (1973). ‘The semantics of giving’. In Gross et al. (1973:
Akademisk Forlag.
(Travaux du Cercle Linguistique de Copenhague, 17.) Copenhague: —
109~26). |
Robins, R. H. (1971). General Linguistics, 2nd ed. London: Longman.
Rohrer, C. (1971). ‘Zur Theorie der Fragesatze’. In Wunderlich (1971:
(1970: 222—72). ,
Ross, J. R. (1970). ‘On declarative sentences’. In Jacobs & Rosenbaum
University Press. -
Tarski, A. (1956). Logic, Semantics, Metamathematics. London: Oxford
& Winston.
Tyler, S. A. (ed.) (1969). Cognitive Anthropology. New York: Holt, Rinehart
Asterisks indicate technical terms; bold type denotes the page number
where a technical term ts introduced.
638 534
canonical situation of utterance, 637, | complex* lexemes, 521, 523, 524,