Categorial Features a Generative Theory of Word Class Categories
Categorial Features a Generative Theory of Word Class Categories
CATEGORIAL FEATURES
A Generative Theory of Word Class Categories
C A T E G O R I A L FE A T U R E S
A GENERATIVE THEORY OF WORD
CLASS CATEGORIES
PHOEVOS PANAGIOTIDIS
University of Cyprus
University Printing House, Cambridge CB2 8BS, United Kingdom
www.cambridge.org
Information on this title: www.cambridge.org/9781107038110
© Phoevos Panagiotidis 2015
This publication is in copyright. Subject to statutory exception
and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without the written
permission of Cambridge University Press.
First published 2015
Printed in the United Kingdom by Clays, St lves plc
A catalogue record for this publication is available from the British Library
Library of Congress Cataloguing in Publication data
Panagiotidis, Phoevos.
Categorial features : a generative theory of word class categories / Phoevos Panagiotidis.
pages cm – (Cambridge studies in linguistics ; 145)
ISBN 978-1-107-03811-0 (Hardback)
1. Grammar, Comparative and general–Grammaticalization. 2. Categorial grammar.
3. Language, Universal. I. Title.
P299.G73P36 2014
415–dc23 2014020939
ISBN 978-1-107-03811-0 Hardback
Cambridge University Press has no responsibility for the persistence or accuracy of
URLs for external or third-party internet websites referred to in this publication,
and does not guarantee that any content on such websites is, or will remain,
accurate or appropriate.
Contents
ix
x Contents
4 Categorial features 78
4.1 Introduction 78
4.2 Answering the old questions 78
4.3 Categorial features: a matter of perspective 82
4.4 The Categorization Assumption and roots 89
4.4.1 The Categorization Assumption 89
4.4.2 The interpretation of free roots 93
4.4.3 The role of categorization 95
4.4.4 nPs and vPs as idioms 97
4.5 Categorizers are not functional 98
4.6 Nouns and verbs 100
4.6.1 Keeping [N] and [V] separate? 101
4.6.2 Do Farsi verbs always contain nouns? 103
References 189
Index 204
Preface
The project resulting in this monograph began in 1999, when I realized that
I had to answer the question of why pronouns cannot possibly be ‘intransitive
determiners’, why it is impossible for Determiner Phrases (DPs) consisting of a
‘dangling D head’ (a turn of phrase my then PhD supervisor, Roger Hawkins,
used) – that is, made of a Determiner without a nominal complement – to exist.
The first answer I came up with was Categorial Deficiency, extensively argued
for in Chapter 5. Back then, however, Categorial Deficiency of functional
heads was just an idea, which was expounded in my (2000) paper. The case
for it was limited to arguments from biuniqueness and the hope was that it
would eventually capture Head Movement. The paper was delivered at the
April 2000 Spring Meeting of LAGB, in the front yard of UCL, in the open:
the fire alarm, this almost indispensable element of British identity and social
life, went off seconds after the talk started. It did not look good. However,
Categorial Deficiency did find its way into my thesis and the (2002) book
version thereof.
There were more serious problems, though: I quickly realized that ‘uninter-
pretable [N]’ and ‘uninterpretable [V]’ mean nothing if we have no inkling
of the actual interpretation of ‘interpretable [N]’ and ‘interpretable [V]’.
This inevitably brought me to the question of the nature of categorial features
and what it means to be a noun, a verb and an adjective. Surprisingly, this was
an issue very few people found of any interest, so for a couple of years or
so I thought I should forget about the whole thing. This outlook changed
dramatically in 2003, when Mark Baker’s book was published: a generative
theory of lexical categories with precise predictions about the function and
interpretation of categorial features. On the one hand, I was elated: it was about
time; on the other, I was disappointed: what else was there to say on lexical
categories and categorial features?
Quite a lot, as it turned out. Soon after my (2005) paper against syntactic
categorization, I had extensive discussions with Alan Bale and, later, Heidi
Harley. These were the impetus of my conversion to a syntactic decomposition
xiii
xiv Preface
Marantz took the time and the effort when I needed his sobering feedback
most, when I was trying to answer too many questions on idiomaticity and root
interpretation. Discussions with Sandeep Prasada, and his kindly sharing his
unpublished work on sortality with me, provided a much-needed push and the
opportunity to step back and reconsider nominality. Gratitude also goes to
Marc Richards, the man with the phases and with even more patience. Luigi
Rizzi has been a constant source of support and insight, through both gentle
nudges and detailed discussions. David Willis’ comments on categorial Agree
and its relation to movement gave me the impetus to make the related discus-
sion in Chapter 5 bolder and, I hope, more coherent.
I also wish to thank the following for comments and discussion, although
I am sure I must have left too many people out: Mark Aronoff, Adriana
Belletti, Theresa Biberauer, Lisa Cheng, Harald Clahsen, Marijke De Belder,
Carlos de Cuba, Marcel den Dikken, Jan Don, Edit Doron, Joe Emonds,
Claudia Felser, Anastasia Giannakidou, Liliane Haegeman, Roger Hawkins,
Norbert Hornstein, Gholamhosein Karimi-Doostan, Peter Kosta, Olga Kva-
sova, Lisa Levinson, Pino Longobardi, Jean Lowenstamm, Rita Manzini, Ora
Matushansky, Jason Merchant, Dimitris Michelioudakis, Ad Neeleman, Rolf
Noyer, David Pesetsky, Andrew Radford, Ian Roberts, Peter Svenonius,
George Tsoulas, Peyman Vahdati, Hans van de Koot, Henk van Riemsdijk.
I also wish to thank for their comments and feedback the audiences in Cyprus
(on various occasions), Utrecht, Pisa, Potsdam, Jerusalem, Patras, Paris, Athens
and Salonica (again, on various occasions), Cambridge (twice, the second time
when I was kindly invited by Theresa Biberauer to teach a mini course on
categories), Chicago, Stony Brook, NYU and CUNY, Florence, Siena, Essex,
Amsterdam, Leiden, York, Trondheim, Lisbon and London.
Needless to say, this book would have never been completed without
Joanna’s constant patience and support.
My sincere gratitude goes out to the reviewers and referees who have looked
at pieces of this work: from the editor and the referees at Language who
compiled the long and extensive rejection report, a piece of writing that
perhaps influenced the course of this research project as significantly as key
bibliography on the topic, to anonymous referees in other journals, and to the
reviewers of Cambridge University Press. Last but not least, I wish to express
my gratitude to the Editorial Board of the Cambridge Studies in Linguistics
for their trust, encouragement and comments.
Finally, I wish to dedicate this book with sincere and most profound
gratitude to my teacher, mentor and friend Neil V. Smith.
1 Theories of grammatical category
1.1 Introduction
In this first chapter, we will review some preliminaries of our discussion on
parts of speech and on the word classes they define. As in the rest of this
monograph, our focus will be on lexical categories, more specifically nouns
and verbs. Then I will present a number of approaches in different theoretical
frameworks and from a variety of viewpoints. At the same time we will discuss
the generalizations that shed light on the nature of parts of speech, as well as
some necessary conceptual commitments that need to inform our building a
feature-based theory of lexical categories.
First of all, in Section 1.2 the distinction between ‘word class’ and ‘syntactic
category’ is drawn. The criteria used pre-theoretically, or otherwise, to distin-
guish between lexical categories are examined: notional, morphological and
syntactic; a brief review of prototype-based approaches is also included.
Section 1.3 looks at formal approaches and at theories positing that nouns
and verbs are specified in the lexicon as such, that categorial specification is
learned as a feature of words belonging to lexical categories. Section 1.4
introduces the formal analyses according to which categorization is a syntactic
process operating on category-less root material: nouns and verbs are ‘made’
in the syntax according to this view. Section 1.5 takes a look at two notional
approaches to lexical word classes and raises the question of how their insights
and generalizations could be incorporated into a generative approach. Section
1.6 briefly presents such an approach, the one to be discussed and argued for in
this book, an account that places at centre stage the claim that categorial
features are interpretable features.
1
2 Theories of grammatical category
1
A distinction already made in Anderson (1997, 12).
4 Theories of grammatical category
both future will and infinitival to belong to the same part of speech, the
category Tense, they belong to different syntactic categories, if syntactic
categories are to be defined on the grounds of distribution and distinct syntactic
behaviour.
Of course, one may (not without basis) object to applying distinctions
such as ‘part of speech’ versus ‘syntactic category’ to functional elements.
However, similar considerations apply to nouns – for example, proper nouns
as opposed to common ones, as discussed already in Chomsky (1965). Proper
and common nouns belong to the same part of speech, the same word class;
however, their syntactic behaviour (e.g., towards modification by adjectives,
relative clauses and so on) and their distribution (e.g., whether they may merge
with quantifiers and determiners . . .) are distinct, making them two separate
syntactic categories. This state is, perhaps, even more vividly illustrated by the
difference between count and mass nouns: although they belong to the same
word class, Noun, they display distinct syntactic behaviours (e.g., when
pluralized) and differences in distribution (e.g., regarding their compatibility
with numerals), as a result of marking distinct formal features.2
The stand I am going to take here is pretty straightforward: any formal
feature may (and in fact does) define a syntactic category, if syntactic categor-
ies are to be defined on the grounds of syntactic behaviour and if syntactic
behaviour is the result of interactions and relations (exclusively Agree rela-
tions, according to a probable hypothesis) among formal features. At the same
time, only categorial features define word classes – that is, parts of speech.
This will turn out to hold not only for lexical categories like noun and verb, as
expected, but for functional categories as well.
Henceforth, when using the term ‘category’ or ‘categories’, I will refer to
word class(es) and part(s) of speech, unless otherwise specified.
2
An anonymous reviewer’s comments are gratefully acknowledged here.
1.2 Approaching the part-of-speech problem 5
number tense
case aspect
gender agreement3
Of course, here too, some semantic interpretation is involved, albeit indirectly:
for instance, the correlation of nouns with number, on the one hand, and of
verbs with tense, on the other, does not appear to be accidental – or, at least, it
3
Agreement with arguments, subjects most typically.
1.2 Approaching the part-of-speech problem 7
4
Gender systems typically fall somewhere in between (Corbett 1991).
5
Nordlinger and Sadler (2004) and Lecarme (2004) argue that nominals (certainly encased inside
a functional shell) can be marked for independent tense – that is, bear a time specification
independent from that of the main event (and its verb). However, Tonhauser (2005, 2007)
convincingly argues against the existence of nominal Tense, taking it to be nominal Aspect
instead.
8 Theories of grammatical category
rephrase the noun–verb distinction along the terms of whether the expression
of their argument structures is obligatory or not:6
(3) NOUN VERB
6
Fu, Roeper and Borer (2001) influentially explain away such ‘complications’ by claiming that
process nominals contain verb phrases (VPs). Certainly, the expression of argument structure in
nominals can be a more intricate affair than Indo-European facts suggest: Stiebels (1999)
discusses Nahuatl, a language where all sorts of derived nominals, not just those with an event
reading, express their argument structure via affixes common with their base verbs.
1.2 Approaching the part-of-speech problem 9
(2003, secs. 1.1–1.3) elaborates on the issues with this approach and with the
prototypical approach in general, principally along the lines of prototypes
predicting very little. Thus, a verb like persist encodes time-stability by
definition, whereas a noun like tachyon has time-instability encoded in its
meaning. Of course, the existence of nouns like tachyon, which express non-
time-stable concepts does not contradict protypicality: tachyon would qualify
as a non-protypical noun. Similar facts hold for non-prototypical verbs
expressing more or less time-stable concepts. This is precisely the problem
of what prototype-based theories of word classes actually predict. Consider,
for instance, the mid-section of the time stability continuum, where non-
prototypical relatively time-stable ‘verbal’ concepts, non-prototypical rela-
tively non-time-stable ‘nominal’ concepts and ‘adjectival’ ones (between
nouns and verbs, by definition) co-exist: the question is what conceptual
mechanism decides which category concepts populating that middle area are
assigned to? Is category-assignment performed at random? This is a matter that
Rauh (2010, 313–21) also raises, although departing from a slightly different
set of theoretical concerns; she goes on to argue for discrete boundaries
between categories.
A more interesting issue is one mentioned above: prototypical (like rock)
and less prototypical (like theory) nouns and prototypical (like buy) and less
prototypical (like instantiate) verbs all behave in the same fashion as far as
grammar itself is concerned (Newmeyer 1998, chap. 4). Clearly, to the extent
that prototypicality matters for the mechanism of the Language Faculty per se,
and to the extent that prototypicality is reflected on the grammatical behaviour
of nouns, verbs and adjectives, prototype effects spring from factors external to
the syntax.7
The limited role of prototypicality as far as the grammar-internal behaviour
of more prototypical or less prototypical members of a category is concerned
is acknowledged in Croft (1991, 2001), who argues that prototypicality correl-
ates with two kinds of markedness patterns across languages. First, prototypi-
cality correlates with structural markedness, in that items deviating from the
semantic prototype (e.g., referential expressions that denote events, like hand-
shake or wedding, or object-denoting words used as predicates, like ice in The
water became ice) tend to occur with additional morphemes. Interestingly, this
is a generalization about the functional layer around an event-denoting noun or
an object-denoting predicate, not about the lexical elements themselves.
7
I am grateful to a reviewer for this discussion.
1.2 Approaching the part-of-speech problem 11
The task at hand, the one that the theory of categorial features to be presented
here will take up, is to explain these differences in a principled manner. We
have already discussed the shortcomings of prototype-based analyses: they
neither predict the identical grammatical behaviour of prototypical nouns like
stone to that of non-prototypical ones like theory, nor do they explain the
differences between nouns and verbs. Still, most generative theories of
category are not faring any better (Baker 2003, chap. 1): they are also descrip-
tive part-of-speech systems that make no predictions about either the syntactic
behaviour or the semantic interpretation of an element x belonging to a
12 Theories of grammatical category
Noun þ
Verb þ
Adjective þ þ
Preposition
1.3 Categories in the lexicon 13
So, let us comment on the system represented in (5): we have the cross-
categorization of four categories by means of two binary features. Hence,
instead of three (or four) primitive lexical categories, we have two primitive
binary features: [N] and [V] with their (expected) values. The attributes [N]
and [V] are not identified with any sort of interpretation, at least not in the
original Amherst system and not in any clear-cut way, and the values
apparently mark the positive versus negative value of such an attribute,
whatever it may represent. The picture, however, certainly looks elegant, as
now it is possible to cross-categorize lexical categories. Having said that, this
cross-categorizing cannot become truly significant, or even useful, until we
resolve the question of what these features, and their values, stand for. To
make this clearer, it is very difficult to get the batteries of properties character-
istic of nouns and verbs in (4) to result from the system in (5). Following Baker
(2003), I agree that this is the main problem with the Amherst feature system:
it is a purely taxonomic system that does not predict much. Should we take up
a more clement attitude to it, the Amherst system establishes cross-
categorizations at an abstract level: nouns share a [þN] feature with adjectives,
verbs a [þV], again with adjectives, and the two appear to be completely
dissociated. Although these are hardly trivial predictions, they are clearly not
robust enough to capture the picture in (4) or, even, part thereof. One reason
for this problem is exactly that we do not know what the attributes [N] and [V]
stand for. All we have at this point is nouns and adjectives forming a natural
class, both being [þN], and verbs and adjectives forming a natural class
because of their [þV] value; finally, prepositions also belong to a natural
class with nouns (both categories being [V]) and verbs, too (due to a
common [N] value) – but not with adjectives (cf. Stowell 1981, 21–2).
Departing from the natural classes defined by the Amherst system of
categorial features [N] and [V], Stowell sets out in the first chapter of his
(1981) thesis to show that these are classes to which syntactic and morpho-
logical rules actually refer. These classes are given in (6).
(6) Natural classes according to the Amherst system
8
The lexical–functional distinction and biuniqueness are examined in Chapter 5 of this book.
1.3 Categories in the lexicon 15
9
That is, the projection of V and that of N.
16 Theories of grammatical category
memorized them as such and because they are part of the lexicon – the lexicon
containing everything in language that must be memorized: ‘a list of excep-
tions, whatever does not follow from general principles’ (Chomsky 1995,
234). In other words, lexical elements come from the lexicon labelled for
a category: noun, verb and so on. The lexicon contains entries like dog
[þN, V] and do [N, þV]. Of course, this is roughly the way most
traditional grammars view the matter, as well.
Baker (2003) explicitly announces that he aims to give content to the
features [N] and [V], rather than view them as convenient labels that create
taxonomies. He revises the system in (5) by positing two privative features,
like Déchaine (1993), instead of the received binary ones. Importantly,
these are expressly hypothesized to be LF-interpretable features as well
as to trigger particular syntactic behaviours. The table in (9) summarizes
this:
(9) Semantic interpretation Syntactic behaviour
Sortality is what makes nouns nominal. Baker (2003, 290) essentially treats
a sortal concept as one that canonically complies with the principle of
identity: a sortal concept is such that it can be said about it that it is the
same as or different from X.10 Furthermore, sortality is understood as the
very property that enables nouns to bear referential indices. At the same
time, [V], which makes verbs, is taken to encode predication. Baker also
argues that verbs are the only lexical categories that can stand as predicates
without the mediation of a functional category Pred. The predicative nature
of verbs is correlated with them arguably being the only lexical category
that projects a specifier.
Baker’s (2003) system yields the following lexical categories:
(10) Nouns [N] ! sortal concepts, referential indices in syntax
Verbs [V] ! predicates, with (subject) specifiers in syntax
Adjectives ! ‘other’ concepts, pure properties
10
We will return to a more detailed discussion of sortality in Chapter 4.
1.4 Deconstructing categories 17
and quantifiers). His account, which forms the basis from which our theory of
categorial features departs, is revised in Chapter 4, extended in Chapters 5 and
6 and further pored over in the Appendix.
(11)
See Marantz (2000) – also Halle and Marantz (1993) – for the background
regarding the categorial decomposition in Distributed Morphology. We will
return to this framework, in which the theory presented here is couched, in
Chapter 3.
Expanding upon the rationale of this approach, we can say that the noun–
verb distinction is purely configurational. Nouns and verbs are essentially
structures containing a root and being characterized purely and exclusively
by the functional environment around the said root. So, what matters is the
syntactic positioning of the root. To illustrate, if a root (say SLEEP) is inserted
within a functional complex like the following, then it becomes a verb:
(13)
1.5 Notional approach: Langacker and Anderson 19
11
I gratefully acknowledge comments by an anonymous reviewer on Cognitive Grammar.
20 Theories of grammatical category
However, THING does not stand for a physical object (this would bring us back
to naïve ontologically informed taxonomies of school grammars) but, rather, as
a ‘region in some domain’ (Langacker 1987, 189). Immediately it becomes
evident, at least intuitively, how rock, theory and wedding can all be nouns:
they define areas in different conceptual domains. Of course, a lot depends on
what these conceptual domains are and, most crucially, how exactly they are
mapped on words – see Acquaviva (2009a) for criticism and Acquaviva and
Panagiotidis (2012) for discussion. However, understanding categories as con-
ceptualizations instead of ontologies constitutes a major step in understanding
what they mean and how they work.
As far as the other ‘basic’ categories are concerned, adjectives and adverbs
conceptualize ATEMPORAL RELATIONS (Langacker 1987, 248), whereas
verbs are understood as conceptualizing PROCESSES. Regarding the gener-
alization in Section 2.5 – namely, that the majority of words meaning physical
objects are nouns cross-linguistically – Langacker (2000, 10) captures it by
appealing to the prototypicality of THING when it comes to object concepts
and to the protypicality of PROCESS when it comes to ‘an asymmetrical
energetic interaction involving an agent and a patient’. The details are very
interesting and certainly intricate; however, they are not readily translatable to
a formal framework like the one employed here.
Turning now to Anderson (1997), the picture is similar to that presented by
Langacker’s approach, with three important differences: Anderson’s frame-
work is a formal one and expresses generalizations by encoding them as
features. He understands categories as ‘grammaticalisations of cognitive – or
notional – constructs’ (Anderson 1997, 1), thus making his theory more
compatible with the underpinnings of a generative theory: he essentially claims
that categories are ways in which grammar, which forms a separate module
from general cognition, translates or imports (‘grammaticalizes’) concepts.
Third, he formalizes the different categories, the different grammaticalizations
of concepts, using two features and a relation of preponderance between them:
a feature P, standing for predicativity or, rather, predicability, and a feature N,
standing for the ability to function as an argument. The resulting categories for
English are the following:
Explicating, auxiliaries {P} are only usable as predicates, names {N} only as
arguments, and functors { } as neither. Adjectives {P:N} are the result of a
balanced relationship between predicability and the ability to function as
arguments, whereas in verbs {P;N} predicability takes preponderance over
possible argumenthood and vice versa in nouns {N;P}. To my mind it is
precisely this ‘preponderance’ factor that makes the relation between two
features very hard to formalize any further. Moreover, in Anderson’s system
there is again a conflation between the category by itself, say ‘verb’, and the
functional layer around it. For instance, saying that verbs by themselves may
function as arguments is misleading, as verbs can only function as arguments
when they are embedded within a nominalizing functional shell, in which case
they form part of a mixed projection, such as gerunds, nominalized infinitives
and the like.
2.1 Introduction
A theory of categorial features cannot overlook the question of whether the
lexical classes we recognize as nouns, verbs and adjectives are universal or
particular to a subset of languages. This task becomes more important if one
considers the frequent charge that categories like ‘noun’ and ‘verb’ have been
analytically imposed by Eurocentric grammarians and linguists (mostly) onto
non-Indo-European languages: this has been a matter of debate for at least the
past 80 years.1 This chapter critically reviews some of the points of contention
in the literature on the universality of lexical categories, beginning with the
need to make sure we know what we are talking about when we talk about
nouns, verbs and adjectives. The reason such a caveat is necessary, followed
by some preliminary clarifications and statements on methods and termin-
ology, is because different scholars tend to mean different things when they
talk about nouns, verbs and adjectives – especially, but not exclusively, when
they use them in a mainly pre-theoretical fashion. After these necessary
refinements, the universality itself is discussed, as well as a detailed justifica-
tion of why it was decided not to include adjectives (and adpositions) among
lexical categories in this book.
The universality question is introduced in Section 2.2, and two methodo-
logical guidelines – that is, to distinguish verbs from their functional super-
structure and to use morphological criteria cautiously – are introduced and
examined in Section 2.3. Section 2.4 reviews evidence from Tagalog and Riau
Indonesian that has been claimed to demonstrate grounds for a single undiffer-
entiated lexical category in these languages. Section 2.5 revisits the Nootka
debate and Baker’s (2003) treatment of it, further suggesting that the debate is
1
This debate was recently vigorously revived, or rather ‘updated’, in the wake of Everett’s (2005)
claims that Pirahã is radically and profoundly different from other languages. See Nevins,
Pesetsky and Rodrigues (2009) for a careful assessment.
24
2.2 Are nouns and verbs universal? How can we tell? 25
2.2 Do all languages have nouns and verbs? How can we tell?
We set out in the previous chapter to develop a UG theory of grammatical
category. This entails making claims about the Language Faculty. More
specifically, claims about the existence of (particular) categorial features
within the purported ‘pool’ of UG-available features will be raised; we will
then have to go beyond typological generalizations that could reflect
(a) extralinguistic factors (e.g., all languages have a word for ‘mother’ or
‘sun’ for obvious reasons) or, worse, (b) a methodological bias that seeks
to impose Indo-European categories on languages working in distinctly
non-Indo-European ways.
If our desideratum of a theory of category founded on LF-interpretable
syntactic features is to have any substance at all, we need to ensure first that
differences between nouns and verbs are universal and subsequently to
inquire as to whether the two lexical categories are indeed manifested in all
natural languages.2 This question is not a matter of pure data collection,
however. If the noun–verb distinction is so persistent and fundamental, it
needs to be shown why this is the case and what kind of conceptual distinc-
tion it reflects and encodes. Moreover, on the methodological side of things,
we need to move away from the situation realized by languages like English:
in languages like English, the noun–verb distinction is also one between two
broad syntactic categories – that is, between two categories that regulate or,
at least, affect syntactic distribution: whether X will appear in such and such
a position depends (also) on its word category membership – see Rauh (2010,
325–39). However, as will be seen, this is not the case cross-linguistically: in
some languages, verbs and nouns have a seemingly free distribution; in other
languages nouns and verbs are embedded within layer upon layer of func-
tional material, material which, typically, is morphologically attached to the
lexical element. Finally, a great number of languages possess mixed projec-
tions. In this section we will examine cases of allegedly free distribution
between nouns and verbs, as well as verbs and nouns embedded inside a lot
2
This second statement would be highly desirable but it is not a necessary condition for the
universality of categorial features.
26 Are word class categories universal?
a lot more structure. Illustrating this, let us have a look at an equally straight-
forward example from Turkish, a token of what traditionally is described as a
‘verb’; notice that, as expected from an agglutinating language, it consists of a
single word:
(2) unut- ma- y- acağ- ımız
forget Neg Fut 1 st .Pl
roles the word painting can have in each of the sentences, the different
categories it can belong to and the diverse structures it can participate in.
Thus, whether there are two instances of –ing, an aspectual one and a nom-
inalizer, or just one –ing, an aspectual marker, painting in (4) is a prototypical
noun denoting an object concept, while in (8) it is a prototypical verb denoting
a dynamic event, an activity. Even more interestingly, in the examples
in-between, each instance of painting is embedded in rich (mixed) projections
and involves complex and differentiated structures.3 So, we have a process
nominal, still a noun, in (5), a gerund with verbal and nominal properties in (6)
and a verbal element in (7).
(4) The painting is on the wall.
(5) [Helena’s painting of horses in her free time] proved more than a hobby.
(6) [Helena(’s) painting horses in her free time] proved more than a hobby.
This means that in such languages we should be able to freely insert lexical
elements in both ‘nominal’ and ‘verbal/clausal’ functional projections. We will
see that this is most probably not attested in any natural language. Interest-
ingly, however, we will argue that it appears that a slightly different state of
3
Again, these matters will be looked at in detail in Chapter 6.
2.4 Identical (?) behaviours 29
affairs could well be instantiated in some languages – that is, that we should be
able to freely insert roots in both nominal and verbal lexical projections.
Summarizing, when we talk about the noun–verb distinction in a language
and, consequently, the universality of nouns and verbs, we may discuss three
very different empirical matters:
(10) Noun–verb distinction in a given language
a. Whether there is extensive nominalization of verbs (denominal verbs and
mixed projections) and extensive verbalization of nouns (deverbal nouns).
This is a red herring: verbs and nouns are real in this language, although
they can be morphosyntactically recategorized by a suitable functional
entourage.
b. Whether all roots can become both nouns and verbs. If yes, then
categorization is certainly a grammatical process: nouns and verbs are
grammatical constructs, syntactic categories, not specifications on lexical
items (i.e., word classes). Essentially, the noun–verb distinction still
stands.
c. Whether (9) holds – that is, whether all lexical elements can be used
interchangeably in all contexts. This would be an example of a language
where a noun–verb distinction, and lexical categorization, would be
pointless.
as in Germanic languages like English or Dutch (Don 2004), some roots make
better nouns or only nouns. In Tagalog, the version of this is slightly different:
not all roots can be inserted in any morphological environment. Himmelmann
(2008) illustrates this by discussing ma–, a polysemous prefix. The marker
ma– expresses states (e.g., magandá ‘beautiful’ < gandá ‘beauty’) when
affixed to a specific set of roots; however, it expresses accomplishments
(e.g., magalit ‘become angry’ < galit ‘anger’) when affixed to a different set
of roots. So, magandá cannot mean ‘become beautiful’. The distinction
between the two classes of roots appears to be arbitrary – that is, it is not the
case that roots like gandá (‘beauty’) can only denote states. To wit, using a
different marker, the infix –um–, gandá may also express an accomplishment:
g-um-andá indeed means ‘become beautiful’. Finally, using a different mor-
phological process – stress shift – roots like galit can express states: galit
(‘anger’) versus galít (‘angry’). The above show that Tagalog roots cannot be
freely inserted in any grammatical context. But what about categories like
nouns and verbs? Is it meaningful to talk about such ‘Eurocentric’ (Gil 2000)
categories in Tagalog? The answer seems to be affirmative. Himmelmann
(2008), again as cited in Rauh (2010, 343–4), makes a very interesting point
with respect to what he calls ‘V-words’. Tagalog roots expressing things,
animate beings and actions are nouns when used in isolation, and they become
verbs, ‘V-words’, when they undergo Voice affixation. 4 Although it is unclear
whether the resulting words are simplex or denominal verbs, I think that
processes like Voice affixation make talking about a noun–verb distinction
in Tagalog meaningful. Baker (2003, 185–9) follows a very similar path to
show that a noun–verb distinction is meaningful in a number of languages,
including Tukang Besi – whose behaviour closely resembles that of Tagalog.
There exists, however, an example of a language that has been claimed to
exactly fit the bill in (9). Gil (1994, 2000, 2005, 2013) has consistently claimed
that Riau Indonesian is a case of a language as described under c. in (10): all
lexical material, all lexical elements, can be used interchangeably in all
contexts; they can be freely inserted pretty much everywhere.5 This purported
radical state of affairs would render the noun–verb distinction in the particular
language completely unnecessary and spurious, reducing it to a sort of analyt-
ical and methodological straightjacket. Consider the most famous example of
this lack of lexical categories, as presented in Gil (2005, 246):
4
The apparent ‘default’ nature of nouns will be examined in Chapter 4, Section 4.6.
5
See Gil (n.d.) for a thorough and well-rounded presentation of the Riau Indonesian variety from a
sociolinguistic viewpoint, a matter that has raised some controversy.
2.4 Identical (?) behaviours 31
Gil (2005) claims that in Riau Indonesian any of the glosses above is an
appropriate translation of the two-word sentence makan ayam/ayam makan –
depending on the context, of course. So, indeed, Riau Indonesian is presented
as a category-less language in the most thorough and dramatic fashion.6
Yoder (2010) put this claim to the test, going back to the original naturalistic
data in Gil’s publications, data originating from ‘recorded spontaneous speech’
(Yoder 2010, 2).7 He went on to review the 154 examples presented in Gil’s
published work on Riau Indonesian until 2010 with a very simple goal: to
detect whether there is any correlation between nouns, verbs and adjectives, on
the one hand, and the syntactic functions argument, predicate and modifier, on
the other (Yoder 2010: 7).
His methodology is simple but solid: departing from a comparative view-
point, he took all the words in Gil’s 154 examples and checked them against
the category to which these words belong to in Standard Indonesian, an
extremely similar and much better described variety, which does make cat-
egorial distinctions. Matching the Riau words to the Standard equivalents, he
‘found a total of 271 nouns, 161 verbs, and 55 adjectives’ (Yoder 2010: 7). He
then revisited each of the 154 examples and examined whether items
6
We must be very cautious with what could be called the ‘Hopi fallacy’ – that is, claims of
exceptionality stemming from an exoticized (re)presentation of a language – on behalf of field
linguists studying it. See, for instance, Nevins, Pesetsky and Rodrigues (2009) on the progres-
sively exoticized grammatical descriptions of Pirahã.
7
Gil does not seem to systematically ask for native speaker intuitions or to elicit responses. This
is, to my mind, the major methodological obstacle in substantiating claims of Riau Indonesian
being category-less.
32 Are word class categories universal?
Yoder proceeded to tackle the non-default cases and found that the majority
thereof fall under five morphosyntactic patterns, which are also attested in
Standard Indonesian: (i) nominal modifiers (belonging to all three categories)
following the noun they modify; (ii) equative clauses with a nominal or an
adjectival predicate; (iii) three instances of noun phrases headed by a phonolo-
gically null noun (like English four reds and a yellow); (iv) use of a verbalizing
prefix N-; (v) use of a verbalizing suffix –kan attaching on adjectives.8
In the light of the above, I think we can safely consider Riau Indonesian as a
run-of-the-mill language, at least as far as its lexical categories are concerned,
pending a more in-depth investigation of its grammar, based on more empirical
evidence, compounded by elicited responses and – crucially – speaker intu-
itions. Until then, Yoder (2010) makes a very convincing point that this variety
has nouns, verbs and adjectives both as syntactic categories and as word
classes, just like Standard Indonesian.
8
The three functions that do not fall under either the default cases or the five non-default patterns
involve a noun in predicate position, a noun in verbal modifier position and a verb in argument
position. None of these is typologically outlandish, let alone unheard of.
2.5 The Nootka debate (is probably pointless) 33
broadly speaking, most analyses arguing that languages like Nootka and
Lillooet Salish do not distinguish between nouns and verbs are usually informed
by the fact that lexical elements can all function as predicates in isolation and
they can be differentiated with respect to their syntactic role only via particular
functional elements. Evans and Levinson (2009, 434–5) eloquently summarize
the case for the non-existence of a noun–verb distinction as follows:
A feeling for what a language without a noun-verb distinction is like comes
from Straits Salish. Here, on the analysis by Jelinek (1995), all major-class
lexical items simply function as predicates, of the type ‘run,’ ‘be_big,’ or
‘be_a_man.’ They then slot into various clausal roles, such as argument (‘the
one such that he runs’), predicate (‘run[s]’), and modifier (‘the running
[one]’), according to the syntactic slots they are placed in. The single open
syntactic class of predicate includes words for events, entities, and qualities.
When used directly as predicates, all appear in clause-initial position,
followed by subject and/or object clitics. When used as arguments, all lexical
stems are effectively converted into relative clauses through the use of a
determiner, which must be employed whether the predicate-word refers to an
event (‘the [ones who] sing’), an entity (‘the [one which is a] fish’), or even a
proper name (‘the [one which] is Eloise’). The square-bracketed material
shows what we need to add to the English translation to convert the reading in
the way the Straits Salish structure lays out.
There are two matters to review in the summary above, which echoes Jelinek’s
extensive work on Salish.
First, we have the fact that in most Salish languages arguments need a
Determiner, as illustrated in (16) for Lillooet Salish and in (17) for Nootka.9
This holds whether they are doubled by argument clitics on the predicate or
not – that is, by whether they are true arguments or DP adjuncts (Jelinek 1996).
However, does this necessarily entail that ‘when used as arguments, all lexical
stems are effectively converted into relative clauses through the use of a
determiner’?10 Should this be so, then it would be a strong argument for all
lexical stems in Salish being inherently predicative. Not necessarily, however.
In Romance, Determiners are generally necessary for nouns to function as
arguments and predicate nouns may surface without a Determiner – this is
what happens in Salish, too.11 Moreover, we do not know what lexical stems
9
In Makah, nouns like wa:q’it (‘frog’) can function as arguments with or without a Determiner
suffix –oiq; however, verbs must be suffixed with –oiq in order to do so (Baker 2003, 179).
10
Emphasis added.
11
Of course, Salish is not Romance! All that is highlighted here is the obvious fact, also reviewed
in Chierchia (1998) and elsewhere, that obligatory Determiners in argument positions, even
with proper names, cannot be taken as direct evidence for the inexistence of nouns.
34 Are word class categories universal?
Once we look at the present and the future tenses, it becomes obvious that the
suffix –dI is the exponence of two different elements: of a past tense copula
and of past tense. The conclusion is that we must be very careful before
making claims of non-distinctness, given that the morphological exponence
of functional categories within which nouns and verbs are embedded can
mislead us. As repeatedly pointed out above, the essential task is threefold:
first, to ensure that homophony of affixes is just that – a matter of exponence
and forms; second, to correctly distinguish nouns and verbs from their func-
tional entourage; and, third, to correctly tell whether the root embedded within
the various functional structures is a categorized root (i.e., lexical noun or
verb) or an uncategorized root.
Let us, however, turn to some real data from Lillooet Salish and Nootka
(Baker 2003, 173–89). Lillooet Salish can use any lexical category as a
predicate, something to be expected if all lexical categories are indeed
predicates (Higginbotham 1985), using the same (?) agreement-like
suffix:12
(15) Lillooet Salish: predicates
a. Qwatsáts-kacw
leave-2ndsg
‘You leave/left.’
b. Smúlhats-kacw
woman-2ndsg
‘You are a woman.’
c. Xzúm-lhkacw
big-2ndsg
‘You are big.’
So far, the situation could be similar to what happens with Turkish in (13).
Nevertheless, as already mentioned, in Salish languages there is a determiner-
like element that can nominalize anything, while whatever appears in the first
position of a sentence can act as a predicate (Baker 2003, 175).
(16) Lillooet Salish: arguments
a. Qwatsáts-Ø ti smúlhats-a
leave-3rdabs the woman-the
‘The woman left.’
12
The examples are from Baker (2003, 175).
36 Are word class categories universal?
b. Smúlhats-Ø ti qwatsáts-a
woman-3rdabs the leave-the
‘The one who left is a woman.’
What is worth observing above is that in (16), the predicate – that is, the first
word in the sentence – takes a null subject agreement marker (third-person
absolutive) that links it with a constituent nominalized by a determiner(-like
element). Once more, the situation in (13) springs to mind, where –dI can be
either a past tense marker (with the verbal stem bayıl) or a past tense copula
(with the noun muhtar). Nootka in (17) is, on the other hand, much more
interesting, as the suffix attaching on both ‘work’ and ‘man’ in the first
position is one encoding indicative Mood.
Abstracting away, the workings of roots illustrated above could in reality
be no different to the ones in English: the category of lexical elements
becomes visible through the morphology surrounding the roots and by
these elements’ syntactic position. If one goes for the view that some roots
can make only nouns or only verbs, the true question here is whether there
are Salish roots that cannot be made into first-position predicates, as, for
example, boy and – for most speakers – cat cannot be made into verbs in
English. Baker (2003, 177 et seq.) proceeds to ‘isolat[e] the lexical heads
from their functional support systems, to see if the noun-nonnoun contrast
reemerges in those environments’. Revisiting candidate languages for a
lack of noun–verb distinction, he shows that in Greenlandic and Nahuatl
only nouns incorporate, despite the wide use of substantivized adjectives
as nouns in other contexts; that in Kambera, Samoan, Tongan and
Niuean (Austronesian) only true nouns can appear as arguments without
a determiner (see also footnote 9). Revisiting Lillooet Salish, Baker
(2003, 182) follows Davis (1999) in foregrounding a very important
observation: while the determiner ti–a can nominalize anything, as shown
in (16), only true nominals can take the demonstrative ti7 plus the deter-
miner ku:
2.6 Verbs: everywhere, not always as a word class 37
So, even in Lillooet Salish there are lexical elements that display an unam-
biguously nominal behaviour. This confirms our suspicions that in the b.
example of (16) there is more structure than meets the eye; after all, subjects
like ti qwatsáts-a a (det leave det) ‘the one leaving’ has been standardly
analysed as a reduced relative.13
13
Or, perhaps, a mixed projection (see Chapter 6).
38 Are word class categories universal?
The most exciting case of a language with complex verbs would be Jingulu,
an Australian language discussed in Pensalfini (1997) and Baker (2003,
90–4). Jingulu possesses only three verbal elements – ‘come’, ‘go’ and
‘do/be’ – and there are no (other) lexical verbs, unlike Japanese and Farsi,
which do possess a few lexical verbs. ‘Come’, ‘go’ and ‘do/be’ in Jingulu
may stand as the only verbal predicates in a sentence and they are strictly
obligatory; additionally, they are the only necessary elements in a sentence.
In order for other verbal concepts to be expressed, one of these three
elements must be combined with what looks like a bare root. Here are some
examples from Baker (2003, 90–1):
(21) Jingulu verbs
a. Ya-angku
3rdsg-will.come
‘He will come.’
b. Jirrkiji-mindu-wa
run-1stincl.du-will.go
‘You and me will run off.’
c. Ngaruk baka-nga-rriyi
dive 1stsg-will.go
‘I’ll dive down.’
d. Ngaruk baka-ngayi arduku
dive 1stsg-will.do carefully
‘I’ll submerge (something) carefully.’
Pensalfini (1997) argues that ‘come’, ‘go’ and ‘do/be’ are light verbs.
I understand that ‘light verbs’ in this instance can possibly be equated to
semi-lexical verbs (see Chapter 4). More concretely, they could be the lexi-
calization of verbalizers themselves, in the manner of Folli and Harley (2005),
who claim that three flavours of v exist, vCAUS, vDO, and vBECOME, each
2.6 Verbs: everywhere, not always as a word class 39
Appealing to the Farsi case is valuable, because there is consensus that its
complex predicates are made of light verbs and a non-verbal element. Neverthe-
less, both elements contribute to the properties of the overall complex predicate.
In other words, although Jingulu roots affect the type of the complex predicate’s
thematic grid, this is no evidence that these roots are verbal. I therefore believe
we can adhere to an analysis in the spirit of Pensalfini (1997); after all, even in
inflected languages with solid lexical verb paradigms, like Greek, verbs may
display an articulated structure in which the verbal or verbalizing element is
kept distinct from the acategorial root material (Panagiotidis, Revithiadou and
Spyropoulos 2013). At the same time, languages like Jingulu, Japanese and
Farsi and many others lend support to the syntactic decomposition approach to
lexical categories, which will be described and presented in Chapter 3.
14
More precisely, Jingulu ‘come’, ‘go’ and ‘do/be’ would be the exponence of verbalizers fused
with Tense heads, as glossed in (21).
40 Are word class categories universal?
15
The interested reader is referred to the works cited in Baker’s (2003) survey for further details
and a more in-depth scrutiny of the matter.
2.8 What about adjectives (and adverbs)? 41
unmarked, are restricted to two positions: (i) predicate positions, with the
mediation of a functional predicator Pred, just like nouns and other syntactic
constituents, yielding examples like The flag is red; I painted the wall green,
and (ii) modifying positions: The red flag; A house with green walls.
In order to formalize adjectives as the unmarked lexical category, Baker
(2003, chap. 4) is restricted by his two-membered privative feature system
(i.e., [N] and [V]), so he has to argue that adjectives do not bear any categorial
specification – that is, no categorial feature whatsoever. This claim has at least
two consequences:
a. They have no ‘special characteristics’ of their own as a category
(Baker 2003, chap. 4), unlike nouns and their relationship with
determiners and ‘counting’ or verbs and their argument structure
and their relation with tense;16
b. As practically admitted in Baker (2003, chap. 5), adjectives are
essentially identical to roots: their categorial specification is precisely
the lack of any categorial specification, if category is encoded by
features like [N] and [V].
The above entail at least that adjectives are different from nouns and verbs. Of
course, in Baker’s theory they are different because they are the unmarked,
basic lexical category. Even if this basic intuition is wrong, as I will propose,
the fact remains that adjectives are different: either because they are catego-
rially featureless roots (which I will disagree with) or because they are not a
fundamental and universal lexical category like nouns and verbs (which seems
to be going in the right direction).
16
This decision is of course reminiscent of Déchaine’s (1993) treatment of prepositions. See
Section 1.3 in Chapter 1.
17
After all, what would categorize a root into an adjective if no categorial features for
adjectives exist?
2.8 What about adjectives (and adverbs)? 43
230) calls them the ‘elsewhere case’ among lexical categories. Now, if
adjectives are indeed the unmarked lexical category, expressing, say, pure
properties, then we would expect
a. adjectives to be universal: if categorially unmarked roots surface as
the category adjective, then all languages should possess this
category, even more so than verbs: after all, in Baker’s system, a
language without [V] on lexical elements could potentially exist,
having all predication mediated by the functional predicator Pred;
b. adjectives not to have any ‘special’ characteristics of their own
whatsoever.
However, a lot of work on adjectives seems to be pointing in the opposite
direction and evidence that adjectives are not a universal category is abundant.
Besides those discussed in Dixon (1982), Schachter (1985), Hale and Keyser
(2002, 13–14) and Amritavalli and Jayaseelan (2004), more languages seem
not to possess an adjective category, let alone one that is the elsewhere case.
If adjectives are indeed the unmarked category, what are we to make of
Chichewa which has six adjectives (Baker 2003, 246–9) or of Hausa which
is reported to possess no more than a dozen? And there also exist languages
without an adjective category; in these languages, three scenarios are typically
attested.
First, in languages like Japanese, the ‘adjective category’ is split in two
(Miyagawa 1987) and adjective-like words behave as if they belong to two
distinct classes with precious little in common. See also the discussion of
Japanese ‘adjectival nouns’ in Iwasaki (1999, chap. 4).
In Korean (and Navajo) adjectives behave as a subclass of the verb category.
Citing Haspelmath (2001, 16542), ‘in Korean, property concepts inflect for
tense and mood like verbs in predication structures [see (22) below], and they
require a relative suffix . . . when they modify a noun [see (23) below], again
like verbs’.18 The examples below (Haspelmath 2001, 16542) illustrate the
situation:
(22) a. salam-i mek-ess-ta
person-nom eat-past-declarative
‘The person ate.’
18
It is true that Korean property verbs do not take the present-tense suffix –nun, but I am not sure
this is enough to make them a lexical category on a par with nouns and verbs, as opposed to a
subclass of verbs or a verbal category.
44 Are word class categories universal?
b. san-i noph-ess-ta
hill-nom high-past-declarative
‘The hill was high.’
Chichewa also seems to overwhelmingly use nouns in the way other languages
use adjectives.
Turning to languages with a distinct adjectival category that constitutes
an open class, this does not possess any of the hallmarks of an unmarked
lexical category, let alone of a category that is posited to be the direct
manifestation of a root without any categorial features. In Romance,
Germanic and Greek a large number of adjectives, if not the majority, are
derived from nouns and verbs, as opposed to being derived directly from
roots. A good case in point is that of Slavic, which makes extensive and
productive derivation of adjectives via suffixes for a number of purposes –
for example, for the resulting adjectives to function as possessives in lieu of
simple genitives. This is elegantly exemplified in Russian, as described in
Valgina, Rosental and Fomina (2002, sec. 145), which the review below
follows closely.20
19
Both examples are from Amritavalli and Jayaseelan (2004, 29), who also make the conjecture
that ‘there is . . . an implicational relationship between the absence of adjectives and the
prevalence of the dative experiencer construction’. As in Chichewa, there are in Kannada ‘a
few indisputable underived adjectives, such as oLLeya “good”’.
20
I am grateful to Svetlana Karpava and Olga Kvasova for discussing the Russian facts with me
and for lending me their native speaker intuitions. Karpava kindly translated Valgina, Rosental
2.8 What about adjectives (and adverbs)? 45
and Fomina (2002, sec. 145), from which examples are drawn, as well as from Karpava
(personal communication, April 2013) and from Kvasova (personal communication,
April 2013).
46 Are word class categories universal?
adjectives, that display concord with the noun inside a DP are all functional
elements: quantifiers, demonstratives, possessives and articles. This is starkly
illustrated in Modern Greek.
(27) Concord
a. olus aftus tus neus typus
all.m.acc.pl these.m.acc.pl the.m.acc.pl new.m.acc.pl type.m.acc.pl
‘All these new types.’
b. oli afti i nea sodia
all.f.acc.sg this.f.acc.sg the.f.acc.sg new.f.acc.sg crop.f.acc.sg
‘All of this new harvest.’
21
This is precisely Déchaine’s (1993) argument for treating prepositions as not encoding any
categorial features; see Section 2.9.
2.8 What about adjectives (and adverbs)? 47
Class 1 Degree expressions (very, as, too, that, how) c-select an Adjectival
Phrase (AP), which they must precede: they take the AP as a complement. This
suggests that Class 1 Degree heads are members of the adjectival projection
line, possibly of the so-called Extended Projection of A.22 Class 1 Degree
expressions need much (a dummy A) in order to combine with non-APs: [[DegP
Deg much] [XP]]; they tolerate neither topicalization nor freer linearization
options – something to be expected if they take APs as complements qua heads
which are members of the projection line of A. Class 1 Degree expressions can
be claimed to be to the category A what D is to N and T to V.
In contrast, Class 2 Degree expressions (more, less, enough, a little, a good
deal) have no c-selectional requirements and they also combine with Prepos-
ition Phrases (PPs), Verb Phrases (VPs) and so on, apparently as adjunct
modifiers:
(29) Class 2 Degree expressions
22
Grimshaw (1991, 2003).
48 Are word class categories universal?
23
The view that Class 1 Degree expressions are specific to adjectives will be qualified and revised
in Chapter 5.
24
Corver (2005), more reasonably in my view, takes –ly to be a copular element. This perhaps ties
in nicely with the Davidsonian insights in Larson (1998).
2.9 The trouble with adpositions 49
Consider the following examples from Panagiotidis (2003a), where the form of
can behave both as lexical (a. examples) and as functional (b. examples):
(30) a. My favourite picture is [of the Vice-Chancellor].
b. * My favourite student is [of Chemistry].
Lexical of behaves like a θ-assigning lexical head and its phrasal projection
does not have to be adjacent to the N, as shown in (30)a.; it can also extrapose,
as in (31)a.25 The reverse is true for functional of and its projection: its
licensing depends on whether the N student in the b. examples can assign a
θ role to it, as the alternation between Chemistry student and student of
Chemistry in (32) also suggests. It is unclear how this state of affairs can be
captured in a satisfactory and non-stipulative way in an analysis of adpositions
as complex structures. However, both the solution of splitting the category
Adposition in two and the one assuming adpositions to be a hybrid category in
the best of cases give the impression of merely restating the facts at a more
abstract level.
Déchaine (1993, 32–6), on her way to arguing for adpositions as the
elsewhere lexical category, observes that there is no English compound headed
by an adposition. So, although adpositions (or, perhaps, particles) may be part
of a compound like up-root, under-dog or over-cast, they can never be the
head of a compound, unlike nouns, verbs and adjectives. Her second general-
ization is twofold: on the one hand, there is no affix that derives adpositions;
on the other hand, very few derivational affixes attach to adpositions. Déchaine
(1993, 33) lists English words derived from an ‘adpositional’ stem: off-ing,
about-ness, under-ling, upp-ity. However, she argues adpositions to be a
lexical category, and not a closed-class either. At this point, we could say that
adpositions are indeed not word classes but complex and articulated syntactic
structures in which certain, one or more, lexical items are visible.
In this monograph, we will assume without any further discussion that,
despite all the complications, adpositions are complex structures à la Sveno-
nius (2007, 2008), which include both clearly defined functional heads like
25
Oga (2001) independently reached the conclusion that there are two types of of. Tremblay
(1996) shows exactly the same to be the case with with.
2.10 Conclusion 51
2.10 Conclusion
First of all, the noun–verb distinction is indeed universal: nouns and verbs are
distinct syntactic categories and, in a large number of languages, they are also
distinct word classes. Two examples of languages that have been claimed to
possess a single undifferentiated lexical category, Tagalog and Riau Indones-
ian, turn out to actually make a distinction, at least between nouns and verbs.
On top of this, and with the possible exception of Nootka, there are roots that
can only become nouns, not verbs. A syntactic categorization approach,
correctly calibrated, should capture this, together with the universal presence
of nouns and verbs.
When we say that verbs are universal we do not necessarily refer to those
exemplified by latinate verbs in English – for example, donate. Single-word
verbs in inflecting languages already contain a lot of structure, structure that is
actually visible in such languages as Japanese, Farsi and Jingulu: these and
other languages combine a light verb with some lexical material to yield
complex verbal predicates. The essential point here is that verbs, including
light verbs, are universal. In Chapter 4 I will actually argue that it is v heads,
verbalizers, that are universal.
Despite what surface descriptions may suggest (confounded and misled
either by morphological complexity or by the radical absence of familiar
morphological distinctions), verbs and nouns cannot be used interchangeably.
Repeating the initial point here: roots may be inserted in a number of gram-
matical contexts categorizing them, but categorized nouns and verbs are not
52 Are word class categories universal?
3.1 Introduction
As already stated in the first chapter, the approach to category in this monograph
is committed to conceiving of word classes as encoding different categorial
features. These features are to be understood not as taxonomic ones creating
word classes of a morphological nature – for example, like declension and
conjugation classes – but as genuine LF-interpretable features, as instructions
to the Conceptual–Intentional Systems. Put differently, we will be very serious
in taking [N] and [V] to be ordinary syntactic features: interpretable, with
uninterpretable versions, triggering syntactic operations and imposing syntactic
constraints. In order to do this, we will embrace an approach to categorization
which takes it to be a syntactic process. Arguing for syntactic categorization will
enable us to work with categorial features as bona fide syntactic features and also
to account for cross-linguistic typological data – for example, categorization
behaviours such as the ones outlined in the previous chapter.
The following section poses the question of how words are made, of where
morphology is located. The point of view assumed is that of a syntactician
seeking to understand the workings of categorial features, and lexical and
functional categories more specifically. After a brief and very sketchy critique
of lexicalism the Distributed Morphology take on word-making is introduced.
The following two sections, 3.3 and 3.4, zoom in on some details of this
approach – that is, how Distributed Morphology can explain idiosyncrasies in
the meaning of morphologically complex words, if all structure-building,
including word-building, is done in syntax, a system typically conceived of
as yielding compositional interpretations of the items it combines. Section 3.5
examines conversions in order to reveal how zero-derived verbs can have very
different derivational histories, being either the product of root categorization,
like the verb hammer, or of recategorization, like the denominal verb tape.
Section 3.6 introduces the very basics of Phase theory in a simplified form and
relates its implementation of cyclicity in grammar with respect to how we can
53
54 Syntactic decomposition and categorizers
1
Although the analysis here is couched within Distributed Morphology, I believe that any
consistently realizational morphological framework can be employed equally well, as long as
it incorporates
a. a separationist distinction (and/or a dissociation) between syntactic feature structures
and their morphological exponence (Beard 1995) and,
b. syntax-all-the-way-down, as in Marantz (1997) and Harley and Noyer (2000) – that
is, taking the same combinatorial mechanism to lie behind both word-building and
sentence-building.
3.2 Where are words made? 55
2
Lieber and Scalise (2007) provide a very informative and thorough overview of the Lexical
Integrity Principle and the different flavours of lexicalism.
56 Syntactic decomposition and categorizers
(1) Word-making
Lexical/
‘morphological’ Syntactic
word-making word-making
In the picture adumbrated in the table in (1) words can be created in three
distinct ways (or, using a more Y-model-oriented metaphor, in three different
places). First, there are roots already coming from the lexicon with a categorial
specification. Thus, some nouns and verbs are roots specified for [N] and [V],
as nouns and verbs, and their category is learned; it is memorized. Examples of
such roots categorized in the lexicon would include cat, dog, coffee, bake, run,
kill and so on. Second, some nouns and verbs are created via lexical word-
making, via derivational morphology of a lexical nature – say, by combining
roots with nominal ([N]) and verbal ([V]) affixes through non-syntactic, mor-
phological processes: for example, quant-ify, cert-ify, cert-ifi-cate, free-dom,
geek-dom, the denominal zero conversion giving the verb water and so on.
Finally, syntactic word-making is also possible. Therefore, some nouns and
verbs are assembled during the syntactic derivation by means of syntactic
processes that combine nouns, verbs and adjectives with category-changing
syntactic heads (which also may surface as affixes). Examples of this would be
gerunds (train-ing) and adjectives like kind-ness, bleak-ness and so on, where
–ness behaves like a nominalizing syntactic head according to the criteria in (1).
Weak lexicalism, exemplified by the picture presented in the table above, is
a response to empirical necessity: strong lexicalism fails to capture important
generalizations because it assigns all word-making to the lexicon or to the
purported morphological combinatorial component. At the same time, weak
lexicalism is not particularly desirable from a conceptual viewpoint, as there
are now three distinct ways of making words; it is also empirically problematic
as Halle and Marantz (1993) and Marantz (1997) discuss at some length.
Until the mid-nineties, most syntactic analyses in the major generative frame-
works would assume that syntax manipulates words and/or morphemes. For
instance, in most of Chomsky’s work, such as in Chomsky (1957, 1995) (to name
but two landmark cases), the syntactic derivation manipulates words and
3.2 Where are words made? 57
The picture is quite telling: there is no single relation between a noun and a
verb. So, the verb sweat means ‘make sweat’ but the verb butter does not mean
‘make butter’; the verb box means ‘put in a box’ but in the verb brush the root
brush is about the instrument of applying x on a surface y – and so on. In
other words: there is a diverse number of types of verbs when it comes to the
relation they have with co-radical nouns and adjectives. As Levin (1993)
points out, these different types of verbs are systematic; they are most likely
the result of where the verbal root appears in the argument structure of the verb
(Hale and Keyser 2002). Thus, once we have a closer look, we realize that
there is more regularity in the possible meanings of noun–verb pairs than is
initially apparent. Broadly speaking, the root in the verbs in (2) behaves as a
direct object in sweat and push and as a predicate designating an end state in
change of state verbs like clear and open. Equally interestingly, in Location
verbs (e.g., box) the root behaves something like the object of a Preposition
Phrase, designating the background. In Locatum verbs (e.g., butter), however,
the root behaves like the subject of a PP, the figure to be placed against
a background. Finally, in an instrumental verb like brush the root acts as a
manner adjunct, an instrument adjunct, more precisely.
One issue that can be noted already is the complexity of some verbs’
argument structure: if we are to generate such structures ‘morphologically’,
we must make sure we do not gratuitously duplicate the workings of syntax
or, at least, those mechanisms familiar from the workings of phrasal syntax.
For instance, it is very interesting that change of state verbs seem to embed a
small clause consisting of the object x of the verb and the end state predicate,
and that the end state predicate is the root that ‘names’ – to echo Harley
(2005b) – the verb. The argument structure of an instrument verb such as brush
60 Syntactic decomposition and categorizers
seems to be even more complex: there, the root naming the verb is the
instrument, the modifying adjunct. Moreover, the surface direct object of the
verb brush sets the background: for example, we brush the roast thoroughly
(with a honey and mustard mix) but we do not #brush a honey and mustard
mix thoroughly although we can brush a honey and mustard mix on the roast
thoroughly; such structural alterations strongly suggest that displacement of
the syntactic sort is at play.
A second, and even more vital, point is that syntactic terminology like
‘subjects’, ‘objects’, ‘adjuncts’, ‘predicates’ turns out to be quite useful when
talking about the position of a root within the verb’s argument structure.
Briefly, and following the lead in Hale and Keyser (1993, 2002), we can claim
that different argument structures are actually different syntactic structures.
Idiosyncrasy is only apparent, as the different types of verbs like those in (2) –
and many more – boil down to where the root is to be found within the
argument structure of the verb: that is, inside a bona fide syntactic structure.
In this respect, we are justified in using syntactic terminology to discuss
argument structure because it looks like (such) verbs contain some hidden
syntax, what Hale and Keyser (1993) call L-syntax and Ramchand (2008) calls
First Phase Syntax. Harley (2005b) and Marantz (2005) explain in consider-
able detail how to analyse verbs in this way, within a syntactic decomposition
framework that embraces a realizational–separationist view on the syntax–
morphology relation – that is, Distributed Morphology.
3
We are not looking at the whole panorama of lexical idiosyncrasy here – for example, the non-
compositional interpretation of compounds – and we just focus on these aspects of the problem
that are more pertinent to matters of syntactic categorization.
4
Examples in (3) are from Chomsky (1970) and those in (4) are from Panagiotidis (2005) –
originally from Clark and Clark (1977).
3.4 There are still idiosyncrasies, however 61
The above pairs (and many more) share a root, but it is extremely improbable
that the meaning of the verb could be guessed – let alone be derived compos-
itionally – by that of the noun, or vice versa, in the way things go with sweat,
push, box, brush and so on above. Incidentally, as argued in Acquaviva and
Panagiotidis (2012), examples like those in (3) and (4) render very problematic
the idea that roots are underspecified but still meaningful elements which give
rise to distinct interpretations depending on their immediate syntactic context
(Arad 2005): ‘even if we argue for impoverished and semantically under-
specified roots, we are still left with the empirical problem . . . that roots too
often do not capture a coherent meaning . . .. This renders unlearnable the
purported “common semantic denominator” roots are supposed to express’
(Acquaviva and Panagiotidis 2012, 109). To wit, what could be the ‘common
semantic denominator’ between revolution (as ‘uprising’ or ‘regime change’)
and revolve? Something as vague as ‘change’? Surely this is hardly coherent,
and the interested reader is again referred to Acquaviva and Panagiotidis
(2012) and, also, Borer (2009).
The stand taken in Marantz (1997, 2000) and elsewhere on pairs like (3) and
(4) is that they involve underlying syntactic structures which receive idiosyn-
cratic interpretations. Now, idiosyncratic interpretations of variously-sized
syntactic objects are not really an oddity: they are everywhere and they are
called idioms. Trivially, break a leg, kick the bucket, raining cats and dogs
are all syntactic phrases, vPs according to Svenonius (2005). However, they
have idiosyncratic meanings.
The idiom idea is definitely useful: lexical idiosyncrasies are not the result of
a ‘lexical/morphological’ way of making words, but the result of idiomatic
interpretation of otherwise ordinary structures. There are two questions, how-
ever: first, why can the meanings of words (nouns and verbs) made of the same
root be so distinctly, even crazily, different and to such a considerable extent?
Second, as Hagit Borer (personal communication, June 2009) pointed out,
phrasal idioms can typically also receive a fully compositional interpretation
when properly contextualized: kick the bucket may also simply describe a
bucket-kicking event. However, the verb castle is impossible to interpret
compositionally – for example, as a Location verb – and can only mean the
chess move it idiosyncratically names. Why should such a difference between
phrasal and ‘lexical’ idioms exist?
Panagiotidis (2011) seeks to answer both questions above by arguing the
following points. If it is indeed the case that roots by themselves are
characterized by impoverished or, perhaps, inexistent semantic content, then
non-compositional and idiosyncratic interpretations of syntactic material
62 Syntactic decomposition and categorizers
directly involving roots are the only interpretive option: after all, how could
compositional interpretation deal with the un- or under-specified meaning of
roots? In other words, idiomaticity is the only way to interpret constituents
containing roots, exactly because of the semantic impoverishment/deficiency
of roots. This point is already alluded to in Arad (2005) and fleshes out
the conception of the categorizing phrase in Marantz (1997, 2000) as the
limit of (compulsory) idiomaticity. However, systematic idiomaticity of
the categorizer projections is not due to the categorizer acting as some sort
of a limit, below which interpretation is/can be/must be non-compositional.
Rather, idiomaticity – that is, matching a structure with a memorized mean-
ing stored in the Encyclopedia (Harley and Noyer 1998) – stems from the
fact that the first phase (an nP or a vP)5 contains a root, an LF-deficient
element, which would resist any compositional treatment anyway. In other
words, the semantically deficient character of the root blocks the application
of a rule-based compositional interpretation. Therefore, inner versus outer
morphology phenomena (Marantz 2006) are due to the semantic impoverish-
ment of roots: once roots have been dispatched to the interfaces with the rest
of the complement of the categorizer, compositional interpretation may
canonically apply in the next phase up – see also the next chapter and
Panagiotidis (2011, 378–9).
Keeping these points in mind, we can now continue with the rest of this
chapter, which will be dedicated to reviewing the literature on syntactic
categorization. This will set the context in which the theory of categorial
features to be posited applies.
3.5 Conversions
Syntactic decomposition can advance our understanding of why some verbs
have a predictable, compositional relationship with their corresponding
nouns (as discussed in Section 3.3 above) whereas others display only an
arbitrary and idiosyncratic one (as reviewed in Section 3.4). The very short
answer, which follows Kiparsky’s (1982) analysis of the two levels of
morphological processes, is that the former are denominal verbs, whereas
the latter are root-derived. Staying with Arad (2003), whose account we
will closely follow in this section, we will call (true) denominal verbs tape-
type verbs and those verbs that are directly derived from a root hammer-type
verbs. The structural differences between them will be argued to account
5
See Chapter 4 for details.
3.5 Conversions 63
In the tree to the left, the nominalizer head n takes a root complement,
nominalizing it syntactically. In the tree to the right, the root hammer is a
manner adjunct to an xP (schematically rendered) inside the vP, more or less
along the lines of what was said about brush in Section 3.3 above, a configur-
ation responsible for the manner interpretation of the root hammer within the
verb hammer. Crucially, the verb does not mean ‘hit with a hammer’ but it
takes up a broader interpretation, roughly ‘hit in a hammer-like fashion’ –
maybe using a stone, a shoe, a fist, or similar.6
On the other hand, verbs like tape behave differently. These seem to be truly
denominal, formed by converting a noun into a verb, by recategorizing the
noun and not by categorizing a root. By hypothesis, the verbalizing head takes
as its complement a structure that already contains a noun – that is, an nP in
which the root tape has already been nominalized.
(6)
6
Once more, this raises the question of roots’ inherent content, if any, as an anonymous reviewer
points out: what is the content of hammer? The answer I am partial to is ‘none’; more on this
line of reasoning and on ways the semantics could be executed can be found in Acquaviva
(2009b), Borer (2009), Panagiotidis (2011) and, in detail, Harley (2012a).
64 Syntactic decomposition and categorizers
with tape’ – not with rope, pushpins, paperclips or even bandages. In other
words, the meaning of the verb tape is not idiosyncratic but it wholly and
predictably relies on the meaning of the noun tape, which it contains embed-
ded in an adjunct position. This difference cannot be due to the verbalizer v,
which is possibly the same in both cases. It therefore must be due to the fact
that the verbalizer makes a verb out of root material in (5), yielding the verb
hammer, whereas in (6) it makes a verb out of a ready-made noun, tape.
This entails that hammer-type verbs can receive special meanings, whereas
tape-type verbs do not.7
The general question here concerns the specifics of how these differently
derived verbs, those categorizing a root and those recategorizing a noun, come
to be. This question reduces in part to the whole issue of what is the limit of
‘special meaning’ in a syntactic tree. Inspecting the trees here, and as is the
consensus in research, the limit of special, idiosyncratic meaning in a syntactic
tree is the first categorizer projection as in Arad (2003, 2005) and Marantz
(1997, 2000, 2006). Having said that, good arguments of an empirical nature
are levelled against this thesis in Anagnostopoulou and Samioti (2009), Borer
(2009), Harley (2012a) and Acquaviva and Panagiotidis (2012): the above
seem to point towards the direction that, at least in the verbal/clausal domain,
it is the Voice Phrase (or an even higher projection) that constitutes the ‘limit’
of idiosyncrasy.
If the first categorizer projection is the limit of idiosyncratic interpret-
ation, then the verb hammer has a special meaning because categorization,
brought about by the v head in (5), includes the x projection with its
hammer root adjunct. On the contrary, the verb tape in (6) includes an x
projection with an nP adjunct – that is, an already formed noun, an already
categorized constituent. The nP has already received an interpretation of
‘(sticky) tape’, so the range of possible interpretations of the vP containing
it is significantly limited: ‘stick x on y with tape’. The obvious problem that
emerges now is why the first categorizer projection should be the limit of
‘lexical’ idiosyncrasy. Why is it not the first projection containing the root,
like those indicated as xP in (5) and (6)? What is special about the first
categorizer projection?
7
I have deliberately chosen not to represent the position of the internal argument of hammer and
tape in the diagrams (5) and (6) purely for reasons of exposition. I would, however, think they
are merged as the specifier of x or v. The choice depends on the nature of the argument, as
discussed in Marantz (2005).
3.6 Phases 65
3.6 Phases
In order to answer the question of why the first categorizer projection is the
limit of ‘lexical’ idiosyncrasy, we need to appeal to cyclicity in syntax.
The first syntactic cycle, which typically, but not necessarily, coincides
with the word domain, manipulates impoverished roots; hence it is impossible
for this first cycle to yield compositional interpretations. Once the interpret-
ation of the first cycle is ‘idiomatically’ fixed, then its interpretive output may
be used by higher cycles to yield compositional interpretations.
Let us follow the consensus in syntactic literature that syntactic operations
are local and cyclical. A specific way to conceive cyclicity, one that makes
precise and falsifiable empirical predictions, too, is through Phase Theory, as
proposed and developed in Chomsky (2000, 2001, 2008) and a lot of subse-
quent work. Let us see how thinking in terms of phases may enable us to
conceive the ‘character’ of the categorizer projection. According to Marantz
(2006), a categorizer phrase (vP or nP) is a Phase. The other two phases are
VoiceP8 and Complementizer Phrase (CP). So, phases are both interpretive
(meaning) units and phonological–prosodic ones – but see Hicks (2009) for
solid arguments from binding that we need to separate LF-phases from
PF-phases.
Let us turn to an example of a derivation with phases. Suppose we start
building a tree – say, a clausal one:
(7)
8
There exists some terminological confusion on what the label v stands for. For the work cited and
followed here, it is the verbalizing categorizer, the ‘verbalizer’. However, in the work mainly
concentrating on Phase Theory, beginning with Chomsky (2000), et seq., v essentially stands for
Kratzer’s (1996) Voice: a causative–transitive or passive head which hosts the external argument
and of which the transitive version may assign accusative case, as per Burzio’s Generalization.
Now, both approaches take v (the categorizer or the Voice head) to be a phase head. Things
become slightly more confusing in that Chomsky, and others, seem to consider the two elements,
v and Voice, as one unitary head. I will here be concerned with the phase status of the
categorizing v (and n) and will remain agnostic about that of Voice. However, see Anagnosto-
poulou and Samioti (2009) and Harley (2012b) for arguments on why v and Voice should be
kept distinct.
66 Syntactic decomposition and categorizers
(8)
The syntactic derivation will from this point on only be able to see the edge
of the phase: the topmost head – that is, v in (8) – and its specifier (none in the
example above). The rest of the phase (shaded) will not be visible:
(9)
(10)
Once the VoiceP phase is interpreted, again, only its edge, the Voice
head and the specifier hosting the Agent argument are visible for further
syntactic computations. The derivation continues on to the next phase, CP,
and so on.
3.7 Roots and phases 67
The verb hammer is made from root material (including some syntactic head x,
as the root HAMMER behaves like a manner adjunct) and the verbalizer v; no
interpretation or form is matched with the syntactic structure until vP, the first
Phase, is completed. This is why hammer has an idiosyncratic meaning,
exactly like any other word made directly out of a root, as opposed to words
containing already categorized constituents in their structure. In the case of the
verb tape, the root and the nominalizer n combine first (making a noun). This is
a phase: it is sent to LF and Morphology/PF, where the meaning and the form
of the nP tape are fixed. This nP phase can then participate in more syntactic
structure: in (11) this nP merges as an adjunct with structure that is eventually
verbalized.9
9
The question here is whether we can embed the root TAPE directly under a v projection. The
answer is that syntactically this is possible, and ‘looser’ usages of a verb tape are reported by
some speakers; the whole thing boils down to whether Morphology makes available the form(s)
68 Syntactic decomposition and categorizers
So, now we can explain why vP and nP are the limit of ‘special meaning’
in a syntactic tree: vP and nP are the first Phases. They are the first cycle
after the derivation begins, provided of course that they categorize root
material and not already categorized constituents – which of course
would already be phasal constituents. Thus, far from acting as limits of
idiosyncrasy, categorizer projections may receive a compositional interpret-
ation if they recategorize already categorized material, just as is the case
of denominal tape with its [vP v [xP nP x]] structure. What causes idiosyn-
crasy is the semantic deficiency/impoverishment of roots within their
first phase.
Further illustrating this point – namely, that verbs like hammer consist of a
single phase – compare also the morphophonology and meaning of the near-
homophones dígest (noun), digést (verb) and dígest (verb). The noun dígest
and the verb digést are each derived directly by a categorizing head, an n and a
v respectively, taking the projection that contains the root as its complement.
Meanings of the respective noun (nP) and verb (vP) are distinct, as expected
from the discussion above, each of them directly embedding the impoverished/
semantically deficient root digest. Stress, a morphophonological property,
is also distinct (N dígest vs V digést), a property decided and fixed at phase
level, as well:10
Furthermore, the noun dígest can be converted into a verb, yielding the
denominal verb dígest; this is a process identical to that giving the verb tape.
Notice in the phrase marker below that the meaning of denominal dígest is
again compositionally derived from the meaning of the noun and the contribu-
tion of y. The denominal verb dígest is therefore interpreted as ‘make a dígest’;
it also preserves the stress pattern, fixed after the nP phase was despatched to
the interfaces:
necessary to express this direct verbalization of the root TAPE as a word. For discussion of
blocking and gaps, see Embick and Marantz (2008) and references therein.
10
Again, in (12) and (13) I have deliberately left out the internal argument. See also footnote 7.
3.7 Roots and phases 69
Moving away from English and zero-derived conversions, let us observe the
behaviour of two more elements as inner and outer morphemes and let us see
how root deficiency and phases can capture their non-compositional behaviour
when they are directly associated with roots below phase level. We begin with
the Japanese causativizer –(s)ase (Marantz 1997). Next to compositional
causatives, like suw-ase (‘make somebody smoke’) below, Japanese causati-
vizers may also yield idiomatic, non-compositional interpretations, like tob-
ase (‘demote someone to a remote post’) below. As illustrated below, the
causativizing element –(s)ase in the case of suw-ase (‘make somebody
smoke’) combines with a vP phase, whose interpretation has already been
fixed as ‘smoke’, and the result is predictable and compositional. In the case of
tob-ase (‘demote someone to a remote post’), the causativizer merges with a
root tob- (which also yields the verb ‘fly’) and derives an unpredictable and
specialized verb tob-ase (‘demote someone to a remote post’) when it eventu-
ally becomes categorized by a v head.11
(14) Outer and inner causativizers
A similar picture emerges with Greek syn- (Drachman 2005). The interpret-
ation of syn- (as an outer morpheme) can be described as comitative and it is
11
As Marijke De Belder (personal communication, 2011) notes, even in idiomatic tob-ase, there is
clear causation involved. This is interesting but, as will be illustrated for Greek syn-, inner
morphemes need not preserve any of the meaning they have when they are used as outer,
‘compositional’, morphemes.
70 Syntactic decomposition and categorizers
productively used to yield verbs and nouns that have a ‘with X’ interpretation.
So, the verb syn-trog-o means ‘eat with somebody’, from the verb tro(g)-o
(‘eat’), and so on. Still, syn- is also used as an inner morpheme, with no
discernible semantic contribution whatsoever. Hence, syn-tax-i, which means
‘syntax’ or ‘pension’. Words like syn-tax-i belong to words that ‘have mor-
phological structure even when they are not compositionally derived’ (Aronoff
2007, 819). Another example of this non-contribution is the noun syn-graf-eas
(‘author’), whose morphological structure is transparently related to that of
graf-eas (‘scribe’) – still, syn-graf-eas cannot mean ‘someone who acts as a
scribe together with someone else’. To wit, ‘co-author’ in Greek would be
something like syn-[syn-graf-eas].12 The diagram below illustrates the differ-
ence between syn- as an outer morpheme, with a compositional comitative
interpretation, combining with a phasal vP in syn-trogo (‘eat with sb’), and
syn- as an inner morpheme in syn-graf-eas (‘author’), where it combines with a
root awaiting the phasal nP in order to receive its idiosyncratic interpretation.
(15) Outer syn- and inner syn- in Greek
12
When I was writing those lines I was confident that syn-[syn-graf-eas] was nothing but a
slightly contrived coinage. While revising the chapter, I ran an exact Google search of συν-
συγγραφέας out of curiosity: it yielded about 21,500 results.
3.8 On the limited productivity (?) of first phases 71
mechanism that makes words and phrases, then we would expect word-making
to be significantly more productive, whereas it is nowhere near so, compared
with phrasal productivity. The short answer to this problem is the role of stored
forms, Vocabulary Items in Distributed Morphology, associated (or not) with
particular structures.
Indeed, syntactic composition at the first, categorial phase is unlike ‘phrasal
syntax’ – that is, syntactic operations above the ‘word level’, involving higher
phases. For instance, there are no verbs cat, hair or tree, for most speakers of
English at least; there is a difference between father (which is only a ‘becom-
ing’ verb) and mother (which is only a manner/instrument verb) and there is no
verb #father meaning ‘to treat as a father would’ – see Clark and Clark (1977)
for scores of similar examples; not all roots can be used to derive Location/
Locatum verbs; and so on.
Of course, productivity, or lack thereof, is a morphological matter:13
whether Vocabulary contains an appropriate form to insert in a given structural
environment. So, perhaps syntax is after all able to construct a Location
verb ‘butter’, next to the well-known Locatum one, or even to construct a
manner verb ‘butter’. The problem in these cases would be whether Vocabu-
lary (or Morphology, more generally) can make available the form(s) that will
express such syntactic structures (see also footnote 9). Syntax is always free
and always combinatorial; at the same time, a lot of Morphology is inevitably
about a repository of memorized forms. It is very characteristic that in lan-
guages with few inflectional restrictions, such as English, spontaneous coinage
of words – that is, novel morphological expressions of syntactic structures at
the level of the word or whereabouts – is easier than in Romance or Greek,
where all sorts of inflectional criteria must be met and where zero derivation is
not available: in English, in the right context, even a manner verb butter is
possible – for example, in a coinage like #The vegetable oil was bad quality;
it buttered (‘coagulated like butter’).
Invoking morphological restrictions – that is, what is listed – is far from
ad hoc. Indeed, there are gaps and morphological idiosyncrasies every-
where, even in the bona fide syntactic inflectional morphology: two well-
known conundrums of this sort, hardly unique, from my native Greek are
13
Aronoff and Anshen (1998, 243) raise a crucial point with respect to productivity – namely, that
it is far from being an absolute notion: ‘Some linguists treat morphological productivity as an
absolute notion – a pattern is either productive or unproductive – but there is a good deal of
evidence for the existence and utility of intermediate cases, . . . so we will assume . . . that affixes
may differ continuously in productivity, rather than falling only into the polar categories of
completely productive and completely unproductive . . .’.
72 Syntactic decomposition and categorizers
14
These matters are discussed in more detail in the following chapter.
3.9 Are roots truly acategorial? Dutch restrictions 73
with Hindi/Urdu apparently working the same way (Rajesh Bhatt and David
Embick, personal communication, October 2010). That is, the lack of verbs
like boy and #cat in English is an empirical fact to be explained, possibly
invoking grammar-external factors.15
More systematic and solid criticism of acategorial roots comes from Don
(2004). He looks at conversions in Dutch to argue that ‘the lexical category of
roots should be lexically stored’ (Don 2004, 933). More precisely, he claims
that hammer-type derivations – that is, with a root directly yielding a noun
and a verb – are not to be found and that all conversions involve denominal
derivations (like tape) or deverbal ones (like the noun throw). Don makes two
arguments, a morphological one and a phonological one. We will review them
in turn.
Don’s morphological argument against category-less roots (in Dutch) is
very simple and comes from examining two properties of zero-derived
noun–verb pairs in Dutch: irregular morphology on verbs and gender on nouns
(Don 2004, 939–42). He first states that four logical possibilities present
themselves when it comes to cross-classifying zero-derived noun–verb pairs,
as follows:16
Interestingly, only the first three options in (16) are attested in the language:
15
This state of affairs is possibly different to the Romance and Greek cases, where the existence of
‘nominal’ and ‘verbal’ roots looks like an illusion of rich morphology, like the result of roots
being embedded within ‘their functional support systems’, their morphological entourage. This
is also suggested by Panagiotidis, Revithiadou and Spyropoulos (2013), where it is argued that
all but a handful of verbal stems in Greek contain an overt verbalizing morpheme whose
exponence depends on complex (sub-)regularities.
16
Dutch has two genders: common (non-neuter), taking a de definite article, and neuter, taking a
het definite article.
74 Syntactic decomposition and categorizers
17
It is quite common for morphological processes to determine the gender of their output. Don
(2004) cites Beard (1995) for examples of this in different languages.
18
Crucially, no statement can be made on the pairs in (17): they could be V-to-N or N-to-V
conversion pairs, verbs being regular and nouns bearing common gender. They could, for that
matter, be directly derived from an acategorial root.
3.9 Are roots truly acategorial? Dutch restrictions 75
19
See Lowenstamm (2008) for how gender defines different flavours of n.
76 Syntactic decomposition and categorizers
The above looks like the kind of morphological constraints we find in language
after language, as opposed to being the offshoot of the purported categorial
specification of roots. Additional evidence that the gap Don discusses is of a
morphological and morphophonological nature comes from Don’s (2004,
942–5) phonological argument for categorially specified roots. Reviewing
evidence in Trommelen (1989), he observes an interesting constraint on Dutch
verbal stems, summarized in (25):
(25) Phonological properties of verbs in Dutch
Simple syllable Complex syllable
structure structure
With identical noun numerous: bal, lepel, some: oogst, feest, fiets
kat
Without identical numerous: win, kom, No examples
noun vang
Apparently, verbs with complex syllable structure are denominal. Looking
at the gap – that is, verbs with complex syllable structure but no correspond-
ing nouns – we can perceive a picture where verbal stems can only have
simple syllable structure – unless they are denominal conversions. Without
getting too deeply into matters of Dutch morphology, two options present
themselves.
a. The Dutch lexicon assigns a verbal category only to roots with a
simple syllable structure. This claim is odd even within a lexicalist
framework, because it regulates the category membership of a root, a
matter ultimately tied to concepts (Acquaviva and Panagiotidis 2012),
according to syllable structure.
b. Root forms are inserted late: roots in the complement of v are spelled
out as forms with a simple syllable structure. Restrictions on the form
of roots categorized as verbs are the result of Dutch grammar-internal
requirements on forms.
3.10 Conclusion 77
If the latter is the case, then the Dutch restrictions, which have to do with the
exponence of roots and categorizers in all cases, are not deep generalizations
about the nature of roots but constraints that are morphological in nature: they
are surface requirements, rather than the result of a universal principle that
roots be lexically categorized.
3.10 Conclusion
Now that we have carefully separated the workings of syntax from those of
morphology, we have a complete picture of how lexical decomposition and,
more relevant to our topic of category here, syntactic categorization work: the
picture is more or less complete. Thus, we can replace the outline in (26):
(26) Lexical/‘morphological’ Syntactic
word-making word-making
4.1 Introduction
This chapter contains the core of the proposal presented here, one that weds
syntactic categorization with a new explanatory theory of categorial features
as LF-interpretable entities. Interpretable categorial features, borne by
categorizing heads, define the fundamental interpretive perspective of their
complement, thus licensing root material. The discussion begins in Section 4.2
with a recasting of the question regarding the difference between nouns and
verbs as one about the difference between n (the nominalizer) and v (the
verbalizer). Section 4.3 introduces categorial features [N] and [V] as
perspective-setting features and in Section 4.4 Embick and Marantz’s (2008)
Categorization Assumption is used as a guide in order to explain the role of
categorization induced by categorial features, its nature as well as why it
is necessary to categorize roots. In Section 4.5, calling upon evidence from
semi-lexical heads, it is argued that categorizers are not functional heads but,
on the contrary, the only lexical heads in a grammar, as they are the only
elements that can categorize root material, setting a perspective on it. Perhaps
more crucially, it is also argued that categorizers are the only necessary
elements, even in the absence of any root, on which functional superstructures
can be built. Section 4.6 examines empirical evidence from languages like
Farsi in order to put to the test the view that perhaps [N] and [V] could be more
closely related to each other, with [N] possibly being the default value of a
perspective-setting feature.
78
4.2 Answering the old questions 79
elements can and cannot have, the syntactic role of roots within the First
Phase – and so on. We also saw that syntactic decomposition elevates
categorizers such as n and v to a pivotal role in the derivation of nouns and
verbs respectively. Consequently, what we call nouns and verbs are essentially
(subconstituents of) nPs and vPs. Before moving on, let us illustrate this point,
based on Marantz (1997, 2000, 2006) and as discussed in the previous chapter.
Take a verb like bake. Essentially, the verb is a subconstituent of a vP.
(1)
However, saying that nouns and verbs are really (parts of) nPs and vPs does
not answer a fundamental question, the one we set out to answer in the first
chapter. More precisely: suppose that a noun is a morphological unit created
around the head of nP, an n constituent typically, and a verb a morphological
unit created around the head of vP, a v node in typical cases as in (1) above.
If the above is the case, then the problem of what distinguishes nouns from
verbs now becomes one of what distinguishes n from v.
(2) What distinguishes n from v? What is their difference at LF?
(1994), Embick and Noyer (2001), Arad (2005, chap. 2).1 However, the
universal relevance and cross-linguistic significance of the noun–verb
distinction extend well beyond both morphological exponence and –
crucially – word-class membership. Nouns and verbs are not mere
morphological epiphenomena: thus, a claim that [N] and [V] are
morphological and/or post-syntactic features is a highly implausible one.
c. All categorial features are uninterpretable, in a fashion similar to that in
which Case features are analysed in Chomsky (1995). If this is so, then
grammatical category would accordingly be a grammar-internal
mechanism with no direct interpretive effect. In the same way that there is
no uniform semantic interpretation for all nominative or accusative DPs,
similarly there would be no uniform interpretive identity for all nouns or
all verbs. As I will sketch below, and as implied in Chapter 1 already, this
would be undesirable given the research on the semantic differences
between (bare) nouns and (bare) verbs.
d. [N] and [V] are LF-interpretable, as suggested in Déchaine (1993) and
explicitly argued in Baker (2003). This looks like the option worth
exploring, perhaps the only one. Of course, this is a take on word classes
that has been more or less embraced by Anderson (1997) and linguists
working in functionalist frameworks, as reviewed in Chapter 1.
1
See Alexiadou and Müller (2007) for an account of such features as uninterpretable syntactic
features.
82 Categorial features
Before concluding this section, I must make a reference, albeit a brief one
and without doing justice to it, to the system of lexical categories put forth in
Pesetsky and Torrego (2004). More or less in the spirit of Stowell (1981) and
Déchaine (1993), they conceive lexical categories noun, verb and adjective as
grammar-internal entities that can be completely defined in a contextual way,
bringing together the most interesting ingredients of approaches b. and c. in (3)
above: so, nouns, verbs and adjectives are morphological epiphenomena and,
consequently, a grammar-internal business. There are no categorial features; in
which of the three forms a predicate manifests itself is regulated contextually –
that is, by the superimposed syntactic structure, but also locally, according to
the features and selectional requirements of a very low tense head, a TO:
(4) Contextual determination of lexical categories (in Pesetsky and Torrego
2004, 525):
Predicates are morphological verbs when associated with a TO that seeks
uninterpretable Tense features (i.e., Case, in the Pesetsky–Torrego system);
predicates are morphological nouns when associated with a TO that seeks
interpretable Tense features;
otherwise, predicates are morphologically adjectives.
The approach is very elegant and simple; it also fleshes out the programmatic
desire we briefly mentioned in Chapter 1 to strip lexical categories of all kinds
of interpretive characteristics. However, I think that a fully contextual defin-
ition of lexical categories, to the extent that it can be implemented in the way
of (4) or otherwise, is undesirable for exactly this reason: it strips lexical
categories of all interpretive characteristics, which – let me repeat this – results
in missing generalizations of paramount importance. So, I will pursue the goal
set out here, to discover the LF-interpretation of categorial features.
Number Tense
Case-marked Case-assigning
gender and so on agreement with arguments
argument structure covert argument structure overt
determiners particles
From the discussion in Chapter 1 it is also worth recalling two simple but crucial
points. First, ‘prototypical’ members of each lexical category (e.g., rock or tree
for nouns) share exactly the same grammatical properties as ‘non-prototypical’
ones (e.g., theory, liberty and game for nouns); this is treated in length in
Newmeyer (1998, chap. 4). Second, the same concept, as sometimes expressed
by an identical root, like sleep, can appear both as a noun and as a verb.
As also announced in Chapter 1, and following Baker (2003, 296–7), we
have to bear in mind that category distinctions must correspond to perspectives
on the concepts which roots and associated material are employed in order to
express: category distinctions are certainly not ontological distinctions,
whether clear-cut or fuzzy. Crudely put:
(6) conceptual categorization 6¼ linguistic categorization
84 Categorial features
Thus, although all physical objects are nouns cross-linguistically, not all
nouns denote concepts of physical objects (David Pesetsky, personal com-
munication, September 2005): thus, rock and theory cannot belong together
in any useful, or even coherent, conceptual category. The concepts expressed
by rock and theory can, however, be viewed by the Language Faculty in the
same way. This would entail that grammatical categories, such as ‘noun’ and
‘verb’, are particular interpretive perspectives on concepts, that there is a way
in which rock and theory are treated the same by grammar, even if they share
no significant common properties notionally. This stance is essentially taken
in Langacker (1987), Uriagereka (1999), Baker (2003, 293–4) and Acqua-
viva (2009a, 2009b). Finally, understanding categorization as grammar
imposing interpretive perspectives on concepts, we can tackle the question
I raised in Section 1.2.6 of Chapter 1 – namely, what conceptual mechanism
decides which category concepts are assigned to. The reply is ‘grammar does
the categorization’, giving us sleep the noun and sleep the verb, built from
the same concept (and root), but encoding different interpretive perspectives,
or even cross-category near-synonyms like fond the adjective and like
the verb.2
Building on the above points, we can now turn to the question of what the
actual interpretations of [N] and [V] are. The question is of course now recast
as one regarding the different interpretive perspectives that categorial features
impose on the material in their complement. In principle, we could adopt
Baker’s (2003) interpretations of [N] as sortal and [V] as predicative.
However, as discussed in the Appendix, there are several issues with the
way Baker imports sortality into his system and links it with referentiality.
Even more damagingly, [V] as a feature encoding predicativity that also forces
the projection of a specifier is also multiply problematic. I am therefore
departing from Baker’s interpretation for the two categorial features and
I am proposing the following alternative interpretations for [N] and [V]:
(7) LF-interpretation of categorial features
An [N] feature imposes a sortal perspective on the categorizer’s
complement at LF.
A [V] feature imposes an extending-into-time perspective on the categorizer’s
complement at LF.
2
I wish to thank an anonymous reviewer for comments on categorization.
4.3 Categorial features: a matter of perspective 85
principally revolves around the criterion of identity, and I will explore instead
the notion of sortality as implemented in Prasada (2008) and Acquaviva (2009a).
Prasada (2008) notes that sortality incorporates three criteria: application,
identity and individuation. The criterion of application ‘means that the repre-
sentation is understood to apply to things of a certain kind, but not others.
Thus, the sortal dog allows us to think about dogs, but not tables, trees, wood
or any other kind of thing’ (Prasada 2008, 6). In this respect, the criterion of
application differentiates (sortal, but not exclusively) predicates from
indexicals like this and from elements with similar functions. The criterion
of application also incorporates the received understanding of bare nominal
expressions as kinds in Chierchia (1998), as it ‘provides the basis for thoughts
like dogs, [which] by virtue of being dogs, remain dogs throughout their
existence’ (Prasada 2008, 7), for as long as external conditions permit them
to maintain their existence (for a short time, like puppy, or for a long one, like
water and universe). Very interestingly, this is precisely the meeting point with
the intuitions in prototype theory and in functionalist literature (see Chapter 1)
as far as the ‘time stability’ of nouns is concerned: while it turns out that
concepts denoted by nouns are not themselves necessarily time-stable – as
cogently pointed out in Baker (2003, 292–4) – nouns are, however, viewed by
the Language Faculty as time-stable, irrespective of the actual time stability of
the concepts they denote. This is where the notion of interpretive perspective
becomes crucial.3
Regarding the criterion of identity, I will adopt its reinterpretation in
Acquaviva (2009a). The discussion in Acquaviva (2009a, 4) goes like this: if
we take a kind (e.g., the kind person), it has instances (i.e., persons) which are
particulars and which do not themselves have instances. In this way, being a
person is different from being tall: only the property person identifies a type of
entity. At the same time, the property of being tall is characteristic of all the
entities it is true of, but it does not define a category of being. This distinction
leads us back to Baker’s (2003, 101–9) discussion of the criterion of identity:
the criterion of identity essentially defines something which may replace A in
the relative identity statement ‘x is the same A as y’. Acquaviva (2009a, 4)
continues by pointing out the following: a concept that defines what it means
3
Acquaviva (2009a, 1–5) contains more detailed and in-depth discussion of nominal concepts as
such, which goes beyond this sketch of the two criteria of application and identity. Here we will
be satisfied with Prasada’s (2008) two criteria of application and identity as being enough to
define sortality for our purposes: namely, exploring what ‘nominality’ means in grammar and –
crucially – what the interpretation of a nominal feature [N] is.
86 Categorial features
4
Acquaviva distinguishes between a criterion, which is a necessary and sufficient condition, and a
condition. I am here using both terms loosely and interchangeably.
5
When not associated with a concept, [N] would trigger a pronominal interpretation; see Pana-
giotidis (2003b, 423–6). As an anonymous reviewer points out, if [N] is about identity and if it is
present inside pronouns, then it is hard to explain the workings of expletive pronouns, such as it
in it rains. I have no coherent answer to this problem.
6
The criterion of individuation – namely, that ‘two instances of a kind are distinct because they
are the kinds of things they are’ (Prasada 2008, 8) – does not apply to mass nouns. However, it
may play a role in the object bias in the acquisition of nouns (Bloom 2000, chap. 4) and the
perceived prototypical character of objects over substances in terms of nominality.
7
There is no one-to-one correspondence between Ramchand’s (2008, 38–42) process projection
and the verbalizer projection here. However, it is crucial that procP, which ‘specifies the nature
of the change or process’ (40), is understood as the essential ingredient of every dynamic verb.
4.3 Categorial features: a matter of perspective 87
Here, then, we have two essential ingredients for the interpretation of categor-
ial features: perspective-setting, and the relevance of temporality (as opposed
to predicativity) for verbs and their distinctive feature.8 In a similar vein,
Acquaviva (2009a, 2) notes that because ‘verbal meaning is based on event
structure [. . .], it has a temporal dimension built in. Nominal meaning, by
contrast, does not have a temporal dimension built in.’ If we replace ‘meaning’
with ‘perspective’ in Acquaviva’s quote, we can make the claim that [V]
encodes an actual perspective over the concept with which it is associated
and that this perspective is of the said concept as extending into time: this is
why verbs and their projections are the basic ingredients of events. From this
point of view, we can actually call Vs and VPs subevents, with the feature [V]
contributing the temporal perspective to event structures.
Some consequences of the way features [N] and [V] are interpreted, if we
go by (7), include the following: first of all, we can now explain why objects –
but, also, substances – are typically conceived as sortal concepts in the way
sketched above; they smoothly satisfy both criteria of application and iden-
tity. This is compatible with the canonical mapping of such concepts onto
nouns cross-linguistically and – as already pointed out – object and substance
concepts are typically expressed as nouns. At the same time, together with
Uriagereka (1999) and Baker (2003, 290–5), we expect dynamic events
(activities, achievements, accomplishments) to be conceived typically as
extending into time, hence the canonical mapping of such concepts onto
verbs. If, on top of everything, dynamic events are compositionally derived
from states and states are, very roughly, equivalent to VPs, then a theory of
event structure, such as the ones in Rappaport Hovav and Levin (1998),
Borer (2005), Levin and Rappaport Hovav (2005) and Ramchand (2008),
receives added justification along the lines of verbal constituents being
8
I have long thought that Langacker (1987) and Uriagereka’s (1999) understanding of nouns as
regions and spaces could be unified with Acquaviva’s (2009a) treatment of nominal concepts as
‘unbroken’ – that is, as having no temporal parts (as discussed below). However, I presently can
offer no true insight on this prospect.
88 Categorial features
Speakers know that only a part of the event had a property and another part
had the other property, but this is disregarded in a structure which predicates
two contradictory properties of the same subject. The nature of the nouns’
referents as occurrents is only disregarded, not changed; this becomes obvi-
ous when we explicitly state that the whole subject is there at a given time:
(2) a. the iron became heated (all of it) 6¼ the argument became heated
(all of it)
b. the wedding moved from A to B 6¼ they married first in A then
in B
In (2a), all of it may not refer to all of the event’s temporal parts . . . In (2b),
the right-hand side is not a good paraphrase of the left-hand side because
marry is a telic verb and so the sentence entails that the event is completed,
first in A, then in B. To explicitly describe the referents of argument and
wedding as lacking temporal parts, thus, conflicts with their lexical semantics.
Yet the sentences in (1) are natural even though they sideline temporal
constitution. Reference to temporal constitution is thus inessential for nouns
referring to occurrents. This, then, is a clear difference: verbal reference has a
temporal dimension built in (in terms of actionality, not tense; this applies to
permanent states as well as to bounded events); nominal reference does not,
and can do without such a dimension even when referring to occurrents.
In any case, the truly important part of (7) is that [N] or [V] on n and v encode
different perspectives, rather than different inherent properties of the concept
itself as expressed by the root. This is actually a most welcome consequence of
syntactically decomposing lexical categories in the light of pairs like N work –
V work, which brings us to the answer of what distinguishes n from v:
(9) Categorial features [N] and [V] are interpretable on the n and v categorizers
respectively.
This –o– morph is an elsewhere form and its sole purpose is to satisfy the
requirement that Greek stems be morphologically bound. Interestingly, if
that was all that there was – that is, a morphological restriction on stems –
acategorial roots could easily be part of syntactic structures by getting –o–
attached onto them as stems: we would expect –o– to have a wider
distribution. However, –o– is illicit in all environments where the root
is expected to be categorized – that is, where the roots will then have
to combine with functional structure. Thus we have pairs like the
following:
9
Perhaps it might be possible to correlate the cross-linguistic avoidance of roots as free
morphemes with the Categorization Assumption itself; however, this is not a claim I would
make here.
10
N here stands for ‘nominal’.
4.4 The Categorization Assumption and roots 91
(12) vrox-o-lastix-a
vrox-o-lastix-n.neut.pl
‘rain tyres’
lastix-a vrox-is
lastix- nom.neut.pl vrox-n.fem.sg.gen
‘rain tyres’
11
The caveat ‘overwhelmingly’ is inserted because I am not sure light verbs exist in Greek.
12
The list is incomplete and will be revised and expanded: we need also to include inner,
subcategorial, morphemes, which merge directly with root material, as seen in the previous
chapter. See Chapter 5, footnote 12.
92 Categorial features
(16) the role of categorial features [N] and [V] in the licensing of roots.
First, we take seriously the hypothesis that roots are unexceptional syntac-
tically, by being merged inside the complement of material below the
categorizers or, perhaps, by projecting their own phrases, too, as in Marantz
(2006) and Harley (2007, 2009) – but see De Belder (2011) for argumenta-
tion against roots projecting phrases. This is the zero hypothesis, as stated
in (14): if roots are manipulated by syntax, then they should behave like
all other LIs (‘lexical items’), unless there is evidence to the contrary.
Having said that, (14) goes against what we have in (10) and (13), unless
of course we try to stipulate (10) and (13) via some kind of c-selectional
restrictions. Still, it is not the case that all roots are directly selected by n
and v: as seen in Chapter 3, some roots may be deeply embedded inside
more material.
Second, the thread in (15) is about carefully considering the LF interpret-
ation of roots in isolation, so to speak, about examining their own conceptual
content; in this I will assume that the semantic content of the root is seriously
underspecified/impoverished.
Third, (16) announces that we will employ our newly developed under-
standing of categorial features on categorizers as providing the necessary
perspective (e.g., sortal or temporal) through which the root will be interpreted;
we will also explain why the absence of such an interpretive perspective
prevents roots from participating in legitimate syntactic objects. In other
words, I argue that categorizers exclusively provide the grammatical ‘context’
of Marantz (2000) for the root to be assigned an interpretation and/or a
matching entry at the interface with the Conceptual–Intentional/SEM systems.
Let us now look at the second and third threads, (14) and (15), in more
detail.
4.4 The Categorization Assumption and roots 93
Verbs:
kavaš (‘conquer’, ‘subdue’, ‘press’, ‘pave’, ‘pickle’, ‘preserve’, ‘store’, ‘hide’)
kibeš (‘conquer’, ‘subdue’, ‘press’, ‘pave’, ‘pickle’, ‘preserve’)
13
Acquaviva also discusses the issue of morphological class membership information and the
problems arising if we argue that it is (not) encoded on the root.
94 Categorial features
all these words with their diverse and unrelated meanings should be listed as
possible interpretations of the root KBŠ. However, if roots themselves are
polysemous, then a root (say, KBŠ) must also lexically encode information on
which of its many meanings is available in which syntactic environment.
Whatever the precise amount of semantic content that characterizes roots,
what is important for the discussion here is that roots on their own have
minimal semantic content or, as Arad (2003; 2005, chap. 3) proposes, that
they are severely underspecified. This could be understood to result in free
roots not being adequately specified to stand on their own as legitimate LF
objects.14 Consider that syntactic objects at LF – say, a vP phase – consist
purely of interpretable and/or valued UG features and roots. Now, by hypoth-
esis, syntax uses the operation Agree in order to eliminate uninterpretable and/
or unvalued UG features before the phase is completed (i.e., before the
derivation is sent off to the interface). But what about roots? Roots are possibly
UG-extraneous elements. Even if this claim is too strong, roots do not form
part of UG, as they can be borrowed or even coined, pre-theoretically speak-
ing, at least. Roots are essentially ‘imported’ into the syntactic derivation.
However, FLN (the Faculty of Language in the Narrow sense) must somehow
manipulate roots, apparently in order to be able to express a variety of concepts.
If no roots are manipulated by FLN in a particular derivation, we get expres-
sions made up exclusively of UG features like ‘This is her’, ‘I got that’, ‘It is
here’ and so on (cf. Emonds 1985, chap. 4). Therefore, the ability of FLN to
manipulate roots enables it to denote concepts and, ultimately, to be used to
‘refer’ (Acquaviva and Panagiotidis 2012).
However, roots being possibly extraneous to FLN, and given that they
probably do not contain any UG features (as is the emerging consensus), they
need to be somehow dealt with by FLN: categorization is exactly the way this
is achieved. In a nutshell: uncategorized roots are FLN-extraneous; either just
because of this or also because they are semantically underspecified them-
selves, uncategorized roots would not be recognized at the interface between
syntax and the Conceptual–Intentional/SEM systems.
14
The semantic underspecification of (uncategorized) roots can be understood as the reason
they are not legitimate objects at LF, but more needs to be said on the matter. For instance,
Horst Lohnstein (personal communication, 2010) points out that a root like spit could still
be interpretable in some sense even if it is underspecified, unlike kbš; the ‘basic meaning’ of
spit as a root is more clearly circumscribed. However, even a relatively straightforward root
like spit means much less when it is uncategorized than, for example, the verb it names. See
Rappaport Hovav and Levin (1998) for discussion; see also Acquaviva and Panagiotidis
(2012).
4.4 The Categorization Assumption and roots 95
The specific claim made here is that syntax does not use a special operation
to ‘acclimatize’ roots but embeds them within a categorizer projection, whose
categorial features provide an interpretive perspective in which Conceptual–
Intentional systems will associate the root with conceptual content. The short
of it is that
(18) (Free/uncategorized) roots are not readable by the Conceptual–Intentional/
SEM systems.
15
This is already stated in Marantz (2000): ‘To use a root in the syntax, one must ‘merge’ it
(combine it syntactically) with a node containing category information.’ I of course take [N] and
[V] to be the said ‘category information’ here.
16
Non-phasal heads could also provide a context for the interpretation of roots; categorizers could
in principle be non-phasal heads. Note that if the objections in Anagnostopoulou and Samioti
(2009) are correct, this is exactly the way things are: categorizer phrases are not phases but they
are (inside) the complements of phase heads.
96 Categorial features
Merge creates a syntactic object from the root and its object ‘tomatoes’, with
the root projecting: a syntactically unexceptional object. However, if this is
embedded under functional structure without the licensing ‘intervention’ of a
categorizer, then the resulting syntactic object will lack interpretive
perspective, because of the SEM-deficient root Grow. This is why the
categorial feature on the categorizer, [V] on v or [N] on n, is necessary: it
assigns an interpretive perspective to the object as extending into time or as
sortal, therefore enabling the resulting vP or nP – the so-called First Phase – to
be interpreted. At the same time, the root-categorizer object, associated with
an interpretive perspective, can be matched with a vocabulary item (grow or
growth) and an appropriate ‘lexical’ concept, a ‘meaning’ (cf. Aronoff 2007).
(20)
17
For overviews, see Ackema and Neeleman (2004) and Marantz (1997, 2000, 2006).
98 Categorial features
(23) There is only one class of ‘lexical elements’ that qualify as atomic
‘nouns’: n heads.
4.5 Categorizers are not functional 99
The fact that what we used to classify as ‘lexical heads’ are in reality the
categorizers themselves can be glimpsed from a very privileged angle once
we consider semi-lexical categories (Corver and van Riemsdijk 2001).
Semi-lexical elements are lexical nouns and verbs that do not carry any
descriptive content, including English one (as in the right one), ‘empty
nouns’ (Panagiotidis 2003b) and at least some types of light verbs. Emonds
(1985, chap. 4) already analyses semi-lexical elements – which he calls
grammatical nouns and grammatical verbs – as instances of N and V heads
without any descriptive, concept-denoting features. This line of analysing
semi-lexical heads is taken up and developed in van Riemsdijk (1998a), Haider
(2001), Schütze (2001) and Panagiotidis (2003b). I think it is very interesting
that Emonds’ definition, dating from 1985, of such elements as lexical elem-
ents that only bear formal features is exactly what we would think categorizers
themselves are in a lexical decomposition framework: consider, for instance,
Folli and Harley’s (2005) postulation of three v heads caus, do and become.
Indeed, Emonds (1985, 159–68) claims that different lexical entries of gram-
matical nouns and grammatical verbs can be distinguished from each other by
virtue of their formal features only, as they are completely devoid of any
descriptive content.
It seems, then, that when we deal with semi-lexical elements, the root
supplying the descriptive content – whichever way it does it – is absent. So,
a first straightforward conclusion would be that in order to have a ‘noun’ (nP)
18
As a consequence, adpositions and functional elements such as determiners should not contain
roots. Received lore within Distributed Morphology would indeed take all functional elements
to be bundles of features realized by Vocabulary Items. As for adpositions, the lexical material
they contain – that is, the ‘heavy’ nominal and adverbial material briefly mentioned in Section
2.9 of Chapter 2 – must already be categorized. I am grateful to Carlos de Cuba (personal
communication, April 2008) who first raised this question in 2008, and to an anonymous
reviewer.
100 Categorial features
or a ‘verb’ (vP) a root is not necessary, whereas categorizers always are. This is
a conclusion that Harley (2005a) arrives at, in her treatment of one: she
convincingly shows that one is, precisely, the Vocabulary Item inserted when
an n head is not associated with a root, under certain morphosyntactic
conditions.
The above strongly suggests that (category-less) roots and subcategorial
material are – syntactically speaking – optional and that a well-formed syntac-
tic representation can be constructed using just categorizers and a functional
structure superimposed on them: we can build syntactic structures using only a
categorizer as its ‘seed’; roots are not necessary. We will return to this
observation in the next chapter.
19
Some relevant options had already been explored by Déchaine (1993) in her three-feature
system for lexical and functional categories.
4.6 Nouns and verbs 101
In Hebrew, Russian and Greek nouns can be borrowed exactly as they are,
modulo phonological adaptation. Observe, for instance, the following loan-
words into Greek:
(25) Loanwords into Greek: nouns
rok ‘rock music’
solo ‘solo’
zum ‘zoom’
indriga ‘intrigue’ – from Italian intriga
At the same time, when borrowing roots to make verbs, special verbalizing
morphology must be added to these root forms in order to turn them into
legitimate verbal stems – recall the discussion in Section 4.4.1:
(26) rok-ar-o ‘I rock.’
sol-ar-o ‘I do a solo.’
zum-ar-o ‘I zoom in.’
indrig-ar-o ‘I intrigue sb.’
On the basis of similar evidence from Hebrew and Russian, Arad argues that
this asymmetric behaviour is the result of a VoiceP (as opposed to a bare vP)
being the minimum verb. She goes on to argue that although a nominalized
root is a noun, a verbalized root is not a verb: this results in the paradox of a
verbalized root being smaller than a verb. Corroborating this is the fact that, in
Hebrew, ‘roots make nouns more easily than verbs’ (Arad 2005, 56), a fact that
also seems to hold true for Greek, even beyond the domain of borrowing.20
However, taking a better look at (26) above, it turns out that the extra
morphology, the –ar– form, attached to (borrowed) roots to make verbs is
not an exponence of Voice but, rather, of v itself. Panagiotidis, Revithiadou
and Spyropoulos (2013) observe that the –ar–-type elements that make verbal
stems out of roots co-exist with Voice, Aspect, Tense and Agreement morph-
ology, and they are obligatory, even when combining with native roots: it is
impossible to form verbs from roots without one of these pieces.21 Even so, the
fact that verbalizers, v heads, overwhelmingly tend to be morphologically
expressed in Greek (and, possibly, in Hebrew and Russian, too), unlike the n
head, is something that in turn must be explained: the issue of the markedness
of verbs re-emerges.
20
Nouns vastly outnumber verbs in any given dictionary, after all.
21
There is actually a very limited number of verbs, all of Ancient Greek origin, that do not show
an overt piece of verbalizing morphology; these include graf-o (‘I write’), vaf-o (‘I paint/dye’),
trex-o (‘I run’), idri-o (‘I found/establish’).
4.6 Nouns and verbs 103
Farsi Complex Predicates, substituting for verbs, are multiply exciting. What
we are going to focus on here, however, is the categorial status of the preverbs.
The preverbs in (27) all look like nouns, making Farsi Complex Predicates
apparently similar to English periphrases like have/take a shower, have/take a
look, make a call/a statement/a mistake, take a picture and the like. The Farsi
situation is much more intricate. Karimi-Doostan (2008a, emphasis added)
explains why:
22
Farsi is a subject–object–verb (SOV) language.
104 Categorial features
23
All examples are from Karimi-Doostan (2008b).
4.6 Nouns and verbs 105
5.1 Introduction
Arguing for two privative/unary categorial features, [N] and [V], as setting
particular interpretive perspectives marks a significant step in our understanding
of the workings and status of categorizers n and v respectively. However, the
theory developed here goes well beyond providing an interpretation for
the behaviour of categorizers. In this chapter I will put forward the hypothesis
that uninterpretable categorial features are borne by functional heads. In
Section 5.2 the case against dedicated ‘category’ features for functional elem-
ents is made and in Section 5.3 the idea that functional heads are satellites of
lexical ones, members of a lexical category’s functional entourage, is rehearsed,
an idea familiar from such analytical concepts as Grimshaw’s (1991) Extended
Projections, van Riemsdijk’s (1998a) M-projections and Chomsky’s
(2001) supercategories. Section 5.4 correlates this idea with the notion of
biuniqueness – namely, that functional elements may appear in the superstruc-
ture of only one lexical head and in Section 5.5 I argue against the claim that
biuniqueness entails that functional heads bear the categorial specification of the
lexical head in their ‘Extended Projection’. Categorial Deficiency, the hypoth-
esis that uninterpretable categorial features flag functional heads, is introduced
in Section 5.6 and is refined in Sections 5.7 and 5.8. In Section 5.9 we review
the crucial consequences of Categorial Deficiency: how uninterpretable categor-
ial features triggering Agree operations can account for some well-established
characteristics of phrase structure and how uninterpretable categorial features
are the Agree probes that regulate labelling of (some) projections after the
application of Merge. The final section concludes the chapter.
106
5.2 The category of functional categories 107
the features themselves (e.g., [T], [C], [D] etc.) would be interpretable at LF.
Again, this appears to form among theoretical syntacticians the received, albeit
largely unexamined, view, a view made explicit, for example, in textbooks
such as Adger (2003). Still, in the spirit of looking for an explanatory theory of
category, as opposed to settling for a purely taxonomic grouping of grammat-
ical elements, we must seriously ask the question of what kind of interpretation
features like [T], [C] or [D] would have.
(2) What is the interpretation of categorial features on functional heads?
Answering this question is both easy and difficult: easy because one can
straightforwardly come up with specific ideas on the interpretation of the [T]
feature – for example, anchoring in time; [C] would most likely encode
illocutionary force and [D] would encode referentiality and, possibly, also
deixis. However, if we are guided by a desire for thoroughness, we will soon
stumble upon the fact that answering (2) can be a rather complex affair,
punctuated by a number of complications.
An initial complication involves the categorial features of functional cat-
egories beyond Tense, Complementizer and Determiner – that is, of categories
such as Focus, Topic, Mood, Aspect, Voice (in the clausal/verbal projection
line), and Quantifier, Number, Classifier (in the nominal projection line).
Surely, these are all natural classes with identifiable LF interpretations.
Adopting a coherent theory of features where features consist of an attribute
and a range of values, we could easily capture functional categories as natural
classes defined by their features. In the case of Complementizer, for instance,
the [C] feature could be reformulated as an ‘illocutionary force’ attribute with
declarative, interrogative and so on being different values thereof. Adger
(2010) offers a cogent and detailed discussion on the details of implementing
a system like that. Hence, we could treat categorial features for functional
categories as ordinary LF-interpretable features consisting of an attribute
and possible values. Different attributes – for example, ‘illocutionary force’,
‘reference/deixis’, ‘anchoring in time’ – would define different functional
categories, Complementizer, Determiner and Tense respectively. The above
would give us the following: (a) the LF-interpretable content of functional
categories, and (b) why, for example, a [Q](uestion) head and the Comple-
mentizer that belong to the same natural class: because they bear the same
illocutionary force feature, albeit differently valued.
However, would these features truly be our categorial features? If we answer
this in the affirmative, then we actually commit ourselves to a view, one with
some currency, that a functional head bears at most one interpretable feature:
5.2 The category of functional categories 109
1
There is some debate even about which would be the zero hypothesis, especially from an
acquisition perspective: a universal fixed repertory of functional categories which are encoded
in UG or forced by the Conceptual–Intentional systems, or a pool of UG features for the learner
to assemble her own repertory of functional categories. See also Borer (2005) for insightful
discussion.
110 Functional categories
stand as a functional head during the derivation. This view essentially echoes
the insights in Chametzky (2003, 213–19).
5.4 Biuniqueness
Besides their acting as satellites of lexical heads, there is a second property
of functional elements that a proper account of their categorial features
should capture – namely, what Felix (1990) calls biuniqueness, harking back
to at least Martinet. Biuniqueness, according to Felix, is a general require-
ment that functional elements merge in the projection line of only one kind of
lexical head. For instance, D can only merge in the projection line of N, T in
that of V, and so on. For mixed projections, which constitute a principled
exception to this, see Bresnan (1997), Alexiadou (2001) and the following
chapter.
There is a recurrent theme in the generative literature with respect to how to
best capture biuniqueness, while also successfully addressing the question of
the category of functional categories. Ouhalla (1991), Grimshaw (1991) as
well as van Riemsdijk’s (1998a) Categorial Identity Thesis all converge on the
following hypothesis:
(3) Functional categories bear the categorial specification of the lexical head in
their projection line.
Illustrating, Aspect and Tense heads will all bear a [V] feature, whereas
Number heads and Determiners must also bear an [N] feature. Any version
of feature matching consequently guarantees that D (an [N] category) will
never select V; T (a [V] category) will never select N – and so on.
But why only talk about nouns and verbs in (3)? We have of course
discussed a number of objections to the uniformly lexical status of adpositions
and even to their forming a coherent category in Chapter 2, Section 2.9; we
have also argued in Section 2.8 of the same chapter that adjectives, even if they
are a uniform lexical category, are not of the same ilk as nouns and verbs.
However, these objections are not enough if we wish to exclude adpositions
and adjectives from the discussion of biuniqueness: we would now need to
show that there are no functional categories biuniquely associated with P and
A. Interestingly, it has already been argued that what makes biuniqueness
essentially a relation between nouns and the nominal functional layer and
between verbs and the verbal/clausal functional layer is the fact that ‘while
V[erb]/N[oun] are uniquely selected by Functional heads, P[reposition]/A
[djective] aren’t’ (Déchaine 1993, 32).
112 Functional categories
Let me begin with adpositions, which are relatively easy to deal with.
Déchaine (1993, 34) puts it curtly: ‘I know of no examples of P c-selected
by a Functional head.’ Assuming that some adpositions are indeed lexical –
say, behind, aboard, underneath, opposite, regarding and the like – these are
typically related to nominal functional material; they are probably specifiers of
functional elements in a nominal superstructure, therefore of satellites of N.
This is one of the main topics in van Riemsdijk (1998a), who examines the
behaviour of Germanic ‘lexical P-DP-functional P’ structures like auf den Berg
hinauf (‘on the mountain upon’), the c-selection relations of prepositions with
both the verbs selecting them (e.g., depend on) and the nouns inside their
complement (e.g., in 1996, on Sunday). He concludes that prepositions
‘should, in a sense yet to be made precise, be considered (extended) projec-
tions of nouns, at least when they are transitive. One aspect of this decision is
that a nominal projection embedded in a prepositional shell does not constitute
a maximal projection DP; instead, there is a transition from D’ to P’, induced
by the prepositional head’ (van Riemsdijk 1998a, 31). Sticking with this
conclusion and not going into any more detail here, as in Chapter 2 the reader
is referred to Emonds (1985, 156–7), where Prepositions are equated to
Complementizers, to Baker (2003, 303–25), who considers Adpositions
to be instances of Kase, to Svenonius (2007, 2008), Botwinik-Rotem and
Terzi (2008) and Terzi (2010), for whom adpositions are complex structures
parallel to argument structures.
Turning to adjectives, and in order not to repeat the discussion in
Chapter 2, Section 2.8, on Degree Phrases and the question of the extent
to which they are part of an Extended Projection of adjectives, I think a
point made by Déchaine (1993, 35) actually sheds a lot of light on Degree
heads, as opposed to Degree adjuncts:2 ‘any stative predicate can take a
degree modifier, as long as it is gradable’. This generalization is already
captured in Maling (1983) and becomes a reasonable working hypothesis
when one considers Greek Degree expressions. Recall from the discussion
of Neeleman, Doetjes and van de Koot (2004) that we have to distinguish
between two classes of Degree expressions. On the one hand, we have what
they call ‘Class 2’ Degree expressions – Degree adjunct modifiers like
more, less, enough, a little, a good deal – which are promiscuous in that
they adjoin to PPs, VPs and so on. Contrasting with them, we also have
‘Class 1’ Degree heads like very, as, too, that, how, which appear to belong
2
See Section 2.8.2 in Chapter 2.
5.4 Biuniqueness 113
3
Although it translates as ‘more’, pio is not a morpheme purely for the formation of periphrastic
comparatives: it combines with indeclinable adverbs like kato (‘down’), mesa (‘inside’), pera
(‘far away’) and ektos (‘outside’) which are not derived from adjectives. Moreover, its behaviour
is identical to that of a number of Degree prefixes intensifying the adjective: kata- (‘over’), olo-
(‘all’), pan- (‘total’), theo- (‘god’), yper- (‘super’), tris- (‘thrice’), tetra- (‘fourfold’), penta-
(‘fivefold’) and so on – which I will not discuss here for the sake of brevity.
114 Functional categories
The behaviour of the two expressions with VPs and PPs is as expected in
Neeleman, Doetjes and van de Koot (2004). However, when it comes to noun
phrases, the picture changes. If the noun is inherently gradable, then pio is
felicitous on its own, without a dummy adjective poly (‘much’) as its comple-
ment, suggesting that this Class 1 Degree expression can actually take a
nominal complement, as long as it is plausible to perceive it as gradable
(Déchaine 1993, 35). Anthropos (‘human being’) is such a gradable noun, so
‘pio anthropos’ (‘more of a human being’) in (5)d. can be felicitous either in
the context of discussing evolution (e.g., homo erectus is more of a human than
homo habilis) or moral qualities and/or being humane. Even more interest-
ingly, just as Number can coerce the nominal in its complement into a kind
and/or countable reading (cf. the ambiguous ‘we need three coffees here’), the
Degree expression pio can coerce an inherently non-gradable noun to behave
as such, just like spiti (‘home/house’) in (5)d. above, which can be uttered
comparing the appearance of a particular residence (a converted warehouse,
a hut etc.) to a prototypical home or house.4 So, at least in Greek,
Class 1 Degree expressions cannot be claimed to be to the category A what
D is to N and T to V. In other words, the relation between Degree and
Adjective is not a categorial one; better put, to the extent that the Degree–
Adjective relation (somehow) refers to category, it is not a biunique one
between Degree and Adjective. Putting this together with the brief review on
adpositions, we can argue for the following.
(6) Biuniqueness is a relationship only between nouns and the nominal
functional heads (D, Num etc.) and between verbs and the verbal/clausal
functional heads (Voice, Asp, T, Mood etc.).
Now, the thesis in (3), a more articulated version of the intuition about
‘supercategories’, effectively captures the insights in Chametzky (2000),
Hegarty (2005) and Hinzen (2006) that functional elements are satellites of
lexical ones, and that functional categories do not exist as primitives of the
grammar. However, a solution to the problem of ‘the category of functional
categories’ like the one in (3) leads to a rather undesirable result – namely, that
we cannot distinguish functional from lexical categories on the basis of
categorial features only: we hardly want to say that, for example, T or Asp
are verbs by virtue of their [V] feature. In other words, the very same feature
4
Identical facts hold for the intensifying prefixes in footnote 3, with a lot of room for innovation
and speaker variation – morphophonological matters, allomorphy and the usual morphological
idiosyncrasies notwithstanding.
5.4 Biuniqueness 115
5
Panagiotidis (2003a) links the semi-lexical elements’ lack of denotation to their inability to θ-mark.
6
In Ouhalla’s (1991) analysis lexical and functional categories can only be distinguished according
to their position: lexical categories occupy the bottom of a projection line – an important matter
we will return to. Similarly, in Grimshaw (1991), a numerically valued feature F stipulates the
different position of the head within an Extended Projection: a value of 0 places the head at
the bottom, rendering it both lexical by definition and the head of the Extended Projection, a value
of 1 places it higher and a 2 value places it at the top of the Extended Projection. The role of this
F0. . .Fn feature is then purely one of deciding the order of con-categorial heads within an
Extended Projection, while its interpretation at LF is at best unclear.
116 Functional categories
According to the idea that functional categories bear the categorial specifi-
cation of the lexical head in their projection line, the above tree should contain
four [V] features, and we are now facing the puzzle of how all of them are to
be interpreted at LF. In other words, if we take (3) at face value, we end up
treating categorial features exactly like taxonomic markers that are there purely
to guarantee biuniqueness, which is nothing but a special case of c-selection
(more on this below). Put differently, what is important is the fact that,
according to (3), all three functional elements and the lexical verb in (7) are
specified for a categorial [V] feature. If [V] is not a taxonomic feature and, in
accordance with the discussion in the previous chapter, if it encodes an
extending-into-time perspective, what happens at the interface with the
Conceptual–Intentional/SEM systems, where this interpretive perspective is
encoded four times inside a TP? To what end would this happen?
Summarizing, if we follow the path indicated in (3), then we must also
explain what the role of categorial features on functional heads is; furthermore,
we also need to suggest a general enough and language-independent way in
which to capture the fundamental lexical–functional distinction. Hence,
accounts invoking identity between the categorial features of lexical and
5.6 Categorial Deficiency 117
Let us clarify the above: if v, the categorizing head, bears a [V] feature, then Asp,
T and so on will all bear an uninterpretable [uV] feature. Identical facts hold
for the nominal functional heads Num and D: they must also bear an uninterpret-
able [uN] feature. It follows that not all categorial features are interpretable:
those on functional heads are not. The following diagram, a reinterpretation
of the hypothetical state of affairs in (8), illustrates Categorial Deficiency:
(10) One [V] feature, three [uV] features
7
I wish to thank an anonymous reviewer for raising this matter.
120 Functional categories
8
See also Section 5.8 below.
9
I am grateful to David Willis for raising and discussing this matter.
122 Functional categories
10
Of course, this is not a problem in analyses such as Jouitteau (2005) and Richards and Biberauer
(2005), where the whole vP might be pied-piped to SpecTP, at least for EPP purposes – but
much more would need be said on the conditions under which this may happen, if at all.
5.8 Categorial Deficiency, roots, categorizers 123
Hence, if Num, Voice and all other functional heads lack categorial features
altogether, they would be unable to set interpretive perspectives on root
material. Everything else we said about functional heads as satellites of lexical
ones, as parts of a supercategory, would most probably still hold, as well.
Given these considerations, and the purported explanatory benefits of Categor-
ial Deficiency, the question is framed thus: is biuniqueness, which potentially
could be worked out on purely semantic and/or conceptual terms, a sufficient
condition to justify positing uninterpretable categorial features on functional
124 Functional categories
11
Emphasis here should be placed on ‘worked out’: on a more personal note, I have heard this
objection raised several times since Panagiotidis (2000). Fleshing out exactly how biuniqueness
can be reduced to semantic requirements and/or conceptual restrictions remains at a program-
matic stage in most cases, with the exception of Borer (2005). However, in order to embrace
Borer’s execution, one would also have to fully embrace her system on the workings of
functional material in its entirety – no grafting of her solutions onto other analyses is possible.
12
There is a potentially important distinction to be made here; however, I will not pursue it in
detail. If functional heads are bundles of UG-features flagged by [uX], an uninterpretable
categorial feature, then we could hypothesize that inner morphemes (see Section 3.7 of
Chapter 3) – that is, subcategorial elements like low applicatives, low causativizers, particles
and the like – are precisely acategorial bundles of UG-features not flagged by any categorial
features, interpretable or uninterpretable. There is a lot more to be said on this conjecture, so
I will leave it as is in this work.
5.9 Categorial Deficiency and Agree 125
Further clarifying for the purposes of the discussion here, ‘Probe’ and ‘Goal’
can be understood either as bundles of features or as the features themselves,
but we will take Probes and Goals to be features here. We will also assume that
we do not need an unvalued feature (Case or other) to flag the F’ (the Goal) as
active: simple matching between F and F’ should be enough (at least for our
purposes here) to guarantee the possibility of Agree between them, if all other
conditions are respected.13
At this point something needs to be said about the c-command condition
in (13). If we seriously subscribe to a bottom-up model of structure-building via
successive applications of Merge, the c-command condition must be adhered to:
(14) The Probe must always c-command the Goal – never the other way round:
Chomsky (2000, 122), Richards and Biberauer (2005, 121) and Donati
(2006).
In the reverse scenario, if Agree could occur with the Goal c-commanding the
Probe, we would have derivations where a potential Probe is merged and has
to ‘wait’ for a suitable Goal. Of course, as mentioned, ‘waiting’ is in principle
possible until phase level is reached: until the spell-out of a phase, all syntactic
operations are system-internal and not evaluable at either level of representa-
tion – that is, at either interface. Thus, phase-internally, the relative structural
configuration between F and F’ is of no consequence: see Richards (2004),
Baker (2008) and Hicks (2009, chap. 2). However, here we will adhere to a
strict version of the c-command condition – that is, for F to c-command F’ as in
(14). The reason we adhere to this stricter version is because we conceive it as
a more special case of the following requirements, already in Chomsky (2000,
133–4; 2004, 109), Hegarty (2005, 32) and Donati (2006):
a. that the Probe is always a head: a lexical item (LI) rather than a
syntactic object (SO);14
b. that the Probe (F) projects.
Thus, after the application of Merge:
(15) The Probe, a head, projects.
The above can resolve labelling in a large number of cases (and, if further
refined, possibly in all of them). Here the focus will be on how (15) works in
13
Although ‘matching’ should be clear, on an intuitive plane at least, it is true that a formalization
of the notion of matching is indeed needed: see Hegarty (2005), Adger (2006).
14
That is, an object already assembled by a previous application of Merge.
126 Functional categories
resolving the label in those cases where two LIs (lexical items) merge in order
to ‘begin a tree’.
Let us now turn to Agree relations between uninterpretable [uX] and
interpretable [X] categorial features. Let us informally call this instance of
Agree ‘categorial Agree’ and make it explicit:
(16) If [X] is an interpretable categorial feature, [uX] serves as a Probe for the
Goal [X], and not vice versa: [X] cannot ever act as a Probe for [uX] and
[uX] can never act as a Goal.
This unattested, and presumably impossible, state of affairs can be ruled out if
it is independently impossible to merge all the lexical heads together in the
absence of functional structure (see also footnote 19). Perhaps even more
elegantly, it can be claimed that [uV] in (18) creates a defective intervention
effect by preventing [uN] from probing beyond it.16
A more general problem is that concerning Agree as a feature valuation
process. Given the situation in (17), we are forced to argue for uninterpretable
15
On the ordering of functional heads, see Hegarty (2005).
16
I wish to thank an anonymous reviewer for raising this matter.
5.9 Categorial Deficiency and Agree 127
versions of [N] and [V] in order to ensure matching with their interpretable
versions on lexical heads and, consequently, biunique projection lines. Arguing
for an unvalued [ucat(egory)] feature on functional heads will not derive the
desired empirical results, as this [ucat] feature could be valued by either [N] or
[V], assuming these to really be [cat:N] and [cat:V]. I of course understand
that, unless we adopt the framework in Pesetsky and Torrego (2004, 2005),
the notion of valued uninterpretable features such as [uN] and [uV] sits uncom-
fortably with current notions on Agree as feature valuation (rather than feature
checking). Here, however, it seems necessary to argue for [uN] and [uV] in order
to preserve biuniqueness – that is, in order to prevent, for example, Tense heads
from appearing in the projection lines of nouns and the like.17
By (16), this structure would contain no Goal for the three Probes, the [uV]
features, on each of the three functional heads. Although details and (im)
17
At this point, a serious empirical question emerges – namely, what we are to make of functional
elements like Aspect, which appear in the projection line of both verbs and nouns. The account
developed here can accommodate such marginal cases under the idea that particular grammars
allow the inclusion of a [uN] feature in bundles containing aspectual features. Alternatively,
whenever we encounter aspectual heads within a nominal, we can perhaps demonstrate that we
are dealing with a mixed projection (see the next chapter).
128 Functional categories
possible scenarios to salvage a derivation like the one in (19) are discussed
below, the bottom line is that there can be no syntactic structure without lexical
material, without a categorizer encoding an interpretable categorial feature.
We now turn to explaining how categorial Agree can also exclude mid-
projection lexical heads sandwiched by functional material. Let us construct
the beginning of a simplified sample derivation which would eventually yield a
mid-projection lexical verb, one sandwiched by functional material:
(20)
A first answer (to be revised below) to why (20) is impossible can be given
as follows. Both will (e.g., a Tense head) and the Aspect head are Probes for
categorial Agree, as they bear [uV], uninterpretable [V] features. However,
they do not c-command a suitable Goal – for example, a lexical verb drink
bearing the interpretable [V] feature. Consequently, there is a violation of
(14) – and (16): of the two potential Probes, neither will nor Asp c-command
their potential Goal drink; an Agree relation cannot be established and the
derivation crashes. A welcome result of an explanation along these lines is that
the impossibility of mid-projection lexical heads can now be expressed without
seeking recourse to a templatic schema, cartographic or similar, where lexical
heads ‘must’ be stipulated to appear at the bottom of the tree.
Assume that both D and N are LIs – that is, monadic elements drawn
from the Numeration and not previously constructed by an application of
Merge. By (15) above, either of them may project because they both qualify
as potential heads of a projection; thus, between the two, it is the Probe of an
5.9 Categorial Deficiency and Agree 129
Agree relation that must project. Put otherwise, the label will be decided on the
basis of which of the two LIs, D or N, is a Probe for Agree. Of course, it has
been common knowledge since the formulation of the DP-hypothesis that in a
case like (21), it is the Determiner that projects, which in turn suggests that,
by (15), D is the Probe in some Agree relation. However, it is not self-evident
what kind of Agree relation could hold between a Determiner and a Noun. It
is at this point that categorial Agree turns out to have explanatory value.
Determiners, being functional, bear [uN] features; lexical nouns by definition
bear [N] features.18 So, the answer to the question of why D is a Probe – so as to
project – is because it hosts an uninterpretable categorial feature. This feature
makes it a Probe for categorial Agree and the noun, bearing [N], is its Goal.
Therefore, D, the Probe, projects after it is merged with N, because the
Determiner is functional, thus bearing a [uX] categorial feature, a [uN] more
specifically:
(22)
18
Well, nominalizers, actually.
19
However, categorial Agree obviously cannot predict the headedness of cases where two lexical
LIs merge. Could this be because directly merging two lexical LIs is an impossibility? The
ubiquity of applicative heads (Pylkkänen 2008), subordinating conjunctions (Cormack and
Smith 1996), relators (Den Dikken 2006) and the like might suggest so, but I wish to remain
agnostic here.
130 Functional categories
with a functional SO. Consider the following structure, where the LI the
merges with the nominal SO three little ducks:
(24)
Once more the Determiner the is a Probe for (categorial) Agree, because it
bears an uninterpretable categorial feature [uN]. The functional SO (e.g., a
Number Phrase) with which it merges contains only one instance of an
interpretable categorial feature, [N] on ducks. This is because the [uN] feature
on three has already been checked off and eliminated after the previous
application of Merge that created the SO three little ducks.20 Therefore, the,
a Probe for (categorial) Agree, projects. If this story is correct, then when a
functional LI and an SO merge, the functional LI will always project: this
straightforwardly predicts that functional LIs merging with a syntactic object
can never be specifiers. Epigrammatically put:
(26) When a functional [uX] LI and an SO merge, the functional LI, a probe for
categorial Agree, projects as the head of the new constituent;
(27) When a functional [uX] LI and an SO merge, the functional LI, a probe for
categorial Agree, cannot be a specifier.
At the same time, when two functional SOs merge, headedness of the resulting
object, its label, cannot be decided on the basis of categorial Agree. This is
because both SOs will, by hypothesis, each have had all their uninterpretable
categorial features already checked. Hence, a different instance of Agree will
be relevant for deciding the label of the resulting object.
20
Actually, in all probability three is the specifier of a Number Phrase with a null Num head, but
this is immaterial to the point made here.
5.9 Categorial Deficiency and Agree 131
(20)
On the basis of (14), it was argued that (the [V] feature of) drink cannot act
as a Goal for (the probing [uV] features of) will and Asp. However, there may
be a way to account for the impossibility of structures like (20) in a more
principled way. First, consider the fact that the phrase marker above is the
result of two applications of Merge. Let us then backtrack to the first applica-
tion of Merge that putatively gave the SO [will Asp], with which drink is
supposed to merge.
(28)
Looking at (28) the following question emerges: the exclusion in (16) notwith-
standing, why is it actually the case that (the [uV] feature of) will cannot probe
(that of) Asp? Or vice versa? Both will and the aspectual head are probes for
Agree. What prevents the [uV] feature of Asp or that of will from serving as a
Goal for categorial Agree? The condition in (14) would not be violated, as will
and Asp c-command each other.
The short answer to the above question in a conception of Agree as a
valuation process is that unvalued (and, presumably, uninterpretable) features
cannot serve as Goals, as they cannot provide the Probe with a value; recall
that this is stipulated in (16) but stems in a principled way from Chomsky
(2001, 6; 2004, 113) and much subsequent work. However, it is perhaps useful
to examine what would happen in examples like (28) if it could indeed happen
to be the first application of Merge in a workspace.21
The first problem in a situation like that in (28) would be which of the [uV]
features on each LI is the Probe. This cannot be resolved, as, in principle, both
[uV] features can be Probes for categorial Agree. Therefore, in principle,
both will and Asp would be possible to project. This in turn would result in
(a) optional labelling for the resulting constituent, or (b) an intrinsic failure of
the system to determine the head, or (c) co-projection of both will and Asp.
I take it with Chomsky (1995, 244) that the last two options are impossible and
21
I am grateful to Marc Richards and an anonymous reviewer for discussing interpretability of
Goals with me.
132 Functional categories
I would think that the first one, that of optional labelling, is highly undesirable,
too. In other words, the above scenarios run afoul of the received understand-
ing of how labels are set after Merge: when merging X and Y, we expect either
X or Y to project, as discussed in detail in Chomsky (2000, 2001, 2004).
Moreover, if Hinzen (2006, 187–9) is correct in saying that the label is
determined by lexical properties of the elements involved and that label
determination is not part of the definition of Merge, then it is plausible to
think that such relevant properties would interact with Agree and that there
would exist no context-independent mechanisms to decide the label of a newly
merged constituent.
Therefore, the relevance of the Probe–Goal relationship for determining labels –
(15): the Probe, a head, projects – and the ubiquity of (categorial) Agree due to
categorial Deficiency in (9) lead us to the following empirical generalization:
(29) We cannot begin a tree by merging two functional LIs.22
22
This has as a consequence that the claim in Abney (1987) – and much subsequent work – that
pronouns are bare – that is, nounless, Determiner constituents – must be false: a pronoun like
[DP D Num], in the absence of an n, will be impossible. However, following Panagiotidis (2002)
and Déchaine and Wiltschko (2002) in that all pronouns contain at least a minimally specified
nominal head, the tension is resolved in favour of the generalization here.
23
In the framework followed here I am not sure that two lexical LIs – that is, two categorizers –
can merge. What would something like an [n v] be? This is why no example is provided in the
‘[X] LI merges with [X] LI’ row. See also footnote 19.
5.10 Conclusion 133
5.10 Conclusion
In this chapter I have proposed that functional heads are essentially category-
less. On top of that, I have argued against categorial labels for functional heads
and against functional categories as word classes like nouns and verbs are,
claiming that functional heads are flagged by uninterpretable categorial fea-
tures, that they are categorially deficient. Building on this hypothesis I have
managed to derive the Categorization Assumption, the part where functional
heads cannot directly merge with roots, more specifically. Biunique relations
between functional and lexical elements as well as the nature of nominal super-
categories/Extended Projections/M-Projections also fall out from Categorial
Deficiency. Finally, the categorially deficient nature of functional heads – that
is, their encoding uninterpretable categorial features – was argued to make
them Probes for Agree relations with lexical heads; these Agree relations can
shed light on labelling and the position of lexical material inside a tree, as well
as on the impossibility of purely functional structures without any lexical
material.
6 Mixed projections and functional
categorizers
6.1 Introduction
The problem of mixed projections – that is, projections combining both
nominal and verbal/clausal functional subconstituents – is discussed in this
chapter. Our theory of categorial features combined with the existence of
uninterpretable categorial features can capture the existence and function of
mixed projections without novel theoretical assumptions. Section 6.2 intro-
duces the problem and Section 6.3 presents two empirical generalizations on
mixed projections: Categorial Uniformity and Nominal External Behaviour.
The following section reviews evidence against freely mixing nominal and
verbal functional elements – that is, without any concerns regarding categorial
uniformity or biuniqueness. Section 6.5 introduces functional categorizers,
Switch elements, as heads that both recategorize their complement, bearing
an interpretable categorial feature, and belong to the functional projection of
a lexical head. In Section 6.7 the type and size that the complement of a
functional categorizer can take is examined, as well as the phasal status of
Switch heads. Whether mixed projections all behave externally as nominals
is reviewed in Section 6.8, on the basis of the behaviour that verbal nouns in
Korean and Japanese display. Section 6.9 attempts to explain away differences
between mixed projections that consist of the same type of subconstituents –
say, Tense Phrases – by appealing to the different feature specification of the
heads within the verbal/clausal functional subtree. The need for functional
categorizers next to the better-known ones – that is, n and v – is addressed in
Section 6.10 and the last section concludes the chapter.
134
6.2 Mixed projections 135
categorizers (n and v), the same way one can argue that temporal relation
features make Tense heads. Categorial features are interpreted as fundamental
interpretive perspectives on the material in the categorizers’ complement.
Uninterpretable versions of categorial features flag functional heads and give
rise to a number of familiar phrase-structure phenomena: the biunique relation
between lexical and functional elements and the position of lexical material
at the bottom of a projection line, they also play a central role in labelling after
the application of Merge.
Given all of the above, we might expect syntactic projections to be either
nominal or verbal/clausal. In other words, we expect biuniqueness all the way
up the tree: essentially, the categorial feature [N] or [V] of the categorizer
should guarantee the categorial uniformity of the whole projection line. Graph-
ically expressed, we expect, for instance, that projections be exclusively made
up of functional elements with a [uV] specification, forming the verbal/clausal
entourage of a lexical verb.
(1)
The obvious question is how this system can accommodate mixed projec-
tions. Generally speaking, mixed projections are a problem for any system of
phrase structure that (implicitly or, as in this case, explicitly) admits total
categorial uniformity of projections. At the same time, most conceptions of
phrase structure would, for instance, prevent a D taking a TP complement or a
T taking a DP complement, and they would do so for very good reasons, pace
Alexiadou (2001) and (in a sense) Borer (2005). This is why the existence and
properties of mixed projections must be seriously addressed.
Of course, mixed projections have already been variously addressed, time
and time again; they have indeed posed a serious problem for syntactic
theories. In order to get a sample of the ways mixed projections have been
dealt with, let us briefly review the types of analyses proposed for Poss–ing
gerunds (after Hudson 2003), maybe the best-studied mixed projections.
A first analysis comes from Jackendoff (1977), who develops an insight in
Chomsky (1970): Poss–ing gerunds are exocentric NPs consisting of a VP.
This could be roughly understood as an NP without an N head but dominating
136 Mixed projections and functional categorizers
a VP, a state of affairs that was possible given the labelling and bar-level
conventions in the 1970s and, more importantly, something that captures a key
intuition: that Poss–ing gerunds, like most mixed projections, ostensibly
behave externally as nominal, despite their having a verb at their heart – this
is an intuition we will extensively scrutinize here. Baker (1985) argues that
gerunds are NPs headed by –ing, an affix which is then lowered onto the verb,
via a version of Affix-hopping; the underlying suggestion is that –ing is
nominal, which is a commonly shared assumption. Abney (1987), updated
and refined in Yoon (1996a), claims that in gerundive projections Det directly
selects IPs or VPs.
A more elaborate solution for the violation of biuniqueness in mixed
projections like gerunds is offered in Pullum (1991), who, within the HPSG
framework, proposes the weakening of the Head Feature Convention, thus
allowing the mother phrase and its head to have different values for
N and V. In a similar vein, in the HPSG account of Lapointe (1993), the NP
and VP nodes have ‘dual’ lexical categories <X|Y>, where X and
Y determine external and internal properties respectively: nominal externally
and verbal internally. Finally, in Bresnan (1997) – to which we will return – an
LFG account is offered, in which a single c-structure N (the gerund) maps to
both an N and a V position in f-structure.
Summarizing, the old chestnut of mixed projections has been attacked from
two viewpoints: either writing categorial duality into their head, as in Jackend-
off (1977), Pullum (1991), Lapointe (1993) and Bresnan (1997), or arguing for
a structure where an abstract nominal element selects a VP, as in Baker (1985),
Abney (1987) and Yoon (1996a). Here we will combine both lines of
reasoning in order to tackle the problem: thankfully, treating categorial fea-
tures as ordinary LF-interpretable features (and not as flags merely identifying
grammar-internal entities) combined with Categorial Deficiency enables us to
do exactly that.
Before presenting the analysis, let us first look at mixed projections in a
descriptive way and let us adumbrate the generalizations we can extract from
looking at empirical evidence.
1
This decision is not made on purely methodological considerations regarding the status of
adjectives. Mixed projections that combine adjectival and nominal characteristics are, well,
adjectives (recall the discussion in Section 2.8 of Chapter 2); mixed projections combining
adjectival and verbal characteristics will have to be set aside for future research, once there is a
more concrete picture of the categorial status of adjectives.
138 Mixed projections and functional categorizers
in the case of English Poss–ing gerunds, and (6), with verbal elements above
the nominal ones (a matter to which we will return in Section 6.8.1).
If Phrasal Coherence is a first generalization about mixed projections,
there is also a second generalization that can be surmised by surveying the
literature on mixed projections: Borsley and Kornfilt (2000), Malouf (2000),
Hudson (2003) more specifically: mixed projections externally behave as
nominals.
(8) Nominal External Behaviour: mixed projections externally behave as
nominal constituents.
(12) How can two categorially different subtrees, a nominal one and a verbal one,
be combined to form a single projection?
not Tense that ‘goes with’ a verb. Instead, a T category makes a root a verb: a
root inside TP will surface as a verb (Alexiadou 2001, 19). Similarly, it is not
the case that a Determiner ‘goes with’ a noun. Instead, a D category makes a
root a noun: a root inside DP will surface as a noun (Alexiadou 2001, 19). This
hypothesis is illustrated in the simplified trees below:
(14)
constituents that are clearly and fully nominal. Illustrating this with Greek
examples, consider kind readings of nouns in (15) and mass readings, as in
(16), where there is no need for any Determiner:
(15) Eyine papia
became duck
‘S/he became a duck.’
(16) Efaye papia
ate duck
‘S/he ate duck.’
In (15), which would be felicitous in the context of a fairy tale or similar, papia
(‘duck’) has a kind reading: the sentence is about something or someone
switching kind into the kind ‘duck’. In (16), papia (‘duck’) refers to duck
meat. In both cases there is no determiner layer present but papia is unam-
biguously a noun. What is even more interesting is that even NumP, another
functional projection which could be held accountable for categorizing roots,
is also most possibly perfunctory in the examples above, as neither example
involves individuation, to begin with; see Borer (2005, chap. 4, passim) for
very extensive discussion on the function and the semantics of individuation
and Number.
For the reasons reviewed above, the free-mixing version of how to capture
mixed projections will not be pursued here. Now, a second way to understand
how mixed projections are possible is to say that the two categorially uniform
subparts of a mixed projection are linked by a category-changing head, a
special type of categorizer. This is the path to be explored here, in a way that
will bring together the insights in two distinct schools of thought on mixed
projections. On the one hand, we will follow Jackendoff (1977), Pullum
(1991), Lapointe (1993) and Bresnan (1997), who encode categorial duality
into a head, our purported special categorizer; on the other hand, we will also
incorporate elements from analyses positing a structure where an abstract
nominal element selects a VP, as in Baker (1985), Abney (1987) and Yoon
(1996a): again, our special categorizer.
and Grohmann 2009).2 This Switch head ‘mediates’ between the nominal
and the verbal half, by virtue of its categorial feature makeup. More precisely,
Switches are categorizers, like n and v, hence they bear interpretable cat-
egorial features. What makes them special, and suitable as mediators between
categorially different functional layers of structure, is that they also bear
uninterpretable categorial features.
(17) Switches are categorizers that bear both interpretable [X] (i.e.,
‘categorizing’) and uninterpretable [uX] (i.e., ‘functional’) categorial
features.
Let us first consider how Switches would work and then discuss their nature.
I will exemplify on English Poss–ing gerunds.
Suppose, essentially following Reuland (1983) and Hazout (1994), that
gerundive projections contain a head Ger. This Ger head takes a verbal
complement but is selected by a Determiner or a Determiner-like element,
initiating a switch of categorial identity within the projection line. More
specifically, Ger takes a verbal complement, an AspP, as indicated by –ing.3
I take this Ger head not to be an ad hoc category or, even, a gerund-specific
element, but a functional categorizer as described in (17), a Switch. In
keeping with our analysis in the previous chapter, and in line with what was
claimed about the Categorial Deficiency of functional elements, we can now
simply claim that a Switch bears a [uV] feature: Switch behaves as a
verbal/clausal functional head that can participate in verbal/clausal projection
lines by virtue of its uninterpretable categorial feature, which probes its
complement for a [V] feature. Actually, van Hout and Roeper (1998) suggest
that the verb head overtly climbs up to Ger, but this is probably wrong, when
adverb placement is considered (Ad Neeleman, personal communication,
February 2007).
Now, this Ger head (our Switch) appears, like nouns do, in the comple-
ment of the possessive Determiner head of the null variety, which assigns
Genitive Case to SpecDP – as already claimed in Abney (1987), Borsley and
Kornfilt (2000, 105) and elsewhere. This Ger head will then contain an [N]
feature, given that Determiners bear a [uN] feature. Note that the intuition that
2
Lapointe (1999) was the first to talk about ‘category switchover points’. The term ‘switch’ is also
used in Schoorlemmer (2001) to describe the point in the structure of nominalized infinitives
where category changes.
3
On why –ing itself cannot be the nominalizing morpheme, contra Abney (1987), Milsark (1988)
and others, see Ackema and Neeleman (2004, 175–81). Johnson (1988) claims it to be a Tense
affix but here we side with Siegel (1998) in considering it to be just an aspectual marker.
144 Mixed projections and functional categorizers
So, Ger bears an [N] feature but is not a noun. Naturally, given the discussion
in Chapter 4, the [N] feature on the Ger/Switch can be explained if the Ger
head is a categorizer and, more specifically, a nominalizer, an n head.
4
This is what Abney (1987) and van Hout and Roeper (1998) also seem to argue for.
6.5 Switches as functional categorizers 145
Thus, a nominal Switch, like ‘Ger’, contains a [uV] feature besides its [N]
one. It is a functional categorizer.
The hypothesis of Switch as a functional categorizer should be quite
straightforward by now; essentially, a Switch can be identified with the
category-changing abstract phrasal affix in Ackema and Neeleman (2004,
172–81). Such category-changing affixes are postulated to attach on
projections of various levels. More precisely, Ackema and Neeleman discuss
nominalizing affixes attaching on verbal projections of various sizes, a point
to which we will return below. As regards its exponence, the category-
changing affix is phonologically null in head-initial languages and con-
versely so in head-final ones; this is a prediction made by a principle of Input
146 Mixed projections and functional categorizers
Correspondence (Ackema and Neeleman 2004, 140) that maps structures onto
morphological forms, and is borne out in a number of languages (176–81).
Now, the above claims immediately beg the following questions:
a. whether it is possible for two categorial features to co-exist on a
single head;
b. how come this co-existence does not induce a categorial clash;
c. what it means (LF-wise) for a syntactic head to be specified as [N] [uV].
First, as insistently claimed here, categorial features are not taxonomic-
classificatory markers; they are LF-interpretable features. Rather than flags used
purely to classify words and delineate constituents, categorial features
are genuine instructions to be interpreted at the interfaces, as expected from
formal features under Chomsky’s (1995) Full Interpretation. A Switch like the
so-called Ger head is a nominal categorizer by virtue of its interpretable,
perspective-setting [N] feature. At the same time, Ger bears a [uV] feature,
eliminable via categorial Agree: it can therefore be part of a verbal projection,
behaving like a ‘functional category’. Thus, the complement of an [N][uV]
Switch element will be recategorized, in a way familiar from simplex categor-
izers (recall the denominal verb tape): the complement of Switch will be
interpreted in the sortal perspective that [N] imposes. Additionally, the [uV]
Probe on Switch will search for a [V] target. There is no categorial clash
whatsoever, given that the [uV] feature of the Switch, along with the ones on
Asp, Voice and so on, will be eliminated before Spell-Out; at the same time, this
feature guarantees that a Switch acts like a functional element: given its [uV]
feature it cannot take root material directly as its complement.
This state of affairs immediately derives Phrasal Coherence in (5). In order
to illustrate this point, let us follow the derivational history of a mixed
projection using abstract phrase markers like we did in (13).
(22)
First, a v (bearing [V]) is merged with the root material and a number of
[uV] functional heads (FHs) recursively merge, giving a categorially uniform
verbal/clausal subtree through successive applications of categorial Agree
between the [uV] features and [V].
Then an [N][uV] head, the functional categorizer or Switch, is merged; its
[uV] feature probes for a [V] and agrees with it.
6.5 Switches as functional categorizers 147
(23)
However, the next head to be merged must be [uN], not [uV], on the grounds
of a version of minimality.5 We cannot merge a [uV] functional head with a
projection headed by a Switch because the Switch’s interpretable [N]
feature would intervene between a [uV] and the [V] on v, as illustrated below:
(24)
Consequently, the derivation will proceed as follows: the Switch head will
participate in the lower verbal/clausal subtree and effectively ‘begin’ the
nominal subtree dominating it, ‘switching’ the categorial identity of the deriv-
ation, it being a categorizer after all. The empirical result is that now Phrasal
Coherence is readily captured, with the functional categorizer acting both as
the Switching element (by [N]) and as the ‘glue’ (by [uV]) between the two
categorially distinct subtrees:
(25)
5
Uninterpretable [uV] creating an intervention effect for a probing [uN], and vice versa, is
independently necessary, as discussed in Chapter 5 in the context of deriving biuniqueness from
categorial Agree. This of course suggests that [N] and [V] are in fact values of an attribute
[perspective]. Thus, [N] must actually be [perspective:sortal] and [V] must be [perspective:
temporal], exactly as discussed in Chapter 5. I will, however, continue using [N] and [V] as
shorthand, for convenience.
148 Mixed projections and functional categorizers
Let us now briefly review how this proposal relates to the previous ones in
the literature. A primary virtue of the account presented here is that it does not
treat mixed projections as exceptional constructions resulting from weakened
labelling conditions – for example, allowing a V-headed constituent project as
an NP – or from the projection of categorially dual elements. Recall that this
second option is impossible, already by a version of Baker’s Reference–
Predication Constraint (2003, 165): the same syntactic node cannot be both
‘nominal’ and ‘verbal’. A Switch, being only nominal (as its [uV] will be
eliminated before it gets a chance to be interpreted), respects that.
Second, our analysis only utilizes the two categorial features independently
argued for and it invokes the concept of an uninterpretable feature as a Probe
for Agree to derive both ‘ordinary’ (i.e., categorially homogeneous) and
‘exceptional’ (i.e., mixed) projections. Focusing on the Switch head, it is
indeed a categorially dual head, as expressed by the intuition in Jackendoff
(1977), Pullum (1991), Lapointe (1993) and Bresnan (1997) – making the best
of the explanatory merits of the Agree system. Looking at the abstract phrase
marker in (25) or, more concretely, at the one of a Poss–ing gerund in (21), we
can observe that mixed projections are simply structures created when a
functional nominalizer, the Switch, selects a verbal projection, continuing
the line of reasoning of Baker (1985), Abney (1987) and Yoon (1996a).
However, in our case this nominalizer’s selection for a verbal projection comes
not as an ad hoc stipulation but due to Categorial Deficiency, precisely because
the nominalizer, Switch, is itself a functional element bearing a [uV]
feature.6
Summarizing, we capture the duality of mixed projections as the result of
the co-occurrence of an interpretable [N] and an uninterpretable [uV] feature
on the same syntactic node: mixed projections are not special in any way that
has to do with their phrase-structure status, and heads that are interpreted as
both nominal and verbal are not necessary in order to explain mixed
projections.
6
See Section 6.10 of this chapter on why we need functional categorizers in mixed projections,
instead of ordinary ones such as n and v.
6.6 Morphologically overt Switches 149
7
ERG ¼ Ergative Case, FN ¼ Functional Nominalizer, D ¼ Determiner, INESS ¼ Inessive Case.
8
Kornfilt (1997) is a classic description of the language in English.
150 Mixed projections and functional categorizers
In both (28) and (29), what follows the purported nominalizing suffixes, –me–
and –tığ– respectively, are purely nominal functional elements: nominal agree-
ment and (accusative) Case. Concentrating on (29), the important element in
examples like it is the underlying form of –tığ– (i.e., dIk) which Borsley and
Kornfilt (2000) gloss as ‘factive’. The interesting twist here is that the particu-
lar form also encodes past tense and that dIk is a form similar, although non-
identical, to the verbal past morpheme dI. Moreover, this ‘factive’ element also
comes in a future tense version, AcAk, which is identical to the verbal future
tense suffix. Hence, the example below forms a minimal pair with (29):
(30) Ben [siz-in tatil-e çık-acağ-ınız-ı] duy-du-m.
I you-gen vacation-dat go.out-fn.fut-2pl-acc hear-past-1sg
‘I heard that you will go on vacation.’
also to encode Tense features, making it both a genuine member of the verbal/
clausal projection line – and one carrying a temporal specification, too – and a
nominalizer at the same time. We could further speculate that this type of
Switch encoding temporal features is made possible in Turkish because
functional nominalizers, affixes in Ackema and Neeleman (2004, 172–81),
must be overt in head-final languages: it just happens that they additionally
carry temporal features.
However, unpublished work by Tosun (1999) offers a more elegant account
of how Turkish functional nominalizers of the ‘factive’ denomination end up
with past and future Tense specifications. First, she addresses analyses
according to which the ‘factive’ nominalizers, dIk and AcAk, are actually Tense
morphemes followed by a version of the Complementizer ki. In the case of dIk
this would entail analysing the form as dI (past) þ k (the Complementizer):
these forms would be the exponents of Tense þ Complementizer sequences.
However, evidence suggests that independent Tense/Aspect morphemes are
not allowed inside subordinate nominalized clauses:9
(31) gid-iyor-du-m in main clauses
go-imperf-past-1sg
‘I was going.’
(32) git-tiğ-im in embedded clauses
go-fn.past-1sg
(33) * gid-iyor-duğ-um in embedded clauses
go-imperf-fn.past-1sg
The above examples illuminate the following state of affairs: although two
Tense/Aspect morphemes may co-exist in Turkish main clauses, as in (31), no
such thing is possible in nominalized embedded ones (33), where an independ-
ent Tense head is not available. Actually, it is impossible to express aspectual
information in ‘factive’ nominalizations, as the ungrammaticality of (33)
demonstrates, and the correct form in (32) appears to encode Tense features
only via the nominalizer, which is dIk in this example. Tosun (1999, 7) claims
exactly this: that the ‘factive’ nominalizer – more precisely, a form like dIk
(i.e., –tığ–) in our (32) – ‘bears both tense and nominal features’. So, the above
examples take the same direction as Borsley and Kornfilt (2000, 108) in that
there is a single head encoding both nominal and temporal features.
Tosun (1999), however, argues against the existence of a single syntactic
head by looking at the availability of object shift and scope ambiguity with
9
I have adapted the glossing to reflect the working hypotheses here.
152 Mixed projections and functional categorizers
10
In the spirit of van Riemsdijk (1998b) and Hegarty (2005), one could perhaps speculate that
fusion is only possible between heads bearing identical uninterpretable categorial features – that
is, if both heads are marked as [uN] or as [uV]. Unfortunately, in the context of this study I can
only offer this as mere speculation.
6.7 Switches and their complements 153
11
Most of the work on phases looks only at uniform verbal/clausal projections (CPs). The question
of whether or not (categorially uniform) DPs are phases has been addressed to a lesser extent,
with some representative discussion in Svenonius (2004) and Hicks (2009, chaps. 4 and 5).
12
This statement on ‘plain’ nominalized infinitives is refined and revised below, in Section 6.9.2.
Panagiotidis and Grohmann (2009) also address the matter; the discussion in this section revisits
some of their arguments and observations.
154 Mixed projections and functional categorizers
13
Schoorlemmer (2001) also explains away an apparent violation of Categorial Uniformity: in
nominalized infinitives nominal and verbal properties look like they can be interspersed, with
direct objects of the verb showing up as phrases headed by van (‘of’).
The validity of the structures outlined in (37) and (38) is corroborated by the
fact that in Dutch an adjective modifying the nominalized infinitive may
precede an adverb (Ackema and Neeleman 2004, 174):
(39) Deze zanger is vervolgd voor dat constante [stiekem succesvolle
liedjes jatten]
this singer is prosecuted for that constant [sneakily successful
songs pinch.inf]
The structure for the example above is given in the phrase marker below:
(40) Adjective plus a verbal/clausal subtree with an adverb
The reverse, an adverb preceding the modifying adjective, is not possible, however.
(41) *. . . dat constant stiekeme [succesvolle liedjes jatten]
that constantly sneaky successful songs pinch.inf
‘This singer is prosecuted for constantly sneakily pinching successful songs.’
Again, the structure for the ungrammatical adverb–adjective order is given in
the phrase marker below:
(42) An impossible state of affairs: an adverb within the nominal subtree
In (44) a Voice Phrase includes the verb tocar (‘play’) and its object in
accusative Case, which is assigned by the Voice head. It must be that the
Switch takes that VoiceP as its complement, because the external argument
de Maria is expressed as an adjunct on the resulting nominal constituent, in a
manner reminiscent of that of ‘possessive’ subjects in English Poss-ing
gerunds. In (46) the verbal/clausal constituent is large enough to include a
Tense head, one that assigns nominative to a post-verbal subject yo (‘I’).14
A first conclusion is therefore that a Switch can appear in different positions
even in the same language: the complement of a Switch head in (36) is smaller
than the one in (35). Similarly, the complement of the Switch head in (43)
seems to be roughly the size of a VoiceP, excluding the nominative-assigning
structure, whereas in (45) the complement of the Switch seems to be the size
of (at least) a TP. Interestingly, in many languages, including Modern Greek,
a Determiner can appear in the syntactic nominalization of a full CP comple-
ment, a mixed projection:
14
At first glance, the nominative in the context of an infinitive is unexpected. Even if it is not a
complete TP that is nominalized here and even if the source of nominative is not T, it still
remains the case that the verbal/clausal constituent in (45) is large enough to contain a
postverbal subject. Yoon and Bonet-Farran (1991) discuss the Case-marking of Spanish infini-
tival subjects both in nominalized and in ‘sentential’ infinitives, arguing that Nominative is not a
default Case, but indeed is the result of Case-marking.
6.7 Switches and their complements 157
Before proceeding, note that neither of the above accounts suffers from the
paradoxical state of affairs of an analysis where D directly selects a CP. A [DP
D CP] constituent would either violate biuniqueness, as it has D (a [uN] head)
select a CP, or would force us to concede that all Complementizers are
nominal – see Roussou (1990, sec. 4.1), Davies and Dubinsky (1998, 2001)
and Manzini (2010) for discussion of the (non-)nominal character of (all)
Complementizers.
So, let us now turn to languages with bona fide nominalizations of complete
CPs, such as Polish and Greek.15 Roussou (1990) already discusses the
implausibility of an account involving an empty noun, on the grounds that this
15
Borsley and Kornfilt (2000, 116–17) discuss Kabardian as one more language where full CPs
can be nominalized and be assigned Case.
158 Mixed projections and functional categorizers
noun cannot host an adjective or, indeed, any other element associated with
nouns. Expanding on this observation by Roussou, nothing can intervene
between the article to (the neuter gender and default form) and the CP in
structures like the one in (47). Moreover, the role of eN, the phonologically
empty non-descriptive noun of Panagiotidis (2003b), is very well understood in
languages like Greek: its lack of interpretation underlies a number of pronom-
inal and elliptical structures, and their functions.16 However, in the context of
nominalized CPs eN would have to mean ‘fact’, as in Plann (1981) and Yoon
and Bonet-Farran (1991, 364–5), ‘matter’, ‘question’ and the like. This range of
different interpretations for the empty noun eN does not look feasible.
This last point takes us to the relevant Polish and Greek structures. In both
languages, all subordinate CPs can be nominalized: declarative and interroga-
tive, in either indicative or subjunctive; in Polish demonstratives to, tego and
tym are used, whereas in Greek the default article to is used. Some data from
the two languages clearly illustrate this:17
(50) [To, że Maria zmienia pracę] Jan oznajmił. Polish
that comp Maria is.changing job Jan announced
‘Jan announced that Mary is changing her job.’
[To oti i Maria alazi dhulia] anakinose o Yanis. Greek
the comp the Maria is.changing job announced the John
‘John announced that Mary is changing her job.’
The nominalized declarative CPs have been fronted to show that
they form a single constituent together with the Det-element introducing them.
Note that the examples above are compatible with both of the alternatives
in (49), as a ‘fact’ empty noun between the demonstrative or the article and the
clause would be compatible with their interpretation – keeping in mind, of
course, the objections on the distribution, interpretation and compatibility with
adjectives that characterize true eN in Greek. These objections become even
more relevant in examples like the ones below:
(51) Jan rządał [(tego), żeby Maria zmieniła pracę]. Polish
Jan demanded that comp Maria changed job
‘Jan demanded that Maria change her job.’
[To na fiyis] ine efkolo. Greek
the irrealis leave.2sg is easy
‘For you to leave is easy.’
16
In the spirit of Harley’s (2005a) analysis, eN would have to be identified with an n head not
associated with any root material; see the discussion in Section 4.5 in Chapter 4.
17
Polish examples and discussion from Borsley and Kornfilt (2000, 113–14).
6.7 Switches and their complements 159
(52) Jan zastanawia się nad [tym, czy kupić nowy samochód]. Polish
Jan wondered over that whether buy new car
‘Jan wondered whether to buy a new car.’
Eksetazis [to an tha fiyi]. Greek
examine.2sg the if will leave.3sg
‘You are considering whether s/he will go.’
(53) Jan zastanawia się nad [tym, kiedy kupić nowy samochód]. Polish
Jan wondered over that when buy new car
‘Jan wondered when to buy a new car.’
Epaneksetazo [to pote tha fiyi].
re-examine.1sg the when will leave.3sg
‘I am re-examining when s/he will leave.’
Epaneksetazo [to pion tha kalesis].
re-examine.1sg the who will invite.2sg
‘I am re-examining who you will invite.’
In (51) we have nominalized subjunctive CPs – thus the purported empty noun
would have to be interpreted not as ‘fact’ but as something else, although it is
unclear as to what exactly. In (52) the demonstrative tym in Polish and the
article to in Greek introduce nominalized subordinate yes/no questions and in
(53) an embedded wh-question with when and – for Greek – with who.
It becomes obvious from the above that no noun can mediate between the
D-element and the clause in nominalized CPs, even an abstract one meaning
‘fact’ and the like. At the same time, a Switch, a functional nominalizer,
inserted very high so as to take a complete CP as its complement, is compatible
with all the empirical facts presented and discussed in this section.
Pausing for a moment, we must ask why we have to argue that complements of
Switch must be the size of phases. The reasons will have to be of a theory-
internal nature but, hopefully, should reflect some more generally received
160 Mixed projections and functional categorizers
ideas about how syntactic structures are dealt with at the interfaces. Consider
(21) and suppose that Ger (our Switch) indeed takes an AspP complement,
as illustrated in the diagram. Is an Aspectual Phrase complete in any sense?
Does it constitute a complete interpretive unit? The answer is most likely
negative: Aspectual Phrases apparently denote time intervals in which events
unfold, but these time intervals are neither anchored in time nor suitably
ordered with respect to an instance and/or another time interval.
Why would this matter? It would if we expect that the two uniform subtrees
in a mixed projection would be able to stand as complete interpretive units. In
previous work, I have precisely argued for this – namely, that the complements
of Switch must be complete interpretive units by themselves. In Panagiotidis
and Grohmann (2009) a central claim is that complements of Switch must be
the size of Prolific Domains (Grohmann 2003), whereas in unpublished work
I take complements of Switch to be the size of a phase, claiming that
‘Switches will be merged with phases and induce themselves a phase’,
exactly as in (54) above.
This last claim brings us to an interesting dilemma. If Switch heads bear
interpretable categorial features, interpretive perspective-setting features, then
we expect them to be phasal heads, like ‘ordinary’ categorizers n and v
(see Chapter 4): in other words, we expect them to induce a phase themselves.
So, the claim that ‘Switches . . . induce themselves a phase’ is consistent
with what we have seen so far on categorial features and phasehood. However,
if a Switch head is phasal, its complement certainly need not be: for instance,
the complement of a phasal head like a Complementizer is certainly not a
phase. Being more precise about the phasehood of categorizers, both n and v
and functional Switch, categorial features on them are sufficient to impose
an interpretive perspective on material that cannot be otherwise interpreted –
for example, root material – and to complete a phase with it. Thus, categorial
features on both a categorizer (n and v) and a Switch could surely also make
a phase out of any other material consisting of UG features, including func-
tional structures. It therefore is not necessary for the complement of Switch,
a phasal head, also to be a phase. A fortiori, if Richards (2007) is correct, it
cannot be a phase: phase and non-phase heads must actually alternate.18
18
Furthermore, if Richards (2007) is indeed correct, then denominal structures (e.g., the verb tape
in Chapters 4 and 5) cannot have a [v [nP]] structure and, respectively, nominalizations cannot
have an [n [vP]] structure: both n and v are phase heads. This would suggest that a projection
must intervene between them, but this is an empirical matter that will not be explored here.
6.8 Are all mixed projections externally nominal? 161
Some suggestions about what these intermediate projections could be may be extracted from
Alexiadou’s (2001) account on different types of nominalizations.
162 Mixed projections and functional categorizers
there are no unambiguous exceptions to (8) and the state of affairs in (55).
Nevertheless, weaker challenges to Nominal External Behaviour exist and
apparent exceptions to Nominal External Behaviour are actually attested, even
if they are not examples of verbal/clausal external behaviour. Following the
discussion in Panagiotidis (2010), I will present and discuss an example of a
mixed projection that externally seems to behave as a non-nominal constituent:
Japanese and Korean verbal nouns.
19
Yoon (1996b) discusses –um and points out that it can also be used as a lexical nominalizer,
presumably an n, directly attached on verbs: for example, cwuk (‘die’) is nominalized as cwuk-
um (‘death’ or ‘dying’). Now, in (59) I argue that verbal nouns in Korean (and Japanese) also
involve an n head, which is (crucially for the analysis here) phonologically null. If this is the
case, then Korean –um can be the morphological exponence
(a) either of a functional categorizer, a Switch, when it nominalizes TPs,
(b) or of an n, when it nominalizes vPs in the absence of a Switch further up the tree.
In the structure in (62) there exist both a (low) nominalizer n and a (high) functional
nominalizer, a Switch, so n surfaces as a null morpheme.
6.8 Are all mixed projections externally nominal? 163
Now, unlike the other mixed projections reviewed in this chapter, VNs
cannot be arguments but are typically embedded within modifying expressions
with a temporal interpretation, as this Japanese example from Shibatani
(1990, 247) illustrates:
(57) [Sensei-ga kaigai-o ryokoo]-no sai . . .
teacher-Nom abroad-Acc travel.vn-gen occasion
‘On the occasion of the teacher’s travelling abroad . . .’
Alternatively, VNs can combine with a copula/light verb (the equivalent of do) to
yield the Light Verb Construction (Yoon and Park 2004); in these cases they
contribute the predicative content to the complex verbal predicate. Summarizing:
(58) Verbal nouns
a. are morphologically simplex nouns that may be modified by adjectives;
b. also contain a verbal/clausal layer that may license an Agent and assign
accusative and nominative Case, but not adverbs;
c. must contain a high nominal layer, which enables them to be complements
of temporal adpositions and which makes them possible (incorporated?)
arguments for light verbs.20
Let us put all those ingredients together and see what kind of structure
emerges. Our rationale here will be that the diverse characteristics of verbal
nouns must be the results of features structurally interacting with each other, in
the spirit of the analysis in Ahn (1991), albeit with different results. Thus, we
start with the fact that VNs display nominal morphology, precisely the way
that English gerunds always contain a verbal morphological chunk. Under the
fairly innocuous assumption that adjectives adjoin to noun phrases, we would
get the following structure as the bottom of a VN tree:
(59) A nominal subtree
Apparently, the nominal chunk in (59) is verbalized immediately above the nP.
Here a first crucial dilemma emerges, especially for our theory of categorial
features, which enables grammars to possess both categorizers, n and v, and
functional categorizers – that is, Switches: which of the two links an nP like
the one in (59) with the verbal/clausal functional material directly dominating it?
In principle, both a v and a Switch specified as [V][uN] would be possible here
20
This last point makes sense if one considers that in complex verbal predicates the non-light verb
element is necessarily non-verbal – recall the discussions about Farsi and Jingulu in Chapter 2.
164 Mixed projections and functional categorizers
and both would do the same job.21 If there is some economy metric according to
which a v head, consisting of a single [V] feature, is more economical than a
[uN][V] head, then we can perhaps argue that an ordinary categorizer, a v, will
do here. The intuitive idea behind this assumed economy metric here is as
simple as not inserting uninterpretable and unvalued features unless we have
to. Already suggested in the discussion below (17) is that a lexical categorizer
can categorize lexical material but not functional constituents and that a func-
tional categorizer, a Switch, will not be used to convert purely lexical material
(see also Section 6.10). Incidentally, a VN, which keeps its ability to take
adjectives, is in this analysis nothing more than a transparent denominal verb,
like tape. One of the ways it differs from tape, however, is that there is no
morphological fusion of the material under n with the material under v, and the
two heads remain separate with v being silent:
(60) A verbalized nP
What immediately follows is a fully blown verbal clausal layer, possibly a full
TP, if Nominative in Japanese and Korean is assigned by T, and given that
nominative arguments are possible with VNs. A Voice head, assigning
Accusative, is also required, in order to capture examples like (56) and (57)
above, where accusative Case is assigned.
(61) A verbalized nP with its verbal/clausal projections
21
Assuming of course that, for the sake of the argument, the unattested functional verbalizers, [V]
[uN] Switch heads, exist. Borsley and Kornfilt (2000, 120) claim that ‘there are no nominal
properties that reflect a nominal functional category located below a verbal functional category’,
having stipulated that ‘[c]lausal constructions with nominal properties are a consequence of the
association of a verb with one or more nominal functional categories instead of or in addition to
the normal verbal functional categories, appearing above any verbal functional categories’
(102). This is equivalent to saying that no functional verbalizers, [V][uN] Switch heads, exist.
6.9 The properties of mixed projections 165
22
More discussion on VNs and mixed projections used exclusively as modifiers can be found in
Panagiotidis (2010) and in references therein.
166 Mixed projections and functional categorizers
23
See Siegel (1998) and Pires (2006, chap. 1) on why Acc–ing and Pro–ing gerunds are not
mixed projections but bare TPs.
6.9 The properties of mixed projections 167
The Aspect node can be correlated with the licensing of adverbs, the
obligatory propositional reading, the providing of a specifier for Object Shift
and, of course, the possibility for perfect aspect.
24
All examples in this subsection are from Schoorlemmer (2002). The phrase markers in (65)
and (68) have been adapted in order to reflect this framework, which incorporates the Switch
head hypothesis.
168 Mixed projections and functional categorizers
The above class of ‘plain’ nominalized infinitives contrasts with one that has
very different properties, although it also constitutes a ‘low’ nominalization,
with the Switch taking a ‘small’ verbal/clausal complement. Such ‘plain’
nominalized infinitives, when modified by an adjective, may have event
readings. Compare the following with (63):
(66) ‘Plain’ nominalized infinitives with adjectives: event reading possible
Het gebeurde tijdens [het snelle tenten opzetten van Jan]
It happened during the quick tents pitch.inf of Jan
‘It happened during John’s quick pitching of tents.’
blueprint: every time two types of mixed projections appear to differ along a
battery of properties, the first attempt at explaining their differences should be
to examine if this battery can be correlated with the presence versus the
absence of a particular functional head inside the verbal/clausal subtree.25
In the case of low nominalizations in Dutch examined here, this was achieved
by correlating a battery of properties (event/proposition reading, adverbs vs
adjectives, Object Shift, specificity and aspect) with the presence or absence of
an Aspect head.
However, not all differences among mixed projections can be explained
along the lines of the presence versus the absence of a category: sometimes
structures containing the same verbal/clausal subconstituents can display very
different behaviours.
25
Of course, this is the line of reasoning followed in Section 6.7.1, where we teased apart TP
‘expressive’ nominalized infinitives and AspP ‘plain’ ones in Dutch.
170 Mixed projections and functional categorizers
Case and support non-null subjects, hence the two Tense heads have different
feature content. Similar facts hold for Korean and Japanese verbal nouns: both
overt subjects and nominative assignment are possible. What we have in
Spanish and Korean/Japanese is a type of Tense head that is reminiscent of
that in ‘absolute’ Greek gerunds (Panagiotidis 2010), with ‘quasi-independent
temporal reference’.26 In any case, differences can be accounted for by the
different feature makeup of the Tense head. Finally, the inability of adverbs to
be licensed within the TP of verbal nouns must again be a result of the feature
content or even the absence of the relevant functional head.
26
‘Absolute’ gerunds are the closest Modern Greek has to infinitives: their Tense head is not
morphologically expressed, it can license temporal (not just aspectual) adverbs, sanction quasi-
independent temporal reference, license periphrastic perfect tenses with an auxiliary, license a
pro subject, and assign nominative to an overt subject (Panagiotidis 2010, 173).
27
But recall footnote 18.
6.10 Why functional categorizers? 171
This is a question to which I have not been able to provide a final, rock-solid
answer, so what follow are comments and intuitions of a more speculative
nature, with some statements verging on the programmatic.
First of all, the difference between recategorizations and mixed projections
can generally be described as follows: when a verb is nominalized or a noun is
verbalized, it is lexical material (nP and vP) that participates as the lower
subtree; it is purely lexical material that undergoes the conversion. On the
other hand, in mixed projections, it is both functional and lexical material that
participates as the lower subtree: in mixed projections we ‘recategorize’ a
chunk of functional structure.
I think that we need to explicitly state a, so far elusive, principle to capture
this situation: categorization of root and lexical material is the job of a lexical
categorizer, whereas (re)categorization of functional material is the job of a
functional head, a functional categorizer. In other words, we need to find a
way, following on from general principles, that bans what looks like a counter-
intuitive state of affairs: a lexical head interrupting a purely functional struc-
ture. Future research must derive the impossibility of the following structure,
where the Switch in (25) has been replaced by a lexical nominalizer n:
(70)
Let us now see if any insight can be offered to help ban the likes of (70).
Generally, if n and v could freely take functional complements, then denominal
verbs and deverbal nouns and mixed projections, on the other hand, should
behave in exactly the same fashion; if this is indeed the case then we can
perhaps do away with functional categorizers. A second point is this: n and v
(or, more acrimoniously, the features [N] and [V]) appear to be necessary and
universal; all languages have n and v. If the lexical categorizers n and v were
responsible for mixed projections in addition to (re)categorizations, then
functional categorizers would be unnecessary. However, I am not confident
that all languages have mixed projections – but this is truly a matter of
empirical enquiry. Finally, arguing that the element which makes mixed
projections is a functional element with a particular feature structure – that
172 Mixed projections and functional categorizers
is, [X][uY] and, possibly, only [N][uV] – enables us to better frame the
problem of the apparent absence of a functional verbalizer, of a [V][uN]
Switch.
6.11 Conclusion
This chapter brings together two approaches to mixed projections, one arguing
them to be categorially dual constituents and one conceiving of them as
projections of a nominalizing head. I claim that they are indeed categorially
dual constituents as a result of their containing a functional categorizer: in the
same way that a lexical categorizer recategorizes lexical constituents, a func-
tional categorizer recategorizes functional ones.
Evidence for the existence of functional categorizers was presented from
head-final languages: Basque and Turkish. The existence of functional cat-
egorizers in natural language grammars was shown to be consistent with the
theory of categorial features presented and advanced here: it results from the
possibility of a categorial feature [N] to co-exist with an uninterpretable
categorial feature [uV] as parts of the same feature bundle, of the same head.
This nominalizing verbal/clausal functional head is understood to be a phasal
head and can take verbal/clausal constituents of a variety of sizes. Moreover,
as expected, verbal/clausal constituents of the same size – say, Aspect
Phrases – may contain elements with different feature content. This derives
the different behaviour of different mixed projections, as expected from a
formal approach: different constituents and different features entail distinct
grammatical behaviours and distinct batteries of grammatical properties.
The puzzle of what looks like the exclusive nominal external behaviour of
mixed projections has been recast as the result of only nominalizing Switch,
an [N][uV] head, existing. This merely rephrases the problem, one that future
research will resolve either by deriving empirical evidence that verbalizing
Switch heads – that is, [V][uN] elements – are possible or by deriving their
impossibility on independent principles.
7 A summary and the bigger picture
7.1 A summary
The theory in this monograph attempts to explain why nouns are different from
verbs and why they are most probably universal. In order to achieve this goal,
it builds upon a series of empirical discoveries, theoretical advances and
methodological principles of more than 50 years of work in generative gram-
mar: empirical discoveries like those regarding the nature of long-distance
dependencies, currently captured as the operations Agree and (internal) Merge,
aka ‘Move’; theoretical advances such as analysing the properties of lexical
items on the basis of their formal features – this being a theory of categorial
features, after all; and methodological principles like the functional–lexical
distinction.
At the same time, research in this monograph has deliberately striven to
synthesize insights, concepts and findings from a variety of frameworks and
approaches to grammatical structure and to language in general. Investigating
mixed projections, I turned to both LFG and HPSG in order to gain an
understanding of how these structures are organized and whether they are
exceptional or the consequence of principles applying everywhere else –
Categorial Deficiency, in our case. Far more importantly, in order to gain an
understanding of what kind of interpretive content each of the word classes
‘noun’ and ‘verb’ could have, I turned to functionalist and typological
approaches, like Baker (2003) did. Crucially, Langacker (1987) and Anderson
(1997) also proved major influences, in their combining notional approaches to
lexical categories with a firm conviction that their interpretation is one of
perspective, conceptualization or ‘grammaticalization’ of concepts – as
opposed to a viewpoint according to which lexical categories form large
pigeonholes into which different concepts are sorted.
Finally, this theory of categorial features attempts to deconstruct word class
categories. Beginning with a criticism of weak lexicalism, I embrace syntactic
decomposition of categories, in the version developed within the Distributed
173
174 A summary and the bigger picture
8.1 Introduction
There is a non-negligible point of criticism that can potentially be raised against
the theory of word class categories presented and discussed in this book, one that
views nouns and verbs as by-products of two fundamental LF-interpretable
features that set interpretive perspectives. The point would be roughly as
follows: Baker (2003) has already developed in detail a theory of lexical
categories based on two unary categorial features; moreover, Baker’s theory is
based on a wealth of empirical data. Why is a theory like the one presented here,
one that builds on assumptions, arguments and discoveries by Baker, necessary
at all? Why do we need yet another generative theory of word class categories?
First of all, Baker’s (2003) book exclusively discusses lexical categories:
nouns, verbs, and adjectives. Here, departing from categorial features as
makers of the word classes ‘noun’ and ‘verb’, we see that their uninterpretable
versions are necessary elements in the creation of functional heads and,
consequently, in the assembling of supercategories (Extended Projections);
we also argue that Agree relations among categorial features both affect
structure-building and play a central role in defining the label of a projection
after the application of Merge. We finally make good on our understanding of
the workings of categorial features so as to derive the (ultimately unexcep-
tional) nature of mixed projections. To be fair, some of the issues mentioned
are insightfully touched upon in Baker (2003, 2008), but not in the systematic
fashion we scrutinize them and account for them here.
A second point is that Baker’s theory, while recognizing the importance of
categories and their unary features being LF-intepretable, does not place
an emphasis on features themselves but on nouns, verbs and adjectives as
virtually primitive lexical categories. This is, of course, to be expected since,
in his system, [N] features exclusively appear on lexical nouns (and, possibly,
Determiners) and [V] features on lexical verbs (and, possibly, a functional
predicator Pred). Categorial features are not particularly active in his theory:
179
180 Notes on Baker (2003)
they merely create the above categories, nouns with their referential indices
and verbs with their specifiers; they then quietly wait to be interpreted at LF.
A third matter, of which this Appendix will be dedicated to providing an
overview, is the fact that a number of arguments and theses in Baker’s (2003)
theory of lexical categories (and his understanding of categorial features) are
the object of extensive controversy. Some of those arguments and theses
are quite central to his theory; others are primarily of methodological concern
or matters of interpreting empirical data. The purpose of this Appendix is
to discuss those controversial points in Baker’s (2003) account that merit a
rethink; it will also remind the reader of how the theory presented here
addresses these points, hopefully in a satisfactory fashion.
Baker (2003, 34–9; 2003, chap. 3 passim) sets out to explain away these and
more complex examples by taking predicate nominals to be always embedded
within a predicate-making functional projection: Bowers’ (1993) Pred.
1
Functional projections like CP and DP may also bear referential indices (Baker 2003, 139).
8.3 Predication and specifiers 181
This Pred head acts as the predicator while its specifier hosts the subject of
predication. According to this recasting of nominal predicates, these actually
have the following structure:
(3) Baker’s predicative noun configuration
2
Again, Chierchia (1998) provides the more or less standard account on what happens with
argumenthood in Determiner-less languages. Also to be consulted: Cheng and Sybesma (1999),
Massam, Gorrie and Kellner (2006), Willim (2000), Bošković (2008).
182 Notes on Baker (2003)
8.4 Are adjectives the unmarked lexical category? Are they roots?
Baker (2003, 190–2) claims that there is ‘nothing special’ about adjectives:
adjectives are exactly the lexical category which is not marked for any
categorial features. This entails that a non-categorized root, one not bearing
an [N] feature (forcing a ‘referential index’ for Baker) or a [V] feature
(resulting in the ‘projection of a specifier’), must surface as an adjective.
Accordingly, adjectives are the lexical categories that lack any particular
properties – that is, that lack categorial features: they are neither predicates
(like verbs) nor referential (like nouns). A very prominent consequence
of this take on adjectives, should it be true, is that adjectives would be
essentially uncategorized – in stark contradiction to the Categorization
Assumption. On top of that, as remarked in Chapter 2, if category-less roots
surface as adjectives, expressing ‘pure properties’, then all languages should
possess an adjective category. This is of course a claim Baker (2003, 238–63)
dedicates extensive discussion to, despite ample typological evidence to the
contrary: again, see Chapter 2. Furthermore, adjectives, as a default category
virtually indistinguishable from uncategorized roots, should at least some-
times surface as morphologically simple elements, with each root available in
a language having at least one adjective derived from it. However, recall that
adjectives in most languages involve more rather than fewer functional
layers – for example, they display concord with the noun inside a DP, just
like functional elements like quantifiers, demonstratives, possessives and
articles. Second, very few adjectives in Romance, Germanic and Greek are
underived, leading us to the implication that Adjective is definitely not the
unmarked category, let alone one co-extensive with roots. Third, if adjectives
are not marked by any categorial feature whatsoever, how are adjective-
making derivational affixes marked? Fourth, not all roots in a language
derive an adjective, contrary to what should be expected for an unmarked
(uncategorized) word class.
Finally, adjectives are taken not to project full phrase structure; thus they
can take no complements when used as attributive adjectives (Baker 2003,
195–6): ‘attributive adjectives cannot take complements, and cannot appear
with a fully fledged degree system, they can be a little more than just a head . . .
Apparently, it is possible for one head to adjoin to another to make a new head
within an attributive modifier, but that is all’ (196 footnote 5). Under a version
of the generalized X’ schema, the above sounds very odd – namely, banning a
whole lexical category from taking complements on the grounds of its being
unmarked. Effectively, this is a ban on Adjective Phrases, at least as attributive
184 Notes on Baker (2003)
3
The typology in (6) suggests that Pred (Bowers 1993) could be perhaps the ‘verbalizing’
equivalent of ‘category-shifting functional heads’ (‘functional nominalizers’), to which switch
(Chapter 6) belongs. This is an exciting topic for future research.
8.6 Co-ordination and syntactic categorization 185
There are several things to be said about the above table, which effectively
summarizes some key consequences of Baker’s account. First, adjectives
and degree elements ending up lumped together with adpositions is truly
counterintuitive, although it is a claim consistent with the discussion through-
out Baker’s monograph. Second, and crucially, there is no way to capture the
lexical–functional distinction using categorial features only; the lexical–
functional distinction is tacitly assumed to constitute a kind of an unexplained
primitive. Because of this blind spot, some basic questions remain
unanswered: how do we distinguish a verb from a Pred head? How do we
distinguish a noun from a Determiner? And so on. Third, a further distinction
between opaque (category-shifting) categories and transparent categories that
are members of the supercategory/Extended Projection to which the lexical
head belongs is made. Unfortunately, this difference between opaque and
transparent categories remains unexplained, though one account is offered in
Chapter 6 of this monograph: opaque categories must be Switch heads.
My purpose is of course neither to engage in nit-picking nor to vindicate the
theory presented here over its predecessor and a valuable source of intuition,
discoveries and methodological solutions. On the contrary, I wish to show that,
when proposing a theory of lexical categories or, in our case, of categorial
features, we need to look at the consequences for elements beyond lexical
categories.
Without getting into too much detail here, if VPs are – roughly speaking –
events and predicate nouns and adjectives in isolation are predicates, then the
examples in (7) can be accounted for without appealing to PredPs.
Let us now turn to the final issue, that regarding theoretical commitments.
In Chapter 5 of his book, Baker mainly discusses whether grammatical
category is inherent or syntactically assigned, as supported here. He argues
extensively against the syntactic categorization hypothesis, while he acknow-
ledges that his theory runs in parallel with it (Baker 2003, 267). In the
pages that follow (268–75), he offers a solid, coherent and very convincing
critique against radical categorylessness. He first calls upon evidence from
Incorporation to support the finding that distinct categorial behaviour in syntax
exists even in the absence of ‘any functional superstructure dominating the
4
Ellipsis and gapping are notorious for introducing much noise when co-ordinated, and are
set aside.
8.6 Co-ordination and syntactic categorization 187
5
Interestingly, all these points of criticism have been addressed by recent research on syntactic
decomposition and categorization, including this book. De Belder (2011) and De Belder and Van
Koppen (2012) contain some exciting work on compounding.
6
There is no escape from a complex structure for verbs – see also Chapter 4.
188 Notes on Baker (2003)
standard syntactic decomposition for verbs, such as Folli and Harley (2005) or
even Ramchand (2008). In other words, how is verb ¼ V þ A different from
verb ¼ v þ ROOT, especially if adjectives are not marked for categorial
features? Generalizing somewhat, and bringing nouns into the discussion,
how is Baker’s structure for predicative nouns in (3), repeated below for
convenience as (9), different from the one in (10)?
(9) Baker’s predicative noun configuration
The question is anything but facetious. For Baker’s view to have explana-
tory weight as a hypothesis which is in contrast with syntactic decomposition,
it must first be shown that the Pred and the N nodes in (9) are indeed distinct.
This is very difficult to ascertain within Baker’s framework because according
to it nouns by definition do not project specifiers. Consequently, predicative
N will always be strictly adjacent to Pred and, if Pred is phonologically null
and/or morphologically non-distinct from the N head, then it is extremely
difficult to establish the existence of both. Even if this is successfully done,
it is also very hard to show that both the Pred and the N nodes in (9) are
categorially marked.
So, it turns out that the differences between (9) and (10) are indeed very
narrow.
References
Abney, Steven P. 1987. The English Noun Phrase in Its Sentential Aspect. PhD thesis,
MIT, Cambridge, Mass.
Ackema, Peter, and Ad Neeleman. 2004. Beyond Morphology: Interface Conditions
on Word Formation. Oxford Studies in Theoretical Linguistics 6. Oxford
University Press.
Acquaviva, Paolo. 2009a. ‘The Roots of Nominality, the Nominality of Roots’. http://
ling.auf.net/lingbuzz/000824
2009b. ‘Roots and Lexicality in Distributed Morphology’. In York Papers in Lin-
guistics, edited by Alexandra Galani, Daniel Redinger and Norman Yeo, 2:1–21.
University of York. http://ling.auf.net/lingbuzz/000654
Acquaviva, Paolo, and Phoevos Panagiotidis. 2012. ‘Lexical Decomposition Meets
Conceptual Atomism’. Lingue E Linguaggio XI (2): 105–20.
Adger, David. 2003. Core Syntax: A Minimalist Approach. Core Linguistics. Oxford
University Press.
2006. ‘Remarks on Minimalist Feature Theory and Move’. Journal of Linguistics 42:
663–73.
2010. ‘A Minimalist Theory of Feature Structures’. In Features, Perspectives on
a Key Notion in Linguistics, edited by Anna Kibort and Greville G. Corbett,
185–218. Oxford University Press.
Ahn, Hee-Don. 1991. Clausal Architecture in Korean and English. PhD thesis,
University of Wisconsin, Madison.
Alexiadou, Artemis. 2001. Functional Structure in Nominals: Nominalization and
Ergativity. Amsterdam: John Benjamins Publishing Company.
Alexiadou, Artemis, and Gereon Müller. 2007. ‘Class Features as Probes’. In Inflec-
tional Identity, edited by Asaf Bachrach and Andrew Nevins, 101–55. Oxford
University Press.
Amritavalli, R., and K. A. Jayaseelan. 2004. ‘The Genesis of Syntactic Categories
and Parametric Variation’. In Generative Grammar in a Broader Perspective:
Proceedings of the 4th Asian GLOW, edited by James Hye Suk Yoon, 19–41.
Seoul: Hankook Publishing Company.
Anagnostopoulou, Elena, and Yota Samioti. 2009. ‘Domains for Idioms’. Paper
presented at the Roots Workshop, June, University of Stuttgart.
Anderson, John M. 1997. A Notional Theory of Syntactic Categories. Cambridge
Studies in Linguistics 82. Cambridge University Press.
189
190 References
Arad, Maya. 2003. ‘Locality Constraints on the Interpretation of Roots: The Case
of Hebrew Denominal Verbs’. Natural Language and Linguistic Theory 21:
737–78.
2005. Roots and Patterns: Hebrew Morpho-Syntax. Dordrecht: Springer Verlag.
Aronoff, Mark. 1994. Morphology by Itself: Stems and Inflectional Classes. Linguistic
Inquiry Monographs 22. Cambridge, Mass: MIT Press.
2007. ‘In the Beginning Was the Word’. Language 83: 803–30.
Aronoff, Mark, and Frank Anshen. 1998. ‘Morphology and the Lexicon: Lexicalization
and Productivity’. In The Handbook of Morphology, edited by Andrew Spencer
and Arnold M. Zwicky, 237–47. Blackwell Handbooks in Linguistics. Oxford:
Blackwell.
Baker, Mark C. 1985. ‘The Mirror Principle and Morphosyntactic Explanation’.
Linguistic Inquiry 16: 373–415.
1988. Incorporation: A Theory of Grammatical Function Changing. University of
Chicago Press.
2003. Lexical Categories: Verbs, Nouns, and Adjectives. Cambridge Studies in
Linguistics 102. Cambridge University Press.
2008. The Syntax of Agreement and Concord. Cambridge Studies in Linguistics 115.
Cambridge University Press.
Barner, David, and Alan Bale. 2002. ‘No Nouns, No Verbs: Psycholinguistic
Arguments in Favor of Lexical Underspecification’. Lingua 112: 771–91.
Basilico, David. 2008. ‘Particle Verbs and Benefactive Double Objects in English: High
and Low Attachments’. Natural Language and Linguistic Theory 26: 731–73.
Beard, Robert. 1995. Lexeme-Morpheme Base Morphology: A General Theory
of Inflection and Word Formation. SUNY Series in Linguistics. Albany: State
University of New York.
Bloom, Paul. 2000. How Children Learn the Meanings of Words. Cambridge, Mass.:
MIT Press.
Bobaljik, Jonathan. 1995. Morphosyntax: The Syntax of Verbal Inflection. PhD thesis,
MIT, Cambridge, Mass.
Bobaljik, Jonathan, and Höskuldur Thráinsson. 1998. ‘Two Heads Aren’t Always
Better than One’. Syntax 1: 37–71.
Borer, Hagit. 2003. ‘Exo-Skeletal vs. Endo-Skeletal Explanations: Syntactic Projec-
tions and the Lexicon’. In The Nature of Explanation in Linguistic Theory, edited
by John Moore and Maria Polinsky, 31–67. Stanford, Calif.: CSLI Publications.
2005. In Name Only. Oxford Linguistics 1. Oxford University Press.
2009. ‘Roots and Categories’. Paper presented at the 19th Colloquium on Generative
Grammar, April, University of the Basque Country, Vitoria-Gasteiz, Spain.
Borsley, Robert, and Jaklin Kornfilt. 2000. ‘Mixed Extended Projections’. In
The Nature and Function of Syntactic Categories, edited by Robert Borsley,
101–31. New York: Academic Press.
Bošković, Željko. 2008. ‘What Will You Have, DP or NP?’. In Proceedings of NELS
37, 101–14.
Botwinik-Rotem, Irina, and Arhonto Terzi. 2008. ‘Greek and Hebrew Locative
Prepositional Phrases: A Unified Case-Driven Account’. Lingua 118: 399–424.
Bowers, John. 1993. ‘The Syntax of Predication’. Linguistic Inquiry 24: 591–656.
References 191
Corver, Norbert. 1997. ‘The Internal Syntax of the Dutch Extended Adjectival
Projection’. Natural Language and Linguistic Theory 15: 289–368.
2005. Copular -Ly. Unpublished ms. University of Utrecht.
Corver, Norbert, and Henk C. van Riemsdijk, eds. 2001. Semi-Lexical Categories:
The Function of Content Words and the Content of Function Words. Studies in
Generative Grammar 59. Berlin: Mouton de Gruyter.
Croft, William. 1991. Syntactic Categories and Grammatical Relations: The Cognitive
Organization of Information. University of Chicago Press.
2001. Radical Construction Grammar: Syntactic Theory in Typological Perspective.
Oxford University Press.
Davies, William D., and Stanley Dubinsky. 1998. ‘Sentential Subjects as Complex NPs:
New Reasons for an Old Account of Subjacency’. In CLS 34–1: Papers from the
Main Session, edited by M. Catherine Gruber, Derrick Higgins, Kenneth S. Olson
and Tamara Wysocki, 34: 83–94. Chicago Linguistic Society.
2001. ‘Functional Architecture and the Distribution of Subject Properties’. In Objects
and Other Subjects: Grammatical Functions, Functional Categories, and
Configurationality, edited by William D. Davies and Stanley Dubinsky, 247–79.
Dordrecht: Kluwer.
Davis, Henry. 1999. On Nouns and Nominalizations in Salish. Unpublished ms.
University of British Columbia, Vancouver.
De Belder, Marijke. 2011. Roots and Affixes: Eliminating Lexical Categories from
Syntax. Utrecht: LOT.
De Belder, Marijke, and Marjo Van Koppen. 2012. ‘One Module, Different Levels of
Merge: ANN Compounds in Dutch’. Paper presented at the Brussels Conference
on Generative Linguistics 7, 17–18 December.
Déchaine, Rose-Marie. 1993. Predicates across Categories. PhD thesis, University
of Massachusetts, Amherst.
Déchaine, Rose-Marie, and Martina Wiltschko. 2002. ‘Decomposing Pronouns.
Linguistic Inquiry’. Linguistic Inquiry 33: 409–22.
Den Dikken, Marcel. 2006. Relators and Linkers: The Syntax of Predication, Predicate
Inversion, and Copulas. Linguistic Inquiry Monographs 47. Cambridge, Mass.:
MIT Press.
Diesing, Molly. 1992. Indefinites. Cambridge, Mass.: MIT Press.
Dixon, Robert M. W. 1982. Where Have All the Adjectives Gone? And Other Essays in
Semantics and Syntax. Berlin: Walter de Gruyter.
Don, Jan. 2004. ‘Categories in the Lexicon’. Linguistics 42: 931–56.
Donati, Caterina. 2006. Labels and Merge. Paper presented at the Interphases
Conference, May, University of Cyprus, Nicosia.
Doron, Edit. 2003. ‘Agency and Voice: The Semantics of the Semitic Templates’.
Natural Language Semantics 11: 1–67.
Drachman, Gaberell. 2005. ‘A Note on “Shared” Allomorphs’. Journal of Greek
Linguistics 6: 5–37.
Dubinsky, Stanley, and Sylvester Ron Simango. 1996. ‘Passive and Stative in
Chichewa: Evidence for Modular Distinctions in Grammar’. Language 72: 749–81.
Embick, David. 2000. ‘Features, Syntax, and Categories in the Latin Perfect’. Linguistic
Inquiry 31: 185–230.
References 193
Embick, David, and Alec Marantz. 2008. ‘Architecture and Blocking. Linguistic
Inquiry’. Linguistic Inquiry 39: 1–53.
Embick, David, and Rolf Noyer. 2001. ‘Movement Operations after Syntax’. Linguistic
Inquiry 32: 555–95.
Emonds, Joseph E. 1985. A Unified Theory of Syntactic Categories. Studies in
Generative Grammar 19. Dordrecht: Foris Publications.
Evans, Nicholas, and Stephen C. Levinson. 2009. ‘The Myth of Language Universals:
Language Diversity and Its Importance for Cognitive Science’. Brain and
Behavioral Sciences 32: 429–92.
Everett, Daniel. 2005. ‘Cultural Constraints on Grammar and Cognition in Pirahã:
Another Look at the Design Features of Human Language’. Current Anthropology
46: 621–34.
Family, Neiloufar. 2008. ‘Explorations of Semantic Space. The Case of Light Verb
Constructions in Persian’. Paper presented at the Conference on Complex Predi-
cates in Iranian Languages, 5 July, Université Sorbonne Nouvelle-Paris III, France.
http://iranianlinguistics.org/complexpredicates/presentations/family-2008.pdf
Felix, Sascha. 1990. ‘The Structure of Functional Categories’. Linguistische Berichte
125: 46–71.
Folli, Rafaella, and Heidi Harley. 2005. ‘Consuming Results in Italian and English:
Flavours of v’. In Aspectual Inquiries, edited by Paula Kempchinsky and
Roumyana Slabakova, 1–25. Dordrecht: Springer Verlag.
Folli, Rafaella, Heidi Harley and Simin Karimi. 2003. ‘Determinants of Event
Type in Persian Complex Predicates’. In Cambridge Occasional Papers in
Linguistics, edited by Luisa Astruc and Marc Richards, 1:100–20. University of
Cambridge.
Fox, Danny. 2000. Economy and Semantic Interpretation. Cambridge, Mass.: MIT Press.
Francis, Elaine J., and Etsuyo Yuasa. 2008. ‘A Multi-Modular Approach to Gradual
Change in Grammaticalisation’. Journal of Linguistics 44: 45–86.
Friedmann, Naama. 2006. ‘Speech Production in Broca’s Agrammatic Aphasia:
Syntactic Tree Pruning’. In Broca’s Region, edited by Yosef Grodzinsky and
Katrin Amunts, 63–82. Oxford University Press.
Fu, Jingqi, Tom Roeper and Hagit Borer. 2001. ‘The VP within Process Nominals:
Evidence from Adverbs and the VP Anaphor “Do-So”’. Natural Language and
Linguistic Theory 19: 549–82.
Fyndanis, Valantis. 2009. Functional Categories in Greek Agrammatism. PhD thesis,
Aristotle University of Thessaloniki.
Galani, Alexandra. 2005. The Morphosyntax of Verbs in Modern Greek. PhD thesis,
University of York.
Gil, David. n.d. What Is Riau Indonesian?. Unpublished ms. Max Planck Institute for
Evolutionary Anthropology, Leipzig. http://sastra.um.ac.id/wp-content/uploads/
2010/01/PU-David-Gil-Riau-Indonesian-.-.-..pdf
1994. ‘The Structure of Riau Indonesian’. Nordic Journal of Linguistics 17:
179–200.
2000. ‘Syntactic Categories, Cross-Linguistic Variation and Universal Grammar’. In
Approaches to the Typology of Word Classes (Empirical Approaches to Language
194 References
204
Index 205
criterion gender, 80
of application, 85 Ger head, 143, 146, 160
of identity, 85 not a noun, 144
of individuation, 88 German, 137
c-selection, 121 Germanic, 37
gerund, 135, 138–9, 143–4, 161, 165–6
Dagaare, 137 Goal, 127
defective intervention effect, 126, 147 grammatical noun and verb, 99
Degree, 47, 112 Greek, 45, 71, 90, 101, 112, 156–8, 165,
denominal verb, 63, 67–8, 76, 170 170, 184
descriptive content, 99, 174 Greenlandic, 187
Diesing effect, 152, 168
Distributed Morphology, 54, 58, 71 Head Feature Convention, 136
Dutch, 73, 76, 137, 153, 165–6 Hebrew, 93, 102, 137
Hindi, 73
Economy, 57, 164
empty noun, 99, 157–8 idiom, 61
Encyclopedia, 62 lexical, 93, 97
English, 49–50, 72, 139, 144, 166 idiosyncrasy, 58, 60–1, 64, 97
entity, 85, 88 Incorporation, 186
Eurocentrism, 24, 30, 177 indexical, 85
event, 87–8 Indonesian
exocentricity, 135 Riau, 30
exoticization, 28, 31, 177 Standard, 31
Extended Projection, 115, 119 inflected language, 27
deriving of, 126 inner morpheme, 69–70, 96, 98
interpretation of [N]
factive nominalization, 150 as sortality, 86
Farsi, 37, 39, 103 interpretation of [V]
feature, 57 as extending into time, 86, 89
affixal, 117 interpretive perspective, 3, 21, 83, 87, 89, 92,
as interface instruction, 2 96, 123
categorial, 80 provided by categorial features, 95
distinctive, 80 Italian, 90, 137
motivation for, 2
taxonomic, 12, 80 Japanese, 37, 43, 69, 138, 162, 165, 187
First Phase, 62, 68 Jingulu, 38
non-compositional, 97
Full Interpretation, 80, 146 Kabardian, 157
functional categorizer, 143, 145, 170 Kannada, 44
overt, 148 Kase, 149
functional category, 114, 118 Kikuyu, 137
functional head kind, 85
as satellite of lexical elements, Korean, 43, 152, 162, 165
110
cannot categorize, 91 label, 129
functional nominalizer, 148 language acquisition, 119
functional projection, 127 language attrition, 120
Fusion, 152 language disorder, 120
206 Index