Intro To Transformational Grammar
Intro To Transformational Grammar
Contents
Preface 1 The Subject 1.1 Linguistics as learning theory . . . . . . . . . . . . . . . . . . . . 1.2 The evidential basis of syntactic theory . . . . . . . . . . . . . . Phrase Structure 2.1 Substitution Classes . . . . . . . . 2.2 Phrases . . . . . . . . . . . . . . . 2.3 X phrases . . . . . . . . . . . . . . 2.4 Arguments and Modiers . . . . 2.5 Deriving Phrase Structure Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii 1 2 8 17 18 24 35 48 59 69 73 76 84 90 102 102 106 106 118 121 126 134 147 149
Positioning Arguments 3.1 Obligatory Speciers . . . . . . . . . . . . . . . . . . 3.2 Movement . . . . . . . . . . . . . . . . . . . . . . . . . 3.3 The Underlying Order of Complements . . . . . . . 3.4 Small Clauses and the Derived Subjects Hypothesis 3.5 Argument Movement . . . . . . . . . . . . . . . . . . 3.5.1 Passive . . . . . . . . . . . . . . . . . . . . . . 3.5.2 Raising . . . . . . . . . . . . . . . . . . . . . . 3.6 Quantier Float . . . . . . . . . . . . . . . . . . . . . 3.7 Innitives . . . . . . . . . . . . . . . . . . . . . . . . . 3.7.1 The CP/IP status of innitives . . . . . . . . 3.7.2 On the Control relation . . . . . . . . . . . . 3.7.3 Argument movement and PRO . . . . . . . . 3.7.4 PRO and Control: A Summary . . . . . . . . 3.8 Argument Movement and a typology of verbs . . . .
Contents
4 Verb Placement and Features 4.1 The Classic account . . . . . . . . . . 4.2 Movement to C0 . . . . . . . . . . . . 4.3 Verb Second word order . . . . . . . 4.4 The Pollockian revolution . . . . . . 4.5 AGREE . . . . . . . . . . . . . . . . . . 4.6 AGREE and Case . . . . . . . . . . . . 4.6.1 Low Subjects . . . . . . . . . . English . . . . . . . . . . . . . Subject Inversion Languages 4.6.2 Case and Agreement . . . . . 4.6.3 A constraint on Agreement . 4.7 Summary . . . . . . . . . . . . . . . . 5 157 158 167 172 181 195 214 220 220 224 230 235 239
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
Determiner Phrases and Noun Movement 241 5.1 The DP Hypothesis . . . . . . . . . . . . . . . . . . . . . . . . . . 243 5.2 Noun Movement . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247 273 275
ii
Preface
These are the always evolving notes from an introductory course on syntactic theory taught at the University of Massachusetts at Amherst. Its target audience is rst-year graduate students, but no background exposure to syntax is presupposed. The course is taught together with a matching introductory course on semantics which, in the Fall of 2007, is led by Angelika Kratzer. These notes augment a set of readings, which are: Chomsky, Noam. 1965. Aspects of the theory of syntax. Cambridge, Massachusetts: M.I.T. Press, Chapter 1. Stowell, Timothy. 1981. Origins of phrase structure. Doctoral Dissertation, Massachusetts Institute of Technology, Chapters 13. Pollock, Jean-Yves. 1989. Verb movement, UG and the structure of IP. Linguistic Inquiry 20:365424. Vikner, Sten. 1995. Verb movement and expletive subjects in the Germanic languages. Oxford: Oxford University Press, Chapter 3. Larson, Richard. 1988. On the double object construction. Linguistic Inquiry 19:335392. Johnson, Kyle. 1991. Object positions. Natural Language and Linguistic Theory 9:577636. Kratzer, Angelika. 1996. Severing the external argument from its verb. In Phrase structure and the lexicon, ed. Johan Rooryck and Laurie Zaring, 109137. Dordrecht, The Netherlands: Kluwer Academic Publishers. Chomsky, Noam. 1981. Lectures on government and binding . Dordrecht, The Netherlands: Foris Publications, Chapter 3.
Preface
Chomsky, Noam. 1986. Knowledge of language. New York, New York: Praeger Publishers, Chapter 3. Reinhart, Tanya, and Eric Reuland. 1993. Reexivity. Linguistic Inquiry 24:657720. Kayne, Richard S. 1994. The antisymmetry of syntax. Cambridge, Massachusetts: M.I.T. Press, Chapters 15. Chomsky, Noam. 1995. Bare phrase structure. In Government binding theory and the minimalist program, ed. Gert Webelhuth, 383439. Oxford: Oxford University Press. Fox, Danny, and David Pesetsky. 2004. Cyclic linearization of syntactic structure. Theoretical Linguistics 31:146.
iv
1
The Subject
Linguistic theory, and so syntactic theory, has been very heavily inuenced by learnability considerations in the last fty-some years, thanks largely to the writings of Noam Chomsky. If we decide that syntactic theory is charged with the duty of modeling our knowledge of language, then we can make some initial deductions about what this knowledge, and therefore our model of it, should look like from some simple observations. This knowledge must interact efciently with the cognitive mechanisms involved in producing and comprehending speech, for instance. It must also be acquirable by any normal infant exposed to speakers of the language over six or so years. A number of considerations combine to make the task of acquiring knowledge of a language look very difcult indeed: the complexity of the acquired grammar, the amount of information that needs to be acquired, the attenuated nature of the information available to the child, etc. It is made even more puzzling by the fact that children appear to complete this task with relative ease in a comparatively short period of time and that the course of acquisition appears to go through a set schedule of stages. There is clearly a problem: If languages are as complex as we think they are, then how can these impossibly complex objects possibly be learned?
1. The Subject
XP { (ZP), X } X { X, (YP) } X { X0 , (WP) } Understand {, } to signify that and are sisters, and () to indicate that is optional. Let W, X, Y and Z range over kinds of lexical items (e.g., noun, verb, preposition, and so on). And, nally, let mean: consists of. The groups here, known as phrases, are the XP and X in the formulas. These constraints, then, leave to the learner only the matter of lling in the variables W, X, Y and Z, and discovering their linear order. As the child goes from step to step in matching the grammar he or she is constructing with the information coming in, these are the only decisions that have to be made. If we imagine that this set of options were to be operationalized into a concrete decision tree, then we could see this as constituting a kind of simplicity metric. It would constitute a procedure for searching through the space of learnable grammars that imposes an order on the grammars, enabling a deterministic method for converging at a particular grammar when exposed to a particular linguistic environment. Additionally, X Theory provides an absolute cap on the possible phrases and, in this respect, constitutes an inviolable constraint as well. If every language learner is equipped with this X Theory, then they will converge more or less on the same GL when presented with the information that being in the environment of speakers of L provides. If there are differences in the GL s that learners converge on, these will trace back to different decisions these learners have made about the identity of W, X, Y and Z, or how their linear order is determined. If the rest of a model that incorporates these constraints is correct, then, it should allow any language learner to pick out a GL very close to the GL giving shape to the speech in that learners environment. Lets consider another example involving transformational rules, one that Chomsky often points to. Transformational rules map one syntactic representation, D-structure, to another, S-structure, typically by way of moving constituents. Interestingly, it appears that all such rules are structure dependent. That is, they make reference to the relative structural positions of the moved thing and the position it is moved to. They dont, for example, make reference to points in a string on the basis of their position relative to some numerical count of formatives. Thus Wh-Movement moves maximal projections that meet certain criteria to particular positions in a phrase marker. And this op-
1. The Subject
the glass of the Poverty of the Stimulus argument, but from the perspective of language variability. He links limits on language variability to a universal ability he sees in human psychology to acquire a language. Humboldts goal is an explanation for the observed limits in variability of the grammars of extant languages. One might imagine that there are explanations for these limits that do not involve, as Humboldt proposes, constraints imposed by human psychology. Similarities in extant languages might reect their common ancestry: if all languages descend from a common one, then features that are shared among them could simply be vestiges of the ancestral language that historical change has left untouched. This is the thesis of monogenesis. I think its possible to read Sapir as advancing this alternative. Sapir is commonly associated with the position exactly opposite to Humboldts; in Sapirs words: Speech is a human activity that varies without assignable limit as we pass from social group to social group, because it is a purely historical heritage of the group, the product of long-continued social usage. (Sapir 1921, p. 4) But, perhaps because of his vagueness, its possible to credit Sapir with a more sophisticated view. One that assigns the universal properties of languages to the detritus of historical change: For it must be obvious to any one who has thought about the question at all or who has felt something of the spirit of a foreign language that there is such a thing as a basic plan, a certain cut, to each language. . . . Moreover, the historical study of language has proven to us beyond all doubt that a language changes not only gradually but consistently, that it moves unconsciously from one type towards another, and that analogous trends are observable in remote quarters of the globe. (Sapir 1921, pp. 120-121) Perhaps the common properties of extant (and known) languages are a function of two facts: all languages descend from a common language, and the forces that cause languages to change are not fully random they preserve certain features and change others only according to some basic plan. If historical relatedness is to explain the common traits that extant languages have,
1. The Subject
tion of the language acquisition procedure becomes available. This course is meant to illustrate these emerging pictures in detail.
1. The Subject
([bnIk]) is not. Presumably this ability to distinguish admissible from inadmissable forms is due to the knowledge speakers have of their language, and so it is an important piece of information about how that knowledge is constituted. A typical way of characterizing this distinction goes as follows. The phonology of a language permits many forms that are not exploited by the lexicon of that language (e.g., [blIk]). Which of these forms are used and which are not is completely extragrammatical. By contrast, because the phonology of a language limits the forms that are available to that language (e.g., English prevents the onset cluster [bn]) these forms (e.g., [bnIk] in English) will be blocked from its lexicon. The absence of these forms is determined by the grammar; they are said to be ungrammatical, and when they are cited, they are prexed with the diacritic * to indicate their status. The same distinction can be elicited for sentences, although because of the larger number of forms involved it is more difcult to recognize a novel sentence. Consider, by way of illustration, the pair of sentences in (1). (1) a. Whenever the earth revolves around its equator, the moon begins to rotate about its axis. b. Whenever the earth revolves around its equator, the moon begins itself to rotate about its axis.
I judge (1b) to be an impossible English sentence, and (1a) to be a possible one. Because I read very little science ction, I think its likely that both sentences are novel for me, but I do not have the certainty about this that I have about blick and bnick. I recognize that there are considerably more sentences that I have encountered than there are words Ive encountered, and consequently I also recognize that it is likelier that I will mistake a sentence as novel than it is that I will mistake a word as novel. Nonetheless, most linguists would agree that the contrast in (1) is of the same kind that distinguishes blick from bnick. It does seem unlikely that the distinction could be reduced to one of novelty. After all, I am roughly as certain of the novelty of (1a) as I am of the novelty of (1b) and yet this does not affect the strength of my judgement concerning their Englishness. It seems probable that my ability to judge the difference between (1a) and (1b) traces back to an ability my syntactic knowledge gives me to judge well-formedness. This distinction between grammatical and ungrammatical forms is important because it seems to tap directly into a speakers linguistic knowledge. Studying corpora cannot provide what is needed to see this distinction; corpora conate ungrammatical and grammatical but non-occurring forms. For
10
Upon reection, most speakers will recognize that (2a) has two meanings. It can assert that the time of my decision to marry was Tuesday, or it can assert that what my decision was was to marry on Tuesday. As we will see, this ambiguity reects the fact that (2) maps onto two sentences, whose difference in syntactic structure is responsible for the two meanings. The rst meaning corresponds to a structure which groups the words as sketched in (3a) on the following page, whereas the second interpretation corresponds to the syntactic structure shown in (3b). Unlike (2a), (2b) seems to have only the second of these two meanings. It can assert that my decision was for my daughter to marry on Tuesday, but it does not seem to say that the time of my decision was Tuesday. At present, this difference in (2a) and (2b) is thought to be due to constraints of sentence processing, and not the well-formedness conditions of sentences. The relevant difference between these examples is the number of formatives between the word decided and the prepositional phrase on Tuesday. As that number grows beyond what can be held in working memory, the processor is forced to start making decisions about how to parse the initial portions of the string. These decisions favor a parse in which later material is made part of more deeply embedded phrases. Thus, in the case of (2b) it favors the structure in (4b) over that in (4a) on the next page. On this account, then, it is not that there is a
11
1. The Subject
(3) a. NP I VP decided to marry b. NP I V decided S VP S to marry on Tuesday S VP PP on Tuesday
difference in the syntactic well-formedness conditions which causes speakers differing judgments about (2a) and (2b). Instead, because of the relative difculty that (2b) presents to the on-line processor, one of the syntactic representations associated with this string (i.e., (4a)) becomes difcult to perceive. This effect of the on-line processor is what Kimball called right association.4 In general, judgments of well-formedness will not be able to distinguish those sentences that do not conform to the constraints of the grammar from those that do conform to those constraints but present problems for the online processor.5 There is no simple way of distinguishing these cases; they can be separated only through analysis. In the case of (2), the decision that the effect is not grammatical but, instead, the result of the processor comes partly from nding no good grammatical way of distinguishing the cases and partly from nding that manipulating factors relevant for the processor determines whether the effect materializes. Another similar difculty involves the fact that the meanings which sentences convey are typically bound to the context of a larger discourse. Inevitably, then, grammaticality judgments are going to be confounded with whether or not there is a discourse in which that sentence could function. Suppose, for
4 See Kimball (1973), Frazier (1978) and Gibson (1998). 5 Chomsky and Miller (1963) is an early, and still useful, examination of this distinction.
12
instance, that you are trying to determine the distribution of a process called VP Ellipsis, which allows a sentence to go without a normally required verb phrase. VP Ellipsis is responsible for allowing the bracketed sentence in (5) to go without a verb phrase in the position marked . (5) Jerry annoyed everyone that [S Sean did ]. If you expose English speakers to the examples of VP Ellipsis in (6), you may nd that they judge them ungrammatical. (6) a. * Whomever she did got better. b. * Everything for her to was hard.
One might be tempted by these examples to the hypothesis that VP Ellipsis is blocked within subjects. But if the examples in (6) are embedded into an appropriate discourse, English speakers will nd (6a) well-formed while (6b) remains ungrammatical. (7) a. Whomever Sally didnt tutor got worse but whomever she did got better.
13
1. The Subject
(8) Two Laws of Elicitation a. The sentences for which you elicit a grammaticality judgement should be embedded in a discourse that makes the meaning that sentence would have salient. b. Every suspected ungrammatical sentence should be part of a minimal pair, the other member of which is grammatical.
b. * Everything for him to do was easy and everything for her to was hard. The problem with (6a) is that recovering the meaning of the elided VP cannot be done without a larger context, and the grammaticality of sentences with VP Ellipsis in them depends in part on recovering the meaning of the elided VP. There is nothing syntactically ill-formed with the VP Ellipsis in (6a), however, as we see when this context is provided. By contrast, neither the context in (7b) (nor any other that I have found) improves the goodness of (6b). There is something ill-formed about the syntax of this example. These two problems are similar. In both, the difculty is in distinguishing judgments of ungrammaticality from other types of ill-formedness. The effect of these difculties can be lessened if the following two practices are used in eliciting judgments. First, embed the sentences whose well-formedness you wish to determine in discourse contexts that make the meaning these sentences should have available and salient. This helps remove the second problem. Second, for every sentence you suspect to be ungrammatical, present your informant with a matching sentence which you suspect to be grammatical. These two sentences the suspected grammatical and the suspected ungrammatical one should differ minimally. Your aim should be to remove all differences between these two sentences except for the factor that you suspect is responsible for the ungrammaticality. This will help mitigate processing effects, as the two sentences will end up matched in length and close to matched in complexity. It will also help remove any other confounds which might be responsible for the ungrammaticality of the sentence you wish to test. These practices are rarely used, unfortunately. As a result, the history of syntactic research is littered with dead ends and wrong turns that have resulted from errors in the empirical base. Dont fall victim to these errors. Wherever you can, follow the Two Laws of Elicitation. In these lectures, I will
14
15
1. The Subject
moving error and making the data you report comparable to the data someone else gathers. Grammaticality judgments, then, will be the central evidence used here in uncovering the principles that constitute a speakers syntactic knowledge. There is one other kind of datum that is important to the syntactician. As weve seen, the syntax of sentences is intimately tied to the meanings they convey. It is the semanticists job to discover the principles that allow users of language to extract these meanings. One of the central principles of semantics is the law of compositionality. (10) The Law of Compositionality The meaning of a string of words, , is derivable from the meanings of the words that contains. As a result of the Law of Compositionality, there is a regular and productive relationship between the syntactic structure of a sentence and the meaning it conveys. This makes it possible to use the meaning a sentence has to draw conclusions about the syntax of that sentence. This requires the assistance of the semanticist, of course, for an intimate knowledge of the rules of semantic composition are needed to draw these conclusions. In recent years, this source of evidence has grown in importance, and it will be an important component of these lectures.
16
2
Phrase Structure
Our goal is to model the processes by which arrangements of words are recognized as forming grammatical sentences. As sketched in the previous chapter, this involves discovering how those processes vary from speaker to speaker, for only in this way will we get a handle on what features of these processes are universal and on how they are permitted to vary. I presuppose that readers of these notes have some familiarity with English, and so well begin this task by investigating those processes that are responsible for the English speakers grammaticality judgments. Our rst observation is that we can get very far in this task using very little information about the words involved themselves. A great deal about the processes that determine well-formed arrangements of words can be characterized using nothing more than the morpho-syntactic category that the words belong to. This can be appreciated by virtue of the Novel Form argument, which is laid out in (1).1 (1) a. If enough information is introduced with a novel word to enable the individual learning that word to recognize its category, then b. The individual knows which arrangements it can grammatically combine in. c. Hence, it must be category membership to which these processes refer.
2. Phrase Structure
If I introduce you to the word [blowrsnIk] (bloresnick), for instance, and tell you that it refers to unwanted facial hair, you will be able to determine that only (2b) of the examples in (2) is grammatical. (2) a. b. c. d. It bloresnicked. He removed the long bloresnick. She nds Sammy bloresnick. He made his face bloresnick.
There is a close connection between the category a word belongs to and its meaning, an issue we will revisit shortly. It is tempting to think that morphosyntactic category could, perhaps, be reduced to something about the meanings of the words involved. At present, however, there is no consensus about how to recast categories into wholly semantic terms and, given the differences in how languages seem to map meanings onto categories, there are real problems to overcome in doing so. Its also not clear that the pattern of grammaticality judgments English speakers give to (2) rests on differences in how their meanings are arrived at. Why cant the meanings of it and bloresnick in (2a) combine in the same way that they seem to in (3a), for instance? And why cant (2c) get a meaning like that assigned to (3b)? (3) a. It was bloresnick. b. She nds Sammy to be bloresnick.
There is information, then, about the kinds, or types, of words that is not identical to the kinds, or types, of things they refer to, and this information seems to underlie our knowledge about grammatical combinations of words. Lets see how far we get in modeling grammaticality judgments just by paying attention to morphosyntactic category.
18
Substitution Classes
we were to look at a large assortment of strings, we would be able to discover patterns that distinguish the grammatical sentences from the ungrammatical sentences. We would discover that categories have a particular distribution in the grammatical strings. Zellig Harris argued that morpho-syntactic category should be dened in just these terms.2 Specically, noun, verb and so on are substitution classes of vocabulary items. They are substitution classes in the sense that there is a set of positions within a sentence into which any member of that class can be substituted preserving the grammaticality of the sentence. For instance, any word that can be grammatically placed in the spot marked with in (4) falls within the subset of vocabulary items we know as nouns. (4) the exists
This is indicated by considering the lists of sentences in (5)-(10). (5) The lamp exists. The girl exists. The sky exists. The streetcar exists. . . . (6) * The happy exists. * The blue exists. * The short exists. * The at exists. . . . (7) * The in exists. * The out exists. * The from exists. * The on exists. . . .
2 Harris (1946) is an accessible introduction to this procedure.
19
2. Phrase Structure
(8) * The swim exists. * The have exists. * The ate exists. * The broke exists. . . . (9) * The slowly exists. * The apparently exists. * The always exists. * The decidedly exists. . . . (10) * The every exists. * The much exists. * The no exists. * The a exists. . . . As can be seen, this technique picks out a list of words that match what the grammar school curriculum calls nouns, and segregates them from the others. A similarly discriminating environment can be devised for each category. For each (major) word class, Ive given a distinguishing environment in (11). (11) a. have b. the c. dance d. in e. must eaten: Adverb thing: Adjective it: Preposition orange: Determiner there: Verb
Understand (11), and (4) as well, as abbreviating the following claim: there is a sentence that is grammatical which contains X Y, and for which replacing a word of category category into uniquely preserves grammaticality. So, for instance, (11a) should be understood as claiming that all the ways of completing the sentence in (12) involve lling with an adverb. (12) They have eaten rutabagas.
20
Substitution Classes
On this view, morpho-syntactic categories are simply partitions of the vocabulary into equivalence classes. The labels noun, verb and so on are merely convenient names for the resulting subsets of vocabulary items. There are a few things about the distinguishing environments in (4) and (11) that should be noted. First, they dene substitution classes solely on the basis of adjacent items. We might elevate this to a hypothesis. (13) Morpho-syntactic categories can be dened on the basis of what words they can be adjacent to. Second, the environments in (11) partition the vocabulary in ways that your language arts curriculum may not have. For instance, the Determiner class picked out by (11d) does not include much or many. There arent grammatical sentences that contain in much thing or in many thing as a substring. One reaction to this would be to allow much and many to belong to different word classes than every, the, a, and so on. We could admit the two additional word classes, Detmass and Detpl ur al , dened over the environments in (17). (14) a. in b. in syrup: Detmass oranges: Detpl ur al
This is a straightforward application of the procedure for dening morphosyntactic category that Harriss program offers, and it is one direction that syntactic theorists go. There is another reaction to these data, however, and it is the one I shall follow. Its clear by comparing the environments that dene Determiner and Detpl ur al that what distinguishes them is whether the word that follows is plural or singular. The difference between singular and plural is a semantic one, and so we should tie the difference between Determiners and Detpl ur al eventually to a semantic primitive. It is also a semantic difference, although a less familiar one, that distinguishes the Determiner and Detmass categories. Words such as syrup refer to entities which do not contain clearly delineated atomic parts, whereas words like oranges do. If one recursively divides a group of oranges into its parts, there will come a denitive point when we are down to the individual oranges that we will no longer be dividing a group of oranges. A group of oranges is made up of things that one could count. The same is not true of syrup. It is not clear how to nd the things that are grouped together to make syrup. Words that refer to entities that can be piled together, or taken apart, in the way that oranges can are called count nouns, while those that cannot are called mass nouns. The difference between the Determiner and
21
2. Phrase Structure
Detmass classes is just whether the term that follows them is mass or count. This is a semantic distinction.3 There is a clearly semantic generalization to be captured in distinguishing these classes of determiners, and we should strive to capture these generalizations in our grammar. There are generalizations hidden in the environments in (4) and (11) as well, but it is not at all clear that these are semantic generalizations. To see these generalizations, consider the following series of distinguishing environments for the word class noun, each of which is very similar to (4). (15) a. b. c. d. e. f. g. h. i. j. k. the eats some knows exists a few is every ate exists no some has put every a screamed drove few and so on
The generalization in this list is that the words anking the environment in which nouns are restricted are themselves of a word class; each member of this list ts the schema in (16). (16) determiner verb Each of the environments in (11) can be similarly converted into a generalization that makes reference to morpho-syntactic category. (17) verb: Adverb a. verb b. determiner noun: Adjective noun: Preposition c. verb
3 But it is still, apparently, one that arbitrarily maps onto lexical items, as Brendan Gillon reminds me. English decides to treat furniture as a mass noun, for instance, even though a group of furniture is, arguably, composed of clearly delineated parts. The point here is that there is a way of dening the mass/count distinction. It doesnt need to be rendered in terms of substitution classes. There remains the issue of how to align this semantic denition of the distinction with our, hopefully, evolving folk metaphysics.
22
Substitution Classes
d. preposition noun: Determiner e. Infl preposition: Verb (nb: The word must belongs to a morpho-syntactic category with a small set of members; Ive labeled it infl in (17e). Well soon encounter the evidence for this category.) At present it is not possible to reduce this generalization to a semantic one. That is, there is no known method of dening morphosyntactic categories in semantic terms.4 At present, the best that can be done is to dene morpho-syntactic categories in the terms that Zellig Harris gave us: substitution classes. The generalizations underlying (4) and (11) are at present irreducibly morpho-syntactic, then. Notice that converting (11) to (17) claims that the particular lexical items chosen will not matter. But, as weve just seen, it does matter: whether the noun in (17d) is count or mass or singular or plural will determine which of the Determiner, Detmass and Detpl ur al classes are well-formed in this position. To take the step that denes substitution classes in terms of other substitution classes, then, requires factoring out the semantic information and introducing, as a consequence, certain ungrammatical strings. One reaction to the differences among Determiner, Detmass and Detpl ur al , then, is to segregate the kinds of information that together determine the distribution of words into a syntactic component and a semantic component. This is the path we shall take. We assign to the semanticist the task of explaining the wholly semantic part of this job: why, for instance, much can be left-adjacent to a mass noun but not a count noun. In general, it is not trivial to know when it is the semantics or the syntax that is responsible for coccurence restrictions like those in (4) and (11), and the line is constantly being questioned. Harris, it seems, believed that virtually none of it was semantic, whereas present-day categorial grammarians push in the direction of removing an independent syntactic contribution. Ill chart a course that is somewhere in the middle. Morpho-syntactic categories, then, are dened syntactically. They are subsets of the vocabulary that can be substituted for each other in particular positions within a grammatical sentence preserving grammaticality. Moreover, the particular positions can be characterized in terms of adjacent morphosyntactic categories. The rst step in characterizing the grammaticality judgments of some speaker is recognizing that the vocabulary of that speaker is partitioned in this way.
4 For a recent attempt to dene some of the major categories in terms that verge on semantic, see Baker (2003).
23
This string can also be cordinated with another identical string of categories. Cordination involves the use of words called conjuncts, words such as and, or, nor, etc. Thus, we nd examples like (19) but not (20). (19) the happy woman and an unhappy man (20) a. * the angry and an unhappy man b. * the and an unhappy man
Finally, with respect to all these distributional tests, the strings D+N+P(reposition)+N, N+P+N, Adj+N, N, and (innitely) many others also pass. We need some way of describing the fact that these strings are the same, and different from, say, P+N which has a distinct distributional pattern. That is, this family of strings is a substitution class in the same sense that morphosyntactic categories are. Families of strings like this are called phrases, and we can write a Phrase Structure Rule to describe which strings belong to such a family. In the case at hand, this rule might look like (21). (21) P (D) (Adj) N Understand material enclosed within ( ) to be optional; (21) therefore generates the set of strings: D+Adj+N, D+N, Adj+N and N. This leaves out the strings D+N+P+N and N+P+N. But these strings involve another phrase, made up of the string P+N. This string, along with any string that conforms to the template P or P+P or P+P or P+P+P has the dening distribution in (22). (22) &P A a. I stood around. b. I knew the man by Mary.
24
Phrases
c. I remain disliked by Mary. d. I stood next to Mary. Like Ps, Ps may be cordinated with other Ps, but not with other Ps, as the following examples illustrate. (23) Under the bed and behind the sofa are usually good places to nd money in my house. b. * The dining room table and behind the sofa are usually good places to nd money in house. a.
Hence, just as with Ps, this family of strings constitutes a substitution class. Putting these observations together, we come up with the Phrase Structure rules in (24). (24) a. P (D) (Adj) N (P) b. P P (P) c. P P (P) (P) (P)
Material enclosed in { } offers a set of exclusive choices: exactly one of the members of the enclosed list must occur. In (25) this means that either P, P or, because these are both optional, nothing may occur after P to form a PP. Note that (24a) and (25) together have the property of being recursive. This is an important aspect of phrase structure rules for it is the primary means by which we describe the indenite length, and cardinality, of sentences. These two phrase structure rules are able to characterize innitely many and innitely long strings of words. This is a correct result, for we are, as far as our linguistic abilities go, capable of forming grammaticality judgments about an innity of sentences and about sentences of innite length. We have other properties that prevent us from doing this, of course. Our memories and attention are too ephemeral for even very long sentences; and even if we were able to overcome these cognitive limitations, our death will bring an eventual end to any sentence, or series of sentences, that we are evaluating. But there is no reason to think that this limitation is a linguistic one. We should let our model of grammaticality judgments characterize an innity of sentences, as well as permit sentences of innite length, and let the actual limits on the lengths and
25
2. Phrase Structure
(29) a. P V (P) (P) (P) b. P P (P) c. P (D) (Adj) N (P)
numbers of sentences that we evaluate be determined by other factors. The recursiveness of phrase structure rules is a step in that direction. Still another phrase structure rule is required to account for the fact that the family of strings that include V, V+P, V+P, V+P+P, and an innite set of other such strings is a substitution class. The environment that denes them is (26). (26) In a. I should eat rutabagas. b. I will talk to Mary. c. I will tell Mary about rutabagas.
In is a morpho-syntactic category that includes should, will, must, would, can, could and a few other words. Its an abbreviation for inection; the words that belong to it are tied to the inectional classes that verbs belong to, as we shall see. Like Ps and Ps, cordination treats members of this family as equivalent and distinct from Ps and Ps. (27) a. b. c. Mary walked and talked. Mary visited Paul and kissed Barry. Mary talked to Paul and met with Barry.
These facts call for a Phrase Structure rule like the following: (28)
P V (P) (P)
We have now arrived at the three rules in (29) below. There is a common property to all these rules. In each case, all of the constituents are optional, except one. Thus, a verb is the only necessary member of a P, a noun the only requisite member of an P and a preposition is all thats required to make a P. This is just another way of observing that the environments that dene these phrases are also environments in which a word class is dened. Further, the converse also turns out to be true: whenever there is a preposition, there is a P, wherever a noun is found, there is an NP, as so on. More precisely, the
26
Phrases
environments that dene a phrase will always include an environment that denes some category. Thus, nouns and P, prepositions and P, verbs and P are in one-to-one correspondence. This is a very pervasive property of Phrase Structure rules. Phrase Structure rules vary to a considerable degree across languages, but this property of them seems to always hold. Well confront two apparent counterexamples from English shortly, but these are probably only apparent counterexamples. So far as I am aware, there is no clear counterexample to this generalization. This property of Phrase Structure rules is known as endocentricity. The word that must be a member of the phrase is its head. Finally, it is common practice to name the phrases after their heads, so well rename P, NP, P PP and P VP. Thus, we now have the rules in (30). (30) a. NP (Det) (Adj) N (PP) (NP) b. PP P (PP) c. VP V (NP) (PP)
In addition to these three Phrase Structure Rules, well need quite a few others. Indeed, the principle of endocentricity leads us to expect that for every category, there will be a Phrase Structure rule that builds a phrase headed by that category. For example, corresponding to the category Adjective, there is a rule that builds adjective phrases; (31) is a good rst approximation. (31) AP A (PP) The presence of PPs within Adjective phrases is supported by the existence of strings like: (32) a. b. She is interested in syntax. She is interested. He seems happy with linguistics. He seems happy.
The cordination test also treats A and A+PP strings as being the same, as (33) indicates. (33) a. b. She is happy and interested in syntax. He seems bored but happy with linguistics.
Well also need a Phrase Structure rule that tells us how these various phrases are put together to form a sentence. (34) looks roughly right.
27
2. Phrase Structure
(34) S NP In VP The morpho-syntactic category that sentences are in a one-to-one relation with is In,5 and so in keeping with the convention of naming phrases after their heads, we should change (34) to (35). (35) IP NP In VP With this rule we have nally come to the task of characterizing the grammaticality judgments of English speakers. For any speaker of English whose vocabulary has been partitioned into noun, verb, preposition, adjective, determiner and In, (35), with the rules in (30), characterizes those strings of words that will be judged grammatical. This is just a rst step, of course. We have hundreds of pages left. In fact, its possible to see something wrong with (35) right away. It says that no sentence can fail to have an In between NP and VP, but if In are just words such as can, could, will, and so on this is obviously wrong. There are grammatical sentences aplenty that fail to have these words in them. (36) is an example. (36) Jerry walked. Where is the In between Jerry and walked in this sentence? If we look hard, we nd that sentences are, in fact, in a one-to-one correlation with a category, but that category includes not just words, but bound morphemes as well. Consider the sentences in (37). (37) a. b. c. d. e. f. g. h. Jerry leaves. Sally left. Sam has left. Sarah had left. Martha should leave. George might have left. Laura desires [Sal to leave]. Larry remembers [Jim leaving].
5 In this case, however, unlike what we found for the VP, NP and AP rules, In is not the only obligatory member of a sentence. It is presently controversial whether sentences are the only phrases that have this property. We will see in later lectures that there are syntactic reasons for the obligatory presence of NP, and likely semantic ones for the obligatory presence of VP.
28
Phrases
The boldfaced terms have similar distributions: they are found either immediately preceding the verb (if they are free) or afxed onto the following verb (if they are bound). Every sentence has one of these, and so these terms meet the criteria of being the head of a sentence. To explain how it is that those Ins which are bound morphemes materialize afxed onto the following verb, we will have to invoke a process that goes beyond phrase structure rules. Let us put off doing this. As we gather more detail about the shapes of grammatical English sentences, we will need to make quite a number of additions to these rules. In fact, to be exhaustive about this proves to be a task beyond what we can manage here; we should consider this an open-ended process. Nonetheless, I want to gather a little more detail than we now have. Ill begin by adding a couple of phrases to our inventory. One of these is a sort of sentence found in examples like (38). (38) a. b. c. d. Mary said that John likes chocolate. Mary recalled the rumor that John likes chocolate. That John likes chocolate bothers Mary. Jerry is angry that John likes chocolate.
Note that the strings following the word that meet the conditions imposed by the rule that builds IPs. The word that is called a complementizer and it is the head of the phrase found in these sentences. This phrase, or clause as sentencelike phrases are often called, is a Complementizer Phrase (CP). CPs conform to the requirements of the following Phrase Structure rule.6 (39) CP C IP Other complementizers are if and whether, as found in the following examples. (40) a. b. I wonder if Mary likes chocolate. I asked whether Mary likes chocolate.
Having introduced this constituent, we will now need to revise our previous Phrase Structure rules to include the positions where they may lie. This yields the following battery of rules.
6 The conclusion that sentences are headed by In, and that the subordinate sentences that include that are headed by complementizers, is reached in Chomsky (1986a). In the generative literature that precedes this work, these two types of sentences were given various treatments. Bresnan (1972) suggested that that-clauses were of the same category as sentences; she called them Ss. Jackendoff (1977) argued that sentences are larger VPs, headed by a verb. The IP/CP model is now standard, though the issue is periodically revisited.
29
2. Phrase Structure
(41) NP I VP CP b. NP (D) (AP) N (PP) (CP) c. VP V (NP) (PP) (CP) d. AP A (PP) (CP) a. IP
Note the option of having a CP in place of an NP at the beginning of a sentence. Note too that Ive brought the NP rule into conformity with the principle of endocentricity. Our earlier rule (i.e., NP (D) (Adj) N (PP) ) permitted an adjective without an adjective phrase. Ive replaced (Adj) with (AP) to correct for this. Well see the empirical support for that change shortly. The second phrase well need are ones headed by adverbs. Adverbs are a word class that is sometimes dened on the position left adjacent to adjectives, as in (42). (42) a. b. a. b. A very happy child The extremely large boat I have deliberately misled. I have noticeably erred.
Interestingly, to a large extent, the set of adverbs that can be immediately preceding a verb is a proper subset of those that can immediately precede an adjective. (44) (45) a. A deliberately angry child b. The noticeably large boat a. * I have very misled. b. * I have extremely erred.
Here is one of those places where we must judge whether we have two morphosyntactic categories or we have a semantic distinction. To judge from these few examples, it seems that the adverbs which indicate an extent or measure are ne in the pre-adjectival position but not in the preverbal position. This description of the contrast makes reference to the meanings of the words involved, and so it could reect a semantic fact. On the other hand, it may be that we are looking at a contrast due to morpho-syntactic category, but one that correlates with (or perhaps is dened by) these semantic factors. I dont
30
Phrases
know how to decide between these alternatives. I will gamble that this is a category distinction like the others we are examining. Lets call the category of words that Ive characterized as denoting an extent, degree, and reserve the term adverb for the others. The class of Degree words can also show up in combination with adverbs, as illustrated by (46). (46) a. b. Mary very quickly walked through the full classroom. Mary extremely loudly declared her interest in Dutch phonemics.
Indeed, wherever an Adverb can be found, so also can a Deg+Adverb string. Our phrase structure rule for adverb phrases will be (47), therefore. (47) AdvP (Deg) Adv If we admit the category Degree, then the principle of endocentricity tells us that there must be a phrase that has the same distribution. It is very difcult to identify a (non-trivial) string that has the same distribution as Degree words. Those strings are most easily seen in so-called comparative constructions, a complex phenomenon that we will hopefully manage to skirt in these lectures. As a stop-gap, let us do with the questionable rule in (48). (48) DegP Deg We should modify (47) accordingly. (49) AdvP (DegP) Adv The existence of degree phrases and adverb phrases now requires that we modify some of our other rules so that they can be positioned within sentences correctly. Adjective phrases will have to be changed to allow for degree phrases within them, as in (50). (50) AP (DegP) A (PP) (CP) And renements to the rule that characterizes verb phrases will be necessary in order to position adverb phrases within them. (51) VP (AdvP) V (NP) (PP) (CP) There are some other changes to the VP rule that are necessary. Note, for instance, that VPs may occur immediately following a verb, as in (52). (52) a. b. Mary has walked. Mary has talked to John.
31
2. Phrase Structure
c. Mary has visited Gary.
Interestingly, if the verb heading a VP is followed by another VP, nothing else may follow the head verb. For instance, Mary has on the platform walked is ungrammatical. We need, therefore, to modify the VP Phrase Structure rule in such a way that the head verb is followed by a VP, or by the expansion previously arrived at, but no combination thereof. This can be done with the aid of curly brackets in the following way: (53) VP (AdvP) V (NP) (PP) (CP) VP
Further, it is possible to nd APs embedded within VPs; (54) provides some examples. (54) a. b. Sally remains angry at Jim. Frank is happy with himself.
When APs follow verbs, they may be preceded by, at most, a PP, as in (55). (55) Jerry seems [PP to Bill ] [AP happy with his rutabagas]. So we change the rule that characterizes VPs to: (56) VP (AdvP) V
(NP) (PP) (CP)
VP (PP) AP
Finally, consider that part of the NP rule that introduces determiners. Determiners include words like the, a, that (not to be confused with the complementizer that), every, some, all, etc. Interestingly, its very rare that we nd determiners combining with other words to form a phrase that combines with a following noun. A couple of examples which might constitute cases of this sort are given in (57). (57) a. b. all but three dogs more than most people
I dont know precisely what the Phrase Structure rule is that determines which strings may stand in this position. Nonetheless, one common approach to these cases is to imagine that determiners head their own anemic phrases, which are then positioned within NPs. We will revisit this idea, but for now lets imagine that determiner phrases are made up of nothing but determiners.
32
Phrases
(58) DP Det Well therefore need to update the phrase structure rule that forms NPs. But before we do this, lets consider strings like those in (59). (59) a. b. c. Marys book the mans toy the man on the tables nose
These examples involve a possessive or genitive phrase. Note that this phrase is an NP with the morpheme s appended to the end. Further, note that this genitive phrase never co-occurs with a DP, as (60) illustrates. (60) a. * the Marys book b. * the the mans toy c. * a the man on the moons nose
One very typical explanation for this is to understand determiners and possessives as competing for the same position. In this situation, that can be done by rigging the NP phrase structure rule in such a way that it either produces a DP or a genitive phrase in the same position. This is done with the curly braces abbreviation in (61). (61) NP (DP) (NPs) (AP) N (PP) (CP)
One nal Phrase Structure rule is required by the sorts of examples weve so far reviewed. This is the Phrase Structure rule that generates cordinated phrases. This can be done with the following. (62)
Conj
This rule says that a phrase of any category can be made up of two other such phrases with a conjunct stuck between them. Conjuncts, recall, are and, or and but. Summarizing, weve now introduced the following battery of Phrase Structure rules: (63) a. IP b. NP NP CP I VP (AP) N (PP) (CP)
(DP) (NPs)
33
2. Phrase Structure
c. VP (AdvP) V d. e. f. g. h.
(NP) (PP) (CP)
VP (PP) AP
DP Det DegP Deg AdvP (DegP) Adv AP (DegP) A (PP) (CP) CP C IP (NP) i. PP P (PP) j. Conj
An interesting property of the phrases dened in (63), a property which Harris discussed, is that they all distinguish the phrases they are dening from the word class which that phrase matches. One might wonder why there is this distinction. We have decided that the way to dene category is as classes of words that can substitute into a certain collection of positions. Weve also used this very denition for phrases. And the principle of endocentricity tells us that there is a one-to-one correspondence between category and kind of phrase. What we end up with, then, is a situation where the positions that dene a phrase are always a proper subset of the positions that dene a category. We might wonder why the positions dening a phrase are always a proper subset of those that dene a word class; why arent they the very same positions? Why, for instance, isnt the rule for noun phrases something like (65) rather than (65)? (64) NP (DP) (AP) N (PP) (65) N (DP) (AP) N (PP) Harris argues that phrases need to be distinguished from word class, and that this is a general property of phrase structure rules. He points out, for example, that while singular nouns are in the same substitution class as are plural ones, a plural noun cannot substitute for a singular one when it combines with the plural morpheme. He assumes that phrase structure rules control inectional morphology, and therefore, that in addition to the rules we have discovered, there is also a rule such as (66) that produces plural nouns. (66) NP N s
34
X phrases
(We will adopt something similar in a few chapters.) His point, then, is that this rule should not be (67) because that would wrongly produce doubly pluralized nouns such as (68). (67) N N s (68) kitses [kIts1z] This is the reason, then, why our phrase structure rules look like (64) and not (65). Or, to put it somewhat differently, we do not want these rules to be recursive with respect to their head. The phrases weve encountered so far all have this property. But interestingly, it turns out that not all phrases do. Some phrases are headed by other phrases. And these phrases, it turns out, are identical. We turn to these cases next.
2.3 X phrases
There are substitution classes that pick out strings which are recursive on themselves. That is: these phrases are headed by themselves. These phrases are found inside those weve identied so far. For example, in the position marked by in (69), we nd the family of strings in (70). Some examples are in (71). (69) Det V (70) {N, AP N, N AP, N PP, AP N PP, N PP AP, AP AP N, N PP PP, AP AP N PP, AP AP N PP PP, . . . } (71) the woman left. the happy woman left. the woman unhappy with the lecture left. the happy woman with a hat left. the woman with a hat unhappy with the lecture left. . . . And cordination also reveals that this set of strings forms a family: (72) The woman and happy man left. The happy woman and man with a hat left. . . .
35
2. Phrase Structure
Now this family of strings does not appear to be the family we have called NP. There are two, related, reasons for this. First: there are grammatical strings from the second family which cannot be substituted for instances of the rst family, as (73) indicates. (73) a. The woman left. b. * Woman left.
Second: a close inspection of the set that the second family is made up of indicates that it does not share Harriss property. This family is recursive with respect to itself. Unlike NPs, which can have only one instance of a DP inside them, the phrase weve discovered here can contain any number of strings of the same kind as itself. So, we set up something like (74). (74) a. NP b. c. d. e. (NPs) (DetP) N
N AP N N N AP N N PP NN
Note how these rules encode the optionality of AP and PP differently than the optionality of DP. And note, further, that they are all endocentric on N. They also leave out the position of CP; this is because tting CPs into this structure poses a problem. We will return to it in just a moment. We nd the existence of very similar subphrases within VPs as well. Consider, for instance, the environment in (75), which permits the family of strings in (76), as (77) exemplies. (75) NP (76) CP {V, AdvP V, V AdvP, V PP, AdvP V PP, V PP AdvP, V AdvP PP, AdvP V PP PP, AdvP AdvP V, AdvP AdvP V PP, V PP PP, . . . }
(77) Sally said that Jerry left. Sally quickly said that Jerry left. Sally quickly said to Peter that Jerry left. Sally said to Peter quickly that Jerry left. Sally said quickly to Peter that Jerry left. Sally carefully said to Peter on Tuesday that Jerry left. . . .
36
X phrases
And, as before, cordination recognizes this family. (78) Sally shouted and whispered that Jerry left. Sally loudly shouted and whispered that Jerry left. Sally shouted to Peter and quietly whispered that Jerry left. . . . Again, this subphrase is recursive and headed. So we have something like: (79) a. b. c. d. e. VP V V AdvP V V V AdvP V V PP VV
These rules leave out the expansions of VP which introduce NPs, CPs, APs, and VPs. Moreover, the rst of these rules says that VPs are Vs and nothing more, which raises the obvious question why we posit Vs here at all. We would get the same result by dispensing with the rst of these rules, and replacing V with VP throughout the remainder. We will soon see, however, that in certain situations there is a term that can show up which appears to be dominated by VP but not V. Ill keep these rules in anticipation of that situation. A similar situation arises in Adjective Phrases too. If we examine the environment in (80) we discover that it characterizes the set of strings in (81). (80) V CP (81) {A, Deg A, Deg Deg A, A PP, Deg A PP, Deg A PP PP, . . . } (82) Sean is happy that syntax is cool. Sean was happy on Tuesday that syntax is cool. Sean was very happy on Tuesday in this class that syntax is cool. . . . As before, this family is recursive and headed. And, as before, it is visible to cordination as well. (83) A child happy with her guardian and well-rested is unlikely to cause trouble. A child happy with her guardian and completely well-rested is unlikely to cause trouble. A child thoroughly unhappy in a zoo and angry at her guardian is
37
2. Phrase Structure
likely to cause trouble. . . .
Note that I have left out CP, as in the other rules; and, as with the VP rule, these rules characterize AP as consisting of just an A and nothing else. Both matters well take up shortly. There is a feature of this method of representing these subfamilies that I would like to draw attention to now. It allows for two separate parses of examples such as (85). (85) the considerate gift and donation It is possible to produce this string either by grouping considerate and gift into one N and conjoining that with an N consisting of just donation, or it is possible to conjoin gift and donation into one N and then group that phrase with considerate into an N. It is easy to represent these two parses by way of phrase marker trees, which graphically elucidate the constituent structure of strings. The two ways of producing (85) are represented by the trees in (86) below. (86) a.
DP D D the AP A A N N NP N N and N N donation
considerate gift
38
X phrases
b.
DP D D the AP A A considerate N N N gift NP N and N N donation
We might note that there are two meanings attached to this string as well, having to do with how the meaning of considerate is combined with the meanings of the rest of the parts. A loose paraphrase of these two meanings might be as given in (87). (87) a. the things which are considerate and which are, rst, a gift and, second, a donation b. the things which are, rst, a considerate gift and, second, a donation There is some reason for thinking that these two syntactic representations map onto those two interpretations. For one thing, the number of meanings and the number of parses matches. For instance, if we add one more adjective to the left of the cordinated nouns, as in (88), our rules allow for a total of three parses (shown in (90) on the following page) and there are three meanings as well (as indicated in (89)). (88) (89) the considerate big gift and donation a. the things which are considerate and big and are also a gift and a donation. b. the things which are considerate and are also a big gift and a donation c. the things which are a considerate big gift and a donation
39
2. Phrase Structure
(90)
DP D D the
considerate
N donation gift
considerate A big
40
X phrases
Furthermore, the meanings vary in a predictable way with the linear order that these terms are arranged in. Thus, for instance, putting the second adjective to the left of the cordinated nouns creates the three meanings listed in (89), whereas putting the second adjective to the left of just the rightmost noun, as in (91), produces just two readings: they are (92). (91) the considerate gift and big donation (92) a. the things which are considerate and both a gift and a big donation b. the things which are a considerate gift and a big donation This is predictable in the sense that our characterization of these strings would deliver just the two parses for (91) shown in (93) on the next page. This correspondence should give us some courage that we are on the right track in characterizing the innite strings under discussion in terms of recursive phrases. It provides a set of structures that are in correspondence with what look like a parallel set of meanings. Our next step should be to esh out this correspondence, but we have some work still to do in characterizing these basic facts about grammaticality judgments. So lets return to that task. The strings belonging to Adverb Phrases are so simple that it is difcult to know whether they contain the substructure weve found in the other phrases. Nonetheless, they do have a recursive part and this might be construed, on analogy with these other cases, as evidence for substructure: (94) Sally carefully spoke. Sally very carefully spoke. Sally very, very carefully spoke. . . . The cordination phenomenon also seems to suggest subphrases, at least if our decision about the meaning-form mapping made above is correct. (95) Sally spoke [almost [very rapidly] and [quite softly] ]. So, lets convert the AdvP rule to (96). (96) a. AdvP Adv b. Adv DegP Adv c. Adv Adv
Like the AP and VP rules, this battery of rules equates AdvP with Adv and so makes mysterious why they are called different things. The rule building sentences, IPs, is similarly meager. But it too shows some signs of the subfamilies which we have discovered in NPs, APs and VPs. This is indicated by cordination in examples such as (97).
41
2. Phrase Structure
(93) a.
DP D D the AP A A considerate N N gift NP N N and AP A A N N AP A A considerate N N gift and AP A A big big N N N donation N N N donation
b.
DP D D the
NP
(97) Jerry [can speak loudly] but [cant speak clearly]. And, when we add to our observations that adverbs can fall to the left of In, we discover the recursive ag of these intermediate families:7 (98) Jerry evidently wont speak. Jerry evidently deliberately wont speak.
7 But havent we already characterized strings like the third and the fth in (98) as adverbs introduced by a recursive V rule? Do we really need to also let sentence-nal adverbs be introduced by a recursive I rule? The answer typically given is: yes. But you might want to decide for yourself what the answer to this should be.
42
X phrases
Jerry evidently wont speak deliberately. Jerry evidently occasionally deliberately wont speak. Jerry evidently wont speak occasionally deliberately. (These are all somewhat strained, I grant you, but I think still grammatical.) This calls for a change along the lines in (99). (99) a. IP NP CP I
b. I AdvP I c. I I AdvP d. I I VP Note how in this battery of rules, unlike the others weve formulated, the X rule that terminates the recursion has more than just the head of the phrase in it. In this case it also introduces the VP. This is required because VPs are not recursively introduced, and the method we have adopted of representing recursion in these phrases is built into the structure of the substitution classes. Actually something similar is true for the rules that build APs, NPs and VPs as well. In the case of VPs, the NP and CP parts of their family are not recursively introduced. So we should change the terminal expansion to: (100) V V (NP) (CP) And similarly, the CP parts of the AP and NP families are not recursively introduced, so the terminal expansions of these families should be changed to: (101) A A (CP) N N (CP) So this corrects the omission of CP and NP in our original formulation of these rules, though, as foreshadowed above, this will produce a difculty. To see this difculty, consider how our structural method of stopping the recursion relates the terms that are within some phrase. We expect that those terms which are introduced in the terminal expansion X X . . . (that is, the non-recursively introduced terms) will form the most inclusive substitution class of the phrase involved. There are some kinds of phenomena which suggest that this expectation is fullled. There are processes, for example, in which a rather surprisingly short string can substitute for one or another of the families we have discovered. This happens under conditions of anaphora.8
8 Anaphora refers to processes in which a phrase in one position refers, in some fashion or other, to the same things that another phrase, in a different position, refers to. For instance, in
43
2. Phrase Structure
For example, the term one prosodically looks like a word, but semantically derives its meaning by being anaphoric to an N. (102) a. I will examine the blue book about language if you will examine the brown one. one = book about language I will examine the big blue book about language if you will examine the small one. one = blue book about language I will examine the long book about language if you will examine the one about Quarks. one = long book
b.
c.
The reason we think that one not only semantically is an N, but is also syntactically an N, is because of the contrast in (103), a contrast which also supports our treatment of the non-recursive parts of NP. (103) I will examine the long proof that language exists if you will examine the short one. one = proof that language exists b. * I will examine the long proof that language exists if you will examine the one that it doesnt. one = long proof a.
What this contrast indicates is that one must stand in for the noun and the CP that follows, and cannot stand in for the noun by itself. This is explained if one can stand in for an N, because there is no N under the current rule set that fails to contain these both. It isnt, incidentally, that there is some semantic constraint on one that prevents it from standing in for something that has the meaning of a single noun, because that is possible in cases such as (104). (104) I will examine the book on the shelf if you will examine the one on the table. The difference between (104) and (103b) is just whether the material that combines with one is allowed to be a sister to an N or not: PPs are (look at (102c)), and CPs arent.9
the sentence: Mary knows that she is smart, it is possible for she to refer to the same individual that Mary refers to. In such a case, we say that she is anaphoric to, or with, Mary. 9 We will see, in just a moment, that this does not turn out to be a distinction that hinges on the PP/CP difference, however so be forewarned. In particular, it will emerge that the semantic
44
X phrases
Similarly, the V family can be anaphorically connected to other Vs, but in this case the phonological manifestation of the anaphor is silence, which will be designated with in what follows. (105) a. b. c. Although Sally shouldnt , Jerry must leave town. = leave Although Sally can carelessly , Jerry must carefully read Aspects. = read Aspects Because Jerry frantically read Aspects after dinner, Sally did just before class. = frantically read Aspects
This process of anaphora called VP Ellipsis, though it might be more accurate to call it V Ellipsis reveals that the non-recursive parts of the VP family are trapped within the smallest subfamily. (106) a. * Although Sally shouldnt Chicago, Jerry must leave New York. = leave b. * Although Sally didnt that she was tired, Jerry said that he would sleep. = say
These processes also indicate that there are at least some PPs that must be part of the terminal expansions of N and V. (107) a. ?? I will listen to this long examination of quarks, if you will listen to the one of syntax. one = long examination b. * Although Sally didnt about George, Jerry will carelessly talk about Sal. = carelessly talk
So we should change these rules to: (108) N N (PP) (CP) V V (NP) (PP) (CP) This way of distinguishing the recursive and non-recursive parts also predicts that the non-recursive parts will always come between the head of their phrase and the recursive parts. This seems true sometimes, as in (109) and (110).
function of the PP or CP determines how it behaves with regard to this test, and not the mere fact that the phrase is a PP or CP.
45
2. Phrase Structure
(109) (110) a. Jill ate it at noon. b. * Jill ate noon it. a. Jill ate sptzle at noon. b. * Jill ate at noon sptzle. a. b. Jill ate the rotting kumquats. Jill ate at noon the rotting kumquats.
But for other cases it seems uncertain, or downright wrong, as in (111) and (112). (111) (112)
a. ?? Jill said [that you shouldnt eat kumquats] at noon. b. Jill said at noon [that you shouldnt eat kumquats].
This then, is the difculty in trying to place CPs, and certain NPs, within VP (and other phrases too, as well see). Lets set this problem aside momentarily. It will be the focus of a lot of our work in the chapters that follow. There is a similarity to the organization of the family of substitution classes that make up NP, VP, AP, AdvP, and IP. The other phrases: PP, CP, DegP and DP are too anemic for us to see that structure, so we dont know, empirically, whether or not they have it.10 But, following Chomskys injunction that we contribute to solving the learnability problem, we would do well to accept as the null hypothesis that they are in fact organized along the same guidelines. This is because doing so is a step towards shrinking the space of grammars through which the learning device has to search. Here, then, is an illustration of how explanatory adequacy can help guide the inquiry. It provides a way to choose among a range of descriptively adequate grammars a null hypothesis: the one to start with. If we give all phrases the shape that NPs, VPs, etc. do, we end up with a family of substitution classes like that below. (113) CP C C C IP NP (DetP) (NPs) N IP (NP) (CP) I
I I VP VP V V AdvP V
N AP N
10 We will eventually see that PP and DP do, but it requires more exotic constructions than we are now prepared for.
46
X phrases
NN
PP
AP CP
PP V V AdvP CP
(IP)
(AP) (CP)
These all conform to the following shapes.11 (114) X Skeleton: XP (ZP) X X QP X X X WP X X (YP) (UP) ZP is called the Specier of XP, WP, QP are called Adjunct(s), and YP and UP are called the Complements of X.12 It should be said that these rules leave out considerable detail. In particular, there are a wide range of things that can stand in adjunct position which are not indicated in these rules. For example, V can have an AP adjoined to it, as in (115). (115) Sandy [VP [VP [VP saw a man] today] [AP angry at her]].
11 Except the V rule that introduces verbs. This rule allows for there to be three phrases within the V. In fact, however, it appears that at most two phrases are possible. The full three V+XP+YP+ZP sequence allowed by this rule never actually arises. 12 This terminology comes from Jackendoff (1977), as do most of the basic ideas sketched in this section about how phrases are organized. Jackendoff s work is a detailed working out, and considerable extension, of proposals in Chomsky (1970), where there is also a proposal about the underlying building blocks of morpho-syntactic category. Chomskys work has its seeds in Harris (1946).
47
2. Phrase Structure
And, as noted earlier, an N can have certain kinds of CPs adjoined to them; see (116). (116) the [N [N [N book] [CP that Mary read] ] [CP which no one will admit writing]] If you have a native English speaker handy, it wont take much time to discover many other kinds of combinations that have been left out. I will continue to leave out this detail, invoking it where necessary as we go along.
These are odd because the meanings they deliver are so bizarre. But they are still recognizable as grammatical strings of words. But some combinations which these rules allow seem to go bad in a very different way; consider (118). (118) a. b. c. d. e. Jerry laughed Mary. Sam gave it at Jill. Sally died that you should eat better. Jim claimed to Kris. Jerry slapped.
These dont go together into weird meanings; they dont go together at all. Whats wrong, here, is that weve matched up verbs with the material that follows them incorrectly. As can be seen by comparing (118) with (119), if the verbs are followed by different material, the results are grammatical. (119) a. Jerry laughed.
48
Here, then, is something more particularly about the words themselves that seems to be relevant to the procedure that recognizes grammatical strings. To capture what goes on here, we must do more than know what the category of the words being combined is. Theres another respect in which the particular choices of words seems to play a role in the syntax. Consider the different semantic contributions the NP Tuesday makes in (120). (120) a. b. I danced Tuesday. I remember Tuesday.
In the rst case, Tuesday says when I danced happened. We say in this case that Tuesday is a modier. It modies the sentences meaning by restricting the events denoted by I danced to just those that transpire on Tuesday. But this is not the role it has in the second case. Here Tuesday refers to the thing remembered. We say in this case that it is an argument of the relation that remember denotes. A similar contrast can be seen in the pair in (121). (121) a. b. I kissed her on the bus. I put her on the bus.
Again, on the bus is a modier in the rst case. It locates the event described by I kissed her; it indicates that this event took place on board the bus. In the second case, by contrast, it names a locations related to the referents of I and her by put. It is an argument. The semantic role an argument has in a sentence is determined by the word for which it is an argument. The meaning that modiers contribute to the sentence theyre part of is considerably more constant. Theres a way of talking about argumenthood that is commonplace, and which we inherit from Gruber (1965). Gruber, and Fillmore (1968) in a similar paper, was concerned with the problem of verb meanings, and in particular with nding a theory that restricted the kinds of argument types that verbs permit. He speculated that there was a nite, in fact quite small, set of argument types, or roles, that could be put together by verbal meanings. He argued
49
2. Phrase Structure
that the roles which verbs combined were always ones that have to do with the metaphysics of motion. For example, a verb like send involves three terms, one that can be seen as indicating the Source of the motion, another that denotes the moved term and a third that names the Goal, or endpoint, of that motion. Gruber called the role borne by the term undergoing motion Theme. (122) Sandy sent his book to Sean. Source Theme Goal
On Grubers thesis, then, verbs are just words that name different kinds of motion. The arguments of verbs will therefore always be the objects, locations, and manners of movement that are involved in motion. Arguments will have roles in the motion relation named by a verb, and what those roles are will be determined by the verb. Those roles Gruber called theta roles, after the Theme role. You will often see theta role expressed as -role. We say, then, that verbs assign -roles to their arguments. There are cases where it is not obvious that motion underlies the relation named by a verb. Nothing has to move in the events described by (123), for example. (123) a. Sandy showed his book to Sean. b. Sandy pleases Sean.
But in these cases, one might imagine that there is a kind of abstract motion involved. The image of his book might be transferred from Sandy to Sean in (123a), for example; and pleasure might be thought to have moved from Sandy to Sean in (123b). But there are cases which even metaphorical extensions of the logic of motion look unlikely to characterize. (124) are some. (124) Sandy nds Sean unpleasant. Sandy is unhappy. Londres refers to London in Portuguese. This means that something is wrong. Sean became unhappy. They know something.
This way of constraining the meanings of verbs has been pretty rmly abandoned, I believe, as a consequence. In its place, a method has been pursued that tries to see the meanings verbs have as the result of combining a small number
50
(126)
We say of these cases that verbs select or s-select a question or declarative. Note that some verbs are compatible with either, as is say. Though it is hard to see these differences as tting the functions that roles typically name, I will use the language of -roles to describe this relation too. Now that we have in view this distinction between arguments and nonarguments, lets return to the contrast between (118) and (119), repeated below. (118) a. b. c. d. e. a. Jerry laughed Mary. Sam gave it at Jill. Sally died that you should eat better. Jim claimed to Kris. Jerry slapped. Jerry laughed.
(119)
51
2. Phrase Structure
b. c. d. e. Sam gave it to Kris. Sally died. Jim claimed that you should eat better. Jerry slapped his thigh.
In all these examples, Ive chosen phrases that cannot get an interpretation as a modier. The contrast between (118) and (119), then, hinges on giving the verbs involved the right argument. The post-verbal phrases differ with respect to their category. What we discover from these examples is that verbs are picky with respect to what category their arguments belong to. This can also be seen in examples, such as those in (127) and (128), where the difference in the category of an argument is not tangled up so much with the meaning of that argument. (127) (128) a. Jerry pleases Mary. b. * Jerry pleases to Mary. a. * Jerry talks Mary. b. Jerry talks to Mary.
We say that verbs are subcategorized by the category of their argument. Or this term has been relexicalized that verbs subcategorize their arguments. Sometimes this is also described as a verb c-selecting its argument.13 Jackendoff (1977) argues that arguments (when they follow the head they are an argument of) are necessarily in complement position. This is supported by contrasts like: (129) (130) a. Although Sally didnt Tuesday, she will dance Monday. b. * Although Sally didnt Tuesday, she will remember Monday. a. Although Sally wont on the bus, she will kiss her in the car. b. * Although Sally wont on the bus, she will put her in the car.
Because the phrase following the verb is an argument in (129b) and (130b), it must be within the V which elides, whereas in (129a) and (130c), the phrase following the verb is a modier and can therefore remain outside the ellipsis. Jackendoff s thesis is also supported by similar contrasts involving do so anaphora, which, like V Ellipsis, nds Vs. (131) a. Sam talked to Mary on Tuesday, and Sally did so on Thursday.
52
a. ?* Sam talked to Mary and Sally did so to George b. * Gerry eats chocolate, and Sandy does so marzipan. c. * Mag proved that she loved chocolate, and Holly did so that she loved marzipan.
If we examine the positions that do so may stand in, we will nd that it has the same distribution as Vs: it may appear between a subject NP and a sentence nal PP, as in (131). In this respect, then, do so is like one in that it is a lexical expression of a certain phrase. If this is granted, then the ungrammaticality of the examples in (132) indicates that the material following the verb in these cases must be within the smallest V. This would explain why this material cannot be positioned outside of do so. And what distinguishes the cases in (131) and (132) is that the phrases following do so in (132) are arguments, whereas those in (131) arent. To the extent that we can tell, the same is true with respect to arguments of other classes of words. Its difcult to be certain of the argument-status of terms which combine with nouns, for instance, but Jackendoff s claim seems correct here as well.14 (133) Ill listen to your long, careful discussion of it, if youll listen to my short one. one = careful discussion of it b. * Ill listen to your long, careful discussion of it, if youll listen to my short one of it. one = careful discussion c. Ill listen to your long, careful discussion in class, if youll listen to my short one in the ofce. one = careful discussion a.
The contrast between (133a) and (133b) will follow if of it must be positioned within the smallest N. The contrast between (133b) and (133c) corresponds to the differing argument-status of the PPs involved: of it is more strongly perceived as an argument of discussion than is in class. As with the do so and V
14 There is evidence that nouns assign -roles only when they have a verb-like use; that is, when they are used to describe processes or events, and not when they are used to name things (see Grimshaw 1990). The examples in (133) are constructed with this in mind. We return to this aspect of nouns in a later chapter.
53
2. Phrase Structure
Ellipsis facts, then, this contrast supports the hypothesis that arguments and modiers are t into phrases in different positions.15 Okay, to summarize: were looking for a way to factor into our procedure for recognizing grammatical sentences enough of the meanings of the words involved to guarantee that Verbs and Nouns (and perhaps other words) combine with the arguments they select and subcategorize. Moreover, when these arguments follow them, we must nd a way of guaranteeing that they are in the non-recursive X: the complement position. We can ensure that these arguments are in the non-recursive part of the X if we force them to bear a -role, and allow -roles to be assigned only to complement positions. We need also to describe the fact that when a verb has a -role, there must be an argument present in the syntax which bears that role. It is customary to divide this task into two parts, which can be expressed as follows:16 (134) The Theta Criterion a. For every -role there is a position to which that -role is assigned. b. For every -position, there is something with an appropriate semantic value that occupies that position (i.e., the argument). It is usual to strengthen the Theta Criterion to a bijection, because of cases like (135). (135) Sally showed John doesnt mean Sally showed John himself. Without constraining the Theta Criterion to a bijection, we might expect (135) to get such an interpretation since presumably the NP John could name the object which bears both the Theme and Goal -roles. So we change this to (136).17 (136) The Theta Criterion
15 Baker (1978) is, perhaps, the rst to argue from one anaphora for this conclusion about where arguments are positioned within NPs. 16 The Theta Criterion comes from Chomskys important Lectures on Government and Binding. 17 The Theta Criterion is also often formulated in terms of a bijective relation between -roles, or -positions, and arguments. That is, it is sometimes written to say: For every -role (or positions) there is exactly one argument and for every argument there is exactly one -role (or position). (In Lectures on Government and Binding it is formulated in various ways, including these two.) The difference between this alternative formulation and the one Ive given here is that mine does not force every argument to receive a -role, whereas the alternative does. Ive decided to place this requirement in another principle, which well come to shortly.
54
Here we might imagine either that there actually is an object in these cases that bears the -role, but that argument is unpronounced. Alternatively, we might conjecture that something relaxes the condition which forces every -role to be assigned to a position holding an argument. The common wisdom is that both possibilities exist we will return to this issue in some detail later. For now, let us imagine that there is a lexically determined process which allows -roles for certain predicates to not be assigned. Some have suggested that (134b) and (137) should be collapsed, and in particular, that there is a means by which the categorial type of some argument can be determined from its -role.18 Grimshaw (1979) provides a way of viewing this hypothesis which has gained some popularity. Her idea is that one of the functions that makes up the learning device assigns a categorial status to arguments on the basis of their -role. She calls this function Canonical Structural Realization (CSR). She sketches how this function might work by way of examples that compare CPs with NPs. So lets look as some of the facts she considers. Note rst that CPs may distinguish themselves as according to whether they denote Propositions, Exclamatives or Questions. Lets use the language of -roles to describe these types of clauses. These -roles can sometimes be borne by NPs too: (139) a. John asked me what the time is the time (Question)
18 The reason c-selection is usually thought to be derivable from s-selection, rather than the other way round is tied to Chomskys epistemological priority argument, see Pesetsky (1982).
55
2. Phrase Structure
b. c. Ill assume that hes intelligent his intelligence how hot it is Bill couldnt believe the heat (Proposition) (Question)
In these cases, then, the verbs s-select either Q(uestion), P(roposition) or E(xclamative) and c-select either an NP or CP. There are other verbs, however, which s-select these very same -roles, but c-select only CPs. (140) a. b. c. John wondered Ill pretend what the time was *the time (Question) (Proposition) (Exclamative)
that hes intelligent *his intelligence how hot it was *the heat
Bill complained
Here then, we have a special instance of the difference in s-selection and cselection that needs to be overcome if one is to be derived from the other. Grimshaws suggestion is that the CSR of Questions, Propositions and Exclamatives is CP and that those verbs which allow these -roles to be borne by NPs are learned on a case-by-case basis. Thus, this is a partial collapse of cselection to s-selection. And it predicts that every verb that s-selects a Q, P or E will c-select a CP; that is, there should be no verbs that express these -roles with an NP only. This seems to be correct. Whether or not this project can be maintained for the situation involving the relation between CPs and NP and the -roles they bear, I dont think a parallel story holds for the complements of other categorial type. Moreover, the scheme Grimshaw proposes wont help determine which verbs select non-nite as opposed to nite clauses, which also seems to be a rather language particular fact. The problem we face here is the same problem we face when we try to understand if morphosyntactic word-class can be expressed in solely semantic terms. It remains unknown what the relationship between word-class and semantic type is, and consequently, we are not in a position to yet understand what the connection, if any, is between c-selection and s-selection. From now on let us assume that c-selection is at least in part independent of s-selection, and determined on a verb-by-verb basis. Interestingly, however, it looks like the thesis that c-selection can be derived from s-selection fares better when external arguments are concerned. To begin
56
(141)
By contrast, when the subject -role is incompatible with the meanings that CPs yield they are banned from Specier position:
That John left To have to leave Leaving
bothers Mary. To have to leave makes Mary happy. Leaving The fact
(142)
John
Subject arguments do not seem to be c-selected by the verbs involved. The role they bear is sufcient to determine what category they can be. Only complements are c-selected. This will require weakening (137) to something like (143). (143) a. A word c-selects its complements. b. If a word c-selects Y, then it -marks Y. a. If a verb has a -role, then there is exactly one syntactic position to which that -role is assigned. b. A -marked position must be occupied by something with the appropriate semantic value. c. A verb c-selects its complements. d. If c-selects then s-selection (aka -role assignment).
The statements in (144a) and (144b) are the Theta Criterion, whereas those in (144c) and (144d) concern the relation between c-selection and s-selection which weve just reviewed. The Theta Criterion insists that for every -role that some term has, there will be a unique position occupied by an argument
19 With the exception of cases like Under the bed is a slipper, plausibly instances of impersonal constructions with inversion; see Stowell (1981) and Rochemont and Culicover (1990).
57
2. Phrase Structure
in the sentence holding that verb. (144c) and (144d) determine whether that argument will be c-selected or not. To force arguments to be within the smallest X, it will now be sufcient to force the -position for that argument to be within the smallest X. We want this effect for complement arguments only we dont want to force subject arguments into X so one way of doing this would be to restrict those positions that are c-selected to just those within the smallest X. This would mean that wed have two principles: one that determines the c-selected position for verbs, and another, yet to be determined, which locates the s-selected position for subjects. Were going to see, however, that the procedure for locating the -positions for both subject and object arguments is the same, or very nearly so, and so we wont take precisely this course. Instead, we will follow a popular view of these principles that is rst found in Chomskys Lectures on Government and Binding. He formulates there what he calls The Projection Principle, which is responsible for mapping the argument structure of a verb or head more generally into a syntactic representation. I will formulate his principle as (145).20 (145) The Projection Principle i. For , a position, if is a sister to X0 ,21 then X0 c-selects s contents. ii. If s-selects , then and are sisters. The second part of the Projection Principle does what we are in search of. It forces arguments of a verb to be in the lowest X, for only in that position will it be a sister to the verb.22 Note that this principle is not restricted to verbs and their projections, it spreads what weve discovered about VPs to all other categories. This, so far as I know, is correct. As presently formulated, the second part of the Projection Principle wrongly forces subjects into the smallest V of the verb that assigns it a -role. We will see, however, that this problem is only apparent. Once we discover what is truly responsible for assigning the subject its -role, this problem is resolved (or, depending on how certain particulars play out, mutated into a different problem).
20 Chomskys own formulation builds in various other properties that we will encounter later on; see in particular the discussion in Chomsky (1981, pp. 3448). 21 X0 is the position in a phrase marker into which a word is inserted. It is a way of explicitly referencing the (smallest) head of a phrase. 22 X and Y are sisters if every phrase including one includes the other.
58
59
2. Phrase Structure
(Note that the range of categories possible in these positions is relatively free, suggesting that there are few, if any, constraints on category type.) Since the inventory of categories varies from language to language we might, just to be safe, factor this information out of the phrase structure rules into a language particular set of statements of the form in (146). (146) a. If modies N, then must be . . . b. If modies V, then must be . . . . . .
The . . . will carry lists of category types. If this project is successful, then the Phrase Structure rules of English collapse in full to the X Skeleton. Some have argued for a picture of language variation that makes the hierarchical arrangements of constituents that the X Skeleton, together with the Projection Principle and Theta Criterion and whatever yields (146), completely immutable.24 All that varies across languages is the linear order in which the terms that follow the arrows in the X Skeleton may have. So, the phrase structure component of the grammar might have nothing more than (147) in it, where , should be understood as representing both the string + and the string +. (147) a. XP {(P), X} b. X {X, P} c. X {X0 , (P), (P)}
What the morpho-syntactic category of , and are is fully determined by the c-selection properties of X0 and the language particular principles governing modier types (i.e., (146)). Note, incidentally, that Ive somewhat arbitrarily set the highest number of things that can share the head with the smallest X to two. That is, Ive decided that there can be at most two complements. That seems to be supported for verbs and adjectives, when can have at the same time a PP and CP complement, for instance, but no combination of complements that goes beyond two. In later chapters we will revisit this issue. The linear arrangements of these constituents must then be determined by the language particular part of the grammar. There is evidence from language typology that whatever it is that determines the order of Specier and X is
24 See Travis (1984), for example.
60
25 See Greenberg (1963) and Dryer (1992). 26 is an X Projection of iff dominates and they have the same head. 27 See also Saito (1985) and Saito and Fukui (1998).
61
2. Phrase Structure
(149)
XP VP ?? WP MP V V V IP I VP WP MP V V I XP IP I I
(150)
The categorial values for MP will be determined by the c-selection specication of the verb involved. The categorial values for XP will be determined by the -role it receives. And the categorial values for WP will be whatever (147) for the language in question allows to modify Vs. We havent yet discovered what sits in the Specier of VP, so this spot is marked with ??. The linearization parameters in (148) produce these phrase markers in the following way. Setting Specier to rst in (149) linearizes XP and ?? so that they precede I and V respectively. Setting Projection-of- to last makes every other I and V, as well as I and V, follow the phrase they are sisters to. As a consequence WP, MP and VP precede the phrase they are complements to or modiers of. In (150), by contrast, Specier is set to last, which linearizes XP and ?? so that they follow I and V respectively. As with (149), Projection-of- is set to last in (150) and the consequence for the position of WP, MP and VP is the same. Restricting the linearization options to just those in (148) blocks certain phrase markers. It blocks languages, for instance, in which the complement
62
This phrase marker linearizes V (a projection of V) last relative to WP, but linearizes V (also a projection of V) rst relative to its complement. Clearly there are languages of this unexpected type; English seems to look precisely like (151). This proposal, then, seems clearly too restrictive. Nonetheless, it will be our starting point. In the chapters that follow we will explore ways of loosening this model so that it is enabled to account for the range of language types we do see without losing the trends in linear organization that Greenberg and Dryer have discovered. What we have now is not yet complete enough to really engage this problem. In fact, the linearization scheme in (148) is itself not yet complete enough to generate the strings we want to associate with the phrase markers it allows, for example those in (149) and (150). All (148) does is linearize the phrases within a sentence. It does not determine how the strings of words within those phrases are linearized relative to the other phrases. To see this, consider a phrase marker like that in (152) on the following page, in which lower-cased letters should be understood as representing words. This phrase marker arises
63
2. Phrase Structure
(152)
YP Y y M
m
XP X MP M WP W W
w
X x
OP O O
o
by imposing the X Skeleton and setting Specier to rst and Projection-of- also to rst. What we would like is for this to be sufcient to generate the string ymw xo . Instead, however, all that these settings give us is the information in (153).
64
Whats required is something to determine how the information in (153) determines the linear order of y relative to the words within MP and X, the linear order of m relative to the words in WP, and the linear order of x relative to the words within OP. Lets turn to that now.28 Recall that in dening morpho-syntactic category, we entertained the hypothesis that looking at only adjacent terms would be sufcient for dening the relevant substitution classes. As it happens, in dening phrases we have also obeyed this constraint. As a result, phrases are always strings of adjacent terms. Lets elevate this too to an hypothesis: (154) Contiguity Let = { 1 , 2 , . . . , n } be terminals dominated by . The string formed from cannot contain if is not dominated by . If Contiguity holds, it is possible to determine from (153) what the consequent linearization for all the words in (152) is. If the words in MP must be adjacent to each other, then (153a) is enough to know that y precedes all those words (i.e. m and w ). Similarly, if the all the words in YP must form a contiguous string, and all the words in X must too, then from (153c) it is possible to deduce that every word in YP (= y , m and w ) must precede every word in X (= x and o ). All that is required is an explicit statement that the words within a phrase, , are linearized with respect to the words in phrase in the same way that is linearized to . This can be done with (155). (155) < =def. precedes . { , } =def. or .29
a. For all words, x and y , within a phrase marker, either x < y or y < x .
28 The relationship between phrase markers and the strings they map onto is investigated in a vast literature, where a wide array of different formalizations can be found. Two widely read classics are Chomsky (1985) and Lasnik and Kupin (1977). For a simple introduction to the formalisms of graph theory (from which phrase markers are drawn) and their projection onto strings, see Partee, Meulen, and Wall (1990). 29 and are said to be sisters in this case, and is called and s mother.
65
2. Phrase Structure
b. Let X and Y be points on a phrase marker. If X < Y, then x < y for all x dominated by X, and all y dominated by Y. (155a) merely makes explicit that all the words in a phrase marker must have a linear relation to every other word in a phrase marker. (155b) determines how these linear relations are derived from the language particular orderings imposed upon phrases. It also derives Contiguity. On this view, then, the phrase structures of languages are the result of four xed universals the X Skeleton, the Theta Criterion, the Projection Principle, and the linearization principles in (155) plus the language particular pieces of information in (156). (156) a. Specifying the categories of modiers (i.e., (146)). b. Setting the headedness parameter (i.e., (148)). c. A vocabulary of lexical items s-select and c-select arguments. Theres a sense, then, in which languages do not actually have Phrase Structure rules. They are merely the epiphenomena that emerge when the various factors of Universal Grammar and language particular information are combined. This theory, if correct, meets the criterion of explanatory adequacy. It provides both inviolable constraints (i.e., X Theory, the Theta Criterion, the Projection Principle and (155)) and an evaluation metric (i.e., (156) and the language particular vagaries of vocabulary). Notice how the evaluation metric this proposal embraces is quite different from the simplicity metric suggested in Chomskys early work. The evaluation metric here involves learning the word-by-word selection requirements and xing parameter values in the headedness linearization procedure in (156). This proposal has the following form: inviolable constraints come in the form of immutable principles, while the evaluation metric (once lexical idiosyncrasies are removed) consists of principles with a menu of parameters that are set on a language particular basis. Theories that have this general form are said to belong to the Principles and Parameters framework. This conception of what explanatory grammars might look like was suggested by Noam Chomsky and his collaborators in the late 1970s, and much of the work of the 80s and early 90s has this form. In 1981, Chomsky published an ambitious book in which he organized much of the work of that time into a principles and parameters form. This book, Lectures on Government and Binding, serves as a rough starting point for much of my exposition in these lectures.
66
67
3
Positioning Arguments
Our grammar of English presently consists of a series of principles that determine the shapes that phrase markers may have. Those principles make reference to the morpho-syntactic category that words belong to, and also to the semantic relation that arguments and modiers have to the things that they combine with. These principles are collected in (1). (1) a. X Skeleton: XP {(P), X} X {X, P} X {X0 , (P), (P)} b. The Theta Criterion i. For every -role there is exactly one position to which that role is assigned. ii. For every -position, there is exactly one thing with an appropriate semantic value that occupies that position (i.e., the argument). c. The Projection Principle i. For , a position, if is a sister to X0 , then X0 c-selects s contents. ii. If s-selects , then and are sisters. d. If a word c-selects Y, then it -marks Y.
3. Positioning Arguments
e. Modication Parameters i. If modies N, then must be AP, PP or CP ii. If modies V, then must be AdvP, PP, CP or NP . . . The X Skeleton forces a certain shape on all arrangements of words, grouping them into phrases. The Projection Principle in conjunction with (1d) ensures that -positions are sisters to the terms that assign the -roles, and that when the term assigning a -role is a head, that the argument it assigns a -role to is also c-selected. This leaves the problem of correctly determining the subjects -position a problem whose solution we are working towards. The Theta Criterion ensures that for every -role associated with some predicate, there will be exactly one -position in the syntax, and that this position will be occupied by an argument. This, together with the Projection Principle will correctly place objects deeper than non-arguments within the phrase that contains them. Finally (1e) lists the categorial status that modiers may have, depending on the term that is being modied. In addition to these principles, we discussed a method for mapping the groupings of words these principles control into strings of words. In particular, I suggested that there were two linearization parameters that control how the phrases within some other phrase are given a linear order. One of those parameters determines whether Speciers of phrases come before everything else in the phrase they are Speciers for, or come after everything else. A second parameter determines whether the phrase that contains the head precedes or follows the phrase it combines with. These parameters are repeated in (2). (2) If is an immediate daughter of , an X projection of , then choose a value in < > for: a. = Specier of : <rst in , last in > b. = the term is an X Projection of: <rst in , last in >, modulo (2a)
These parameters express an hypothesis about how languages can vary. They say, for instance, that the Specier of IP will always come on the same side of the rest of the material in IP that the Speciers of every other phrase in that language do. And they predict that languages will put the heads of phrases always to the left or the right of their complements, no matter what category that phrase is. The objects of verbs, in other words, should fall on the same side of the verb that the objects of adjectives, or prepositions, do. I remarked
70
that these cross-linguistic predictions are somewhat spottily supported. Well return to this matter in a later chapter and take a closer look at the facts. The linearization parameters are coupled with a constraint on linearization that enforces Contiguity. That constraint requires that all the words within a phrase, , follow or precede all the words in a phrase in accordance with how the linearization parameters arrange and . If precedes , then every word in will precede every word in , and if follows , then every word in will follow every word in . These principles are all declarative statements. They express constraints on the syntactic organization that sentences may have. Whats missing from this picture is the engine that produces that organization. Nothing in this grammar tells us what gets those words grouped into phrases; we just have a series of hypothesized generalizations about what the resulting groups of words must look like. Ill briey sketch now a method of generating these structures. I do this because itll be handy to have something concrete in front of us when we think about how the representations for sentences are created. I wont give any reason for thinking this particular procedure is correct, and so I dont suggest you believe it. Well have a chance to examine some of its claims later on. Until then, let us use this as a crutch as we limp from language data to speculations about the constraints those data are evidence for. The procedure Ill sketch is very loosely built upon Chomskys Aspects of the Theory of Syntax. The rst thing well need is a device that assembles the phrases that the X Skeleton constrains. For that purpose, I will dene three functions, all named group, that differ with respect to the number of arguments they take. These functions simply arrange the N0 , N, VP and other such symbols into phrase markers of the sort that the phrase structure rules from the previous chapter built. These functions are dened in (3) on the next page. The second thing well need is a way of matching the words in a sentence to the positions produced by group in the phrase marker it constructs. This procedure is called Lexical Insertion, and its spelled out in (4). Together, group and Lexical Insertion will produce a wide array of phrase markers. The principles in (1) can then be thought of as lters on that set of phrase markers. They will permit only those that arrange the symbols N0 , VP and so on in the ways that the X Skeleton permits, and they will arrange arguments and modiers in ways that conform to the Projection Principle and Theta Criterion. The phrase markers that group and Lexical Insertion form are sometimes called Deep Structures, or D-structures. We can
71
3. Positioning Arguments
(3) Let be any of the symbols X0 , X, XP, where X ranges over the morphosyntactic categories in the language, and let , and be or group() or group()() or group()()(). a. group() =def. { } b. group()() =def. { , } c. group()()() =def. { , , } (4) Lexical Insertion Insert into every X0 in a phrase marker a morpheme of category X.
think of the principles in (1) as well-formedness conditions on D-structures. There are a variety of problems for the model that we took note of as we developed it. Lets list the more salient of these here so that we do not forget them. (5) a. For many phrases, the distinction between XP and X has not been motivated. We have not seen, for instance, what phrase lies within the Speciers of VP, AP, PP, AdvP, or CP. For all of these phrases, we have only seen evidence for Xs, a kind of X that is recursive and another that introduces complements. b. The Projection Principle describes the conditions under which -roles are assigned, but it only correctly delivers -roles to objects. Under what conditions is the -role delivered to a subject argument? c. The CP complements to verbs present a paradox. They appear to be in complement position with respect to tests like do so anaphora and V Ellipsis, but they are positioned linearly as if they are not in complement position. They seem to appear following modiers of the verb. d. Our present system does not place any constraints on the linear order that two complements take with respect to each other. We should expect to nd, for instance, that the objects of verbs could come in any linear order. Instead, what we seem to nd is that a NP object to a verb precedes a PP object, and that a CP object of a verb follows all other objects of the verb (this is part of the problem in (5c)). e. The head of an IP can be an inectional morpheme, and when it is, our rules will position that inectional morpheme before, and
72
Obligatory Speciers
separated from, all verbs in the sentence. This is wrong. The inectional morpheme appears instead on an immediately following verb. We will tackle the rst four of these problems in this chapter. The problem in (5e) will have to wait until the following chapter. Before doing that, however, there is another difculty that crept into our model when we transitioned from phrase structure rules to the system of constraints on D-structures that we presently have. Our rst task will be to look at that difculty.
This might be due to a semantic effect, though there is considerable language variation here whose source is not known. Another possibility is that there is a determiner present even in (6a), but that it is silent. This would allow for the possibility that Specier of NP is obligatorily lled, accounting, then, for the ungrammaticality of (6c). Lets leave the status of the specier of NP open, for the moment. Well have a chance to revisit this issue when we examine more closely the structure of noun phrases. Phrases in Specier of IP, however, buck the trend and are always obligatory. This was one of the facts encoded in our original phrase structure rules that has been lost in the transition to the X Skeleton plus constraints system that we are now employing. In cases where the IP contains a subject argument,
73
3. Positioning Arguments
the obligatoriness of this subject is plausibly derived in the same way that the obligatoriness of complements is: by the Theta Criterion and the Projection Principle. Somehow or other, the Projection Principle is going to have to be xed so that it guarantees that there is a -position for the subject argument, and the Theta Criterion will force an argument into this position. If the position happens to be specier of IP, then this will guarantee the presence of something in specier of IPs whose verbs assign a subject -role. But, interestingly, even in IPs whose verbs do not have a subject -role, the presence of something in Specier of IP is obligatory. The verb seem, for instance, has only one -role, and that is assigned to its clausal complement. And yet, as (7) shows, an IP containing this verb must surface with something in its specier position. (7) a. It seems that we are behind. b. * Seems that we are behind.
The it in (7a) appears to have no meaning whatsoever, and is merely present in order to occupy specier of IP. It is called an expletive or pleonastic term, to indicate its lack of semantic content. To recapture this bit of information, Chomsky proposes in Lectures on Government and Binding adding another statement to the Projection Principle which simply requires that specier of IP be lled. This is known as the extension to the Projection Principle, or much more commonly, the EPP. (8) Extension of the Projection Principle (EPP) The Specier of IP must have a phrase in it.
In the normal case, when the verb of a sentence has a subject -role, the Extended Projection Principle will be satised by the presence of an argument phrase, whose presence will also satisfy the Theta Criterion. But in the relatively rare case when the verb does not have a subject -role, it will still demand the presence of something, and the expletive is invoked as a consequence. Notice that this system restricts the use of the expletive to just those circumstances where there is no subject -role. When the verb of some sentence has a subject -role, the Theta Criterion will require that the Specier of IP be occupied by an argument and expletives, by virtue of their semantic vacuity, cannot function as arguments. This, then, is why (9) does not allow the it in specier of IP to be interpreted as an expletive. (9) It discovered the problem
74
Obligatory Speciers
In fact, the distribution of expletives is extremely limited. They are found only in Specier positions to which no -role is associated. As weve seen, theyre not found in Specier positions associated with -roles, but they are also not found in complement or adjunct positions. Thus an example like (10) is ungrammatical on either of the parses indicated in (11). (10) *She slept it. (11) a.
NP she I ed V sleep IP I VP V NP it
b.
NP
IP I VP V V V sleep NP it
she I ed
The parse in (11a) is already blocked by the Projection Principle, the rst clause of which requires that things in complement position be c-selected and -marked by the neighboring head. This isnt the case in (11a). I dont know of anything in the literature that is explicitly designed to exclude (11b), so I suggest that something along the lines of (12) is responsible. (12) {X X, } iff modies X. This bijection limits modiers to sisters of Xs. It is redundant with the Projection Principle, which blocks modiers from complement position. But it also blocks modiers from being in Specier positions, something that would otherwise be available. It also forces sisters to Xs to be modiers, and this is the use we have of it here. Because expletives, by virtue of being semantically vacuous, cannot modify, they will be banned from this position. There are other ways of blocking (11b), of course, and there is no particular reason to believe that this is the correct method. But lets adopt this principle until something better comes along.
75
3. Positioning Arguments
There is one last fact in this domain that requires addressing. This is that when there is no subject -role, only an expletive can satisfy the Extended Projection Principle. Placing an argument in the Specier of IP in such a case is ungrammatical, as a comparison between (7) and (13) indicates. (7) (13) It seems that we are behind. * Jerry seems that we are behind.
Many formulations of the Theta Criterion are designed to capture this fact, requiring that there be a -marked position for each argument. We have a different option. Because modication is restricted to just adjunct positions, semantically contentful phrases in specier positions are not going to be able to modify. If the only other way a meaningful phrase can be put together with the rest of a sentence is by way of a -role, then the ungrammaticality of (13) will follow. Under either way of thinking, the point is the same. If NPs in Specier of IP can combine semantically with the rest of the material in the sentence only by way of a -role, then that NP will have to have no semantic content (be an expletive) or get a -role. Lets return now to our original set of problems: (a) how do subjects get their -roles, (b) when there are two or more complements, what determines their order and (c) what are we to do about the paradoxical position of CP objects?
3.2 Movement
At present our system imposes a linear order on complements and non-complements. Because complements are trapped inside the lowest X and non-complements are forced out of that X, complements will precede non-complements when they both follow the head. More generally, (14) holds. (14) The Projection Principle entails: If an argument, X, and a non-argument, Y, both fall linearly on the same side of the head, then X will come closer to the head than Y. Its this generalization that the problem with respect to CP objects is tangled up in. We nd that object CPs must be part of the smallest V they are an object of this is what is responsible for the inability of object CPs to be outside the V that do so occupies in examples like (15) and yet they tend to follow modiers that are able to stand outside this V, as (16) indicates.
76
Movement
(15) (16) * Max said that pigs y and Sally did so that snakes swim. a. Max said yesterday that pigs y. b. * Max [said that pigs y] yesterday.
The evidence concerning the hierarchical position of CP objects is at odds with what (14) says about their linear order. In fact, as briey noted in the previous chapter, this problem arises with other kinds of objects as well. Object NPs also seem to be forced to reside in the lowest V, as indicated by their inability to be stranded by do so anaphora as in (17). (17) * Max ate the rotting kumquats, but Sally hasnt done so the delicious strawberries.
And, as expected, it does seem that (14) holds for some kinds of NP objects, as in (18) and (19). (18) (19) a. Jill ate it at noon. b. * Jill ate at noon it. a. Jill ate squash at noon. b. * Jill ate at noon squash. a. b. Jill ate the rotting kumquats at noon. Jill ate at noon the rotting kumquats.
But in other situations, its less clear, or downright wrong, as in (20) and (21). (20) (21)
a. ?? Jill said [that you shouldnt eat kumquats] at noon. b. Jill said at noon [that you shouldnt eat kumquats].
And a similar situation arises in cases where the object is a PP as well. The ungrammaticality of (22) indicates that object PPs are trapped within the lowest V they are the object for, and yet as (23) indicates, they can follow modiers of this V. (22) (23) * Max put it on the table and Sally has done so on the chair. a. b. Max put it on the table carefully. Max put it carefully on the table.
There are other situations in which violations of (14) arise. One that has engendered a lot attention are constituent questions of the type that (24) illustrates.
77
3. Positioning Arguments
(24) Which linguist has given up?
One feature of such questions is that they have an NP, or other phrase, that contains a special interrogative word. In (24), that interrogative word is the determiner which. Another feature of such questions is that a phrase containing that interrogative word known as the interrogative phrase or wh-phrase appears at the very beginning of the sentence that constitutes the question. In (24), this second requirement is trivially satised as the wh-phrase is appears at the beginning of the sentence by virtue of its status as subject. In other cases, however, this requirement results in a violation of the expected position of the wh-phrase. In (25), for instance, the wh-phrases are objects and yet appear at the beginning of the sentence. (25) a. Which linguist has Max visited? b. On which table has Max put it?
Although the linear position of these objects is not where we would expect them to be, they still behave as if they are within the smallest V; the do so anaphora test behaves in just the same way it always does for these phrases. (26) a. * I know that Max visited some philosopher, but I dont know which linguist he has done so. b. * I know that Max has put it on some table, but I dont know which table Sally has done so.
Here too, then, we have a violation of (14). There have been a number of approaches to these problems. I will not be able to consider them all in these lectures. The rst we will consider is the rst that was proposed. Zellig Harris, and then Noam Chomsky in many articles in the late 1950s and early 1960s, argued that the solution to these violations of (14) involves giving sentences more than one phrase-marker. In addition to the D-structures we have arrived at, there are (sometimes) other phrase markers that are formed from D-structures by way of Transformations. In the cases at hand, the Transformations are movement rules, that preserve everything in the D-structure of the sentence affected except for the position of one phrase. The phrase-markers that are formed by these Transformations are the ones that get pronounced, and so the position that they give to the phrases they move are the ones that determine their linear position. This phrase marker is called a Surface Structure, or S-structure. In the case of constituent questions, the movement rule involved is known as Wh Movement, and a preliminary formulation of it is in (27).
78
Movement
(27) Wh Movement Move a wh-phrase into Specier of CP.
We will see evidence in the following chapter that the position that the whphrase is moved to is the Specier of CP. At this stage we can note that this decision correctly places the wh-phrase at the very beginning of the question. We might also note that this provides an answer to one of the problems we left open at the end of the previous chapter: what is the material that occupies Specier of CP. It appears that the material that ts into the Specier of CP is always material that has been moved from elsewhere. We will see other cases of movement to Specier of CP, by Wh Movement seems to be one that is found in many languages beyond English. This rule would give the D-structure/Sstructure mapping in (29) on the next page for a sentence like (28).
(28)
For the cases that pose a problem for (14), Ross (1967) made an inuential proposal. He proposed a rule sometimes called Heavy NP Shift, which forms from sentences where the argument precedes the non-argument, sentences where that argument has been shifted to the right. His rule can be formulated as (30).
79
3. Positioning Arguments
(29) a.
NP she I s V know IP I VP V CP C C NP she I s V know NP I NP VP Max V ed CP V C visit which linguist C NP Max I ed IP I VP V V visit which linguist NP V I VP I IP
b.
IP
(30) Heavy NP Shift Right adjoin a phrase to the rst VP containing that phrase.
80
Movement
By Adjunction is meant a process that takes two terms and makes them both daughters of a newly created phrase which is identical to and replaces one of the two terms brought together. By convention, the term replaces is the object of the expression x is adjoined to y. In diagram form, adjoins to is (31). (31)
To right adjoin to is to cause the linearization algorithm to force to the right of . To left adjoin to is to cause the linearization algorithm to place to the left of . When the direction of adjunction is left unspecied, it is understood to be leftward, as in (31). Heavy NP Shift would form an S-structure from the problematic (20b) from a D-structure that, if pronounced, would correspond to (20a). This is sketched in (32). (32)
NP Jill I ed V V eat NP IP I VP V PP at noon V V eat NP Jill I ed VP V PP at noon IP I VP NP the rotting kumquats
This sentence, then, has two parses. One that satises the Projection Principle and the Theta Criterion, and the other which doesnt, but is mapped from the rst by Heavy NP Shift. This sentence is a series of phrase markers, then, made up of the two parses shown above. We can see the members of this series as being generated in the following fashion: the rst member is produced by group
81
3. Positioning Arguments
and satises the Projection Principle, the Theta Criterion, the conditions on modier type and the X Skeleton. This is the D-structure. The other parses in the series are produced by the action of transformational rules acting on elements already found in the series. In (32), there is only one other such phrase marker, but we will see cases soon in which transformational rules apply more than once forming series of greater length. We will eventually have to control the method by which transformations produce additional elements, but in the simple cases we will be concerned with right now, there are only two members of the series: the D-structure and another formed by a transformation acting on that D-structure. We speak the nal phrase marker in the series. In the case of NP Shift, this is the parse produced by the transformation. The parse that is spoken is called the S-structure, and the series of parses are called Derivations. Heavy NP Shift will have to include conditions that generate the paradigm of contrasts that we encountered earlier. We will want to ensure that it obligatorily applies in cases where the object is a CP, as in (33). (33)
NP Max I ed VP V V V say NP yesterday IP I VP CP that pigs y
(I will frequently collapse two adjacent phrase markers in a derivation into one parse, using the device of a movement arrow, as in this example.) By contrast, we must nd a way of blocking Heavy NP Shift from applying when the object is an NP and short: a pronoun or bare plural, for instance. For other objects, Heavy NP Shift seems to be an option, with there being preferences determined
82
Movement
by the length of the object relative to the material around it. I dont know the source for these preferences. I speculate that the conditions governing these preferences come from something other than syntax proper.1 The conditions on Heavy NP Shift, then, is as indicated in (34). (34) a. Obligatory Heavy NP Shift: i. (nite) CPs: * Max [said that pigs y] yesterday. Max said yesterday that pigs y. b. Optional Heavy NP Shift: i. full NPs: Max visited the students yesterday. ? Max visited yesterday the students. ii. PPs: Max talked to the students yesterday. Max talked yesterday to the students c. Blocked Heavy NP Shift: i. Pronouns: Max visited him yesterday. * Max visited yesterday him. ii. short NPs: Max visited children yesterday. ?* Max visited yesterday children.
This provides a characterization, then, of the cases in which a phrase seems to be hierarchically in a position different from where its linear position would lead us to expect. This movement transformation hypothesis claims that there is no real deviation from the expected relationship between hierarchical position and linear position in these cases. What makes them exceptional is the existence of a phrase marker at odds with the conditions we have discovered on D-structures. This other phrase marker gives certain phrases a non-D-structure hierarchical position, and its this new position that reects the linear position of the affected phrase.
1 Some plausible candidates are conditions on processing load the longer an object gets the more a conguration triggering the effects of minimal attachment discussed in the rst chapter arise conditions on the prosodic interpretation of syntactic structures, and conditions on how the distinction between old information and new information is signaled.
83
3. Positioning Arguments
This is a deeply mysterious affair. Why should movement transformations exist? We should hope to nd reasons for the existence of these deviations from the normal relationship between hierarchical position and linear order. If the movement transformation approach to these cases is correct, then the mystery becomes the question why certain phrases should be given two positions within a phrase marker. A goal of much current research in syntax is to nd an answer to this mystery. We will have a chance to look at some recent attempts in what follows.
84
Ive restricted the Case Filter to argument NPs because, as well see, adjunct NPs do not seem to be positioned in a way that suggests they are sensitive to Case assignment. Moreover, typically the Case morphology they bear is xed, and not sensitive to Case assigners. The Case lter, then, requires that some parse in a sentences derivation puts every argument NP that sentence contains in a Case marked position. As I noted before, Case marked positions are ones that are close to terms that are responsible for assigning (i.e, determining) the Case. So, all we have to do now is know what those terms are and what close means, and well be able to use the Case lter to distinguish sentences in terms of grammaticality. The object Cases so-called Accusative and Dative and Locative, and a host of others are assigned by particular lexical items. In English there are just three cases: accusative, nominative and genitive. The form of pronouns in English depends on this Case information, as well as the number, person, and, in some cases, gender of the referent of the pronoun.
85
3. Positioning Arguments
1st , sing Nominative I we you you she he it they Accusative me us you you her him it them Genitive my our your your her his its their
1st , plur 2nd , sing 2nd , plur 3rd , sing, fem 3rd , sing, masc 3rd , sing, neut 3rd , plur
Our immediate task, then, is to determine what terms are responsible for assigning these Cases. The genitive appears on NPs that are in the Specier of NP in English, as in (38). (38) a. your hat b. his marbles c. our problems compare: *your left. compare: *I talked to his. compare: *She met our.
We put off for a couple chapters investigating what the item is that assigns Genitive; there are facts about the shape of NPs that we have yet to discover that are relevant. Nominative Case is found on the subjects of nite IPs, as in (39). (39) a. She left. b. We arrived before he left. c. They thought I would stay. compare: *I left she. compare: *I talked to he. compare: *they hat
Finite IPs are distinguished from other phrases by the presence of a nite I0 : tense/agree inection or a modal. Moreover, the nominative Case-marked NP appears in Specier of IP, at least in the examples we have examined so far. (We will encounter soon cases that dont meet this description.) An initial stab at the conditions on nominative Case assignment, then, is (40). (40) Finite I0 assigns Nominative Case to its Specier position. Accusative Case is found on NPs that immediately follow some verbs and prepositions, as in (41). (41) a. We met him. b. They talked to me c. She stood near us. compare: *Him left. compare: *They are happy me. compare: *us hats.
86
Stowell (1981) proposes to derive the ordering of complements by way of the Case Filter and an adjacency condition on accusative Case assignment. Because NPs are subject to the Case lter, when they get a -role from a verb, they will necessarily be positioned adjacent to this verb. By contrast, PPs are not subject to the Case lter and they are free to appear anywhere in the V that contains their -role assigner. This will ensure that when an NP and PP share a V, the NP will come closer to the verb than does the PP. In a language like English, in which the linearization parameters cause the verb to be rst in its V, this will mean that an object NP will precede an object PP. With regard to the relative order of PP and nite CP, Stowell uses Rosss Heavy NP Shift operation, but he suggests that Case plays a role in making Heavy NP Shift obligatory. He speculates that nite CPs, like NPs, must receive Case but that unlike NPs they cannot sit in Case-marked positions at Sstructure. He calls this the Case Resistance Principle. (43) Case Resistance Principle A (nite) CP may not be in a Case marked position at S-structure. The Case Resistance Principle requires nite CPs to be moved by S-structure to some non-Case marked position. NP Shift is capable of doing this, and thus, in cases where a complement CP shows up string nally in a VP, it has satised Stowells injunction against surfacing in Case marked positions by undergoing NP Shift. There is, surprisingly, a certain amount of evidence for this picture. Consider, for example, situations where the nite clause is the subject of the clause. Here too, following an argument from Koster (1978), we see that there is some reason for thinking that it isnt actually in the Nominative Casemarked, Specier of IP position. Kosters argument makes reference to a process that is found in certain question-types in English. Normally, in these question contexts, it is possible to move I0 to the front of a sentence, as in (44). (44) Mary will put the book on the table. Will Mary put the book on the table?
87
3. Positioning Arguments
How precisely this is done is the subject of the next chapter. Whats relevant here is that this process is blocked if it interacts with another process that moves something to the left edge of IP, as in (45). (45) Mary will put the book on the table. On the table, Mary will put the book. These two processes cannot apply to the same sentence, as (46) indicates. (46) a. Mary will put the book on the table b. On the table, Mary will put the book. c. * Will on the table, Mary put the book?
Now, interestingly, for many speakers of English the presence of a nite CP as a subject of a sentence also blocks movement of I0 . (47) a. That Mary has left should be obvious. b. * Should that Mary has left be obvious? c. That Sally sleeps late bothers Mittie. d. * Does that Sally sleeps late bother Mittie?
This would follow if nite CPs are driven from the nominative Case marked Specier of IP, and adjoin to the left of IP in these cases. Stowells method of forcing NP Shift to apply to complement CPs would extend to this scenario as well. CPs start out in the nominative Case-marked position, but are driven from there in order to form an S-structure. This evidence all points to the fact that nite CPs move to either the left or right linear edges of the sentences they are part of. If CPs are prevented from being in Case marked positions in the S-structure, this would get us close to deriving this fact. But is there evidence for the motivation for this movement that Stowell proposes? In particular, is there motivation for the claim that nite CPs, like NPs, require Case? One piece of suggestive evidence comes from the class of verbs that permit both NPs and nite CPs. These are only just those verbs that already exceptionally allow two NPs. These include verbs like promise, tell, and show; they are known as double object verbs. Some examples indicating that double object verbs also take NP and CP are in (48). (48) a. Mary promised me that she would sing. Mary promised me the ring
88
This isnt completely the case, as (49) is a counterexample. (49) a. Mary persuaded Bill that he should go. b. * Mary persuaded Bill the fact
But, so far as I know, (49) is the only counterexample. To the extent that there is a match in the verbs which accept NP CP and those which accept NP NP complements, there are grounds for believing that their surface positions are governed by the same, or similar, principles. And to the extent that the dominant principle is the Case Filter, then there is reason to conclude from these data that CPs are subject to the Case Filter as well. This match between the distribution of NPs and CPs should be completely general if Stowell is correct. Indeed, nite CPs are distributed in sentences much like NPs are. We nd them as complements to verbs, as we have seen, and in the subject position of other nite clauses, but not in the subject position of innitives (as we shall see shortly). These are just the Case-marked positions. But there are several differences in their distribution. In English, nite CPs are probably never found as the complements to a preposition, though of course NPs are. The only potential counterexample comes from temporal prepositions, as in (50). (50) a. I left before Mary arrived. b. * I left before that Mary arrived.
Similarly, both adjectives and nouns can take CP complements, but not NP complements. (51) (52) a. Sean is unhappy that he had to sing. b. * Sean is unhappy that. a. the proof that lemons cure cancer b. * The proof this fact.
If this has a Case Theoretic explanation, then Stowells theory is in trouble. But it could also be that this arises because of some property of c-selection. The jury is still out with respect to Stowells suggestion that Case is responsible for the surface position of CPs.
89
3. Positioning Arguments
Though the evidence suggests that Stowells ideas meet with some success, there are problems too. One which threatens Stowells Adjacency Condition on Case assignment, and its use in xing the order of complements, concerns socalled double object constructions, as in: (53) a. b. Mary showed Bill the picture. Bill baked Sally a cake.
How can the second NP in these examples receive Case? We will have to develop some of the rest of the system that is responsible for linearizing arguments before we can engage this difculty. So, let me ask for your indulgence and postpone examining this case.
90
As indicated by the brackets, it is thought that the strings following let and make form a single phrase which is serving as the complement to these let and make. Why dont we instead believe that these verbs are followed by two complements, the rst of which is them? Because them is not an argument of the verb. We can see from other examples that this isnt the case. In (56), for example, this position can be occupied by an expletive. (56) a. b. I will let [it seem that there is a mistake on the handout]. This will make [it obvious that there is a mistake on the handout].
We discovered in 3.1 that expletive it can only stand in Specier positions. Thus, the positions following let and make in these cases must be Specier positions. Note that the bracketed string in (55) is, indeed, an AP and VP, and not IP. This, we know, because the strings which follows these specier positions are characterized by the rules which generate Vs and As, and not those which characterize Is. As (57) indicates, an I0 is not found in this string. (57) a. * I will let [her should make a cabinet]. b. * I will let [her to make a cabinet]. c. * I will let [her makes a cabinet].
Finally, why might wonder if the bracketed strings in (55) are just A and V. Why dont we think that them is inside the V or A? I cant show you a reason for this in the case of A; but for V we can tell this from the action of the do so phenomenon. Recall that do so replaces Vs; so the contrast in (58) indicates that the NP in (55a) is not in V. (58) a. I will let [her make a cabinet] and I will let [him do so] as well. b. * I will let [her make a cabinet] and I will let [do so] as well.
From these observations we will want to give the examples in (55) the parses in (59).
91
3. Positioning Arguments
(59) a.
V let NP them V make V VP V NP a cabinet
b.
V make
V AP NP them A angry A PP at me
These complements to let and make are sometimes called small clauses, and it is to Stowell, once again, that we owe this way of viewing them.3 If this is correct, we have now discovered what stands in the Specier of AP and VP. These positions can contain a subject argument. A similar conclusion can be reached for PPs, though here the evidence is somewhat less meager. Cases such as (60) plausibly involve small clause PPs. (60) a. I want those kids in the bathtub! b. She saw them in the next room. Both of these examples can have a meaning in which it seems that the verb has just one object. In (60a), the object of want seems to be, roughly, what them in the bathtub now describes. The content of my desire is for those damn kids to be in the tub. Under our present understanding of how -roles are assigned, this requires that (60a) have the parse in (61).
92
those kids
Similarly, among its several meanings, (60b) can describe a situation in which what was seen was them in the room. This reading arises when in the room does not describe where the seeing event took place that reading is manufactured by letting in the room modify see but instead describes the location of the referent of them. To produce such a reading, we want them in the room to be the object of see, and this is achieved by a parse like (62). (62)
V see NP them P in V PP P NP the room
In both of the examples in (60), then, there is reason to believe that the verbs are followed by one PP, whose Specier is lled with the NP argument. Consider now the examples in (63). (63) a. They should make a cabinet. b. They are angry at me. c. They are in the next room.
These sentences contain the same VP, AP and PP that constituted small clauses in our earlier examples. Moreover, the -role borne by the subject in (63) seems to be the same as that borne by the NP in the examples with small clauses. Indeed, the meaning of the sentences in (63) seems to be identical to those of the small clauses in the previous examples, except for that meaning introduced by
93
3. Positioning Arguments
the modal should (in (63a)) and the verb be (in (63b,c)). A number of linguists independently noted that there could well be a small clause within (63), but that this arrangement is obscured by a movement transformation that relocates the subject.4 Indeed, the Case lter based theory about where argument NPs must be predicts that (63a) could have a small clause in it at D-structure. Suppose that the -role that they bears in (63a) is assigned to the same position that it is in our small clause example in (55a), that is to the Specier of VP. Then the Theta Criterion would require the D-structure of (63a) to be (64).
(64)
The subject argument, they, bears nominative Case, and yet in this phrase marker, it is not in the position our system assigns nominative Case to. That position is the Specier of a nite IP, which is the position that they is, in fact, spoken in. Imagine, then, that there is a movement transformation that relocates they into the Specier of IP, as in (65).
4 See Kuroda (1988), Kitagawa (1986), Fukui and Speas (1986) and Koopman and Sportiche (1991) for proposals along these lines that t into the general theoretical assumptions defended here, and Fillmore (1968) and especially McCawley (1970) for early antecedents to the idea.
94
The same derivations could be present in (63b) and (63c) as well. They would be as indicated in (66). (66)
NP they I present V be A angry IP I VP V AP A PP at me NP they I present V be P in IP I VP V PP P NP the next room
The conjecture that the subject arguments of IPs start out inside small clauses and move into the position they are spoken goes by the name of the derived subjects hypothesis. The movement transformation responsible for relocating the subjects is known as Argument Movement, or more commonly: AMovement. (67) Argument Movement Move an XP into Specier position.
95
3. Positioning Arguments
We shall examine some evidence in support of the derived subjects hypothesis in section 3.6 after we see other environments in which A Movement is alleged to occur. But before we turn to those tasks, we need to sharpen the conditions on Case assignment in order for them to produce the desired effects under the derived subjects hypothesis. Our present conditions on Case assignment are (68). (68) Conditions on Case assignment a. Nominative Case is assigned by nite I0 to its Specier position. b. Accusative Case is assigned by verbs and prepositions to an adjacent position. The description of where accusative Case is assigned does not produce the right results when coupled with the derived subjects hypothesis. To see this, consider the D-structures of some of the examples weve looked at. (69)
IP I I should NP they V make VP V NP a cabinet I present V be NP they A angry IP I VP V AP A PP at me
The subjects in these examples cannot be assigned accusative Case; they only appear with nominative. But the statement of accusative Case assignment in (68b) does not ensure this. The subject is adjacent to the verb make in (69a), and in (69b) it is adjacent to the verb be. The case in (69b) contrasts minimally with (70).
96
In (70), make assigns accusative Case to them. Because of the parallel structures of (70) and (69b), the usual conclusion is that the difference in availability of accusative Case must devolve to the difference in the verbs involved. Unlike make, be does not assign accusative Case. It appears that (71) is true. (71) The ability to assign accusative Case is part of the lexical specication of a verb. The response to (69a) needs to be more general. No matter what verb is involved, it is not able to assign accusative Case to its Specier position. The usual answer given to this problem, one that extends to other cases, as we shall see, is based on the structural relationship between the verb and the positions it can assign Case to. A verb is able to assign accusative Case to its sister, as in (69a), or to a position within its sister, as in (70). To capture this range of positions, we dene the following relation.5
5 This relation comes from Reinhart (1976), whose denition is slightly different from the one produced here. (The central difference is that she uses branching node in place of my phrase.) The name c-command is short for constituent command. Reinhart presents her proposal as a modication of the relation command in Langacker (1969). It is close to the notion in construction with that is found in Klima (1964), which Langacker cites as his model.
97
3. Positioning Arguments
(72) c-commands iff a. every phrase dominating also dominates , and b. does not dominate . The c-command relation governs a wide range of syntactic phenomena, and so it will arise in several other contexts in these lectures. The denition in (72) anticipates these later uses, and so goes a bit beyond what is necessary for the Case assignment relation. Only (72a) is needed here. (72b) prevents things from c-commanding themselves or their contents, and neither of these scenarios are relevant for the Case assignment relation. Armed with c-command, we can restrict the condition on accusative Case assignment to (73) and, thereby, prevent verbs from assigning accusative Case to their Specier position. (73) Accusative Case is assigned by some verbs and prepositions to positions that they c-command and are adjacent to. We need one additional change to the condition on accusative Case assignment. This change is made necessary by the contrast in grammaticality of examples like (74). (74) a. * She made him running easy. b. She saw him running.
The complement to saw in (74b) is a kind of non-nite IP called a gerund. These IPs are headed by the inectional morpheme ing that appears on the following verb. The subject of this IP in (74b) receives its accusative Case from see; its S-structure parse is (76) on the facing page. The condition on accusative Case assignment in (73) correctly captures the grammaticality of this example; see c-commands and is adjacent to him. Consider now the contrasting (74a). In this example, the gerund is the subject of a small clause, as indicated in (77) on the next page. The condition on accusative Case assignment in (73) would allow make to assign Case to him in (77): make c-commands, and is adjacent to, him. And yet, the ungrammaticality of this example traces back to the availability of Case for him, as can be appreciated by considering the fact that this example becomes grammatical without him. (75) She made running easy. The ungrammaticality of (74a) should arise because it violates the Case lter, and this indicates that make cannot assign accusative to him.
98
(77)
NP she
IP
A salient difference between (74a) and (74b) is the depth of embedding of the NP to be assigned Case. There appears to be a condition on how far into
99
3. Positioning Arguments
its sister a verb can assign Case. The precise statement of this condition is still under investigation; we will consider some of the proposals in what follows. I will add this condition to the formulation of Case assignment in a way that allows us to modify it as we go along. An initial reformulation, then, is (78). (78) Accusative Case is assigned by some verbs and prepositions to positions that they c-command, are klose to, and are adjacent to.
is klose to iff there is no more than one phrase that dominates but not .
This denition allows a verb to assign Case to its sister and to the immediate daughters of its sister, but prevents it from assigning Case to any position more deeply in its sister. In Lectures on Government and Binding, and much subsequent work, a term government is dened that collapses the ccommand and klose relations. The condition on accusative Case assignment is often formulated as (79). (79) Accusative Case is assigned by some verbs and prepositions to positions that they govern and are adjacent to.
governs iff c-commands and is klose to .
With these changes, then, the system of Case assignment and the Case lter correctly work with the derived subjects hypothesis to ensure that subjects of nite clauses show up with nominative Case in the Specier of IP at S-structure. The existence of small clauses, and the accompanying thesis that subjects of IPs start out in small clauses, requires that the subject -role be assigned to the Specier of the phrase headed by the term that has the -role. The Projection Principle determines where -roles are assigned, and so we should modify this condition to capture where subject -roles are assigned. At present, the Projection Principle requires that words assign their object -roles to positions that they are sisters to. Lets broaden this to: (80) The Projection Principle assigns a -role to iff and are sisters, where is either the X0 or the largest X that projects. The Projection Principle now correctly states where -roles are assigned. This leaves us with the job of determining which -roles get assigned to which positions. The -roles assigned to complements can be (largely) distributed by way of the c-selection. If a verb assigns two object -roles, like, for instance, put, those -roles can be distributed on the basis of the categories
100
Theme Location
b. give: Agent, NP , PP
Theme Goal
The -role that is not associated with a c-selection, the Agent -roles in (81), could be understood as assigned to Specier of V. This -role is often called the external -role. Together with the Projection Principle, this will come close to matching heads with their arguments correctly. Its easy to see that there are some inadequacies to this system, however. It does not produce a awless way of distributing -roles to complements when there is more than one complement of the same morphosyntactic category, as in (82). (82) Jerry talked to Bill about Mary. Jerry gave Sally the books.
Moreover, there is some evidence that the -role assigned externally is not completely determined by the verb alone. Marantz (1984) notes that the specic content of the subjects -role varies as a function not just of the verb, but also of the verb in combination with the material that follows it. This can be seen by considering (83). (83) a. b. c. d. She should make a cabinet. She should make an effort. She should make some noise. She should make nookie.
This doesnt seem to be true of the -role complements bear, however. It isnt possible to nd examples where the specic content of the objects -role varies as a function of the contribution the subject makes. We may need a method of seeing the semantic combination of subject and V that takes into account more than just the meaning of the head verb. Well return to these issues in Chapter 6. In the meanwhile, the method of thinking about -role assignment that (81) represents will serve.
101
3.5.1 Passive
Consider, for instance, the pair of examples in (84). (84) a. He will make [her angry at me]. b. She will be made [angry at me].
The -role that she bears in (84b) is the same one that it bears in (84a), and yet she shows up in a different Case and in a different position in (84b). The VPs are different in these examples. The verb in (84b) is said to be in the passive voice, while the verb in (84a) is in the active voice. Passive verbs in English are embedded under the auxiliary verb be. It is a fairly reliable generalization about the Passive in Indo-European that Passive verbs do not assign Accusative Case, even if the Active verbs they are related to do. English conforms to this generalization. We wont look at Passive in detail, but an informal way of thinking about it is as an operation that derives a passive predicate from an active one by way of the changes described in (85). (85) Passive a. Add passive morphology to the active verb and embed it under be, and b. Rob the verb of its Accusative Case, and c. Rob the verb of its external -role. This takes a garden variety transitive verb and produces a verb which looks intransitive, as indicated in (86).
102
Argument Movement
(86) Jerry admires her. She is admired.
The D-structure representation of the Passive sentence in (86) is (87a) from which the S-structure in (87b) is produced. (87) a.
IP I I pres V be V admired VP V VP V NP she
b.
NP she
IP I I pres V be VP V VP V V admired
Because Passive has robbed admired of its ability to assign Accusative Case, its object will seek out the Nominative Case marked position to satisfy the Case lter. This will cause (87a) to be mapped onto an S-structure in which she is positioned in the Specier of IP. The passive in (84b) has the derivation in (88) on the following page. This mirrors the kind of derivation that the derived subjects hypothesis gives to every nite clause. The subject argument is put into a D-structure position from which its -role derives, but cannot remain there and satisfy the Case lter. Argument Movement kicks in and gives it a position in which it is Case marked. Passive Constructions, then, are ones in which we have independent support for displacement of arguments from their -marked positions. They even manufacture cases which resemble the situation the derived subjects hypothesis claims to be widespread. Theres a confusing aspect to English morphology which clouds how the Passive works in English. English has a morphological rule by which transitive verbs can be converted into adjectives, and these adjectives look just like Passive participles. This process is responsible for created the adjective in (89), for instance.
103
3. Positioning Arguments
(88)
NP she I will V be V made A angry IP I VP V VP V AP A PP at me
(89) the admired student It could be, then, that what we have in (87b) is an adjectival phrase headed by admired rather than a Passive VP. It might have a derivation like that in (90) on the next page. Over a large class of cases, then, we will not be able to tell whether we are looking at a Passive sentence or a be+AP (a so-called copular construction) sentence. In some cases, however, it is possible to tell.6 This is because the morphological process that derives adjectives from verbs operates on the arguments of the verb. In particular, it makes the -role that a verb assigns to the NP it cselects become the external -role of the resulting adjective. For instance, the adjective in (90) assigns to its Specier position the same -role that the verb it was derived from assigns to its complement. This is quite general. We can see this by considering the relation these adjectives have to the nouns they mod6 This point is made in Wasow (1977), where the way of resolving the problem we will adopt is also proposed.
104
Argument Movement
(90)
NP she I pres V be IP I VP V AP A A admired
ify. The pre-nominal adjective in (89) is related to the noun it modies in the same way that the verb it derives from is related to its object. When we gure out how the modication relation works semantically, we will need to derive the fact that the modication relation for adjectives roughly matches the relation these adjectives have to their subjects in the copular construction. We can use this correlation, then, to see which -role is externalized by the adjective formation process. What we nd when we look is that, in fact, its always the direct object -role that gets externalized. In each of the cases in (91), for instance, the modied noun is related to its adjective in the same way that the direct object is to the verb. (91) a. b. c. Sally introduced the man to Sean. the introduced man Sally placed the book on the table. the carefully placed book Sally baked the cake for Sean. the baked cake
In no case is this relation like the one that holds between the verb and its subject argument or its indirect object argument.7
7 For more details about this process, see Levin and Rappaport (1986).
105
3. Positioning Arguments
Because of this restriction on the adjective formation process, we can be certain that the example in (84b) is a Passive and not an adjective. This is because the argument that shows up as the nominative argument in this example is not an argument of the verb at all. It is the subject of the complement of the verb. If made were an adjective in this example, then its the complement itself that would have appeared as subject, as can be conrmed by considering alternations like those in (92). (92) a. b. Someone made this table this (recently) made table.
3.5.2 Raising
There are verbs which behave much like the Passive verb in (84b), but without needing to be Passivized. An example is (93). (93) She seems [angry at me]. Just as in the Passive example, she is the subject of angry at me, not seems. And yet, just as in the Passive, the subject of angry at me shows up in a higher Specier of IP and in the nominative Case assigned to this position. This example seems to require the derivation in (94) on the facing page. Verbs like seem are called raising verbs. They have properties that force an argument in their complement to raise into their subject position. Under the proposals we are examining, one of those properties is that it does not assign accusative Case. Raising verbs, then, are like be in being among those verbs that do not have an accusative Case associated with them. As a result, in situations like (94) the argument of the embedded small clause must Argument Move into a position where it may satisfy the Case lter. In these circumstances too we see cases like those the derived subjects hypothesis claims are common.
106
Quantier Float
(94)
NP she I pres V seem A angry IP I VP V AP A PP at me
always are. The existence of movement Transformations, or some other means of overcoming violations of Contiguity (i.e., resolving mismatches between constituent structure and linear order), makes it understandable why the surface position of the subject of nite clauses is not the Specier of VP. And the Case lter provides a reason why a movement transformation would be invoked in these contexts: it gives an account of why subject arguments remain in Specier of VP sometimes, and move others. The derived subjects hypothesis, then, ts together independently needed pieces movement transformations, the Case lter, and a Projection Principle that produces small clauses into a simple picture of how subject arguments are placed. Nonetheless, it would be encouraging to nd empirical support that directly speaks on behalf of the derived subjects hypothesis. Sportiche (1988) argues that the phenomenon of Quantier Float provides such evidence. Quantier Float arises when a quanticational term is not in the determiner position normally reserved for quantiers, but is instead to the right of NP it is associated with. It can be illustrated by the English and French examples in (95).8
8 Miyagawa (1989) provides a similar analysis for Quantier Float based on Japanese data.
107
3. Positioning Arguments
(95) Les enfants (*tous) ont (tous) vu ce lm (tous). The kids (all) have (all) seen this lm (*all). Sportiche argues that there is a violation of Contiguity in these examples of the sort that movement Transformations are designed to account for. He suggests that the oated quantier and the NP it is related to start out together at Dstructure and then are separated at S-structure by Argument Movement. Note that there are language particular differences in this process that are reected in the differing grammaticality judgments for some of the French and English examples. Well return to these differences in the next chapter. Sportiches argument relies on the observation that the relation between the Floated Quantier and the argument it is related to mimics the relation that Argument movement is designed for. Once this is appreciated, the fact that the oated quantier appears in just those positions the derived subjects hypothesis predicts it should provides the evidence we are looking for. To see the force of his argument, we will rst have to learn some generalizations about Argument Movement. One of those generalizations is that the moved phrase can only be relocated to a c-commanding position. (96) If moves to the position , the must c-command . This is a generalization about all movement Transformations: all clear cases of a term moving to a non-c-commanding position are ungrammatical. The sentence in (97) on the next page, for example, is ungrammatical because the rule of Wh Movement has moved the wh-phrase to a Specier of CP that does not c-command the position the wh-phrase occupies at D-structure.9 Heavy NP Shift also obeys (96), as the ungrammaticality of (98) on page 110 demonstrates. These are a small sample of examples that the movement Transformations weve dened would create ungrammatical S-structures because they do not conform to the generalization in (96). We should want to understand why (96) is true. At present, though, it does appear to be true and so we can conceive of it as a constraint on movement Transformations generally. Sportiche points out that Quantier Float is subject to the parallel constraint in (99).
9 I have placed the CP complement to explain inside the V in (97), but we have now concluded that CPs are Heavy NP Shifted to the right of VPs. I leave out this detail here just to simplify the derivation involved. This example would still violate (96) even if Id given the correct Sstructure parse.
108
Quantier Float
(97) * He explained to whom Sally left.
IP NP he I past V explain PP to whom C I VP V CP C IP Sally left
(99) If is a oated quantier related to the NP , then must c-command . It isnt possible for (100b) to have an interpretation like that which (100a) gets, presumably because it is not possible for Quantier Float to relate these two structures. And because (100b) can have no other interpretation there are no other plural NPs for tous to be construed with the sentence is ungrammatical. (100) a. [Lauteur de tous ces livres] a vu ce lm. the author of all these books has seen this lm
b. * [Lauteur de ces livres] a tous vu ce lm. the author of these books has all seen this lm The ungrammaticality of (100b), then, can be credited to the fact ces livres fails to c-command tous. What the c-command requirement on Quantier Float establishes is that however the semantic connection between a oated quantier and the NP it involves is worked out, it should follow that the NP c-commands the quantier. There are a variety of ways that could be ensured, but because movement Transformations obey this relation, they provide one. Well see what Sportiches
109
3. Positioning Arguments
(98) * That Sally stood every silly boy bothered.
IP CP C that NP Sally I past VP V V stand IP I I past VP NP every silly boy V bother I VP V
suggestion is about the semantic connection between the oated quantier and the NP in a moment. But rst, lets consider how Quantier Float seems to be diagnostic of Argument movement. We have seen in the previous section that the subject argument, the kids, in examples like (101) has moved from a position within the small clause that follows the main verb. (101) a. b. c. The kids seem happy. The kids were made happy. The kids were on the table.
And, as expected on Sportiches proposal, a oated quantier within the small clause can be related to the boys, as in (102).10 (102) a. b. The kids seem both happy. ? The kids were made both happy.
110
Quantier Float
c. The kids were both on the table.
The relationship between the oated quantier and the subject is like that in examples such as (103), then. (103) a. b. She made the kids both happy. She found the kids both on the table.
In (103), unlike (102), the NP is able to remain within the small clause and satisfy the Case lter. Consider now (104). (104) The parents found the kids both on the table. This is grammatical, and has an interpretation in which both is construed with the kids. It doesnt have an interpretation, however, in which both is construed with the parents. Unlike (102), the oated quantier in the small clause of this example cannot be construed with the argument in the higher subject position. The difference between (102) and (104) is that the argument in the higher subject position is -marked inside the small clause in (102), but in (104) it is -marked by the V containing the small clause. In fact, (104) is part of the following generalization about the relationship between a oated quantier and the NP it is construed with. (105) Let be the X that assigns a -role to some NP. A oated quantier construed with that NP cannot be contained in . Because the parents in (104) is -marked by some phrase that contains nd, the oated quantier is necessarily contained within this projection and (105) prevents it from being construed with the parents. Sportiches proposal that Argument Movement relates a oated quantiers position to the argument it is construed with derives (105). The phrases that Argument Movement will apply to must start out in their -marked positions; this is a consequence of the Theta Criterion. Therefore, because Argument movement brings arguments to c-commanding positions only, there will be no way for an argument to be placed within the X that -marks it. Its clear, now, why examples like (95) provide evidence for the derived subjects hypothesis. (95) The kids have all seen this lm. For (95) to conform to (105), the position that the kids gets its -role from must be within the phrase containing have. That is precisely what the derived subjects hypothesis claims.
111
3. Positioning Arguments
Lets consider now how Sportiche proposes to make the semantic connection between the oated quantier and the NP it is construed with. His leading idea is to use whatever semantic rules of interpretation derive a meaning from (106) to give what appears to be the same meaning for (95).
Imagine that the semantic connection between all and the NP it combines with in (106) is established at D-structure. This connection, then, is produced from the representation in (107).
(107)
This D-structure yields the S-structure representation for (106) when Argument movement relocates the entire QP all the kids into the Specier of IP. It yields the S-structure for (95) when Argument movement applies to just the NP part of this QP, leaving the quantier behind, as in (108).
112
Quantier Float
(108)
NP the kids I pres V have Q all QP V seen IP I VP V VP V NP this lm
Theres a popular alternative to Sportiches account which you should keep in mind. Ill introduce it now, and offer a problem for it, but keep it at hand because it may begin to look more appealing when we discover some of the problems for Sportiches analysis. This alternative collapses the conditions on so-called subject oriented (see Jackendoff (1972)) adverbs with those of Qoat. Note that these adverbs have a distribution very much like oated quantiers. (109) The kids (deliberately) have (deliberately) seen (*deliberately) this lm (*?deliberately). To get the interpretation right, we would need a semantics that lets a oated quantier distribute the predicate that follows it over the parts of the subject. This approach to Q-oat has many advocates: Bowers (1993), for example, and Kayne (1975); (Roberts 1987, chapter 3) discusses the semantics. This alternative can be wedded to Sportiches proposal that Argument movement is involved; it can be seen as a different way of producing the semantic composition. If we are to maintain Sportiches suggestion that Argument movement is implicated, we should see the D-structure representation for (95) as being something like (110).
113
3. Positioning Arguments
(110)
IP I I pres V have NP the kids QP Q V VP V VP V V NP this lm
all seen
The similarities in the relationship that Argument movement produces between -marked position and surface position of an argument and that Quantier Float seems to require between a oated quantier and the NP it is construed with recommend maintaining this part of Sportiches proposal. What we might consider altering is how the quantier combines syntactically with the NP it is construed with. Sportiches (best) argument against this alternative is that the distribution of subject-oriented adverbs and oated Q are not always the same cross-linguistically. Theres another kind of argument against this alternative which is found in Giusti (1990b,a) which is worth looking at. Her argument is typological, and based on where oated quantiers can be found related to objects across languages. First, note that oated quantiers related to objects are, by themselves, something of a puzzle for an approach like the alternative sketched above. That account demands that the oated quantier combine with a V, or similar predicate, and steer how that predicate combines semantically with the neighboring NP. The existence of oated quantiers related to objects, then, is by itself an argument for Sportiches imagine of the underlying connection between quantier and NP. What Giusti observes is that the languages which allow quantiers to be related to objects are those for which we have independent evidence that objects can move.
114
Quantier Float
Thus, German/Dutch and Icelandic, but not English or Danish, can have a oated quantier related to an object, as in (111) and (112). (111) a. Der Lehrer hat die Schler alle gelobt. the teacher has the students all praised The teacher has praised all the students. (German) De leraar heeft de kinderen allen geloofd The teacher has the children all praised. The teacher has praised all the students. (Dutch) Stdentarnir lasu greinina allir. students-the read article-the all The students read all the articles. (Icelandic) (Giusti 1990a, (1): 137, (22a) & (25b): 144)
b.
c.
(112)
a. * The teacher has praised the children all. b. * Laereren roste eleverne alle teacher-the praised children-the all The teacher praised all the students. (Danish) (Giusti 1990a, (2c): 137)
She argues that this can be related to the fact that denite objects in Dutch, German and Icelandic can move leftwards, whereas this is not possible for objects in English and Danish. That leftwards movement is possible in these languages is, at least partly, indicated by the fact that they can precede adverbs which are normally found at the left edge of VPs in these languages. We have the contrasts, then, between examples such as (113) and those in (114). (113) a. Der Lehrer hat die Schler ohne Zweifel gelobt. the teacher has the students without a doubt praised. The teacher has without a doubt praised the students. (German) De leraar heeft de kinderen gisteren geloofd. the teacher has the children yesterday praised. The teacher has yesterday praised the students. (Dutch)
b.
115
3. Positioning Arguments
c. Stdentarnir lasu greinina ekki. students-the read articles-the not The students didnt read the articles. (Icelandic) (Giusti 1990a, (2): 136, (22a): 144, (13): 141)
(114)
a. * The teacher has praised the children not. b. * Laereren roste eleverne uden tvivl. teacher-the praised students-the without a doubt. The teacher without a doubt praised the students. (Danish) (Giusti 1990a, (2): 137)
This is just what we would expect on a Argument Movement account of Quantier oat, of course, because on that account movement is what is responsible for separating an NP from the quantier it is construed with. But this is also an argument for Sportiches particular way of putting the quantier together with the NP. On Sportiches proposal, the only way a quantier can appear after an NP it is construed with is by that NPs movement. The alternative account, by contrast, allows the D-structure arrangement of quantier and NP to put the quantier in a position that follows the NP. Its not clear how such an account could capture Giustis correlation. On the other hand, there are examples which Sportiches proposal does not explain. As he notes, his model of Quantier Float should allow quantiers to be related to objects in English in just those cases where the object has undergone Argument movement. Weve seen that one situation where this happens is in the Passive: an object moves into a higher Case-marked position when the verb it is an object of is Passivized. And yet, Quantier Float is not possible in these contexts, as the ungrammaticality of (115) indicates. (115) * The kids were praised all. For some steps towards a solution, see Bokovi c (2001, 2004). Another problem concerns examples like (103). (103) She made the kids both happy. In this example, the kids does not undergo movement. Its D-structure position is also the one where it gets assigned Case. If quantiers follow the NPs they are construed with only when they move, how is it that the quantier manages to follow the NP in this example? We will see a solution to this problem in Chapter
116
Quantier Float
6. Keeping these problems in mind, lets nonetheless adopt Sportiches account of Quantier Float.11 One interesting consequence of Sportiches analysis is that oated quantiers can be seen as indicators of the positions that arguments occupy not only in the D-structure element of the derivation, but in all the non-S-structure parses of the derivation. It gives us, in other words, a useful glimpse at these otherwise invisible parses. We learn from this that phrases can take smaller steps than necessary in their journey from a -marked position to a Case marked position. So in (116), for example, it appears that the children has made an intermediary stop in the Specier of the VP headed by have. (116) The children might all have eaten chocolate. Argument Movement is employed twice, as indicated in (117). (117)
NP the children I might QP Q all V have V eaten IP I VP V VP V NP chocolate
We learn from these sorts of examples that its possible for Argument movement to act on a structure that is itself the product of Argument movement. Such derivations are said to involve successive cyclic movement. The existence of such derivations means that complex interactions of movements are conceivable, and, as a result, the mapping from D-structure to S-structure is
11 For other work on Quantier Float, see Jaeggli (1982), Belletti (1982), Dprez (1989), Doetjes (1995) and Bobaljik (2003).
117
3. Positioning Arguments
potentially quite complex. There are problems that lurk here, but they are for another time.
3.7 Innitives
We have seen that raising verbs select small clauses but do not assign Case to the NP argument within that small clause. As a result, Argument movement brings that NP argument into a Case marked position. In (118), for instance, Sally has moved from within the small clause where its -role is assigned, into its surface, nominative Case marked, position. (118) Sally seems happy. Raising verbs are (some of them) also capable of selecting an IP headed by to, as in (119). (119) Sally seems to be happy In this example too, Sally has moved from a position within the small clause into its surface, nominative Case marked, position. But in this case, there is a question about how the EPP is satised. (The EPP, recall, is the requirement that a Specier of IP be lled with something.) Its clear how the highest IP satises the EPP: its Specier is lled with the subject argument that resides in it at S-structure. But what of the Specier of the IP headed by to? One possibility is that (119) has a successive cyclic derivation in which a representation like that in (120a) on the next page is manufactured. From this representation, A-movement could apply again to form the S-structure we see in (119). We might imagine, then, that (119) has a successive cyclic derivation. If this is correct, we should understand the EPP to be a condition that can be satised by any phrase marker in a derivation. That a successive cyclic derivation of this sort is possible is indicated by (121), whose derivation is indicated in (120b) on the facing page. (121) The girls seem all to like chocolate. We see in this example that Argument movement has moved all the girls into the Specier of the innitive before the NP part is moved into its surface position. The presence of the oated quantier in the Specier of the innitival IP is the tell-tale sign that the subject argument has passed through the Specier of the innitival clause.
118
Innitives
(120) a.
IP I I present V seem NP Sally I to V be VP V IP I VP V AP A A happy
b.
NP the girls
IP
I I
VP V
present
V seem
IP QP Q all
I I to V like VP V
NP chocolate
Innitival clauses come in a variety of types that can be distinguished from those that following raising verbs, and we will look at some of these in this section. The example in (122), for example, though it has a supercial similarity to (119), is different in some ways. (122) She tried [to be happy]
119
3. Positioning Arguments
In (119), the count of -roles and arguments matched, as expected. But in (122), this isnt the case. Both tried and angry assign an external -role, and yet there is only one argument which seems to bear them both: she. This is precisely what the Theta Criterion prohibits. Recall, that the Theta Criterion is designed in this way in an attempt to capture the fact that (123a) doesnt have the meaning that (123b) does. (123) a. b. I showed John. I showed John himself.
So, either we should nd another explanation for the fact that (123) illustrates, or we should nd a way of resolving (122) with the Theta Criterion. Note also that (122) seems to be a violation of the Extension to the Projection Principle. Unless we determine that (122) involves Argument movement of she through the Specier of the innitival IP, as in the raising case, there appears to be nothing that occupies this Specier position at any point in the derivation. Moreover, its not just that (122) seems to counterexemplify these principles, it perversely cant satisfy them. If the Specier of the embedded IP is given content, and in so doing hosts an argument for the -role that angry assigns, the result is ungrammatical. (124) * She tried [him to be angry].
And if a predicate that does not have an external -role to assign is put into these innitives, the result is also ungrammatical. (125) * She tried [ to seem that pigs y].
Innitives with this peculiar mix of properties are called control innitives, and a wide assortment of verbs select innitives of this type. Some are in (126). (126) a. b. c. d. e. f. g. h. i. Sally prefers to eat chocolate. Sally managed to eat chocolate. Sally attempted to eat chocolate. Sally wished to eat chocolate. Sally asked to eat chocolate. Sally proposed to eat chocolate. Sally neglected to eat chocolate. Sally promised to eat chocolate. Sally decided to eat chocolate.
120
Innitives
In all of these examples, there is an argument here it is Sally, the subject of the higher verb that appears to bear the -role assigned to the subject of the innitival clause. We say in these cases that the subject argument controls the innitival clause, or controls the embedded subject. There are instances of control innitives where its the object of the higher verb that is the controller; (127) provides some examples. (127) a. b. c. d. Sally told Jim to eat chocolate. Sally asked Jim to eat chocolate. Sally required Jim to eat chocolate. Sally forced Jim to eat chocolate.
In these examples, it is the object, Jim, not the subject, Sally, that is understood to bear the external -role assigned by eat. From what we have seen about control innitives, we can draw the following conclusions. (128) a. A control innitive must have a subject argument. b. The subject argument of a control innitive cannot be pronounced within the innitive. c. The subject argument of a control innitive can be understood as the same as an argument in the higher clause: its controller.
Our goal is to understand what is responsible for these properties of control innitives. Our hope should be that it is consistent with, maybe even follows from, the forces that are responsible for positioning arguments. Well look at a few ideas in this section that might bring us within range of these desires. Ill start with the question of whether these innitives are CPs or not, and then turn to issues that more transparently bear on (128).
121
3. Positioning Arguments
(129) a. b. c. d. a. b. c. d. Sally told Jim how to eat chocolate. Sally asked when to leave. Sally decided when to leave. Sally wondered what to eat. Sally told Jim whether to eat chocolate. Sally asked whether to leave. Sally decided whether to leave. Sally wondered whether eat chocolate.
(130)
In many, not too distantly related, languages, even the non-question form of a control innitive comes with a complementizer. Icelandic is a particularly clear example of this. It has subject and object control innitives that closely resemble those of English. And yet these control innitives appear with the same complementizer that is found in nite clauses in Icelandic: a. (131) Mara lofai a lesa bkina. Mary promised that read the book Mary promised to read the book. compare: Mara segir a har lesi bkina. Mary says that you have read the book Mary says that you have read the book. We should conclude, then, that a control innitive can be a CP. By contrast, the same kind of evidence regarding raising innitives suggests that they are never CPs. There are no known examples of raising innitives that have the syntax of questions. (132) a. * It seems whether to have left. compare: John seems to have left. b. * It appears when to eat chocolate. compare: John appears to eat chocolate. c. * It is likely who to dance. compare: John is likely to dance.
This could be because innitival clauses are incapable of being CPs, and therefore do not support the CP-dependent syntax of questions. But it could also be because raising verbs (and adjectives, such as likely) do not semantically select
122
Innitives
questions. Whether a verb can take a question as an object depends on the semantics of the individual verb. A verb like believe, for instance, cannot take a question complement, whereas the similar know can. (133) a. * Jill believes whether Sean has left. b. * Jill knows whether Sean has left.
The absence of question complements to raising verbs could, then, simply reect a fact about the semantics of these predicates, and nothing about the syntax of their innitival complements. Indeed, raising predicates do not seem able to combine with question complements even when those complements are nite clauses. (134) a. * It seems whether Joe has left. compare: It seems that Joe has left. b. * It appears when Mara has eaten the chocolate. compare: It appears that Mara has eaten the chocolate. c. * It is likely who will dance. compare: It is likely that Anna will dance.
Finite clauses can always be CPs (it seems), and so we should conclude that the absence of question complements to raising verbs has to do with their meaning. Although we do cannot conclude that raising innitives are prevented from being CPs, we also cannot learn from their inability to be questions whether they are allowed to be CPs. Icelandic, however, provides a clue. Raising innitives in Icelandic cannot include the complementizer a. (135) * Mara hafi virst a hafa vaska upp diskana. Mary had seemed that have washed up the dishes Maria had seemed to have washed up the dishes compare: Mara hafi virst hafa vaska upp diskana. Maria had seemed have washed up the dishes Mary had seemed to wash up the dishes. This would be explained if raising innitives are IPs, but cannot be CPs. In Icelandic, then, we can conclude that one difference between raising and control innitives is their categorial status. Raising innitives are IPs but cannot be CPs, whereas control innitives are CPs. In English, the picture isnt as
123
3. Positioning Arguments
clear. There is no direct evidence that raising innitives cannot be CPs. And though there is evidence that some control innitives are CPs the questions other control innitives do not differ outwardly from raising innitives. English internal evidence supports the conclusion that control innitives can be CPs, but nothing more. If we consider the evidence from Icelandic and English together, however, and keep in mind the criterion of explanatory adequacy, we can nd some help in making decisions about the proper treatment of English innitives. We conclude from Icelandic that the mapping in (136) exists in at least one grammar. (136) raising innitive IP control innitive CP
Assuming that this is not a capricious relationship, we can assume that there are forces, still to be discovered, that produce it. If the mapping in (136) does not materialize in English, then we can conclude that those forces do not operate in the same way in English. When we model these forces, our model should allow them to vary from language to language. Consequently, we must credit the child with the ability to detect that variation as that child is acquiring his or her language. Lets call this hypothesis 1. Hypothesis 2 is that the forces responsible for the existence of (136) in Icelandic are also found in English. On this hypothesis, Icelandic and English should be identical with respect to the categorial status of their innitives. Any differences in the appearance of innitives in these languages will ow from other ways these languages differ. What are the differences between Icelandic and English that would have to be explained on hypothesis 2? As weve seen, Icelandic and English raising innitives look the same; nothing would have to be said under hypothesis 2 about how raising innitives differ in these languages. Control innitives in Icelandic and English also look the same in all ways except one: declarative control innitives can appear with a complementizer in Icelandic but not in English. Hypothesis 2 requires that this difference be made compatible with the mapping in (136). One way of doing that is with (137). (137) The complementizers of innitival clauses in English have no phonetic realization. It seems difcult to avoid a conclusion like (137) under hypothesis 2. Hypothesis 2 is likely to require, then, that words can be silent. It must credit the child with the ability to detect the existence of silent words in a sentence.
124
Innitives
So we have before us two alternatives. English does not conform to the mapping in (136) and the forces that are responsible for (136) can vary in ways that are detectable by children acquiring English. Or, English does conform to the mapping in (136) but has a silent innitival complementizer, and the existence of this silent complementizer is detectable by children learning English. Framing the analysis of English innitives in this way allows us to make decisions based on criteria of language acquisition. If the forces behind (136) turn out to be too distant from the information available to a child, then we can conclude that they cannot vary in detectable ways and were stuck with hypothesis 2. If, by contrast, the existence of words in a sentence cannot be detected without their phonetic manifestation, then were stuck with hypothesis 1. Hypothesis 2 is adopted in Lectures on Government and Binding, and it is, perhaps, the most popular in the subsequent literature. One way of ensuring that the silent complementizer is detectable to the language-acquiring child is to elevate the mapping in (136) to a language universal. If the child can rely on (136), then it can deduce that there is a silent complementizer in the declarative control innitives of English. Lets adopt hypothesis 2 as our working hypothesis as well. Well examine some alternatives in a later chapter. Before leaving the question of whether innitives are CPs or not, we should look at one last kind of innitive. Some predicates that take control innitives as objects can take an innitive which does have an overt complementizer, and in which the subject argument is visible inside the innitive. Some examples of these predicates and the two types of innitives they can combine with, are given in (138). (138) a. Misato wanted to eat durian. Misato wanted for him to eat durian. b. Junko preferred to eat natto. Junko preferred for me to eat natto. c. It is possible to eat poi. It is possible for some to eat poi.
The subject of the innitival clauses appears in the accusative Case, as is clear from the form of the pronouns in (138a) and (138b). The complementizer is for, which might easily be mistaken for the homophonous preposition found in sentences like She wanted something for Susan. We think that the for in (138) is a Complementizer for several reasons. First, it forms a constituent with the entire clause that follows it. This is indicated, among other things, by the behavior of these phrases in the context of cleft constructions.
125
3. Positioning Arguments
(139) a. Its for him to eat chocolate that Sally would prefer. b. * Its to him how to eat chocolate that Sally should explain.
The ungrammaticality of (139b) derives from the fact that there is only room for one phrase between the its and the that of these clefts, and in (139b) two things, a PP and an innitive, have been shoved into this spot. Thus the grammaticality of (139a) would seem to argue that we do not want to parse the boldfaced string as a PP followed by an innitive, but instead as something that makes a single phrase. To the extent that an innitive is an IP, and that the him in (138) is its subject, then one of the few ways of doing this is to let for be a complementizer. Moreover, the for that comes with these innitives has a property that no preposition in English has, but which the complementizer that does have. It is optional. Under a certain set of circumstances, the complementizer that can go unspoken in English, as in (140). (140) a. b. Sally said that he eats chocolate. Sally said he eats chocolate.
This is also possible for the for of those innitives that are objects to verbs, as (141) shows. (141) a. Misato wanted him to eat durian. b. Junko preferred me to eat natto.
Furthermore, the circumstances on the optionality of the complementizer that are mimicked by the optionality of for. In general, the complementizer that can go unspoken only in contexts where the CP it heads is a complement to a verb. That is why that cannot go missing when it heads a CP that has been clefted, as in (142). A parallel constraint on for ellipsis is indicated by (143). (142) (143) a. a. Its that he eats chocolate that Sally said. Its for him to eat chocolate that Sally would prefer. b. * Its he eats chocolate that Sally said. b. * Its him to eat chocolate that Sally would prefer. In general, the constraints on where that can go missing are parallel to those on where for can, suggesting that they are both subject to some process, restricted to complementizers, that allows them to go silent.
126
An instance of object control, like that in (146) below could have the derivation indicated in (147). (146) Shai told Florian to sing.
127
3. Positioning Arguments
(147)
NP Shai I past V tell IP I VP V NP Florian I to IP I VP V V sing
These derivations differ from the ones weve posited up to now for Argument movement in that they move arguments into -marked positions. If this is the correct account of control innitives, then, we should adjust the conditions weve placed on Argument movement to allow these derivations. Doing so would not only give a transparent account of the Control relation, it would also provide an account for how control innitives satisfy the EPP and that part of the Theta Criterion that requires every -role to be assigned to an argument. On this view, the Control relation is just the one established by Argument movement between an argument and its -marked position. This view of the Control relation is offered in Hornstein (1999) and its consequences explored in Hornstein (2000) and Boeckx and Hornstein (2003, 2004, 2006a,b). There are circumstances where control innitives do not involve control, however; in these circumstances the subject argument is not an argument expressed elsewhere. This is the case when, for instance, the verbs taking a control innitive are passivized, as in (148). (148) a. To eat natto shouldnt be tried. b. To be content is often wanted.
128
Innitives
c. To be honest should be promised in a wedding vow. In fact, whenever a control innitive stands in subject position, its understood subject does not have to be an argument found elsewhere; (149) provides some other examples. (149) a. b. c. d. To remind him of Amherst Winters bothers Pius. To eat durian shouldnt be possible. To leave your shoes untied is a danger to those around you. To be the largest prime number is to be very large indeed.
In these examples the subject of the innitive is understood to have a meaning close to that which one gets in sentences like (150), as can be appreciated by comparing this example to (149b).12 (150) For one to eat natto shouldnt be possible. The understood subject of these innitives is said to be arbitrary. In (148) and (149) there is no argument that could be the understood subject of the innitive and so it is easy to see that it must get another meaning. But there are also examples where potential controlling argument is present and yet an arbitrary reading for the understood subject of the innitive is available. That is the case for the examples in (151), for example, which may all have the same arbitrary interpretation that is found in (149). (151) a. Elena discovered that to remind Pius of Amherst Winters bothers him. b. Jeff thought that to eat durian shouldnt be possible. c. Bart remembered that to leave his shoes untied is a danger to those around him. d. Jill knew that to be the largest prime number is to be very large indeed.
When the innitive is a question, it may also sometimes support an arbitrary interpretation for its understood subject, as in (152). (152) a. Jon knew how to eat live shrimp. b. Satoshi can tell when to add vinegar to rice.
12 There is a subtle difference in the meanings of (150) and (149b), but it doesnt seem to me that this difference resides in the meaning of the subjects.
129
3. Positioning Arguments
c. Julie decided where to put the Dicentra. These can get interpretations roughly parallel to those in (153), in which the subject is arbitrary. (153) a. Jon knew how one can eat live shrimp. b. Satoshi can tell when one should add vinegar to rice. c. Julie decided where one should put the Dicentra. The precise conditions under which the arbitrary interpretation are available are imperfectly understood. See Williams (1980, 1989) and Landau (2004) for some ideas. Control innitives with an arbitrary reading pose the same problem for the EPP and Theta Criterion that normal control innitives do. They seem to violate the EPP there is nothing in the Specier of their IP and because the -role normally assigned to the subject has no argument to bear it, they seem to violate the Theta Criterion as well. Moreover, just like normal control innitives, they are ungrammatical if there is no -role for a subject argument, as (154) shows. (154) * To seem that pigs y is worrisome. compare: For it to seem that pigs y is worrisome.
Just as for normal control innitives, then, these facts taken together suggest that there is a subject argument present in these innitives. If such an argument is present, then the EPP and Theta Criterion could be maintained. Moreover, the ungrammaticality of examples like (154) would be explained: if there is a subject argument, then the Theta Criterion requires that there also be a -role assigned to it. In these cases, however, this alleged subject argument cannot be found in some unexpected position. We cannot here nger the controller as that argument because there is no controller. A common reaction, and the one I shall adopt, is that there is a subject argument syntactically present in these examples but that it is silent.13 This argument is indicated by putting PRO in the position our syntax suggests it should be. The representation for (149b), then, would be (155).
13 For an approach that does not countenance a silent subject, see Chierchia (1984).
130
Innitives
(155)
CP C e PRO I to IP I VP V eat durian I shouldnt V be IP I VP V AP A A possible
Ive decided that this innitival clause is a CP with a silent complementizer, represented here with e. Ive also shown PRO moving from the position that the external -role is assigned to, Specier of VP, to the Specier of IP where it satises the EPP. I also show the innitival CP itself moving from Specier of AP into Specier of IP; this claims that the innitival gets an external -role from possible and moves in order to satisfy the EPP. We can think of PRO as being a kind of pronoun (hence its name), maybe along the lines of a silent one. Of course, if the grammar of English includes a silent pronoun, then where it can be must be heavily constrained. Otherwise, sentences such as (156) should be grammatical, with Theta Criterion satised by the presence of PRO in subject position or object of preposition. (156) a. * Ran into the garden. b. * Andre talked to.
The problem of restricting PROs distribution remains unsolved. In fact, the question of where PRO can be is still controversial. There is no controversy over the question of whether PRO can be found in complement positions; it seems clear that it cannot. And it also seems clear that PRO cannot be in subject position of nite clauses, at least not in English. But its not clear yet in which non-nite clauses PRO can be a subject. Perhaps the most popular view presently is that PRO can stand as subject to any control innitive, not just those that get an arbitrary interpretation. On this
131
3. Positioning Arguments
view, the Control it does not reduce to a special instance of Argument Movement, but is instead an interpretation that PRO can receive. The representation for sentences involving Control would not be as indicated in (145) and (147), but would instead include PRO, as in (157) and (158). (157) Shai tried to sing.
IP NP Shai I past V try C e PRO I to I VP V CP C IP I VP V V sing
132
Innitives
(158) Shai told Florian to sing
IP NP Shai I past V tell I VP V NP Florian PRO I to IP I VP V V sing
Unlike the movement-based analysis of Control that (145) and (147) illustrate, the representations in (157) and (158) obey the Theta Criterion as it is now formulated. What we would need to do if these representations are correct is produce a non-movement account of Control, an account that would emerge as part of a more general account of how PRO gets its semantic interpretation. What weve discovered in this section, then, is that there is a silent pronominal argument in English and that this silent argument is prevented from appearing in a variety of positions otherwise normally available to arguments. Keeping in mind that this argument, PRO, cannot be in complement positions, we can partially describe its special distribution with (159). (159) The PRO Restriction PRO can only be in certain Specier positions at S-structure. Which Specier positions PRO can be in is controversial, but it certainly doesnt include the Speciers of nite IP. The PRO Restriction describes something that is peculiar to the placement of the silent argument: PRO. Lets now look at whether the conditions weve
133
3. Positioning Arguments
been developing to capture the placement of the non-silent arguments also hold of PRO.
If Argument movement could move NPs to non-c-commanding positions, then (160) should be able to have the derivation indicated in (161) on the next page. The derivation satises every other condition weve discovered. The EPP is satised for both Specier of IPs. The number of -roles and arguments is the same, so the Theta Criterion is satised. And every argument stands as a sister to the term that assigns it a -role at D-structure, and so the Projection Principle is satised as well. In particular, the argument Jill stands as a sister to the verb shown at D-structure, and there gets the object -role from shown. Because show has been Passivized, however, it does not have an accusative Case to assign to Jill, and so Argument movement must move this NP to a Case marked position. The Specier of the nite IP that follows Jill is such a Case marked position, and so movement to this position should satisfy the Case lter. If this position c-commanded the D-structure position of Jill,
134
Innitives
(161)
NP it I past V be V shown C that NP Jill I present V IP I VP V VP V CP C IP I VP V CP
every other condition would be satised and this sentence should be grammatical. Its the c-command requirement on Argument movement, then, that is responsible for preventing sentences such as (160). This can be appreciated by comparing the grammatical (162), in which the c-command requirement is satised, with (160).
135
3. Positioning Arguments
(162) Jill was shown that it seems that pigs y.
IP NP Jill I past V be V shown C that NP it I present V I VP V VP V CP C IP I VP V CP
If PRO is an argument NP, then we should expect that Argument movement will be responsible for xing its position and, consequently, we should nd a parallel pattern of grammatical and ungrammatical sentences that the successful or unsuccessful application of Argument movement is responsible for. In the case of overt NPs, the success of Argument movement hinges on being able to nd an S-structure that satises the Case lter that can be produced from a grammatical D-structure through an application of Argument movement that satises the c-command requirement. In the case of PRO, the grammatical sen-
136
Innitives
tences should be ones in which PRO can reach the Specier of an appropriate innitival IP from its -marked D-structure position by way of an application of Argument movement that satises the c-command requirement. This match in grammaticality patterns shows up. The ungrammaticality of (163), for instance, parallels the ungrammaticality of (160). (163) * It was shown to seem that pigs y. If the innitival clause in (163) is of the sort in which PRO is permitted, then this example satises every condition except the c-command constraint on Argument movement, as (164) indicates. (164)
NP it I past V be V shown C e NP PRO I to V IP I VP V VP V CP C IP I VP V CP
137
3. Positioning Arguments
That this innitive is of the kind that permits PRO in its Specier position is indicated by (165), which permits an arbitrary reading for the subject of its innitive. (165) Jill was shown how to y. When the c-command requirement on Argument movement is obeyed, then examples parallel to (164) are grammatical, as (166) indicate. (166) To be shown how to y is swell.
IP CP C C e PRO I to V be V shown IP I VP V VP V CP how to y I present I VP be swell
Examples like these, then, suggest that the position of PRO is controlled by the same set of constraints that weve seen determine the position of overt argument NPs. To the extent that this is true, it speaks on behalf of the existence of PRO. If the ungrammaticality of sentences like (160) are due to conditions on where NP arguments may appear in S-structures, and those conditions are mimicked by arguments which are semantically present but not spoken, then
138
Innitives
we should conclude that the syntactic position of those unspoken arguments matters. They must be present in the syntactic representation for that to hold. There is another condition on the Argument movement of overt NPs that we can seek in the Argument movement of PRO. This condition we have not yet encountered; it concerns how the Case lter is satised. It appears that NPs cannot be Argument moved from positions that are Case marked. (167) If Argument movement moves something from position , then cannot be a Case-marked position. This condition is thought to be responsible for the ungrammaticality of (168). (168) * Elena seems to that pigs y. (compare: It seems to Elena that pigs y.)
The argument NP Elena resides in its -marked position, which is the complement position of to, and from there moves into the Case marked Specier of IP position, thereby producing a representation that satises the EPP. The D-structure and S-structure representations meet the well-formedness requirements that they need to meet, and the position that Elena moves into c-commands the position it moves from. Even though it meets all the other requirements on a grammatical sentence, (168) is ungrammatical because it violates (167). The position that Elena resides in at D-structure is assigned accusative Case by to, and (167) prohibits Argument movement from a Casemarked position. Another example whose ungrammaticality relies on (167) is (169). (169) * Elena seems that is happy. The derivation that needs to be blocked here is shown in (170) on the following page. For this sentence to satisfy the EPP, it is necessary for Argument movement to bring Elena into the Specier of the embedded IP. But this position is assigned nominative Case by the nite I, and so movement from this position is prohibited by (167). There is consequently no way to derive (169) that does not violate either the EPP or (167). There are examples parallel to these involving PRO, and they too speak on behalf of the existence of PRO and letting Argument movement play a role in positioning PRO. Parallel to (168) is (171). (171) * To seem to that pigs y is swell. This example would be grammatical if it were possible to Argument move PRO from its D-structure position following to into the Specier of the innitival
139
3. Positioning Arguments
(170)
NP Elena I pres V seem C that I present V be IP I VP V CP C IP I VP V AP A A happy
clause, where it will satisfy the EPP and also meet the condition on where PRO is allowed. To explain its ungrammaticality in a way that relates it to (168), we should credit the ban against Argument movement from Case marked positions. Parallel to (169) is (172). (172) * To seem that is happy is swell. The derivation we should be considering for (172) the one that satises the EPP is (173).
140
Innitives
(173)
CP C C e PRO I to V seem C that I present V be IP I VP V CP C IP I VP V AP A A happy I present IP I VP be swell
Just like (170), this violates the ban against Argument movement from a Case marked position. A nal condition inuencing Argument movement that might be relevant
141
3. Positioning Arguments
for the placement of PRO is the Case lter. If PRO is an argument NP, then we should expect it to obey the Case lter. Is Argument movement invoked to put PRO into Case marked positions at S-structure just as it is to put overt NPs into Case marked positions? Its not easy to determine the answer to this question because of the confound that the PRO Restriction presents. The PRO Restriction describes the fact that PRO seems to occur at S-structure in the Speciers of control innitives and nowhere else. If PRO is also subject to the Case lter, then the narrower distribution that the PRO Restriction allows it will make it difcult to detect. We are reduced, in fact, to the question of whether the positions that the PRO Restriction allows PRO to be in are Case marked positions. We presently have two ways of determining whether a position is a Casemarked one. If an overt NP, bearing Case, is capable of surfacing in some position, then that position in assigned Case. And if Argument movement is prohibited from some position, then through a process of elimination we can determine whether the ban against moving from a Case-marked position is to blame and in this way discover whether a position is Case marked. By the rst criterion, we cannot conclude the the Specier positions in which PRO resides are Case-marked. The Speciers of control innitives cannot contain overt NPs. (174) a. * Her to run seems unlikely. b. * She to sing is swell.
From this, we might conclude that the positions PRO resides in are not Case marked positions. This would explain why overt NPs cannot surface in this position: the Case lter would be violated if so. This is the conclusion reached in Lectures on Government and Binding, and it informs most of the literature on PRO from the 1980s and 1990s. The second criterion gives a somewhat more mixed image. If the movement account of Control is correct, then the ban against moving from a Case-marked position allows us to deduce that the Specier of the control innitive that a controller moves from is not Case marked. So, for instance, we can conclude that the Specier position of the innitival complement to a verb like try is not Case-marked if there is Argument movement of the sort indicated in (175). (175) Hotze tried [CP [IP to eat durian ]]. If the innitive that controllers move from is the same kind that can host PRO in non-control environments, then the simplest assumption would be that the
142
Innitives
Specier of these innitives is not Case-marked when they host PRO as well. So, for instance, if the innitive clause that stands in the subject position of a Passivized try is the same as that which stands as the object of an Active try, as in (176), then the PRO this innitive contains is standing in a non-Case-marked position. (176) [CP [IP PRO to eat durian ]] shouldnt be tried.
On the other hand, if the movement account of Control is incorrect, then we cannot use the prohibition against movement from a Case-marked position to deduce that the subject position of a control innitive is Case-marked. And there are examples whose ungrammaticality could be derived from this prohibition if the subject position of control innitives are Case-marked. One of these is (177). (177) * Winnie was tried [CP [IP to eat durian ]].
If the prohibition against Argument movement from a Case-marked position is not responsible for the ungrammaticality of (177), then its not clear why it should be bad. There is only one -role for the argument Winnie, and so the Theta Criterion is satised, as is the Case lter and the c-command restriction on movement. Indeed, the contrast between (177) and (175) is problematic for the movement account of Control. The only relevant difference between these examples is the number of -roles involved: in (177), Passive has removed try of its external -role, whereas in (175) this -role remains. The movement account of Control relies on an as yet undiscovered force that will allow Argument movement from a control innitive into a -marked position but prevent Argument movement from the same position that is not into a -marked position. A theory of Control that makes it a special reading for PRO, however, can make sense of this contrast with the tools at hand. Let the Specier of a control innitive be a Case-marked position, and everything follows. The Theta Criterion as well as the ban against movement from a Case-marked position will prevent Argument movement from a control innitive into a -marked position. The movement that is illustrated in (175) will be blocked, and instead the subject of the embedded innitival clause will be PRO. The ban against Argument movement from a Case-marked position will continue to prevent Argument movement from a control innitive even when that movement is not to another -marked position, as in (177). On this set of assumptions, then, what makes a control innitive different from, say, the innitival complement
143
3. Positioning Arguments
to a verb like believe is how the Specier of the innitive is Case-marked. The subject position of the innitive that follows believe is accusative Case-marked by believe, and so Argument movement is possible from this position when believe is Passivized and thereby robbed of its accusative Case. (178) a. Bart believed [IP him to be happy ]. b. He was believed [IP to be happy ].
The subject position of the control innitive following try, by contrast, is Case marked by by something other than try and so Argument movement from that position will be blocked in both the Active and Passive versions of try. (179) a. * Bart tried [CP [IP to be happy ]]. b. * Bart was tried [CP [IP to be happy]]. If we adopt this account, the subject position of control innitives is Casemarked and since these are the positions that PRO resides in, we can let PRO be subject to the Case lter. Whats mysterious under this account, though, is why overt NPs cannot stand in the subject position of control innitives. If these positions are Casemarked, then we should expect not just PRO to be able to reside in them; overt NPs should be able to as well. But, as weve seen, this is not possible; examples like (180) are ungrammatical. (180) a. * She to sing is swell. b. * Elena tried her to sing.
A popular solution to this problem is proposed in Chomsky and Lasnik (1993). They suggest that the Specier of a control innitive is assigned a Case that only PRO may bear. English, they speculate, has a fourth Case, one whose morphological manifestation only PRO is capable of achieving. They call this the null Case. This would have the desired effect of blocking overt NPs from the subject position of control innitives. It could also be made to derive the PRO Restriction. If the only place that null Case is assigned is to the Speciers of control IPs, and if we not only restrict null Case to PRO but prevent PRO from bearing any other Case, then the Case lter alone will properly segregate the positions of PRO and overt NPs. Overt NPs will have to surface in a nominative, accusative or genitive Case-marked position, and PRO will have to surface in a position marked with null Case. Martin (1992, 2001) argues that the availability of null Case can be tied to the semantics of the innitival to that comes
144
Innitives
with control innitives. He argues that the to which heads control innitives has an interpretation as a kind of tense, whereas the to that heads the innitives that PRO cannot appear in does not have a tense interpretation. If correct, this would suggest that null Case is assigned under conditions rather like those that nominative Case is assigned. Nominative Case is assigned to the Specier position of a nite I0 ; on Martins view, null Case is assigned to the Specier of another type of tense I0 , the one that appears in control innitives. There are problems with both the idea that there is a special Case reserved for PRO and the view that this special Case is tied to a particular interpretation that distinguishes control innitives from others. Pires (2001, 2006) and Terzi (1992) and Roussou (2001) (among others) have shown that the tensed interpretation Martin claims to correlate with the presence of PRO doesnt. Therefore, the supposed Case that only PRO can receive does not seem to be predictable from other properties of the innitives it is alleged to be present in. Moreover, there is evidence that the subject position of control innitives is capable of being assigned the normal Cases that can be borne by overt NPs. This evidence comes from Icelandic. In Icelandic, oated quantiers can be marked for Case and when they are, they have the same Case that the subjects they are related to do. In (181), for instance, the subject Strkarnir (the boys) is inected for nominative Case, and so is the oated quantier allir (all). (181) Strkarnir komust allir skla. the boys.nom got all.nom to school The boys all managed to get to school. (Sigursson 1991, (6a): 331) The rules of Case assignment differ interestingly from English. Sometimes, the NPs in Speciers of nite IPs can appear in something other than nominative Case. This seems to depend on the verb involved. For instance, the subject argument of the verb vantai (lacked) appears in the accusative Case, and the subject argument of leiddist (bored) shows up in the dative Case. In these situations, a oated quantier related to the subject will inect for the Case that the subject bears. This is illustrated in (182). (182) a. Strkana vantai alla sklann. the boys.acc lacked all.acc in the school The boys were all absent from school. b. Strkunum leiddist llum skla. the boys.dat bored all.dat in school
145
3. Positioning Arguments
The boys were all bored in school. (Sigursson 1991, (6b,c): 331) In Icelandic, then, we can tell what Case a subject NP has by looking at its oated quantier; and subject arguments have a wider range of Case options than they do in English. We can use this feature of Icelandic to determine what Case, if any, PRO bears. What we nd is that PRO seems able to bear whatever Case an overt NP would bear in that position.14 The examples in (183) show this correspondence. (183) Strkarnir vonast til . . . the boys.nom hope for . . . the boys hope . . . a. [CP a PRO komast allir skla ]. [CP that PRO get all.nom to school ] to all manage to get to school. b. [CP a PRO vanta ekki alla sklann ]. [CP that PRO lack not all.acc in school ] to all not be absent from school. c. [CP a PRO leiast ekki llum skla ]. [CP that PRO bore not all in school ] to not all be bored in school. (Sigursson 1991, (8ac): 331) The controller Strkarnir (the boys) is marked with nominative Case in each of these examples. But the quantier oated in the control innitive inects in a way that depends on the verb of the innitival clause. When the verb is the kind that makes its subject appear in the accusative or dative Case, in nite clauses, it makes the oated quantier in the control innitives inect for the accusative or dative Case. If the match between the Case borne by a oated quantier and the Case borne by the subject it is related to holds in innitival clauses as well, then we can conclude from these examples that PRO bears the nominative, accusative or dative Cases assigned to Speciers of nite IP in Icelandic. Using null Case, then, to segregate PRO from overt NPs does not seem attractive. The evidence we have does not suggest that PRO and overt NPs differ with respect to the Cases they can bear. At present, it seems that the PRO
14 But PRO is capable of bearing other Cases as well, and so they do not behave precisely like overt NPs in this regard.
146
Innitives
Restriction, and the ban against overt NPs in the subject position of control innitives, must be sought elsewhere.
These innitival clauses can be CPs, as the presence of the wh-phrase in (184b) shows. In closely related languages, like Icelandic, its possible to show that they are always CPs. Weve entertained the hypothesis that these innitives are always CPs in English as well, and this requires the conclusion that the complementizers of these CPs are phonetically silent. Lets call these innitives: PRO-innitives. The existence of PRO means that there are forces which restrict its occurrence to just the places where we have evidence for it. This seems to be the Specier of the IPs in PRO-innitives. These same positions do not seem to allow overt NPs, as the contrast between (185) and (184) indicates. (185) a. * She to eat durian is ill-advised. b. * I wondered how him to eat durian.
We have entertained the hypothesis that what makes the Specier positions of PRO-innitives special in this way has to do with the Case they are assigned. But so far as I can tell, there is no evidence that these Speciers get assigned a special Case, one that would make it different from the Cases assigned to other Speciers of IPs. I suggest we abandon this hypothesis, therefore. So, putting these parts together, we have: (186) a. There is a silent NP: PRO. b. PRO can get a semantic interpretation like that assigned to one: this is called its arbitrary interpretation.
147
3. Positioning Arguments
c. PRO can only appear in the Speciers of PRO-innitives. d. Overt NPs cannot appear in the Speciers of PRO-innitives. e. PRO-innitives can be CPs. Its possible that they are always CPs. Weve also encountered innitival clauses that do not contain a PRO with an arbitrary reading, but instead have a silent subject that refers to the same thing that an argument spoken elsewhere does; (187) is an example. (187) Andre tried to sing. In these cases, the overt NP controls the understood subject of the innitive. We have two ideas about how to characterize these situations. On the rst view, control is a special instance of Argument movement, and so the syntax of (187) is identical to that posited for (188). (188) Andre seemed to sing.
In both cases, Andre moves from the position sing assigns a -role to. In (188) this is the only -role Andre receives, as the verb seem does not assign an external -role. In (187), by contrast, Andre also receives a -role from try, presumably by moving into the position this -role is assigned to. If this is the correct treatment of Control, then our present formulation of the Theta Criterion will have to be changed so that it allows on argument to bear more than one -role. This account of Control will also have to nd a way of capturing the difference between (187) and (189). (189) * Andre was tried to sing.
And, if the Speciers of these innitives are Case-marked, as the evidence from Icelandic suggests, then (187) would seem to be a violation of the prohibition against Argument movement from Case-marked positions. So, this characterization of Control includes the points in (190). (190) If Control is Argument movement, then: a. PRO-innitives are prevented from arising in contexts where Control is obligatory. b. The Theta Criterion requires changing. c. There is an unknown force responsible for the contrast in: i. Andre tried to sing. ii. * Andre was tried to sing.
148
149
3. Positioning Arguments
there are verbs which assign one external -role, and verbs which assign an external and an internal -role and Accusative Case. These are sometimes called intransitive and transitive verbs, respectively; examples are in (192). (192) a. b. Sally slept. Sally likes kiwis. (intransitive) (transitive)
A special kind of transitive verb are exemplied by believe, consider and make, which assign their Accusative Case to something different than the argument they assign an internal -role to. As weve seen, these verbs can take a clausal complement sometimes these are small clauses and assign their Accusative Case to an NP within this clause. (193) a. She believes [IP him to be unhappy ]. b. She considers [AP him happy ]. c. She made [VP him dance ].
We have also seen verbs that have no external -role but do have an internal -role. One of these we have seen in situations such as: (194) Sally appears [IP to like kiwis ]. Sally seems [AP happy ]. And others we have seen formed by the process of passivization: (195) a. b. Sally was considered [AP unhappy ]. Sally was considered .
There is evidence that some verbs which might otherwise look like intransitive verbs fall into this last class too. For instance, when appear c-selects not a clause, as it does in (194), but an NP, this NP surfaces in the nominative Case marked subject position, as shown in (196). (196) A ghost appeared. Burzio (1986), who produces the rst systematic arguments on behalf of these two classes of single argument verbs, uses the terms unergative and ergative to distinguish them. Others, notably David Perlmutter who is the co-discoverer of this distinction,15 have used the term unaccusative for what Burzio calls ergative verbs. Lets use the term intransitive as a label for either type of singleargument verb, with these two sub-classications.
15 See Perlmutter (1978)
150
There are a wide assortment of syntactic phenomena that are sensitive to the distinction between these two sorts of intransitive verbs. We will encounter a few of them in the lectures that follow. In English, one of the phenomena that conrms the picture that there are intransitives that have a single internal argument as well as those that have a single external argument comes from the adjective formation process we briey discussed in the previous chapter. Recall that this rule creates from a verb an adjective whose external -role is the same as that assigned by the verb to its direct object. This process, then, should only be able to apply to verbs that have a direct object -role, and indeed it is blocked for a large range of intransitive verbs as a consequence. (198) a. b. c. d. * the danced man * the ran dog * the slept woman * the cried child
But there are a small class of intransitives which are able to be adjectives by this process, and these seem to be just the ergative or unaccusatives. (199) a. b. the fallen leaves the recently arrived package
In other languages, there is a wider assortment of phenomena that appear to be sensitive to the unaccusative/unergative distinction in intransitives. Lets consider, then, the space of verb types that we might expect to nd and compare that to what we have found so far.
-roles
(200)
If assigning an external -role, an internal -role and assigning Accusative Case are independent properties that verbs have, then this table expresses all the possible ways in which we might expect these properties to combine. As
151
3. Positioning Arguments
can be seen, there are quite a number of verb types that we havent yet seen, but which we should expect to exist. Are there verbs yet to be discovered that ll these various cells? In some cases, the properties combined make for verbs that are rather difcult to discover. Consider, for instance, a verb that assigns an external -role and Accusative Case, but assigns no internal -role (a verb that would t in the cell in the top row, rst column). It will be very difcult to discover verbs of this sort, even if they should exist, because without a complement there will be nothing to bear the Accusative Case that such a verb would assign. The only way to see such a verb would be in cases where we might nd a non-complement to which, or into which, Accusative Case could be assigned. One candidate, perhaps, for this situation are cases such as (201).16
It is likely that the small clause, herself silly, is not a complement to laugh; it does not, in any case, refer to something that is involved in the event, or action, that laugh denotes. If the Accusative Case on herself comes from laugh and where else could it come from? then laugh is a verb of the sort we are in search of. It should be noted that this analysis of (201) is at odds with some of the rest of the grammar we have developed. If herself silly is not -marked by laugh, then the Projection Principle is going to require it to be a sister to the V that laugh projects, as indicated in (202).
16 My thanks to Angelika Kratzer for suggesting that I use the adjective formation rule as a diagnostic for unaccusatives, and for offering this construction as an example of this class of verb. See Carrier and Randall (1992) and Kratzer (2005) for some discussion of this construction.
152
But in this conguration, laugh will not c-command herself, and this is a requirement on Accusative Case assignment. Indeed, if the c-command requirement on Case assignment is correct and the Projection Principles placement of complements is too, then these will conspire to prevent verbs of the sort we are searching from ever being found. If laugh genuinely is such a verb, then these parts of our grammar will need adjustment. This is work for the future. Consider now verbs that assign neither an external nor an internal -role: the class of verbs that would ll the cells of the bottom row in (200). Do these verbs exist? A candidate are verbs such as rain. (203) It rains.
If the it in this example is not an argument, then here is a verb that assigns no -role. What of the other two categories of missing verb? Are there verbs which support no external -role, but do assign an internal -role and Accusative Case? And are there verbs that assign both external and internal -roles, but no Accusative Case? To date, there are no verbs with these properties that have been discovered in English. At present, then, we can update the table in (200) to (204).
-roles
(204)
153
3. Positioning Arguments
Burzio discovered the two gaps in this paradigm where there appear to be no verbs, and formulated generalizations which describe these absences. (205) Burzios Generalization a. If a verb assigns accusative Case, then it assigns an external role. b. If a verb assigns an external -role (and an internal -role?), then it assigns accusative Case. Why havent we found verbs like these? Burzios generalizations might reect a relationship between Accusative Case and -role assignment for which we should nd a source. The second of these generalizations might be derivable from the conditions we have seen on Argument movement. To see this, consider the syntactic frames that our theory would let this verb be inserted into. One of these is (206). (206) [IP should [VP Smith V Jones ]].
If V assigns these two -roles, but no Case to Jones, then there is no way both of these NPs are going to be able to satisfy the Case Filter. There are more NP arguments than there are Cases. So, if such a verb is to survive, the only environment it will be able to appear in are sentences which have two Case marked positions. These two positions will both have to be Speciers, because these are the only positions reachable by Argument movement. Thus, were looking for contexts like: (207) [IP should V1 [XP [VP Smith V2 Jones]], where both positions are Case marked.
Now the rst part of Burzios Generalization tells us that V1 cannot assign Accusative Case. If it did, then it would also assign an external -role, and thats going to bring the count of things that need Case to one more than there are Case marked positions. As a consequence, the Case marked position inside XP is going to also have to get its Case from some place other than V1 . So far, the only ways we have seen for this to be done are if XP is in a CP: (208) [IP should V1 [CP for [IP to [VP Smith V2 Jones ]]]]. [IP should V1 [CP that [IP I0 [VP Smith V2 Jones ]]]] [IP should V1 [CP e [IP to [VP PRO V2 Jones ]]]]
154
155
3. Positioning Arguments
the connection between -role assignment and Case that Burzios Generalization seems to point to lets clear up this matter about I0 . Surprisingly, what we learn about the expression of inectional morphology in I0 lends a hand to these other questions.
156
4
Verb Placement and Features
One problem with our attempt to characterize the grammatical English sentences has to do with the fact that the heads of sentences are often not words. Many nite clauses are headed by bound morphology, as in (1). (1)
NP she I -s V enjoy IP I VP V NP chocolate
We reached this conclusion because it is morphology of this sort agreement/tense morphology that is in the one-to-one relation with the family of strings that we call IP. What we need to understand is how it is that this bound morpheme manages to nd itself expressed on the verb that follows. There is some evidence that in certain cases of this sort, the verb moves to the position our rules put the inectional morphology. The rule responsible is, therefore, sometimes called Verb Movement. Its classic description is found
But when an auxiliary verb is not inected with tense/subject-agreement morphology it can follow these polarity terms. (3) a. b. c. d. Andre must not have eaten. Andre must not be eating. Andre will too/so have eaten. Andre will too/so be eating.
In these contexts we nd that the polarity item must be found immediately before the auxiliary verb. (4) a. b. c. d. * Andre not must have eaten. * Andre not must be eating. * Andre must have not eaten. * Andre must be not eating.
158
Some English speakers may judge (2c,d) grammatical. For these speakers, I suspect that not is being used differently than it is in situations where not appears between the modal and auxiliary, as in (5). (5) a. b. Andre will not have eaten. Andre will not be eating.
In (5), not negates the entire sentence. The speaker of these sentences means to deny the un-negated version of these sentences, i.e. the sentences in (6). (6) a. b. Andre will have eaten. Andre will be eating.
By contrast, the sentences in (2c) and (2d) are positive assertions. The speaker of these sentences means to assert that Andre must have done something namely not eaten or must be doing something namely not eating. This difference between (2c,d) and (5) can be brought out by considering how they combine with so-called question tags. These tags, illustrated in (7), have the opposite polarity as the clause they are appended to. (7) a. b. Andre will eat, wont he? Andre will not eat, will he?
If the polarity of the tag is the same as the clause its appended to, the result has a rhetorical avor: (8) a. b. Andre will eat, will he? Andre will not eat, wont he?
When not stands between the modal and a following auxiliary, adding a positive question tag gives a question of the sort illustrated by (7); cf. (9). (9) Andre will not have eaten, will he (have)? But when not follows both modal and auxiliary, adding a positive question tag yields the rhetorical sorts of questions in (8); cf. (10). (10) Andre will have not eaten, will he (have)?
159
(13) Andre will have not eaten. These data, then, lead us to conclude that polarity items lie between I0 and VP. And from this we can conclude from the contrast between (2) and (3) that verbs are in I0 when they bear the inection that resides there. There is, in other words, a correlation between a verbs inectional class and its syntactic position. That auxiliary verbs have a different position when they are inected is conrmed by the action of rules that affect VP. These rules seem to necessarily strand the verb inected for agr/tense. This is indicated by the following examples of VP deletion and VP topicalization. (14) a. Sam is eating pickles because Mike is . b. Sam should be eating pickles because Mike should . c. * Sam is eating pickles because Mike . d. I claimed that Mary is eating pickles, and [VP eating pickles] she is. e. I claimed that Mary has to be eating pickles and [VP be eating pickles] she has to. f. * I claimed that Mary is eating pickles, and [is eating pickles], she.
If VP Ellipsis is capable of eliding any VP, then the ungrammaticality of (14c) indicates that the nite form of be is not within a VP. Similarly, (14f) indicates that the movement rule responsible for moving a VP at the beginning of a clause cannot relocate a nite form of be. This would follow if nite forms of be are not within the VP that is being moved in these constructions.
160
(16) a.
V have
V IP I I -en V VP V
b.
V be
V IP I I -ing -en VP V V
c.
I to modal
I VP V V YP
YP
YP
In these parses Ive decided to position polarity items as heads that embed the following VP. Moreover, Ive renamed these items, and the phrase they head, Negation. This renaming is common in the literature, as it is typical that only the negative item among the polarity terms is talked about. The representation in which Neg0 embeds VP is argued for in Pollock (1989), and we will consider his reasons soon. There are other possibilities which are consistent with what
161
b.
V have be
V IP I I I -en -ing VP V YP
This gives a straightforward account of the correlation described above. Verbs stand in the positions determined by the inections they bear because they are moved to the positions that the phrase structure rules assign to these afxes. To force Verb Movement in these contexts, we could posit a requirement that an afx must share an X0 with its stem at S-structure. Lasnik (1981), where this idea is proposed, calls this condition the stray afx lter. (19) Stray Afx Filter An afx must co-exist with its stem under a common X0 to be pronounced. To prevent Verb Movement in contexts where the I0 contains a word, such as a modal, we could strengthen the Stray Afx Filter to (20). (20) The Word Criterion Let be an X0 immediately dominated by X. Everything dominates must form one word.
162
be must
An exception to this scheme is encountered when English main verbs are considered. In these situations, the verbs do not occupy the positions determined by the inections they bear, but remain instead in the location given to main verbs by the phrase structure rules. Examples like (22) are ungrammatical. (22) *Andre likes not/too/so apples.
And the processes that indicate whether a verb is in its VP, i.e., VP ellipsis and VP fronting, indicate that nite main verbs are indeed positioned within the VPs they head. (23) a. * Andre ate apples because Jill . b. * I said that Andre ate apples and [VP ate apples ] Andre.
The process that brings inection together with the verb on which it is expressed must have a different outcome when main verbs are involved. The challenge is to nd a way of conceiving of this process so that both kinds of outcomes are possible. There have been a number of attempts to capture this exibility, none of them entirely successful. One account, found in Chomsky (1957), posits a transformation rule that moves the inection down onto the verbal root when main verbs are involved. A sentence like (24), for example, would enter into the derivation in (25). On
163
this view, then, verbs and their inections always combine by way of a movement rule. The difference between auxiliary and main verbs concerns what moves: the verb, in cases involving an auxiliary, and the inection, in all other cases. A problem with this proposal is that it requires movement transformations to be able to move things to non-c-commanding positions. We have seen that for other movement transformations, this does not seem to be possible. Outside of these contexts where main verbs get inected without moving, the same generalization holds with respect to the rule that moves verbs around. It looks like it is worth preserving the generalization that movement rules do not move things to non-c-commanding positions. For the reason, the proposal in Chomsky (1957) has been largely abandoned. A second account lets the mechanism that brings a main verb together with its inection be a modication of how lexical insertion works. Presently, our formulation of lexical insertion matches lexical items to X0 s, and this prevents one word from spanning more than one terminal position in a phrase marker. Bobaljik (1994, 1995) argues that this should be weakened so that a lexical item can be inserted into a position that does not map directly onto a terminal in a phrase marker. In particular, he proposes to let an inected form of a main verb be inserted into a position occupied by two X0 s, just in case those two X0 s are linearly adjacent. Under this proposal, a representation like (26) below could be directly mapped onto the string Andre likes apples.
164
This proposal has the benet of explaining why nite sentences containing polarity items in English are ungrammatical if they do not contain an auxiliary verb. (27) a. b. c. d. * Andre not likes apples. * Jill not ran down the street. * Winnie so read that encyclopedia. * Bart too ran into the street.
Because polarity terms are positioned between I0 and the verb that follows, it will prevent an inectional morpheme residing in I0 from being adjacent to that verb and block lexical insertion from rendering them as a single inected verb. Indeed, this proposal predicts that a nite clause with no auxiliary verbs in it should not allow any material to stand between I0 and the main verb. The only grammatical parse for an example like (28) on the following page should be one in which the adverb is to the left of I0 , as indicated. This is not an obviously correct prediction, however. Contrasts like those in (29) indicate that completely must be placed between I0 and the following VP. (29) a. Jill will completely nish the apple. b. * Jill completely will nish the apple.
This account, then, has difculties making sense of data like (28). Unlike Chomskys proposal, it does address the problem of why the mechanism that inects main verbs is sensitive to the presence of polarity items; but it doesnt explain how these items differ from other material that appears to be able to stand between the main verb and nite inection.
165
(30)
NP Andre
agr/tns
The nal idea we will consider divorces the mechanism that brings a verb and its inection together from the process that moves verbs around. On this view, the relationship between an I0 that is associated with inectional morphology and the verb that expresses that morphology is taken to be akin to the relationship we have posited between an NP and the term that assigns that NP Case. We can think of I0 as assigning the relevant inectional morphology to the verb that follows, as in (88) on page 195. Chomsky (1995), where this proposal can be found, calls the assignment relationship between an inected verb and the I0 where that inection is determined: AGREE. The precise for-
166
Movement to C0
mulation of AGREE will be taken up later; but it should be clear from (88) that it doesnt match perfectly the condition we arrived at for Case assignment in the previous chapter. While the nite I0 does c-command the verb it assigns tense/agreement inection to in (88), it is not klose to that verb: there is more than one phrase that dominates the verb but not I0 . This would block Case assignment. One desirable consequence of the formulation of AGREE is to make it account for the inability of a polarity item to stand between I0 and a main verb. That would be a natural place for this account to attempt an explanation for this fact. If nite I0 assigns inectional morphology to a main verb that follows, then there are a couple ways of characterizing the different outcome with auxiliary verbs. One possibility is that auxiliary verbs are inected in the same way, and that there is an independent reason that they must move to the I0 that assigns them their inection. Another possibility is that auxiliary verbs are prevented from getting their inectional morphology by way of AGREE, and instead make recourse to Verb Movement. Hopefully, a better understanding of what makes an auxiliary verb different from a main verb will help us understand how to express the difference in how their inection inuences their surface position. We can learn something about this question by considering how these processes arise in other languages. Well look at other languages in the sections that follow, and then return to the question of how to give a picture of the main verb/auxiliary verb distinction in English. But rst, there is one last piece to the picture of English verb placement that we should see.
4.2 Movement to C0
The rules weve looked at that relate verbs to I0 can be embellished with another that plays a role in forming questions. One sort of question involves relocating the nite verb to the beginning of the sentence, as in (31). (31) a. b. Have you eaten pickles? Should you eat pickles?
These are called Yes/No questions. Another sort of question which involves a relocation of the nite verb seeks more specic information and is called a WhQuestion. In these questions, as we saw in the previous chapter, there is also movement of a phrase into Specier of CP. Some examples are in (32). (32) a. Which pickles have you eaten?
167
In both of these types of questions, the contents of I0 has moved to some sentence-initial position. As a consequence of this rules targeting I0 , only material which can stand in I0 can move. Therefore, main verbs in English do not undergo this process, and instead the nearly meaningless auxiliary verb do is used. (33) a. * Eat you pickles? * Which pickles eat you? b. Do you eat pickles? Which pickles do you eat?
Where does I0 move to? The common answer to this question, based on work of the Dutch linguists Jan Koster and Hans den Besten,1 is that it is C0 . The reason for this speculation is that the rule involved here seems to be in complementary distribution with complementizers. That is, its effects are blocked in those situations where we believe that complementizers are present in this position. Thus, there is a distinction between (34a) and (34b). (34) a. Have you eaten? b. * I remember (that) have you eaten. c. Which pickles have you eaten? d. * I remember that which pickles have you eaten.
Instead, Yes/No questions that are embedded are marked with a special complementizer: whether. And wh-questions in embedded contexts involve no evidence of a C0 at all. Under this view, the rule involved in these questions, then, might be formulated as in (35). (35) I-to-C Movement Adjoin I0 to C0 . The Word Criterion will correctly block movement of I0 to C0 in those cases where C0 is lled with a complementizer, assuming that a complementizer and the modal or inected verb in I0 cannot together form one word. Okay, so this is, roughly, a sketch of the rules that go into making up our knowledge of this fragment of English syntax. Weve got three rules, one that
1 See den Besten (1983) and Koster (1975).
168
Movement to C0
moves auxiliary verbs to I0 , another than moves I0 to C0 and a third that inects main verbs. The rst and last are driven by the Stray Afx Filter, or some parallel condition on the view that inection is assigned under AGREE, and the second arises in questions. Both are subject to the Word Criterion, which prevents I0 , C0 or V0 from combining whenever they cannot form a word. Theres one last feature of these rules that we should consider. In all the cases so far examined, the V0 that moves to I0 is always the one that is closest to it. It appears that this is not just an accident of the examples we have chosen, instead it looks like it is a necessary feature of the verb movement rule. Consider, for example, how these rules might combine to apply to a representation like (36) below. Suppose that we move be to -en and form thereby the participle form: been. Imagine, further, that the I0 which determines the present participle, the I0 holding -ing, inects the main verb that follows in whatever way it turns out that main verbs inect. All that would then remain is to provide the tns/agree morphology associated with the highest I0 with a stem. The correct outcome is one in which have moves to this I0 . But what would prevent the participle been, residing in the I0 directly beneath have, from moving into this I0 . The outcome would be (36), and this is clearly not possible.
(37)
Theres not even an inectional form for be in which agreement/tense inection combines with the (perfect) participle. Similarly, consider how our rules might combine to apply to the D-structure representation like (38). In this example, we might imagine that, as before, the I0 associated with -ing inects the following main verb and that be moves into the I0 associated with -en and forms the participle: been. Let verb movement also bring have into the I0 associated with tense/agreement morphology to form the nite verb have. Finally, assume that this CP is a question and so triggers the rule that brings I0 into C0 . We might imagine that the I0 holding been could move into C0 ; but this is not grammatical, as (39) indicates.
169
-agr/present V have
(38)
170
There might well be independent reasons for some of these effects. It might be, for instance, that English morphology does not allow participles to inect for tense and agreement. And yet, all of the blocked outcomes t a generalization that it might be worthwhile crediting for the whole class of cases. Travis (1984) has made just such a proposal. She suggests that there is a constraint on movement rules that relocate X0 s that blocks them from moving past other X0 s. She calls this condition the head movement constraint, and I will formulate it as (40).2 (40) The Head Movement Constraint No X0 may move past a Y0 that c-commands it. As we will see, this feature of the grammar of English verb placement can be assigned to Universal Grammar. The Head Movement Constraint seems to govern instances of X0 Movement in other languages.
From this we can conclude, perhaps, that VPs are head-nal. Further, if there are IPs and a method of combining verbs with the inection associated with
2 She has a different formulation which builds in more context sensitivity and relates it to the notion of government that was described in chapter 3. For the cases that we will look at, (40) will do.
171
This seems to suggest that there is a movement rule which relocates nite verbs into the position immediately following the subject. Using the logic of the correlation argument, we might imagine that the position where nite inection is in German immediately follows the subject, and its into this position that nite verbs are driven in German. But this would miss the fact that the position of the nite verbs differs for embedded and independent clauses. What we want is some way of forcing verbs to move into the post-subject position in root clauses only. This suggests that it is not the nite distinction that is responsible for verbs position in root clauses, but something else. Something that distinguishes root from embedded clauses. Weve already seen a similar difference in the grammar of English: recall that I0 -to-C0 movement is restricted to root clauses in English. Perhaps the verbs are moving through I0 into C0 in cases like (42), then. This would credit German and English with two differences. On the one hand there is the difference in headedness that we see most directly in embedded contexts. And then there is something that allows/forces subjects to move past C0 in embedded clauses. We might imagine that this second force, whatever it is, is like the process that moves wh-phrases in English into Specier of C0 . Thus, the examples in (42) might get a parse like that in (43) on the next page. The subject moves into Specier of IP, to satisfy the EPP, and then into Specier of CP. The nite verb follows the subject because it moves through I0 into C0 , as indicated in (44) below.
172
(44)
NP Hans
CP C C I V hat VP V NP V I C VP V IP I
173
das Buch gekauft
The generalization about German word-order can be described this way: any phrase may be initial, but exactly one must be. German is sometimes described as having Verb Second word-order for this reason. The account weve just sketched of this captures Verb Second by way of a rule that moves the verbs that have moved into nite I0 into C0 , and by moving anything, but something, into the single position that exists to the left of C0 . That the verb in these cases has in fact moved into C0 is further substantiated by cases where Verb Second word-order is found in embedded clauses. Though there is considerable dialectal variation here, in the standard dialects,
174
b.
Despite the similarities in verb movement that German and English have, note that one place where they differ is whether main verbs fall under the scope of verb movement. Unlike English, German main verbs can undergo movement. This is seen by their ability to move into C0 in main clauses, as in (42a). There are a host of mysteries about the grammar of verb placement in German that we will not attempt to solve here. What, for instance, is responsible for forcing movement of something into the Specier of CP in German root clauses. And why isnt the movement of a phrase into Specier of CP allowed in embedded clauses, as in (48)? (48) * Ich sagte [CP das Buch [ da [IP Hans [VP gekauft] hat]]]. Vikner (1995) discusses some of the ideas that have been offered for answers to these questions. There are some things that we might notice about the verb movement processes in German which are similar to the parallel processes in English. For example, the Head Movement Constraint is preserved in the grammar of verb placement in German. Sentences like the following, where haben (have) moves past the modal are ungrammatical in German just as they are in English. (49) * Hans haben das Buch gekauft mu John have the book bought must .
Moreover, we can observe that there is a pattern to all these rules in German and English: the moved term always adjoins to another head. Because this appears to be something that is the same across German and English, it is a candidate language universal. Indeed, over a large range of cases examined across
175
We have already found that this is a feature of the Argument Movement rule it also relocates terms only to a position higher in the phrase marker. Actually, we found that the constraint was more specic than this; it required that the moved term relocated to a c-commanding position. Let us formulate this constraint, shared by both German and English Verb Movement rules, as follows. (52) Upwards Constraint can move to position only if c-commands .
That this narrower condition is required for verb movement is shown by cases like (53), which would be possible if verbs could move to non-c-commanding positions.3
3 This parse does not reect the topicalization that weve seen evidence for recall, there is reason to believe that nite subject clause has been topicalized to IP.
176
This too looks like a good candidate for a universal condition: things only move to c-commanding positions. Lets take a look now at what we nd with respect to verb placement in some of the other Germanic languages. In the Scandinavian languages we nd a situation similar to German, as Vikner (1995) reviews. I will use Danish as a guide; much of what we see for Danish is found in Norwegian and Swedish as well.The same sensitivity to embedding is found in the placement of the nite verb in these languages. As the contrast in (54) indicates, Danish is like German in allowing any constituent, but only one, to precede the nite verb in independent clauses. That is, it shares with German the trait of being Verb Second. (54) Brnen har set denne lm kids-the have seen this lm The kids have seen this lm. b. Denne lm bar brnen set. this lm have kids-the seen The kids have seen this lm. c. * Denne lm brnen har set. this lm kids-the have seen The kids have seen this lm. a.
177
Moreover, as the contrasts in (56) show, the placement of the nite verb relative to negation is sensitive to the embedded/non-embedded context. (56) Brnen har ikke set denne lm. kids-the have not seen this lm The kids havent seen this movie. b. * Brnen ikke har set denne lm kids-the not have seen this lm The kids havent seen this movie. c. Jeg ved at brnen ikke har set denne lm I know that kids-the not have seen this lm I know the kids havent seen this movie. d. * Jeg ved at brnen har ikke set denne lm I know that kids-the have not seen this lm I know the kids havent seen this movie. a.
This indicates that the nite verb has moved out of VP, past negation, into C0 in independent clauses, just as it does in German. It looks, therefore, like Danish has the word-order of English placing verbs before their objects but the syntax of verb movement that German has. Note in particular that main verbs are able to move as well as auxiliary verbs, as we see partly in (55) (look at the root clause word-order). If we were to look further, we would see that Danish (and the remainder of Germanic) also have the range of constraints we have seen on verb movement operations. There are a couple differences, however. One at least potential difference between Danish and German is the existence of V0 -to-I0 . Whereas we cannot easily discern whether such an operation exists in German, it apparently does not in Danish (nor does it in standard Norwegian and Swedish), since when the verb has not relocated into C0 , it remains to the right of negation. (Of course,
178
179
A word about negation in French is needed before the contrast in (60) will be interpretable. Unlike English, sentential negation in French is expressed by way of two words, one (ne) appears to be in a position like that we have put inectional morphemes in English. We might imagine that like the contracted form of not in English (nt) it is contracted onto whatever word has moved into I0 (as in (60a)). The second word (pas) is the one that has a distribution more like that of English not. And, as (60) illustrates, the nite form of have (ha) must precede this part of sentential negation. French differs from English (but is like German) in moving main verbs. This can be seen in French by the fact that they are placed to the left of sentential negation when they are nite, as (61) demonstrates. (61) Jean naime pas Marie. John nelove not Mary John doesnt love Mary. b. * Jean ne pas aime Marie. a.
As the contrast between the position of main verbs in (61a) and (60a) indicates, it is the inectional class of the verb that determines its position. That is, just as in English, there is a correlation between the inectional class of the verb and its syntactic position a correlation that is captured by xing the position of inectional morphemes with the phrase structure rules and driving the verbs to these positions with the verb movement operation.
180
b. * John kisses often Mary. This apparent difference in the function of the Projection Principle can actually be seen as a product of verb movement. Since we already know that main verbs in French but not English move into I0 , it makes sense that the main verbs in French, but not English, should be able to be separated from their objects by all sorts of material, including adverbs. But now consider what happens in French non-nite clauses.4 (63) a. Comprendre peine litalien apres cinq ans dtude understand barely the-Italian after ve years of study dnote un manque de don pour les langues. shows a lack of gift for the languages To barely understand Italian after ve years of study shows a lack of talent for languages. Perdre compltement la tte pour les belles tudiantes cest lose completely the head for the pretty students it is dangereux. dangerous To completely lose your head for pretty students is dangerous. ne pas sembler heureux est une condition pour crire ne not seem happy is a condition for writing des romans novels To not seem happy is a (pre?)condition for writing novels.
b.
(64)
a.
181
b.
They may optionally move past negation. We learn two important things from these facts. First, that the contrast between auxiliary and main verbs that seems to distinguish English from German/Danish is actually not a distinction in languages. Instead, it is a difference which is found within a single language: French. That is, this is not a parameter along which languages vary, then, but rather a parameter along which clausetypes vary. We need to express the distinction between English, and these other languages, in terms of the clause types these languages host, not in terms of the targets for their verb movement rules. So, were looking for something that distinguishes English nite clauses and French non-nite clauses on the one hand from French nite clauses on the other. The second lesson of this paradigm is the main point of Pollocks paper. He suggests that the two different positions that main verbs may occupy across nite and non-nite clauses warrants giving a structure like that in (66) to clauses, where the higher I0 is equated with Tense and the lower one with Agreement. This answers to the correlation that appears to hold for main verbs between whether they bear tense morphology or not: in nite clauses they do, and in non-nite clauses they do not.
182
Pollock also introduces the idea that Neg heads a phrase which stands between the two other Inectional phrases, as indicated in this pares. There are several reasons for wanting to treat negation differently from other adverbials. One is that the syntactic position of negation is comparatively rigid when compared to other adverbs. Another is that only negation blocks the process that inects main verbs in English, as we saw earlier; other adverbs dont. Pollock suggests distinguishing negation from adverbs structurally, and then making reference to this structural difference to capture these ways in which negation behaves uniquely. I wont examine in detail how this idea works, partly because we will eventually go in a slightly different direction than does Pollock. But let us nonetheless adopt at least as an intermediary hypothesis the thesis that negation does head a phrase as shown in (66). It should be noted, however, that this introduces a problem: how is it that Agr0 can move to T0 past negation without violating the Head Movement Constraint. Pollock offers a solution to this problem that I will come to soon. Note that though the difference between main and auxiliary verbs in French non-nite clauses that weve just reviewed speaks on behalf of two head positions to which verbs may move, it doesnt really indicate what the identity of these two head positions might be. While most of Pollocks proposal involves examining how the hypothesis that there are two inectional positions within a sentence can be used to explain the differences in verb position across clause types, he also assigns values to these inectional phrases. But, in fact, it is extremely difcult to tell what the value of these heads is. Pollock decides in favor of giving the higher I0 the value of Tense and the lower one the value of Agree-
183
Note that in each of these cases, the morpheme which encodes tense information precedes that which encodes agreement. Is there a way of determine the syntactic arrangement of inectional terms from their morphological arrangement? Mark Baker has made famous an argument for thinking that the answer to this question is yes. Following proposals by Pieter Muysken and Donna Gerdts, Baker argues that there are correlations between the order that inectional afxes have relative to the verbal stem and the order of the syntactic operations that these afxes encode. Thus, for example, in Chamorro, the passive morpheme necessarily comes closer to the verbal stem than does the subject agreement afx, as (68) shows.6 (68) Para.u.fan - s - in - aolak l famguun gi as tat-n-niha irr.3pS - pl - pass - spank the children obl father.their The children are going to be spanked by their father.
This ordering on afxation mimics the order in which the syntactic operations of Passive and subject agreement take place the passive must bring the underlying object into the subject relation before the agreement process can apply.
5 The argument Im reporting here comes from Belletti (1990). 6 These data come from Gibson (1980). The principle formulated in Baker (1985) has its roots in many earlier works, among which are: Muysken (1979, 1981) and Gerdts (1981).
184
S-structure:
Each metarule brings about the relevant syntactic operation and adds to the verbal root the corresponding afx. Note in particular that this picture correlates syntactic operations with afxes; the correspondences that Baker summarizes with the Mirror Principle only concern the relation between afxal orderings and the orderings of syntactic operations. Indeed, Bakers argument leads to the conclusion that the metarules in (70) necessarily involve afx-syntactic operation pairs. It is the order of afxes that correlates with the relevant syntactic operations, and not some more abstract information, such as morphological features or the like. We shall have an opportunity to revisit this point. Now if Bakers conclusions from the Mirror Principle are imported into the domain of Verb Movement, then the arrangement of morphemes in (67) suggests that the higher of Pollocks positions should be associated with agreement morphology, and the lower with tense morphology. Then, the fact that tense morphology comes closer to the verbal stem will follow from the Head Movement Constraint. So we should adopt, perhaps, a picture like that in (71) below.
185
We can reach this conclusion, perhaps, through a different route. If we compare Icelandic with Danish, we see a difference in verb placement that suggests that only Icelandic has movement of verbs into the I0 that stands above negation. In embedded Icelandic clauses the nite verb must precede negation, as in (72). (72) a. * g spuri af hverju Helgi ekki hefi lesi essa bk I asked whether Helgi not had read this book I asked whether Helgi hadnt read this book. b. g spuri af hverju Helgi hefi ekki lesi essa bk I asked whether Helgi had not read this book I asked whether Helgi hadnt read the book.
But as we have seen, Danish nite verbs cannot precede negation. Another example illustrating this fact is (73). (73) Jeg spurgte hvorfor Peter ikke havde lst den. I asked why Peter not had read it I asked why Peter hadnt read it. b. * Jeg spurgte hvorfor Peter havde ikke lst den. I asked why Peter had not read it I asked why Peter hadnt read it. a.
If we assume that the position of negation is constant across these languages, then this indicates that the verb has moved past this position in Icelandic, but not Danish. Now, interestingly, Icelandic has a full paradigm of subject agreement, but Danish has no subject agreement. Comparing Icelandic with Danish,
186
187
Note that we cannot determine whether nite T0 is strong or weak in French because verbs are always driven up to Agr0 in this context. Turning now to Scandinavian, we have something like: (77) Danish, Norwegian and Swedish: a. Finite: Tense: weak Agreement: not there b. Innitive: havent looked yet (it varies across the group, as it turns out)
188
On this view, then, when T0 moves to Agr0 it doesnt have to skip Neg0 , it can move through this position and pick up the head (if there is one) on the way. In a nite French clause, then, the surface parse might look roughly as in (80) on the next page. As this parse indicates, ne gets carried to the position before
189
V -inf etre
pas by the tensed verb as it moves into Agr0 .8 Note that we have to let something force the bound morphemes, that is, the tense and agreement sufxes, to linearize to the right of the term that has adjoined to them, but force ne to linearize to the left of the term that has adjoined to it.
8 The S-structure in (80) is derived as follows (ignoring the movement of the subject). The main verb, aime, is moved into T0 , where it combines with the tense morpheme. The resulting tensed verb (that is, T0 ) moves and adjoins to ne. the resulting negated verb (that is, Neg0 ) moves into Agr0 , where the agreement morpheme combines with the verb. This surface parse, then, claims that negation ne is closer to the verbal stem than is the agreement morpheme, and this is somewhat surprising. We must also understand how to get the right linear order of ne and the inected verb out of this representation.
190
191
rection. He suggests that we think of main verbs as differing from auxiliary verbs with respect to their ability to assign a -role. In particular, he suggests that we dene main verbs as ones that have a -role to assign, and auxiliary verbs as ones that dont. He then crafts a proposal that would prevent verbs from assigning -role from a position associated with a weak inectional head. And this will essentially prevent main verbs from both moving to and moving through the positions we have assigned the weak value. His idea goes something like this: (83) a. b. c. Assume that movement operations leave a trace of the moved X0 or XP in the position that the term is moved from. Let the Theta Criterion be enforced at S-structure. Let the trace of a verb be capable of assigning the verbs -role only if the verb has not adjoined to a weak X0 .
There are some immediate problems with this idea that Pollock himself addresses. It wouldnt seem able to account for the British (85), nor for the
192
Why doesnt the be in (84a) have the same number of -roles as (84b)? And it sure looks like has in (85) is capable of moving past negation, though it appears equally like it assigns -role. Pollock suggests that the -role assigned in (85) comes not from the verb, but from some hidden term. Something similar might be attempted in distinguishing the cases in (84). But there is another kind of problem, specically for the prediction that movement through a weak X0 is unavailable to main verbs, which seems to me insurmountable. Danish, as we have seen, and the other Mainland Scandinavian languages, show Verb Second word-order; (86) is an example, for instance, where the main verb has relocated to C0 position. (86) Kbte han bogen bought he book-the Did he buy the book?
But we have also just discovered that Danish has weak tense morphology. Thus, if we preserve the Head Movement Constraint, this entails that Danish main verbs are capable of moving into T0 and from there moving into C0 . If Danish T0 is weak, this means, in turn, that moving through a weak X0 is possible for main verbs. For this reason, I will interpret the weak/strong typology to determine only whether a main verb may surface in the relevant X0 . This seems to be the way these values are interpreted most commonly in the literature. It is, in fact, on this basis that I gave the assignments of weak and strong to Agr0 and T0 in English, French and the Scandinavian languages in (74), (75), (77) and (78). In these assignments, then, I have given a weak value to those X0 positions for which there is evidence that main verbs cannot surface, and to those X0 positions where there is evidence that main verbs can surface I have given the strong value. I have also indicated certain positions namely, French and English innitival Agr0 as empty positions. This is my, somewhat idiosyncratic, way of expressing the optionality of movement to these spots. Because they have no morpheme in them, these innitival Agr0 s do not cause the Word Criterion force a verb stem to move into this position. But, because it is present, it is still a possible landing site for X0 movement. Its in this respect, then, that
193
4.5 AGREE
Weve developed a rough description of how main and auxiliary verbs are driven into I0 s that uses the strong/weak distinction. We should now look at how this matter interacts with the way in which inectional heads assign their morphology to the relevant verbs. The conception we have adopted of how verbs inect is that it parallels the kind of relation that we have seen in connection with Case. The tense and agreement heads assign the relevant inection to the verb in congurations like (88).
9 This is going to have certain consequences for the syntax of a sentence in which the verb has moved into C0 . In these cases questions and, in the Germanic languages other than English, root nite clauses we must imagine that there is a strong morpheme in C0 .
194
AGREE
(88)
NP they Agr AgrP Agr TP T T VP V V ate NP nattoo
When auxiliary verbs are involved, the verb assigned inection moves to the heads responsible for that inection, as in (89) on the next page. The syntactic results of inecting the verbs in these two cases is obviously different, and so a reasonable rst question is: could the method of inecting auxiliary and main verbs be different? Could it, for instance, be that main verbs get inected by assignment while auxiliary verbs get inected by moving? This direction looks unlikely. The conditions we have placed on verb movement are mirrored by conditions on the assignment relationship that holds in the case of main verbs. Movement of V0 and I0 can move the term to just ccommanding positions, and never past a c-commanding X0 . And, similarly, the assignment relation always goes from an I0 (or C0 ) to a term that it ccommands, and it never crosses a c-commanding X0 . This is indicated by the fact that the parse in (90) cannot produce the outcome in (91). This would be possible if the Agr and T heads of the root clause were able to assign their inection to the verb that heads the embedded VP. This would be prevented if these heads are prevented from assigning their inection past the closer head: make. This is quite general: cases where an I0 assigns the inection it determines on a head that is embedded below another possible target for its inection do not seem to arise. This looks completely parallel to the Head Movement Constraint, which proscribes movement of a head past a c-commanding head.
195
That inectional heads assign the inection they determine only to verbs they c-command can be appreciated by considering examples like (92) on page 199. The grammatical outcome of this structure is one in which the inectional head determining the sufx ing assigns that inection to the verb it c-commands: ski. What cant happen is for this I0 to assign this inection to remember, a verb it does not c-command, but is otherwise a good candidate for being inected. (92) does not lead to the outcome in (93). (93) * Bart should remembering Jill ski.
In general, examples of this sort are not found. If a verb bears inection that is associated with an I0 , that I0 c-commands the verb. Note that this example confronts us with a question that our move to the Pollockian framework has produced. The root clause in (92) contains a modal, and this is a word that we have mapped onto the position that heads sentences. But we now have two heads, and two consequent phrases, that sentences are made from: which of these two should we map modals onto? I have decided to
196
AGREE
(90)
NP Jerry Agr AgrP Agr TP V T VP V V make NP Sally VP VP V V eat
(91)
put the modal in the Agr position for two reasons. First, this correctly places modals to the left of not. If NegP lies between AgrP and TP, then we do not want modals to be positioned in T0 , as that would place them to the right of negation. And, second, some modals appear to have alternations which are suggestive of tense. The forms in (94) could be seen as present and past tense forms of these modals. (94) a. shall should b. will would While there is never any agreement morphology associated with modals in English, there might be tense morphology. Its not clear to me how to get these inected forms from the representation in (92), but perhaps the morphology in T0 is capable of moving to the modal in such examples.
197
We should endeavor to capture these parallel constraints on the ways that main and auxiliary verbs are inected in English. It is not clear that can be done if the means by which they get their inection is different. If, on the other hand, the process is the same in both venues, then we can place these constraints on that process and in this way explain why they are shared. That appears to be the direction presently taken. The c-command requirement and the Head Movement Constraint are presently formulated as constraints on the movement relation. That is where they were originally discovered and that is how they were initially framed, as a result. But we can now see that there is reason to rethink this decision. Main verbs do not move, and yet the Head Movement Constraint and the c-command condition nonetheless hold between a main verb and the I0 its inection derives from. We want the Head Movement Constraint and the c-command conditions to hold,
198
AGREE
therefore, on the inection process itself. We then want those constraints to be reected in how the inected verbs moves to the I0 that determines their inection. I will sketch a proposal here that puts together some of the ideas in chapters 3 and 4 of Chomsky (1995). It is not too faithful a reproduction of those ideas, however. First, let us formulate the condition under which inectional heads assign verbal inection. This relation is known as AGREE. We begin by expressing the inectional morphology involved in terms of features. Let verbs come with features that will determine how their morphological form is determined, and let Agr0 and T0 also come with those features. This allows us to express the assignment of inectional morphology in terms of matching the relevant features on a verb with the features on Agr0 and T0 . The AGREE relation species under what conditions this matching of features occurs.10
(95) Let X0 and Y0 be terms that carry features, and let the features on X0 have a value and those on Y0 be unvalued. The features on Y0 can get the value of the matching features on X0 just in case Y0 agrees with X0 . X0 and Y0 agree iff: a. X0 c-commands Y0 , and b. there is no Z0 such that X0 c-commands Z0 and Z0 c-commands Y0 .
10 The system I will lay out here is modeled after Adger (2003).
199
The verb in this example has features for agreement and tense which, when they are given values, will let the process of lexical insertion know which lexical item to put into this position. Similarly, the T0 position is occupied by something that has features for agreement and tense, and one of those features is also given a value, here: past. On this view, T0 is something that contains an agreement feature; it may not be obvious why tense should be so equipped. It will be necessary to have T0 be the place from which the agreement feature on the following verb is valued, so there is a theory internal reason for making this claim. It would be good to have an independent criterion upon which to make this claim. Perhaps the fact that the tense/agreement morpheme in English is a portmanteau could be related. Finally, Agr0 is equipped with an agreement feature, and it has the value 3rd , singular. Because Agr0 c-commands T0 , and there is no head between them, the value for the agreement feature on Agr0 can value the agreement feature on T0 . This produces (97).
200
AGREE
(97)
NP they Agr AgrP Agr TP T VP V V eat agr: tense: NP nattoo
Both of the unvalued features on the verb eat can now be valued by the features on T0 . This is because T0 c-commands eat and there is no head between them; they are therefore able to AGREE. This produces (98) on the next page. The values of inectional features are assigned down through the heads, and verbs then express those collected feature values morphologically. We can now model the disruptive effect that NegP has on inecting main verbs in terms of the Head Movement Constraint, which is now expressed as a condition on AGREE. If we assume that the head of NegP does not come with the tense or agreement features, a sentence with NegP would get a representation like (99).
201
(99)
NP they
AgrP Agr Agr NegP AdvP not Neg Neg TP T T tense: past agr: V tense: agr: VP V NP natto
202
AGREE
That Neg0 does not come with agreement or tense features could be linked to the fact that negation does not express those features: English negation does not come inected for tense or agreement. The values on the agreement feature that Agr0 has cannot be given to the unvalued agreement feature on T0 because Neg0 is a head that intervenes. As a consequence, the agreement feature on the verb will also not be valued. The ungrammaticality of (99), then, can be credited to the inability of this system to inect the verb with both tense and agreement when NegP is present. To make explicit the requirement that values of the features on T0 and Agr0 get expressed on a verb, lets adopt (100). (100) The Stray Feature Filter The values of an inectional feature must be morphologically expressed. In English, values for tense and agreement features are expressed on verbs. The ungrammaticality of (99), then, comes from the fact that the value for the agreement feature is not found on a term that can express that value: it is not assigned to a verb, and only verbs in English are equipped with the morphology to express agreement. The condition in (100) is just a restatement of Lasnik (1981)s Stray Afx lter. His condition said that the bound morphology placed in an inectional head could not be expressed without the aid of a verbal stem in that very same inectional head. The Stray Feature Filter does nothing more than translate that into a framework which expresses inectional morphology with features; (100) says that the inectional feature values carried by some inectional head must nd a way of being assigned to a verbal stem. Ive named it the Stray Feature Filter to remind us of this connection. Lets now consider how this system will work with auxiliary verbs. A simple clause will have the structure in (101). The chain of AGREEment relations that we saw in the main verb case will apply here too, producing the representation in (102).
203
(102)
NP they
204
AGREE
This representation satises all of our present conditions and so should lead to a grammatical string. As we have seen, however, this isnt what we want. Auxiliary verbs surface in Agr0 in these contexts. There must be some property that auxiliary verbs have that prevents them from surfacing in their D-structure position when they are inected. I dont know what this property is, but it is possible, nonetheless, to characterize the consequences it has. It requires the auxiliary to adjoin to the inectional heads which are responsible for giving values to the features the auxiliary contains. If we call this mysterious property: , then we can state this requirement with (103). (103) If a lexical item has , then it must be pronounced adjoined to the heads whose features it is in an AGREE relation with. Auxiliary verbs have , and so representations like (102) will not be well-formed S-structures. Instead, the head movement operation must kick in to produce (104). (104)
NP they T V
have
AgrP Agr Agr Agr T agr: 3rd ,plur tense: past TP T VP V IP eaten nattoo
The fact that auxiliary verbs differ from main verbs in needing to move is now coded in the property. We should hope to nd what the property is, and it seems likely that it has something to do with how the inectional morphology is expressed since it is a requirement that makes reference to the features expressed by that morphology. One possibility would be to let auxiliary verbs differ from main verbs in the kinds of features they come with. If, for instance, auxiliary verbs are not equipped with the tense and agreement features, but
205
206
AGREE
inection.12 He proposes the constraint in (105). (105) Earliness Let U be a xed D-structure representation and D = {D , D 1 , . . . , D n } be all the grammatical derivations from U to some representation that satises the conditions on well-formed S-structures. Every D i D is ungrammatical if |D i | is greater than some other |D | in D , where |D | is the number of movement rules in D . What Earliness requires is that derivations from D-structure to S-structure minimize the number of movement operations. If it is possible to satisfy the constraints on S-structure without moving, than movement is blocked. This is precisely what happens in the inection of main verbs. They are able to satisfy the Stray Feature Filter by way of AGREE, and this makes movement of main verbs to T0 or Agr0 superuous. This is a popular account of the failure of main verb movement, but it is not entirely successful. This becomes apparent when the cases in which an auxiliary verb combines with NegP are considered. In such cases, we have the Dstructure representation in (106) on the facing page. As we have seen in the case of main verbs, AGREE is prevented from giving the value that Agr0 has for the agreement feature to either T0 or the verbs beneath T0 . Just as in the case with main verbs, then, this representation does not satisfy the Stray Feature Filter. Instead, as we know, the auxiliary verb moves until it surfaces in the Agr0 position. It seems reasonable to assume that its the ability of the auxiliary verb to move that allows it to get inected. We want movement and AGREE to have different outcomes in this circumstance. The auxiliary is close enough to T0 to get its tense feature valued. If it moves to T0 , and the resulting complex moves to Neg0 , then the representation in (107) will be produced.
12 The proposal comes from the unpublished Pesetsky (1989), which can be found at his website. This is not his formulation of the condition, but I think it expresses his idea.
207
(107)
NP they
AgrP Agr Agr agr: 3rd ,plur AdvP not T V T Neg NegP
agr: tense:
208
AGREE
In this representation, have, and the T0 it has joined with, are both part of Neg0 . Perhaps be virtue of being part of Neg0 , have and T0 are now no longer separated from Agr0 by another head. If so, the agreement features they contain can be valued by Agr0 . This would satisfy the Stray Feature Filter. In order to satisfy the property, head movement would move Neg0 into Agr0 , producing (108) below. This is the desired outcome. (108)
NP they Neg T V T Neg Agr Agr agr: 3rd ,plur AgrP Agr NegP AdvP not Neg TP T VP V IP eaten nattoo
But if this is how auxiliary verbs manage to overcome the disruptive effect that NegP has on getting inected, then Earliness will not be sufcient to prevent main verbs from making use of the same method of getting inected when NegP is present. In fact, the point is more general than the particular proposal Ive made here. Irrespective of details, if its movement that allows auxiliary verbs to get inected when NegP is present, then Earliness will not prevent main verbs from moving when NegP is present. Earliness will prevent main verbs from moving when there is no NegP. But it cannot block derivations that are otherwise legitimate if they are the only ones that give rise to a well-formed S-structure; and that is precisely the case for a derivation which would move a main verb through Neg0 to Agr0 . Something must be added to
209
b.
Consider the representation given to (110a), in (111) on the following page. I have given T0 the property, as this head forces head movement of both auxiliary and main verbs to it. (Recall: we saw that main verbs in French innitival clauses move to T0 and are thereby separated from their NP complements by adverbs.) In general, we can characterize those inectional heads that allow main and auxiliary verbs to move to them, i.e., those heads that are described as being strong, as having the property. Consider now the status of Agr0 in this representation. I suggested in the previous section that this head is not associated with inectional morphology. Such heads, I suggested, could serve as the site of head movement but did not force head movement. For this reason, auxiliary verbs could optionally move to Agr0 in innitival clauses, and when this option is taken, (110b) arises. But this account is incompatible with Earliness, which would block the needless movement required to derive the word order in (110b). Lets consider how we might characterize these cases in a system of inection assignment that includes Earliness.
210
AGREE
(111)
PRO Agr AgrP Agr NegP Neg TP T T V tre tense: ir
T
tense: ir
First: if Agr0 has feature values that need to be assigned, then we should expect the presence of NegP to block assigning those values to heads beneath Neg0 . Therefore, when no verb moves to Agr0 , the present system requires that Agr0 be associated with no verbal morphology. And, as noted, in that case movement to Agr0 will be prevented by Earliness. The sentence in (110a) is therefore consistent with our present system under the assumption that Agr0 has no feature values to assign. But the outcome in (110b) requires a different set of assumptions about the status of Agr0 . In that circumstance, Agr0 must be associated with verbal morphology. Because the inectional morphology in these two scenarios remains the same, lets assume that in the cases where a verb moves to Agr0 , innitival morphology is associated with Agr0 and not T0 . Under the assumption that the property is associated with particular morphologies, we might speculate that when the innitival morphology resides in Agr0 it gives Agr0 the property. A way of characterizing (110b) in the theory of this section gives it the
211
AgrP Agr NegP AdvP pas Neg TP T VP V VP heureux est une condition pour crire des romans
tense: ir
representation in (112) below.13 The property requires that a verb expressing the value the tense feature be adjoined to Agr0 , and this results in the auxiliary verb moving through Neg0 to Agr0 . Since this is the only derivation in which the property gets its requirements satised, Earliness permits it. In these situations, when there is no auxiliary verb, and instead a main verb expresses the value of the tense feature, there will be no grammatical outcome. This is because main verbs, as weve seen, are incapable of moving through Neg0. To characterize optional movement, then, a system which embraces Earliness requires that there be two possible representations: one that forces movement and another in which it is blocked. Under the particulars of the account I am sketching here, that would mean that French speakers who allow the alternation in (110) allow the valued tense feature to be generated in either Agr0
13 Note that this representation does not linearize ne relative to tre correctly. We will continue to abstract away from the issue of how to match a linearized form to the complexes created by Head Movement.
212
Rather than seeing (113b) as being derived by movement from (113a), we could see this alternation as a reection of the ambiguous status of to. Perhaps English speakers allow to to stand in either the T0 or Agr0 positions; this would give the two word-orders in (113). If these methods for characterizing the exible position of certain heads can be conrmed, then we can continue to entertain the hypothesis that Earliness exists, and rely on its consequences to block main verbs from moving in English (non-negated) sentences. In any case, lets assume that verbs inect by way of the AGREE relation, and that the Head Movement Constraint is a constraint on this relation and not movement. As weve seen, in order to spread this constraint on AGREE to cases where head movement arises, we need a method of explaining how head movement is tied to AGREE. Head movement needs to be seen as one outcome of the AGREE relation, and therefore an outcome that inherits the constraints on AGREE. Ive sketched a way of doing that in this section, but its clear that this is not yet a full account. It relies on nding a source for the property, and for understanding what prevents main verbs from moving through Neg0 into higher heads.
213
214
215
216
(Only the Case features are indicated in (121).) The NP bears an unvalued Case feature, a feature that must be valued before the Feature Valuation Requirement will be satised. In its D-structure position, it is too far from Agr0 for AGREE to hold, and so it cannot get its value from Agr0 . If it moves to Specier of TP, however, it will then be klose enough for AGREE to hold, as (122) shows.
217
(122)
AgrP Agr Agr Case: nom nom TP NP Case: nom T Wayne V V run T VP
This representation satises the Feature Valuation Requirement (the former Case Filter), but it does not yet satisfy the EPP, which requires further movement of Wayne to Specier of AgrP. The result is (123) below, which corresponds to the grammatical S-structure outcome for this sentence.
218
This reworking of the conditions under which nominative Case assignment happens claims that nominative Case is assigned to a position lower than where
219
Case: nom
V seem
P Case: acc to
the NP bearing that Case surfaces. There are examples which can be construed as evidence for this claim. In these examples, the nominative Case marked NP surfaces in a position c-commanded by the Case-assigning Agr0 , and the EPP is satised with there. One of these examples is (127). (127) There should be a solution in this chapter. It doesnt seem that there in this example has the same meaning that the locative there has in (128). (128) A solution stands there.
Indeed, the distribution of the there that is found in (127) is what is expected if it is an expletive. Just as in the scenarios in which the expletive it appears, we might conjecture that the EPP (or property) is satised by virtue of there
220
If this is the right interpretation of these examples, then it conrms the hypothesis that nominative Case assignment occurs under c-command, just as accusative Case assignment does. But it presents a host of problems for the particular proposals we have arrived at. If there satises the EPP in this example, then collapsing the EPP and the property becomes difcult. Further, the distance between Agr0 and the NP it Case marks in (129) does not t to our denition of klose: there is clearly more than one phrase that dominates a solution but not Agr0 . To t this analysis to our system would require a different denition of klose, then. Moreover, examples of this type are governed by a set of constraints that remain little understood. The semantic type of the nominative Case marked NP matters, for instance. That argument must be an indenite of a certain type, as the contrasts in (130) illustrate.
221
And the kind of predicate involved also matters. While PPs support this construction, APs and NPs do not. (131) a. * There is a man a nk. compare: A man is a nk. b. * There is a man happy. compare: A man is happy.
VPs support the construction, but for many verbs, only when they are embedded under be. (132) a. There was a woman given a book. b. There was a woman drinking milk. c. * There has a woman given me a book. d. * There has a woman drunk milk. e. * There a woman drank milk. a. b. There arrived a woman. There appeared an elf.
The exception are some (but not all) unaccusative verbs, as in (133). (133)
And in these cases, as can be seen, the nominative Case marked NP remains in its -marked object position.14
14 For some discussion of the constraints on the semantic type of the NP, see Milsark (1974), Sar (1985, 1987), Heim (1987) and Diesing (1992). For discussion of the constraints on the
222
There are languages in which Nominative Case-marked NPs surface in S-structure positions that, like those weve just seen in English, are c-commanded by Agr0 . In these languages, interestingly, there is no expletive like there involved. Spanish, Italian and Greek are such languages. Examples from Spanish and Greek are in (134). (134) a. Leyo Juan el libro. read John the book John read the book. b. Pandreftike o Petros tin Ilektra. married Peter Ilektra Peter married Electra. (Alexiadou and Anagnostopoulou 1998, (3c,d): 492)
We might venture a parse for (134b) like that in (135) on the next page. These examples pose the same problem for our denition of klose that the ones in English did, but in these cases there appears to be no problem for expressing the EPP with the property. That no expletive is required in the Specier of AgrP in these examples suggests that these are languages that have no EPP. Note that in (135), the main verb has moved to Agr0 , thereby putting it in a position that the linearization principles of Greek will cause it to precede the nominative Case marked NP. This is a very common trait of languages that allow their nominative NPs to remain low like Greek that those NPs follow the verb (when they are not moved by other processes like wh-movement). In particular, the language properties in (136) seem to hold. (136) If a language has (136a)(136b), then its nominative NPs will follow the nite verb.
predicates, see Levin (1993). There is a large, and heterogeneous, literature that addresses the problem of assimilating the way there satises the EPP to a more general theory of the EPP. Among many others, see Chomsky (1995), Lasnik (1992, 1995), Moro (1997), Chomsky (1998, 2000, 2001), Bokovi c (2002b,a), Groat (1999, 1995), Hazout (2004), and Epstein, Pires, and Seely (2005). A good introduction to the language variation in expletives is Vikner (1995).
223
a. Allows nominative NPs to surface c-commanded by Agr0 . b. Linearizes VPs and the heads that verbs move to so that they precede their complements c. Has no expletive These languages are said to display subject inversion. Perlmutter (1971) and Rizzi (1982) proposed that subject inversion is always found in languages that dont have the EPP, and are verb initial. One inuential attempt to explain this correlation is in Alexiadou and Anagnostopoulou (1998, 1999). They argue that, in fact, the EPP does hold in these languages, but that it is satised by moving the verb instead of the nominative Case-marked NP. For that to make sense under an interpretation of the EPP that collapses it with the property, we cannot express the EPP as a property of Case. Verbs do not carry morphology that expresses Case features, and so they will not form an AGREE relation with the Case feature associated with Agr0 . If the EPP is the attribute associated with Agr0 s Case feature, then our present formulation of what the attribute requires would not be satised by moving the verb to Agr0 . For the proposal in Alexiadou and Anagnostopoulou (1998) to make sense, we will have to see the attribute as being associated with some feature that both the verb and the nominative Case marked NP express. That feature, as it turns
224
Indeed, all other verbs merely distinguish the 3rd singular forms from all others. But the existence of the paradigm in (137) indicates that both person and number features are present in the agreement morphology of English. We should therefore decompose the agreement feature into number and person features and let AGREE x the values of these features in a way that involves not just Agr0 and the verb, but the nominative Case-marked NP as well. Well take up that task in the next section. The proposal in Alexiadou and Anagnostopoulou (1998), then, requires that it be the agreement feature that has the property responsible for manufacturing EPP effects. Schematically, then, the relevant verb initial languages will have a D-structure representation like that given schematically in (138) below. In one class of language, the requirements of the attribute are satised by Argument moving the NP into Specier of AgrP. This produces the typical word-order in English sentences, illustrated by (139).
225
XP
(139)
NP person: v number: w
AgrP Agr Agr person: v number: w Case: nom T person: v number: w tense: y TP T VP V V person: v number: w tense: y XP
In the other class of language, the verbs movement satises the property on the person and number features, resulting in an S-structure like (140). As can be seen, the correlative properties of high verb and low nominative NP are captured.
226
TP T VP NP person: v number: w V XP
The cases in English involving expletive there seem to form a special case. Like the languages with the syntax in (140), the movement of the nominative Case marked NP is not required by the property. But unlike the languages in which this is generally possible, the movement of verb does not satisfy the property; instead, the presence of there does. We will not examine how there has this ability, nor will we try to unpack the causes for the constraints that govern this word-order in English. With these problems bracketed, however, we can see the there-construction in English as a special instance of the syntax that gives subject inversion elsewhere. As noted above, this means that we will have to abandon our present formulation of klose, the locality condition on feature valuation when phrases are involved. We designed klose so that it would allow accusative Case assignment in (141), but not in (142). (141)
V Case: acc acc believe NP Case: it obvious V AP A A
227
We cannot dene klose in such a way that it legislates against too many phrases between the Case assigner and the NP that gets assigned Case, as we did originally. One direction that some have taken is to make reference to the geometries of the phrase markers. In the good examples, both (143) and (141), the NP is embedded in phrases that are either complements or X0 s. In the bad example, by contrast, the NP is embedded within a phrase that stands in Specier position. A wide array of movement, and other, relations seem to be blocked
228
It appears that there may be something quite general about phrases in Specier positions that makes them incapable of engaging in syntactic relations outside those phrases. Ross (1967), who charted out many such cases, calls these regions islands.15 One hypothesis about the ungrammaticality of (142), then, is that this has nothing to do with proprietary constraints on feature assignment but is the result of a global constraint on establishing relations into Speciers. Let us adopt that idea. This means that we have not found any locality condition on the assignment of features to phrases. We revise AGREE to (145), then. (145) Let X and Y be terms that carry features, and let the features on X have a value and those on Y be unvalued. The features on Y can get the value of the matching features on X just in case Y agrees with X. X and Y agree iff: a. X c-commands Y, and b. If Y is a head, there is no Z0 such that X c-commands Z0 and Z0 c-commands Y0 . Lets now turn to unpack agreement into the person and number features weve seen them consist in, and work out how AGREE can match the values of these features on Agr0 , a verb and an NP.
229
Lets consider now how AGREE would cause the values for the person feature (v in (146)) and the number feature (w in (146)) to be xed. It seems natural to think of these features as coming valued on the NP involved in the relation. These features have a role in the semantic interpretation of the NPs, though they dont obviously have any semantic role on Agr0 or the verb. For this reason, we should take the D-structure representation to be as (147) below, which maps onto the sentence she runs, indicates. Under our present denition of AGREE, values are passed from one item, , to another, , only if c-commands . This means that the person and number values on NP cannot be assigned to Agr0 in (147), though it can assign those features directly to the verb. This denition of AGREE, then, is not compatible with there being person and number features on Agr0 if those features come valued on NPs. But if we do not put person and number features on Agr0 , then we will no longer be able to give these features the attribute and employ the account for the subject inversion typology that we reviewed in the previous section. In fact, there is another reason to think that Agr0 has person and number features associated with it. There is a close connection between the features that gure in agreement in English: person and number and those that are responsible for Case. Features that can enter into agreement relations are called features, and in addition to person and number, they also include gender and, perhaps, word class markers like those that play a role in Bantu
230
languages and elsewhere. It is very common cross-linguistically that the NPs which bear nominative Case also agree in features with the verb. Moreover, the distribution of agreement and nominative Case within, and across, languages is very similar. Within English we see this in the fact that those NPs which bear (overt) nominative Case do so in the very clauses where they trigger (overt) agreement. Nominative Case marked NPs are exclusively found as subjects of nite clauses, and this is where agreement triggering NPs are found as well. This is extremely common: in those languages that have both nominative Case assignment and agreement, it is the nominative Case marked NP that enters into agreement. This correlation is also reected in those languages that have only nominative Case assignment or agreement. Just as in English, the nominative Case, or agreement, targets certain NPs in nite clauses but not in other reduced clauses. This relationship is found with respect to accusative Case as well. There is no agreement associated with accusative Case-marked NPs in English, but in many languages there is. This so-called object agreement has a distribution that looks very much like that which accusative Case-marked NPs have in English: it depends on the choice of verb (transitive verbs trigger it and intransitive verbs dont) and it targets NPs that are roughly in the positions that
231
nomenon, in which case they would only play a role in determining the meanings of the NPs they are in. In such languages, all the features would be valued. The rst clause in (148) therefore restricts the constraint to those languages that have Case marking or agreement. It requires in such languages that there be a head which mediates the values of those features, thereby gluing them together. In English, this head is Agr0 . For these reasons, then, we should leave person and number features associated with Agr0 . As noted above, this means that unless we decide against letting the valued versions of the person and number features come with the NP, we have a problem for our formulation of AGREE. We cannot restrict AGREE so that it only passes values down to c-commanded terms. In (147), repeated below, we need to let the values for person and number on the NP be passed up to Agr0 . To this end, AGREE becomes (149). (149) Let X and Y be terms that carry features, and let the features on X have a value and those on Y be unvalued. The features on X or Y can get the value of the matching features on the other just in case X and Y agree. X and Y agree iff: a. one of X or Y c-commands the other, and b. If Y is a head, there is no Z0 such that X c-commands Z0 and Z0 c-commands Y0 . This version of AGREE still requires that the two terms in an AGREE relation also be in a c-command relation. But it doesnt require that the values be passed from the c-commander to the c-commandee. This will have the desired effect,
16 This claims that if a language has both Case assignment and agreement, only one of the Case assigners will be forced to trigger agreement with the NP it assigns Case to. What we (probably) want is to force every Case assigner to control agreement with the NP it assigns Case to. We want to allow languages like English, in which one Case assigner (nominative) is tied to agreement but another (accusative) isnt, while still tying together the Case assignment and agreement phenomena.
232
then, of letting the values for person and number features on an NP be assigned to an Agr0 that this NP does not c-command. I will allow AGREE to convert (147) to (150) below. The version of AGREE in (149) would also let the values that NP has for the person and number features be xed on T0 and V0 by putting these terms into an AGREE relation with NP. This isnt desired, however. If the NP could directly agree with verbs, and other terms, then our desire to tie agreement to Case would be lost. We would expect that the values for features on a verb could be valued by an NP that is not getting its Case from Agr0 . What we see is something different, however. It is the very NP that bears nominative Case whose features arise on the verb. We must prevent the NP from entering into an AGREE relation with V0 . Its values for person and number must come from Agr0 . A common way of ensuring this is to require that the c-commanding term in an AGREE relation is a head. This would prevent the NP in (149) from entering into an AGREE relation with the verb that follows. We therefore change AGREE once more. (151) If X c-commands Y, then the unvalued features on X or Y can get the value of the matching features on the other just in case X and Y agree.
233
X and Y agree iff: a. X is a head, and b. If Y is a head, there is no Z0 such that X c-commands Z0 and Z0 c-commands Y0 . The c-commanding head that enters into an AGREE relation is often called the probe, and the other term in the AGREE relation is known as the goal. What (151) insists, then, is that the probe be a head.
234
TP T T person: number: tense: past VP NP person: 3rd number: sing Case: she V V person: number: tense: past blort NP person: 3rd number: sing Case: it
Case: nom
The verb blort is one of the kinds that Burzios generalization describes as not occurring: it is a verb that c-selects two NPs but does not come equipped with a valued Case feature. We saw in the previous chapter that a way of understanding why such verbs should not arise is by considering how the D-structures they produce would satisfy the Case lter. Under a certain set of assumptions about how Case is assigned, we discovered that such D-structures could not satisfy the Case lter. With no well-formed S-structure able to express these verbs, their non-existence is explained. One of the assumptions necessary to block (152) is that the NP in object position is not capable of getting nominative Case from Agr0 . At the time we examined this scenario, we were working with a system of Case assignment that required NPs to move into Specier of AgrP to get their Case feature valued. Now that this system has been revised, we should reconsider how (152) is prevented from having a good S-structure associated with it. The denition of AGREE in (151) does not have this desirable consequence. The unvalued Case features on both NPs could get a value from Agr0 . To prevent this, we have two options. We could prevent the Case feature from entering
235
TP T T person: number: tense: past V person: number: tense: past arrive VP V NP person: 3rd number: sing Case: it
Case: nom
We want to make the presence of the NP in Specier of VP responsible for blocking Case assignment to the object NP in (152). One way of doing this is with (154). (154) If X c-commands Y, then the unvalued features on X or Y can get the value of the matching features on the other just in case X and Y agree. X and Y agree iff: a. X is a head, and
236
b. There is no Z such that X c-commands Z and Z c-commands Y, where Z and Y match in projection level. This denition has generalized the Head Movement Constraint clause in AGREE to phrases. It prevents there from being any term between the probe and goal that is of the same projection level as the goal. If the goal is an XP, then there can be no closer NP to the probe; and if the goal is an X0 , there can be no closer X0 to the probe. This will correctly prevent (152) from having a grammatical outcome as it will prevent the object NP from getting a Case value from Agr0 since there is a closer NP, the subject. The locality condition in (154b) is too strong. It prevents any phrase from standing between an NP and the head it gets its Case valued from, and that would prevent (155) below from having a grammatical outcome. But (155), of course, leads to the well-formed S-structure: It very suddenly arrived. Lets weaken the locality condition, then, to (156). (156) If X c-commands Y, then the unvalued features on X or Y can get the value of the matching features on the other just in case X and Y agree. X and Y agree iff: a. X is a head, and
237
4.7 Summary
Here is a brief recap of the elements we have introduced to our grammar in this chapter. (157) a. If a feature, , has , then it must be pronounced with the term whose features it is in an AGREE relation with adjoined to a projection of the head containing . b. If a language has an unvalued Case feature or unvalued features, then there will be some head in which they are all bundled. c. If X c-commands Y, then the unvalued features on X or Y can get the value of the matching features on the other just in case X and Y agree. X and Y agree iff: i. X is a head, and ii. There is no Z such that X c-commands Z and Z c-commands Y, where Z and Y the same kind. iii. X and Y are the same kind if they are both heads, or they are both NPs d. Feature Valuation Requirement (holds of S-structure) The values of a feature must be morphologically expressed and every feature must get a value.
Ive changed the wording of some of these conditions in order to t them together better. For instance, the formulation of the property that I set out
238
Summary
had the property of heads, rather than of features. But in order to embrace the account for subject inversion in Alexiadou and Anagnostopoulou (1998), we found it necessary to see the attribute as property of features. There are other, similar, small rewordings in (157).
239
5
Determiner Phrases and Noun Movement
One of the puzzles we stumbled over in introducing phrase structure rules involved the internal shape of determiner phrases. I noted that the set of strings that constitute DPs is miserably anemic. There are very few examples of nontrivial strings of words that offer themselves as possible determiner phrases. Typically, a determiner phrase appears to be constituted of nothing more than its head. The sorts of examples I offered as candidates for this family of strings were things like the bold-faced material in (1). (1) a. b. c. all but three determiners more than six children two dozen eggs
But there are reasons to think that these examples dont have parallel parses, and that, in fact, none of them t to a DP string in the desired way. Its probable that dozen, in (1c) is an adjective; this can be seen by observing that it can follow other adjectives (something determiners arent capable of doing): (2) an expensive dozen eggs (1a) involves a coordinator, but, which will invoke the sorts of structures we have encountered before with coordinations. (1b) involves what is known as a comparative construction, whose syntax, like that with coordinations, invokes larger joined structures. We wont examine these cases in any detail here, but let me offer as a way of thinking about the syntax of these cases that makes their semantics transparent, something along the lines of (3), where the
Imagine, that is, that these cases involve bringing two full NPs together, and that a process of ellipsis removes the N from the rst NP and, moreover, this N is understood to refer to the same set of individuals that the N in the other NP refers to. If these cases dont involve strings that have the same distribution as determiners, then where are the strings that are determiner phrases? Why are they so hard to nd? This problem can be related to another, worse, problem. Remember that determiner phrases compete with genitive NPs for the Specier of NP position. This is what is responsible for the paradigm in (4). (4) a. Marys lamp b. the lamp c. * the Marys lamp
We adopted a view of NPs that embraced a constraint yet to be found that limited DPs and genitive NPs to their Specier position. When we transited from a model about syntactic form that used Phrase Structure rules to one that involved X Theory and other general principles, we lost a way of expressing this fact. We must now understand how to capture the competition between determiners and genitives for the rst position in NPs. This involves understanding what it is that governs the distribution of genitive NPs and determiners. A problem for capturing the related distributions of determiners and genitives arises in certain cases which look rather like clauses, but which nonetheless have genitive subjects. These are called gerunds, and (5) provides some examples. (5) a. b. c. [Marys loudly singing the song] bothered us. I recalled [Marys xing the car]. [Marys having talked to John] wasnt widely known.
This suggests that these phrases have an organization something like (6).
1 For an examination of cases like (3b), see Hackl (2000).
242
The DP Hypothesis
(6)
NPs Mary ? ?P ? IP I I ing AdvP loudly V sing VP V V NP the song
243
b.
DP D D the NP N N lamp
244
The DP Hypothesis
If this is correct, it would also answer the problem we began with. The reason DPs look so anemic is because theyre considerably larger than we thought. The two considerations Ive just adduced in favor of reanalyzing NPs as DPs with NPs inside them can be found in Abney (1987).2 He gives another, smaller, argument on behalf of this reanalysis that relies on a mystery concerning the expression of Adjective Phrases in English. The mystery is that there is a constraint on Adjective Phrases in English nominals which determines how large they can be depending on whether they precede or follow the noun. As (10) shows, when an AP has nothing but its head in it, it prefers preceding the noun, whereas if it contains material following the A0 , it prefers following the head. (10) (11) a. some angry children b. * some children angry a. * some angry at Bill children b. some children angry at Bill
There is a systematic exception to this, and these are expressions like everyone/everything, someone/something, anyone/anything and no one/nothing. (12) a. someone angry b. something large c. * angry someone d. * large something a. everyone angry b. everything large c. * angry everyone d. * large everything a. no one angry b. nothing large c. * angry no one d. * large nothing
(13)
(14)
Abney3 suggests that an analysis of this exception should not make it accidental that the determiners every, some, any and no and the nouns one and thing
2 Ive changed slightly his discussion of cases like Marys singing the song but the spirit of the argument is his. 3 Who is here following a suggestion of Richard Kaynes, who in turn is building on ideas in Postal (1969).
245
b.
Now Head Movement can combine one with D0 to form the DPs in (12). Further, to the extent that combining one with some/every really involves Head Movement, we have an argument for the reanalysis of NPs into DPs. This is because the Upwards Constraint and the Likes Attracts Likes constraint com-
246
Noun Movement
bine to allow one to adjoin only to a head that c-commands its original position. So, if the some and every parts of someone and everyone are determiners, and the one part is a noun that has Head Moved into these determiners, then it must be that D0 c-commands NP. If these considerations are on the right track, it demands that we change our way of talking about nominals altogether. Everything we once thought to be true of NPs, is now true of DPs instead. So, for instance, the Case Filter is something that xes the position of DPs, not NPs. NPs are now found pretty much only inside DPs and not, as we previously thought, in subject and object positions. From this point forwards, then, let everything that we have credited to NPs hold of DPs instead, and let NPs be selected only by determiners, thereby xing their position within DPs.4 We have also seen, faintly, evidence that nouns move internal to DPs in a way somewhat like the movement of verbs internal to CPs. Indeed, there is a variety of interesting evidence that Noun Movement exists to a larger degree than just that found in the someone and everyone cases mentioned above. Moreover, there is some evidence that this movement relocates a noun to a head associated with inectional morphology, much like the situations we have viewed involving verbs and inectional morphology. In gross terms, then, DPs and IPs have a variety of parallels; it is this parallelism that Abney focuses on in the rst two chapters of his dissertation.5 In this chapter, we will examine a paradigm of facts which focus on the noun movement part of this parallelism.
247
6 In an unpublished talk delivered at the Universit di Venezia in 1990. See his GLOW abstract from the 1992 Lisbon meeting.
248
Noun Movement
In (18a), American serves as a genuine modier, merely attributing to the referent involved that it has the property of being American. But in (18b), American refers to an abstract entity that is constituted of the American people or the American government.7 That this is a function of these adjectives bearing the external -role which opinion and discussion assign is indicated by the fact that this meaning is lost when there is another external -role bearer in the nominal. (19) a. b. Ugandas American opinion of the blockade Moroccos American discussion of trade barriers
Like other external -role bearers, then, this one shows up postnominally in Italian (and the other Romance languages), which can be accounted for if we let these adjectives be underlying placed in Specier of NP, and then make nouns move leftwards past them. There is an alternative method of generating these word orders. Giorgi and Longobardi (1991) suggest that there is a difference in the direction that Speciers can branch in Romance and Germanic, and that this is responsible for the fact that the phrases that appear in these Speciers, i.e., subjects, show up following the noun in Romance but not Germanic. This alternative account predicts that postnominal subjects can follow the complements, and this is generally possible too. Thus the di/de phrases in (16) are actually ambiguous; either of them can have the subject or object reading. Giorgi and Longobardi suggest that this word-order alternation arises by virtue of a rule that moves the object past the right-branching subject position. The Noun Movement account would have to claim that the subject can move rightwards past the object. There are reasons for doubting that the Giorgi and Longobardi account is correct, and this direction has largely been abandoned in the literature. One of these is that, as Valois (1991a) and Bernstein (1993) note, ethnic adjectives cannot follow complements in Romance. (20) *Linvazione dellAustria tedesca the invasion of Austria german (Valois 1991a, p. 374) This can be related to the fact that ethnic adjectives seem unable to move. There is evidence in English for this which comes from the fact that ethnic
7 This quality of ethnic adjectives is rst discussed, to my knowledge, in Kayne (1984, Chapter 7).
249
a. * The Iranian bombardment by Russia took weeks. b. * The Ugandan invasion by Tanzania grinds slowly on. (with an object interpretation for the adjective (roughly Kaynes 1984, (32) and (33), p. 139)
As we shall have occasion to see, there is evidence that the genitives in (21) have undergone A Movement from a position to the right of the noun, where they receive their -role. Ethnic adjectives, apparently, are unable to move from this position. Instead, they are stuck in the position from which they get their role. Thus, the fact that they appear in Romance between the noun and the nouns complements suggests that the underlying position to which the external -role is assigned in nominals is to the left of the complement. This follows if the Specier of NP branches to the left rather than the right. This fact, then, ts the model of Romance nominals which has the noun moving leftwards past the subject. So, now, where are the nouns moving in these cases? One possibility, explored in a variety of places, is that the intermediary position is where number morphology is associated. There is some prima facie reason for thinking that number morphology heads a syntactic phrase: Cross-linguistically this is common, as Dryer (1989) shows. Thus, in Yapese, for example, the plural/singular/dual categories are expressed with separate morphemes. (23) a. b. c. ea rea kaarroo neey sing car this ea gal kaarroo neey dual car this ea pi kaarroo neey plural car this
This at least suggests that Universal Grammar makes projecting a syntactic phrase above Number a possibility. Further, Dryer shows that the relative order of Num0 and Noun correlates with Verb-Object word order. This would be explained, on standard theories of word order typology, if Num0 is in a head complement relation with Nouns. Moreover, Dryer nds that most times there
250
Noun Movement
is a number word, it falls more embedded in the nominal than do determiners, but above adjectives and the noun. He provides examples like the following. (24) a. ha ongo puhae ua art dual box two (Tongan) b. do mamu ragha tree big plural (Kimaghama) c. me-ria rabiri plur-new paddle (Cayuvava) There are exceptions, but this can be said to be the basic order among these terms. If so, the pattern that emerges can be sketched in (25) below. This is (25)
DP D D NumP Num NP N N XP
what the statistical study yields. Of course, if Dryers conclusion that adjectives come between Num0 and N0 is valid for English, then English nouns must combine with this morphology in one of the ways we have seen possible in the verb/inection cases. One possibility is that nouns overtly move to Number, but that this doesnt bring the noun to the left of the possessor in English because possessors are driven into Specier of DP, an even higher position. Or, we might imagine that the noun undergoes covert movement to Number head. So now what we want to determine is: Is there language internal evidence for the picture that Dryer gives us statistically? And, is there evidence that
251
It is also possible to nd prenominal single adjectives in French, as in the following example. (27) a. b. une large valle a large valley une valle large a valley large
But here Bernstein notes that there is a difference in meaning: in (27a), the nominal refers to an individual drawn from the set of things that are large valleys. In (27b), by contrast, a member of a class of valleys which happens to be large is denoted. In Giorgi and Longobardis study of this phenomenon in Italian, they suggest that the prenominal depictive adjective can only get an appositive interpretation, whereas the postnominal one can have either an appositive or restrictive reading. The difference between an appositive and a restrictive reading is subtle. Roughly speaking, appositive modiers contribute their meaning to the expression they are attached to in a way that is reminiscent of conjunction. So, for instance, in (28a) the PP from Duluth stands in the same relation to Mary as it does in (28b). (28) a. Mary, from Duluth, has arrived.
252
Noun Movement
b. Mary has arrived and she is from Duluth.
In an example such as (29), by contrast, from Duluth plays a more direct role in determining the reference of the DP it is attached to. (29) Jill likes women from Duluth. In this case, from Duluth restricts the reference of women to just those that have an attribute that Jill values: being from Duluth. One could not capture the meaning conveyed by (29) with a circumlocution, parallel to (28b), like: (30) Jill likes women, and they are from Duluth. Perhaps it is this sort of difference in meaning that correlates with the prenominal/post-nominal position of adjectives. If so, it doesnt appear to always be the case, however. There are some examples where the alternation between Adj+N and N+Adj order doesnt appear to invoke any meaning difference. Valois (1991b) provides some examples in nominals with a deverbal noun. (31) a. La probable invasion de Jupiter the probable invasion of Jupiter La invasion probable de Jupiter the invasion probable of Jupiter b. La frquente invasion de Jupiter the frequent invasion of Jupiter La invasion frquente de Jupiter the invasion frequent of Jupiter (Valois 1991b, 374) Valois claims that there is no difference in meaning attendant with these word orders. Whats going on here will have to await a better understanding of the syntax-to-semantics mapping of modication. What is the source of the difference between French and English with respect to placement of these single adjectives. Why can they appear after the noun in French but not in English? One possibility would be to blame whatever it is that prohibits bare adjectives from being right-adjoined to the nominal projection as the cause. Maybe this constraint is not present in French? Actually, however, there is evidence that this constraint is also present in French. Weve seen that bare adjectives
253
254
Noun Movement
These plural afxes are not present phonetically even when the context for liaison is provided. However, she notes that in French there is a certain class of suppletive forms where the nouns do show a morphological mark for plurality. Examples are given below. (35) a. b. (36) a. b. (37) a. b. un mal an evil des maux evil(s) un oeil rouge a red eye des yeux rouges red eye(s) quel cheval which horse quels chevaux which horses
In Walloon, however, these forms always appear just in their singular form. (38) a. b. (39) a. b. (40) a. b. on m evil ds m[s] evils on rothch[e]-oy red eye ds rodj[e]-z-oy red eyes [ke: dzva:] which horse [ke: dzva:] which horses
255
In French, the head noun is driven overtly into the Num0 position to satisfy the attribute associated with the feature residing there, and this will bring it
8 But its not that liaison is completely absent in Walloon Bernstein notes that it is still present after plural determiners and pronouns (for this, see Bernstein (1991, note 7, p. 107).
256
Noun Movement
past the adjective. This movement is blocked in Walloon because either there is no NumP, or its head is associated with a feature that has no attribute. Thus the availability of plural forms in French is correlated with its ability to appear before single adjectives. This achieves the desired correlation between presence of number morphology and N+Adj word order, and also supports the idea that number morphology is associated with an inectional category that projects its own phrase. Is it possible to tell whether Walloon has a NumP, or whether it is absent altogether? Bernstein suggests that there are reasons for assuming that Walloon does have NumP and, moreover, there is some reason to believe that it is actually lled with morphology. If this is correct, the crucial difference between Walloon and French is not whether NumP is present or not, but instead how it combines with the noun that follows. Interestingly, Bernstein argues that it combines in a way that we would not have expected from our examination of verbal syntax. She argues that the plural morpheme in Walloon begins in Num0 and attaches to the left edge of the following noun; but, somewhat surprisingly, it shows up orthographically as the nal syllable of an adjective which precedes the noun. Lets briey examine how she arrives at this conclusion. One fact, due to Morin (1986), that leads her in this direction is that liaison between prenominal adjectives and a following noun is absent in Walloon, though present in French. (43) a. b. un gro-z -arbre a big tree une peti-t -enfant a little child (French) (44) a. on gro[s] abe a big tree (Lige Walloon) b. on peti[t] fant a small child (Gondecourt Picard) She suggests that the account offered here would provide an immediate explanation for this, if in Walloon there is a Num0 that lies between the prenominal adjective and the following noun. This intervening category might then be responsible for blocking liaison in Walloon. In French, by contrast, the noun has
257
258
Noun Movement
(45)
DP D D NumP Num AP1 Num Num NP N AP2 N N
the right of the noun, itll have to be restrictive. I dont know if this is a correct outcome. A different way of modeling the N+Adj/Adj+N word order, that still correlates the availability of the N+Adj order with overt Noun Movement to Num0 and would also address the problem we encountered with Bernsteins explanation for the contrast in liaison between Walloon and French, would be to hypothesize an optional projection above NumP. Then we could place adjectives on either side of this optional projection, and let the noun move into its head when it is present. I dont know what this optional phrase is, so I will simply call in Y here. The idea, then, would be to give to DPs the shape in (46).10
10 Relevant to this decision is that ethnic adjectives cant appear prenominally in (standard) Italian or French. (i) (ii) * la tedesca invazione dellAustria the german invasion of Austria quel tedeschissimo comportamento that very German behavior (Valois 1991b, p. 374)
259
Now, as before, let nouns be driven into Num0 to satisfy a property associated with the number feature in Romance. When YP is absent, then nouns will surface to the right of adjectives, both those in AP1 and those in AP2 position. If, as before, we associate these two positions with the two interpretations that these adjectives can get, we will, in this situation, allow prenominal adjectives to have either interpretation. When YP is present, assume that Y0 has a feature with a attribute associated with it, and the Num0 +N0 pair will be driven into Y0 . In that situation, the noun will surface to the left of adjectives in AP2 , thus appearing to the left of adjectives with a restrictive interpretation, and will still remain to the right of adjectives in AP2 , presumably those with a non-restrictive interpretation.
To the extent, then, that ethnic adjectives show us where the D-structure position of external role bearers are in nominals, this fact suggests that these external -role bearers are positioned before objects in Romance nominals.
260
Noun Movement
Consider, by contrast, a language which does not allow Nouns to move into Num0 , presumably Walloon for instance. In these languages, movement into Y0 will be blocked by the Head Movement Constraint. That is, if we could nd something that not only prevented Nouns from surfacing in Num0 , but also prevented them from moving through Num0 , then we would have a way of correlating access to Y0 with access to Num0 . This, in fact, is what Bernstein does. She argues that the plural morpheme in Walloon is placed in Num0 , and this blocks movement of the noun in Walloon. There are two plural morphemes in Walloon, one for feminine nouns and one for masculine nouns. Both are expressed orthographically on the prenominal adjective. The feminine plural morpheme is realized before consonant initial words as an unstressed vowel and before vowel initial words as [Ez]. Examples follow. (47) a. b. Compare: (48) li ner sipne the black thorn les bel[s] feyes the pretty girls ds ner-z -amonnes some black berries
The masculine plural morpheme (-s) shows a similar pattern, though it is phonetically manifest only in contexts of liaison. (49) a. b. ds der[s] tchivs the black hair ds ner-z -ouy the black eyes
She argues against composing the feminine plural marking of a gender morpheme and a number morpheme because this would assign to feminine the sufx -e, and this doesnt show up in singular nominals. (50) li ner sipne the black thorn So she supposes that there is only one morpheme, a plural one, that is to be found here. And, she conjectures that this morpheme is portmanteau with gender, or what she calls a word-marker, following work by Jim Harris. From now
261
And when the nominal that the adjective precedes is absent, this morpheme does not appear. Its missing in copular constructions, for instance, as shown in (52). (52) C ds bl[es]. those are good b. * C ds ble[s]. those are good a.
Second, only one of these morphemes appears when two prenominal adjectives are conjoined. (53) ds bl[es] t boun[s] bisses some nice and good animals This, at least, is the case in one dialect of Walloon (Boulogne Picard). In another, Lige Walloon, it is possible to nd the plural morpheme on all adjectives in the series. (54) ds bl[s] grad[s] djn[s] fy[es] some nice and strong young girls She suggests that in these cases, the adjectives arent actually stacked, but are instead conjoined. She notes that the conjunction is es in Walloon, and therefore homophonous with the plural morpheme. Third, there is phonological evidence that this morpheme is a proclitic on the following noun and not sufxed onto the preceding adjective. First, there is a widespread process of nal obstruent devoicing in Walloon, that Bernstein illustrates with the following pair. (55) a. grander big
262
Noun Movement
b. grande amice [gr at amis] good friend
When adjectives are followed by the plural morpheme, they show obstruent nal devoicing, as the contrast below illustrates. (56) a. * grand[s] fyes big girls b. grant[s] fyes good girls
A second phonological reason for thinking that the plural afx is not part of the preceding adjective is that it is unstressed. She cites Morin who argues that all words in Walloon have stressed nal syllables. Finally, again following Morin, she points to the fact that in Gondecourt Picard, the plural morpheme, s, triggers harmony on the following noun. She follows Morin and adopts the proposition that harmony is restricted to words in Walloon, which leads to the conclusion that s is part of the following noun, not the preceding adjective. This pattern of data all makes sense, Bernstein concludes, if the Walloon plural sufx combines with the following noun not by way of N0 movement, but instead, by procliticizing onto the unmoved, following N0 , as indicated in (57) on the following page. (Understand the material enclosed within [Pr W d ] to form a phonological word.) As noted above, this will explain why Walloon nouns surface always to the right of adjectives, because they will not be able to move through Num0 into Y0 . Still, there are problems with this account which are central enough to suggest that it needs revision. For one thing, it persists in requiring that adjectives be placed higher than NumP, and this runs against the trend Dryer found for languages to place adjectives within NumP. In addition, it credits the availability of a noun movement past an adjective to the properties of Y0 , and only indirectly to the properties of Num0 . But the aim of Bernsteins analysis of the Walloon/French contrast is to make the properties of Num0 responsible for noun movement past adjectives. Unless some intimate link can be made between Y0 and Num0 , the phrase marker in (46) isnt equipped to express a correlation between occupying Num0 and preceding single adjectives. The decision to place adjectives higher than Num0 , and to invent a new Y0 into which nouns can move, responds to the desire to explain the absence of liaison in Walloon between prenominal adjectives and the nouns that follow them. Bernsteins account forces Num0 to intervene between prenominal adjectives and NP. Perhaps we should abandon trying to account for this fact,
263
and let it come from some other idiosyncrasy of Walloon. This will allow us to return to a model of DPs like (58) on the next page. The difference between French and Walloon, as before, consists in whether Num0 holds a free morpheme a clitic as in Walloon, or a number feature that values the number feature on the following noun, as in French. Lets consider how this account of the difference in adjective placement between Walloon and French/Spanish/Catalan might be applied to the similar difference between English and French/Spanish/Catalan. There is no evidence of the sort weve just seen in Walloon that the number morpheme in English is a free morpheme. Lets assume, then, that Num0 in English contains a number feature, as it does in French. Given the tools developed here, perhaps the most straightforward way of modeling the difference between English and Romance, then, would be to give the number feature in French, Catalan and Spanish a property, but not give the number feature that attribute in English. This will force nouns in French, Catalan and Spanish to move to Num0 , thereby bring-
264
Noun Movement
(58)
DP D D NumP Num AP Num Num NP N N
ing them to the left of (some) adjectives. So, English S-structures will arrange DPs as in (59), while in the remaining Romance languages, nouns will surface in Num0 as in (60) below. (59)
DP D D NumP Num AP1 Num Num NP N AP2 N N
265
This gives us a three-way distinction. Walloon nouns have no number feature, English nouns do, but dont move, and French nouns have a number feature and move. The surface position of nouns in English and Walloon, then, is the same but for different reasons. One consequence of forcing nouns in Walloon and English to remain in their underlying position is that they will remain to the right of the Specier of NP. Recall that in Romance, we associated the ability of nouns to surface to the left of Specier of NP with the availability of subjects of nouns to surface post-nominally. For instance, the French example in (16c), repeated below, arises by leaving de chaque peintre tranger (of each foreign painter) in Specier of NP and moving the noun, portrait (portrait), past it to the left. (16c) le portrait de chaque peintre tranger de son enfant the portrait of each painter foreign of his child the picture by each foreign painter of his child If nouns dont move to Num0 in English or Walloon, we would expect these postnominal subjects to be unavailable in both English and Walloon. Weve already seen that this is the case for English. But, interestingly, it doesnt seem to be the case for Walloon.
266
Noun Movement
Walloon does allow the N+Subject word order; Bernstein illustrates this with examples like (61) below.11 This suggests that even in Walloon, there is (61) la fy do moun the daughter of the miller the millers daughter
short noun movement, past the Specier of NP position. If Bernsteins arguments concerning how number morphology is expressed in Walloon is correct, this short noun movement cant be to Num0 . Bernstein suggests that it is instead movement to the position associated with the gender morpheme that Romance nouns so typically end in. She calls this a word marker. A schematic surface phrase marker for a Walloon DP, then, looks something like (62), then. Note that this requires that adjectives are not capable of being within NP. Indeed, Walloon illustrates that the availability of postnominal subjects and postnominal (single) adjectives do not correlate cross-linguistically. It is necessary, therefore, to divorce the processes that yield these two word-orders, and if noun movement is the relevant process, then this means there must be two positions to which nouns can move, with subjects below, and adjectives above, the lower of these. Up to now we have relied on a general theory of modier placement one of whose outcomes is that adjectives should be adjoined to the N that they modify. One thing we learn from this study, then, is that this general theory will have to be changed. But lets leave that for a later occasion. We must also revisit our account for why postnominal subjects dont arise in English. Its no longer sufcient to prevent nouns from moving to Num0 in English. We must also now address the possibility that nouns could move to a position beneath Num0 : the Wm0 position that Bernstein posits Wallon to have, for instance. While it might be that there are no postnominal subjects in English because English nouns dont make even a short move, but it could also be because
11 This is perhaps not the most compelling example as it is difcult to tell whether miller bears the subject relation to daughter. Interestingly, Bernstein claims that Walloon also allows for postnominal adjectives when they are ethnic; in fact, in these situations, the prenominal position is blocked. This also, rather dramatically, supports the conclusion that subjects can be postnominal in Walloon.
267
the other ingredient necessary to getting postnominal subjects is missing from English. Perhaps subjects cannot remain in Specier of NP position. We might characterize this difference between English and Romance in terms of the positions that Case is assigned to within DPs. Let the Specier of DP be assigned Case in both English and Romance, but let only Romance assign Case to Specier of NP. Note that this Case is manifest in what appears to be a preposition di or de, depending on the language. Lets call this Case, the one expressed by a preposition, Nominal Case. On this view, then, the difference in availability of postnominal subjects between English and Romance boils down to the availability of Nominal Case in Specier of NP. Indeed, the subject arguments of DPs uniformly appear with the genitive Case in English, and this is a position, as weve seen, associated Specier of DP. Thus, no matter what its position relative to the noun, the subject of a transitive noun cannot be Case marked with of, as (63) indicates.
268
Noun Movement
(63) a. b. c. d. e. f. * the discussion of Jill of the problem * the discussion of the problem of Jill * the of Jill discussion of the problem * the placement of Mark of the sofa * the placement of the sofa of Mark * the of Mark placement of the sofa
Its not possible to express the subjects of nouns this way even when the nouns do not express their object argument. Leaving the objects unexpressed in the examples in (63), for example, does not improve them. (64) a. b. c. d. * the discussion of Jill * the of Jill discussion * the placement of Mark * the of Mark placement
But its not that Nominal Case is completely absent in English. It is possible for Nominal Case to be found on the arguments of nouns that derive from unaccusative verbs, as in (65). (65) a. b. c. d. the death of her the arrival of it the appearance of Jill the sinking of the ship
With nouns derived from unergative verbs, the situation is somewhat intermediate, as illustrated by (66) below.12 If we interpret these facts as indicating that (66) a. b. c. d. e. ?* the running of her * the talking of him ?? the dancing of Jill ?* the speaking of the woman ?* the sitting of Mark
there is a distinction between the unaccusative nouns and the others that
12 See Grimshaw (1990) for a discussion of these facts and an argument that nouns divide into the unaccusative and unergative classes.
269
13 Restricting attention to just those languages that Case mark subject DPs in Specier of NP, and have the ban on right-adjoining bare adjectives.
270
Noun Movement
(68)
DP Jill
271
Bibliography
Abney, Steven. 1987. The English noun phrase in its sentential aspect. Doctoral Dissertation, Massachusetts Institute of Technology. Adger, David. 2003. Core syntax. Oxford: Oxford University Press. Alexiadou, Artemis, and Elena Anagnostopoulou. 1998. Parametrizing Agr: word order, V-movement and EPP-checking. Natural Language and Linguistic Theory 16:491539. Alexiadou, Artemis, and Elena Anagnostopoulou. 1999. EPP without Spec, IP. In Speciers, ed. David Adger, Bernadette Plunkett, George Tsoulas, and Susan Pintzuk, 93109. Oxford: Oxford University Press. Baker, C. L. 1978. Introduction to generative transformational syntax. Englewood Cliffs, New Jersey: Prentice-Hall. Baker, Mark. 1985. The mirror principle and morphosyntactic explanation. Linguistic Inquiry 16:373416. Baker, Mark C. 2003. Lexical categories. Cambridge: Cambridge University Press. Baltin, Mark. 1982. A landing site theory of movement rules. Linguistic Inquiry 13:138. Belletti, Adriana. 1982. On the anaphoric status of the reciprocal construction in Italian. The Linguistic Review 2:101138. Belletti, Adriana. 1990. Generalized verb movement. Rosenberg & Sellier. Bernstein, Judy. 1991. DPs in French and Walloon: Evidence for parametric variation in nominal head movement. Probus 3:101126.
Bibliography
Bernstein, Judy B. 1993. Topics in the syntax of nominal structure across Romance. Doctoral Dissertation, City University of New York. den Besten, Hans. 1983. On the interaction of root transformations and lexical deletive rules. In On the formal syntax of the Westgermania, ed. W. Abraham. Amsterdam, The Netherlands: John Benjamins Publishing Company. Bobaljik, Jonathan. 1994. What does adjacency do? In The morphology-syntax connection, ed. Heidi Harley and Colin Phillips, 132. MIT. Bobaljik, Jonathan. 1995. Morphosyntax: The syntax of verbal inection. Doctoral Dissertation, Massachusetts Institute of Technology, Cambridge. Bobaljik, Jonathan. 2003. Floating quantiers: Handle with care. In The second Glot International state-of-the-article book, ed. Lisa L.-S. Cheng and Rint Sybesma, 107148. Mouton de Gruyter. Bod, Rens, Jennifer Hay, and Stefanie Jannedy. 2003. Probabilistic linguistics. Cambridge, Massachusetts: MIT Press. Boeckx, Cedric, and Norbert Hornstein. 2003. Reply to Control is not movement. Linguistic Inquiry 34:269280. Boeckx, Cedric, and Norbert Hornstein. 2004. Movement under control. Linguistic Inquiry 35:431452. Boeckx, Cedric, and Norbert Hornstein. 2006a. Control in Icelandic and theories of control. Linguistic Inquiry 37:591606. Boeckx, Cedric, and Norbert Hornstein. 2006b. The virtues of control as movement. Syntax 9:118130. Boersma, Paul, and Bruce Hayes. 2001. Empirical tests of the gradual learning algorithm. Linguistic Inquiry 32:4586. Bokovi c, eljko. 2001. Floating quantiers and theta-role assignment. In North East Linguistic Society, ed. Minjoo Kim and Uri Strauss, 5978. Georgetown University: GLSA. Bokovi c, eljko. 2002a. A-movement and the EPP. Syntax 5:167218.
274
Bokovi c, eljko. 2002b. Expletives dont move. In Proceedings of North East Linguistic Society 32, ed. Masako Hirotani, volume 1, 2140. Bokovi c, eljko. 2004. Be careful where you oat your quantiers. Natural Language and Linguistic Theory 22:681742. Bowers, John. 1993. The syntax of predication. Linguistic Inquiry 24:591656. Bresnan, Joan. 1972. Theory of complementation in English syntax. Doctoral Dissertation, Massachusetts Institute of Technology. Burzio, Luigi. 1986. Italian syntax. Reidel Publishers. Carrier, Jill, and Janet H. Randall. 1992. The argument structure and syntactic structure of resultatives. Linguistic Inquiry 23:173234. Chierchia, Gennaro. 1984. Topics in the syntax and semantics of innitives and gerunds. Doctoral Dissertation, University of Massachusetts, Amherst. Chomsky, Noam. 1957. Syntactic structures. Mouton. Chomsky, Noam. 1965. Aspects of the theory of syntax. Cambridge, Massachusetts: M.I.T. Press. Chomsky, Noam. 1970. Remarks on nominalization. In Readings in English transformational grammar, ed. J. Jacobs and P. Rosenbaum, 184221. Waltham, Massachusetts: Ginn. Chomsky, Noam. 1981. Lectures on government and binding . Dordrecht, The Netherlands: Foris Publications. Chomsky, Noam. 1985. The logical structure of linguistic theory. Chicago: University of Chicago Press. Chomsky, Noam. 1986a. Barriers. Cambridge, Massachusetts: MIT Press. Chomsky, Noam. 1986b. Knowledge of language. New York, New York: Praeger Publishers. Chomsky, Noam. 1995. The minimalist program. Cambridge, Massachusetts: MIT Press.
275
Bibliography
Chomsky, Noam. 1998. Some observations on economy in generative grammar. In Is the best good enough?, ed. Pilar Barbosa, Danny Fox, Paul Hagstrom, Martha McGinnis, and David Pesetsky, 115128. Cambridge, Massachusetts: MIT Press. Chomsky, Noam. 2000. Minimalist inquiries: The framework. In Step by step: Essays on minimalist syntax in honor of Howard Lasnik, ed. Roger Martin, David Michaels, and Juan Uriagereka, 89156. MIT Press. Chomsky, Noam. 2001. Derivation by phase. In Ken Hale: A life in linguistics, ed. Michael Kenstowicz, 152. Cambridge, Massachusetts: MIT Press. Chomsky, Noam, and Morris Halle. 1968. The sound pattern of English. New York: Haper & Row, Publishers. Chomsky, Noam, and Howard Lasnik. 1993. Principles and parameter theory. In Syntax: an international handbook of contemporary research, ed. Arnim von Stechow, W. Sternefeld, and T. Vennemann, 506569. Berlin: Walter de Gruyter. Chomsky, Noam, and George Miller. 1963. Finitary models of language users. In Handbook of Mathematical Psychology 2, ed. Luce, Bush, and Galanter, volume Handbook of Mathematical Psychology 2, 419491. New York, New York: Wiley and Sons. Dprez, Vivian. 1989. On the typology of syntactic positions and the nature of chains. Doctoral Dissertation, MIT, Cambridge, Massachusetts. Diesing, Molly. 1992. Indenites. Cambridge, Massachusetts: MIT Press. Doetjes, Jenny. 1995. Quantication at a distance and iteration. In Proceedings of the North East Linguistic Society 25, ed. Jill Beckman, 111126. University of Pennsylvania: Graduate Linguistic Student Association. Dryer, Matthew S. 1989. Plural words. Linguistics 27:865895. Dryer, Matthew S. 1992. The Greenbergian word order correlations. Language 68:81138. Emonds, Joseph E. 1976. A transformational approach to English syntax. New York, New York: Academic Press.
276
Epstein, Samuel David, Acrisio Pires, and T. Daniel Seely. 2005. EPP in T: More controversial subjects. Syntax 8:6580. Fillmore, Charles. 1968. The case for case. In Universals in linguistic theory, ed. Emmon Bach and Richard Harms, 190. New York, New York: Holt, Rinehart and Winston. Fox, Danny. 2000. Economy and semantic interpretation. Cambridge, Massachusetts: MIT Press. Frazier, Lyn. 1978. On comprehending sentences: Syntactic parsing strategies. Doctoral Dissertation, University of Connecticut, Storrs. Fukui, Naoki, and Margaret Speas. 1986. Speciers and projection. In MIT working papers in linguistics, ed. Naoki Fukui, Tova R. Rapoport, and Elizabeth Sagey, volume 8, 128172. Department of Linguistics and Philosophy. Gerdts, Donna. 1981. Object and absolutive in Halkomelem Salish. Doctoral Dissertation, University of California, San Diego. Gibson, Edward. 1998. Linguistic complexity: Locality of syntactic dependencies. Cognition 68:176. Gibson, Jean. 1980. Clause union in Chamorro and in universal grammar. Doctoral Dissertation, University of California, San Diego. Giorgi, Alessandra, and Giuseppe Longobardi. 1991. phrases. Cambridge: Cambridge University Press. The syntax of noun
Giusti, Giuliana. 1990a. Floating quantiers in Germanic. In Grammar in progress, ed. Joan Mascar and Marina Nespor, 137146. Dordrecht, The Netherlands: Foris Publications. Giusti, Giuliana. 1990b. Floating quantiers, scrambling and congurationality. Linguistic Inquiry 21:633641. Godfrey, J., E. Holliman, and J. McDaniel. 1992. switchboard: telephone speech corpus for research and development. In Proceedings of ICASSP 92, 517520. San Francisco.
277
Bibliography
Greenberg, Joseph H. 1963. Some universals of grammar with particular reference to the order of meaningful elements. In Universals of language, ed. Joseph Greenberg. Cambridge, Massachusetts: MIT Press. Grimshaw, Jane. 1979. Complement selection and the lexicon. Linguistic Inquiry 10:279326. Grimshaw, Jane. 1990. Argument structure. Cambridge, Massachusetts: MIT Press. Groat, Erich. 1999. Raising the case of expletives. In Working minimalism, ed. Samuel Epstein and Norbert Hornstein, 2743. Cambridge, Massachusetts: MIT Press. Groat, Erich M. 1995. English expletives: A minimalist approach. Linguistic Inquiry 26:354365. Gruber, Jeffrey S. 1965. Studies in lexical relations. Doctoral Dissertation, Massachusetts Institute of Technology, Cambridge, Massachusetts. Hackl, Martin. 2000. Comparative quantiers. Doctoral Dissertation, Massachusetts Institute of Technology. Harris, Zellig. 1946. From morpheme to utterance. Language 22:161183. Hazout, Ilan. 2004. The syntax of existential constructions. Linguistic Inquiry 35:393430. Heim, Irene. 1987. Where does the deniteness restriction apply evidence from the deniteness of variables. In The representation of (in)deniteness(14), ed. Eric J. Reuland and Alice G. B. ter Meulen, 2142. Cambridge, Massachusetts: MIT Press. Hornstein, Norbert. 1999. Movement and control. Linguistic Inquiry 30:6996. Hornstein, Norbert. 2000. Control in GB and minimalism. In The rst glot international state-of-the-article book, ed. Lisa L.-S. Cheng and Rint Sybesma, 2745. New York: Mouton de Gruyter. Huang, C.-T. James. 1982. Logical relations in Chinese and the theory of grammar. Doctoral Dissertation, Massachusetts Institute of Technology.
278
von Humboldt, Wilhelm. 1836. ber die Verschiedenheit des Menschlichen Sprachbaues und Ihren Einuss auf die Geistige Entwickelung des Menschengeschlechts. Berlin: Royal Academy of Sciences of Berlin. Jackendoff, Ray. 1972. Semantic interpretation in generative grammar. Cambridge, Massachusetts: MIT Press. Jackendoff, Ray. 1977. X syntax. Cambridge, Massachusetts: MIT Press. Jaeggli, Osvaldo. 1982. Topics in Romance syntax. Dordrecht Holland: Foris Publications. Kayne, Richard S. 1975. French syntax: the transformational cycle. Cambridge, Massachusetts: MIT Press. Kayne, Richard S. 1983. Connectedness. Linguistic Inquiry 14:223249. Kayne, Richard S. 1984. Connectedness and binary branching . Dordrecht, Holland: Foris Publications. Kimball, J. 1973. Seven principles of surface structure parsing in natural language. Cognition 2:1547. Kitagawa, Yoshihisa. 1986. Subjects in Japanese and English. Doctoral Dissertation, University of Massachusetts, Amherst. Klima, Edward S. 1964. Negation in English. In The structure of language: Readings in the philosophy of language, ed. Jerry A. Fodor and Jerrold J. Katz, 246323. Prentice Hall. Koopman, Hilda, and Dominique Sportiche. 1991. The position of subjects. Lingua 85:211258. Koster, Jan. 1975. Dutch as an SOV language. Linguistic Analysis 1:111136. Koster, Jan. 1978. Why subject sentences dont exist. In Recent transformational studies in European languages, ed. Samuel Jay Keyser. Cambridge, Massachusetts: MIT Press. Kratzer, Angelika. 2005. Building resultatives. In Event arguments in syntax, semantics, and discourse, ed. Claudia Maienborn and Angelika WllsteinLeisten, 177212. Tbingen: Niemeyer.
279
Bibliography
Kuroda, S.-Y. 1988. Whether we agree or not. Linguisticae Investigationes 12:1 47. Landau, Idan. 2004. The scale of niteness and the calculus of control. Natural Language and Linguistic Theory 22:811877. Langacker, Ronald W. 1969. On pronominalization and the chain of command. In Modern studies in English, ed. David A. Reibel and Sandford A. Schane, 160186. Prentice Hall. Lasnik, Howard. 1981. Restricting the theory of transformations. In Explanation in linguistics, ed. Norbert Hornstein and David Lightfoot. London: Longmans. Lasnik, Howard. 1992. Case and expletives: Notes toward a parametric account. Linguistic Inquiry 23:381406. Lasnik, Howard. 1995. Case and expletives revisited: On greed and other human failings. Linguistic Inquiry 26:615634. Lasnik, Howard, and J. Kupin. 1977. A restrictive theory of transformational grammar. Theoretical Linguistics 4:173196. Lasnik, Howard, and Mamoru Saito. 1984. On the nature of proper government. Linguistic Inquiry 15:235290. Levin, Beth. 1993. English verb classes and alternations. Chicago: University of Chicago Press. Levin, Beth, and Malka Rappaport. 1986. The formation of adjectival passives. Linguistic Inquiry 17:623661. Marantz, Alec. 1984. On the nature of grammatical relations. Cambridge, Massachusetts: MIT Press. Marcus, Mitchell, Beatrice Santorini, and May Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn treebank. In Computational Linguistics 19, 313330. Martin, Roger. 1992. On the distribution and case features of PRO. Doctoral Dissertation, University of Connecticut, Storrs, University of Connecticut, Storrs.
280
Martin, Roger. 2001. Null case and the distribution of PRO. Linguistic Inquiry 32:141166. McCawley, James D. 1970. English as a VSO language. Language 46:286299. Milsark, Gary. 1974. Existential sentences in English. Doctoral Dissertation, Massachusetts Institute of Technology, Cambridge, Massachusetts. Miyagawa, Shigeru. 1989. Structure and case marking in Japanese. San Diego: Academic Press. Moro, Andrea. 1997. The raising of predicates : predicative noun phrases and the theory of clause structure. Cambridge ; New York ;: Cambridge University Press. Muysken, Pieter. 1979. Quechua causatives and Logical Form: A case study in markedness. In Theory of markedness in generative grammar, ed. Adriana Belletti, Luigi Brandi, and Luigi Rizzi. Scuola Normale Superiore, Pisa. Muysken, Pieter. 1981. Quechua word structure. In Binding and ltering , ed. Frank Heny. Cambridge, Massachusetts: MIT Press. Partee, Barbara Hall, Alice G. B. ter Meulen, and Robert Eugene Wall. 1990. Mathematical methods in linguistics. Dordrecht ; Boston: Kluwer Academic. Perlmutter, David. 1971. Deep and surface structure constraints in syntax. New York: Holt, Rinehart and Winston. Perlmutter, David. 1978. Impersonal passives and the unaccusative hypothesis. In Berkeley Linguistic Society, volume IV, 157189. University of California, Berkeley. Pesetsky, David. 1982. Paths and categories. Doctoral Dissertation, Massachusetts Institute of Technology. Pesetsky, David. 1989. The earliness principle. Paper presented at GLOW. Pires, Acrisio. 2001. Clausal and TP-defective gerunds: control without tense. In North East Linguistic Society, ed. Minjoo Kim and Uri Strauss, 390406. Georgetown University: GLSA.
281
Bibliography
Pires, Acrisio. 2006. The Minimalist syntax of Defective Domains. Amsterdam/Philadelphia: John Benjamins Publishing Company. Pollock, Jean-Yves. 1989. Verb movement, UG and the structure of IP. Linguistic Inquiry 20:365424. Postal, Paul M. 1969. On so-called pronouns in English. In Modern studies in English, ed. D. Reibel and Sandford Schane, 201244. Englewood Cliffs, New Jersey: Prentice-Hall. Reinhart, Tanya. 1976. The syntactic domain of anaphora. Doctoral Dissertation, Massachusetts Institute of Technology. Rizzi, Luigi. 1982. Issues in Italian syntax. Dordrecht, Holland: Foris Publications. Rizzi, Luigi. 1990. Relativized minimality. Cambridge, Massachusetts: MIT Press. Roberts, Craige. 1987. Modal subordination, anaphora and distributivity. Doctoral Dissertation, University of Massachusetts, Amherst. Rochemont, Michael, and Peter W. Culicover. 1990. English focus constructions and the theory of grammar. Cambridge: Cambridge University Press. Ross, John. 1967. Constraints on variables in syntax. Doctoral Dissertation, Massachusetts Institute of Technology. Roussou, Anna. 2001. Control and raising in and out of subjunctive complements. In Comparative syntax of Balkan languages, ed. Mara Luisa Rivero and Angela Ralli, 74104. Oxford: Oxford University Press. Sar, Kenneth. 1985. Syntactic chains. Cambridge: Cambridge University Press. Sar, Kenneth J. 1987. What explains the deniteness effect? In The representation of (in)deniteness(14), ed. Eric J. Reuland and Alice G. B. ter Meulen, 7197. Cambridge, Massachusetts: MIT Press. Saito, Mamoru. 1985. Some asymmetries in Japanese and their theoretical implications. Doctoral Dissertation, Massachusetts Institute of Technology.
282
Saito, Mamoru, and Naoki Fukui. 1998. Order in phrase structure and movement. Linguistic Inquiry 29:439474. Sapir, Edward. 1921. Language: an introduction to the study of speech. New York: Harcourt, Brace and Co. Sigursson, Halldr rmann. 1991. Icelandic case-marked PRO and the licensing of lexical arguments. Natural Language and Linguistic Theory 9:327363. Sportiche, Dominique. 1988. A theory of oating quantiers and its corollaries for constituent structure. Linguistic Inquiry 19:425449. Stechow, Arnim von. 1996. Against LF pied-piping. Natural Language Semantics 4:57110. Stowell, Tim. 1978. What was there before there was there. In Proceedings of the Fourteenth Regional Meeting of the Chicago Linguistic Society, ed. D. Farkas, volume Proceedings of the Fourteenth Regional Meeting of the Chicago Linguistic Society. Chicago, Illinois: The University of Chicago. Stowell, Timothy. 1981. Origins of phrase structure. Doctoral Dissertation, Massachusetts Institute of Technology. Terzi, Arhonto. 1992. PRO and obviation in modern Greek subjunctives. In The Proceedings of the Tenth West Coast Conference on Formal Linguistics, ed. Dawn Bates, 471482. Stanford, California: Center for the Study of Language and Information. Travis, Lisa. 1984. Parameters and effects of word order variation. Doctoral Dissertation, Massachusetts Institute of Technology. Valois, Daniel. 1991a. The internal syntax of DP. Doctoral Dissertation, University of California, Los Angeles. Valois, Daniel. 1991b. The internal syntax of DP and Adjective placement in French and English. In The Proceedings of the North Eastern Linguistics Society, ed. Tim Sherer, volume The Proceedings of the North Eastern Linguistics Society, 367382. Universite du Quebec a Montreal: The Graduate Students Linguistic Association, The University of Massachusetts-Amherst. Vikner, Sten. 1995. Verb movement and expletive subjects in the Germanic languages. Oxford: Oxford University Press.
283
Bibliography
Wasow, Tom. 1977. Transformations and the lexicon. In Formal syntax, ed. Peter Culicover, Tom Wasow, and Adrian Akmajian, 327360. New York, New York: Academic Press. Williams, Edwin. 1980. Predication. Linguistic Inquiry 11:203238. Williams, Edwin. 1989. The anaphoric nature of theta roles. Linguistic Inquiry 20:425456.
284