Abstract
Chomsky 2013 argues that D of an external argument in Spec TP is in principle as close to C as T is. Assuming that “inversion depends upon locality independent of category,” T and D should therefore compete with each other as candidates for raising to C in English questions, yet only T so raises. Chomsky takes this to indicate that the external argument is in its base position, Spec, vP, when C is merged. Our paper argues that this approach cannot generalize to account for why only V+v and not D of an external argument can raise to T in V-v-to-T languages. It also has major difficulties accounting for a well-known asymmetry: T raises to C only in English non-subject questions. We conclude that head-movement is sensitive to categorial and other features of lexical items, contra the claims of Chomsky 2013.
1 The problem
A central property of grammatical processes is that they are structure dependent. Chomsky 2013 (henceforth PoP) illustrates this well-established fact with Yes/No question formation in English. [1] In (1)a, fronted can must relate to the matrix verb swim and cannot link to the linearly closer fly. [2] Thus (1)a has the paraphrase (1)b, not (1)c:
Can eagles that fly swim?
Is it the case that eagles that fly can swim?
#Is it the case that eagles that swim can fly?
This follows if grammatical operations like T-to-C must be structure dependent, based on hierarchical relations rather than linear proximity.
Taking this assumption as a starting point, PoP asks two related questions: (a) why must syntactic operations exploit hierarchical conceptions of proximity and never linear conceptions? [3] and (b) assuming that “inversion depends on locality independent of category” (PoP: 43), why does T-to-C move T rather than (a subpart of) the expression in Spec TP?
PoP’s answer for (a) is that the objects of grammatical manipulation are only hierarchically specified. They have no linear order, the latter arising from the mapping to the sensory and motor systems (S&M) at spell out (SO). Thus, until an object has been transferred to S&M, phrase markers are unordered and so operations that transform them (including T-to-C I-merge) cannot refer to such order. The idea, both simple and elegant, is that grammars cannot use absent information. [4]
PoP gives a variant of this answer to (b) as well. Here’s the proposal: Were (2) the structural input for T-to-C, [5] then we would expect that either of D or T could move to C as they are equidistant from C. [6]
[C [TP [D NP] [T’ T vP]]]
PoP’s proposed solution to the locality puzzle in (2) is the following: The reason that only T moves to C is that D (and the phrase that contains it) is in its vP-internal base position when C is Merged. The structural input to T-to-C movement is not (2) but (3), on the assumption that the external argument only raises after C-T Feature Inheritance (Chomsky 2007, 2008; Richards 2007). Thus at the crucial point there is nothing as close to C as T, and that’s why T alone can move. [7]
Chomsky’s claim: only T can raise to C because the C-T relationship is established when EA is still in situ
[C [TP T… [vP [D NP] v…]]]
Summing up, T and not D raises to C because D is not there to raise. As in the solution to problem (a), the key assumption is that the derivation cannot exploit absent information.
In what follows we concentrate on PoP’s approach to (b). We argue that when a fuller range of head movement operations are considered, PoP’s conclusion, viz. that D does not count because it is not there to be moved (I-merged), turns out to be inadequate in a very important way: the crucial ambiguity regarding movement should arise in any configuration of the form in (4), where α and β correspond to XPs or intermediate projections in the traditional X’ schema and H, X, and Y are heads. [8] But PoP’s solution is not applicable in most such cases. One consequence of this state of affairs is that, when combined with the VP Internal Subject Hypothesis (VPISH), PoP assumptions falsely predict that raising D to T from an external argument should always be a licit alternative to V-to-T movement.
X and Y are equidistant from H

The second half of our paper addresses T-to-C movement in Wh-questions. It is well-known that local subject Wh-questions disallow T-to-C while all other direct Wh-questions require it (see (5)a versus (5)b,c). [9]
Which boys (*did) eat the pizza
Which pizza *(did) the boys eat
Which boys1 *(did) you say t1 ate the pizza
We argue in §3 that, like the facts of V-to-T movement, this asymmetry shows that factors other than locality are involved in motivating and constraining head-movement. This is true in each of several possible approaches to the asymmetry. Pesetsky and Torrego 2001 argue that T and a subject Wh-phrase are indeed equidistant from the local C. Both have features relevant to C, but the subject has more of them; its movement hence blocks T-to-C. A very different approach in terms of affix-hopping relies crucially on categorical features of potential hosts. Yet a third possibility, building on a PoP proposal that subject WH surface in Spec, TP, refers crucially to Q features in accounting for the asymmetry. We show that each potential account is at odds with the spirit of PoP’s purely locality-based approach to (1). The results of our investigation of T-to-C are thus more compatible with a revised approach to head-movement sketched out in Chomsky 2015. In Chomsky 2015, the subject raises to Spec, TP before C merges. The featural relation of inheritance is involved in determining that only V raises to v*, and T to C.
This paper is structured as follows. §2 explores V-to-T movement. §3 addresses the distribution of T-to-C in Wh-questions and how to account for it. §4 critiques some related aspects of Feature Inheritance in PoP. §5 concludes.
2 V-to-T: the equidistance problem
2.1 Introduction
As noted above, PoP’s proposed solution to (b), i.e. to why there is a T-to-C relation but not a D-to-C relation, exploits the fact that in this case, the configuration [H [XP, YP]] arises when XP raises from a lower position after H is Merged. The solution therefore cannot be extended to many familiar cases of head-movement. We begin in §2.2 by illustrating the problem with respect to V-to-T movement across an external argument. §2.3 briefly considers whether it is relevant that in SVO languages, the subject raises out of vP: assuming the tail of a chain is invisible, this might yield a potential remedy in terms of the relative timing of EA and V-raising. §2.4 shows that V-to-T is possible in low-subject VSO languages and D-to-T is not an available alternative; hence EA raising is not the key to this asymmetry. §2.5 adds that V-raising across negation is also erroneously ruled out by Chomsky’s solution to (b).
2.2 V-raising across EA
Consider the structural relations between T and material in vP. By the logic of PoP, V-v and D, the immediate daughters to vP and DP respectively, are equally close to T. Therefore the puzzle PoP raised for T-to-C movement arises here as well. Why is it that in (6) there can be a V-v-to-T relation but not a D-to-T relation? In fact, D and V-v should be equally available for a relation to T, since both are equally close to it. Minimal search should obtain an ambiguous result. Yet V-v can raise to T in familiar languages, and D cannot. [10]
Minimal search should find D and v equidistant from T
... [T T [α... DP... vP]]
PoP’s argument that C-to-T is evidence of a C-T relation prior to EA-raising is weakened by the recurrence of the same phenomenon at this lower point in the clause. A parallel account of V-v-to-T movement would have to claim that a relation existed between T and v prior to Merge of the EA. However, in this case, there is no apparent lower position to shunt the external argument to in order to finesse the problem. Thus, if VPISH is correct, PoP’s proposed solution to (b) above is too narrow to account for the regularities of V-v-to-T movement. [11],[12]
2.3 A timing account?
Suppose the external argument raised to Spec, TP before V-v-to-T movement. Its unpronounced copy in vP would be invisible to T under PoP assumptions, hence solving the problem we presented in §2.2.
EA raises making v unambiguously closest to T
[TP EA T [vP <EA> [v VP]]
V-v subsequently raises to T
[TP EA v+T [vP <EA> v VP]]
This timing provides a potential explanation for why it is true that only V-v, and not D, can raise to T in languages with verb raising. But it gives rise to other difficulties. For one thing, it appears to conflict with PoP’s account of T-to-C. Recall that PoP explains the fact that only T (and not D of spec T) raises to C in questions by assuming that the C-T relation is established BEFORE the EA raises from Spec, vP to spec TP. Crucially, PoP assumes that D of the subject cannot raise to C because the subject is “not there” (see (3), repeated below).
PoP’s claim: only T can raise to C because the C-T relationship is established when EA is still in situ
[C [TP T... [vP [D NP] v...]]]
To maintain PoP’s timing solution to the original puzzle while at the same time adopting the timing solution we just suggested for (6), we would have to assume that T-to-C and EA-raising both precede V-v-to-T. This is highly stipulative, suspiciously counter-cyclic, and incompatible with the fact that V-v-to-T appears to feed T-to-C movement in many languages. We illustrate with Standard Norwegian in (8)a (from Taraldsen 1986: 8, who adopts a V-to-C analysis; the representation in (8)b is a slight update). [13]
| Hvilket | spørtsmål | skjønte | Jens ikke? |
| Which | question | understood | Jens not |
‘Which question did Jens understand?’
| [CP Hvilket | spørtsmål | skjønte+T+C | [TP Jens [T’. | ..ikke...]]] |
| which | question | understood | Jens | not |
Another problem arises for any appeal to timing in relation to these issues. Recall that PoP adopts both the VPISH and the Feature Inheritance hypothesis of Chomsky (2007, 2008) and Richards 2008. Under FI, only phase heads have edge and Agree features, and hence after operations in the vP phase are completed, there are no further operations until C is merged. Chomsky 2008: 151 writes, “It follows that the edge and Agree properties of P[hase] H[ead] apply in parallel: EF raises XP… to Spec, PH, while Agree values all uninterpretable features and may or may not raise XP to form an A-chain…. the edge and Agree features of the probe apply in either order, or simultaneously…”
This approach raises questions about whether it is even possible to claim that the C-T relation necessarily precedes EA raising, as PoP does; or to couch a potential account of V-v-to-T in terms of timing. Any timing solution is incompatible with simultaneity, since simultaneity expressly denies timing.
Summing up, we have explored the possibility of a timing solution to the puzzle of V-v-to-T across EA in Spec, vP. We have argued that such a solution is incompatible with PoP’s approach to T-to-C. Lastly we have argued that, in fact, any timing solution is incompatible with the simultaneity of application of operations internal to a phase proposed in Chomsky (2007, 2008).
We turn in §2.4 to a final piece of evidence against such a timing approach, namely that it is founded on the assumption that competition between EA and V-v-to-T are avoided only if EA raises to Spec, TP. §2.4 shows that V-v-to-T is the only option in various cases where EA does NOT raise to Spec, TP.
2.4 Low subject VSO languages
VSO clauses in languages such as Standard Arabic, Middle Egyptian, and Xhosa provide evidence that V-v-to-T is possible across subjects that remain low. [14] In such cases no timing approach could explain why D-raising from EA does not compete with V-v-to-T raising.
The morpho-syntax of VSO varies considerably across languages, and there are accordingly a number of approaches to it. In Standard Arabic, where only SVO is accompanied by full subject agreement, Mohammad (2000) argues that a sentence like (9) is the product of V-to-v-to-T across the in situ subject (see also Fassi Fehri 1993, Melebari and Seely 2012, and see Kramer 2009 for an analysis of Middle Egyptian along the same lines). By the logic of PoP’s approach to (1), in a VSO language with this derivation, D of the subject should be equidistant from T and the crucial ambiguity should arise. [15]
| Tuhibbu | kull-u | ʔumm-in | ʔibn-a-ha |
| loves.3SG.FEM | every-NOM | mother-GEN | son-ACC-her |
‘Every mother loves her son’
Subparts of EA and V+v are equally close to T, but only V+v raises
[TP... T [vP [every mother] [v’ love+v [VP <love> her son]]]]

Minimal search fails to single out V+v alone
Carstens and Mletshe (2013) provide examples of VSO in Xhosa embedded clauses with overt complementizers. They argue that in a clause with default (= Class 17) subject agreement like (10)a, the subject remains very low in the structure. The verb raises across it and adjoins to a middle-field inflection below Tense, identified in the Bantu linguistics literature as Mood (see (10)b); (10) adapted from Carstens and Mletshe (2013). [16] The subject cannot surface higher as (10)c,d illustrate. The locations of C and T morphemes make it particularly unlikely that EA has raised to Spec, TP in the licit example (as it would have to for the proposal explored in §2.2 to be applicable). Had it so raised, the subject would be expected to surface between okokuba – ‘that’ and the future auxiliary be, contrary to fact.
| ... okokuba | ku-be | ku-fund-a | wena | i-si-Xhosa |
| that | 17SA-FUT | 17SA-study-MOOD | you | 7-7-Xhosa |
| ‘...that you will study Xhosa’ | ||||
[lit: that will study you Xhosa]
[CP that [TP FUT [MoodP study+MOOD ... [αyou ... <study+v> Xhosa ...]]]]
*...okokuba wena ku-be ku-fund-a i-si-Xhosa
*...okokuba ku-be wena ku-fund-a i-si-Xhosa
We conclude that verb-raising is possible in Xhosa across an EA that remains low. As (11) illustrates, this means that prior to V+v-raising, D of the EA is at least as close to Mood as V+v is. But the contrast in (12) shows that D cannot raise instead of V. [17]
EA and V+v are equally close to Mood.
[MoodP... Mood ... [α [EA] [study+v [VP ...]]]]
D cannot move instead of V:
| .... okokuba | ku-be | ku-fund-a | lo |
| that | 17SA-FUT | 17SA-study-MOOD | 1this |
| mntwana <kufund> | i-si-Xhosa | ||
| 1child | 7-7-Xhosa | ||
‘that this child will study Xhosa’
| *... okokuba | ku-be | lo(-a) <lo> | m-ntwana |
| that | 17SA-FUT | 1this(-MOOD) | 1-1child |
| ku-fund(-a) | i-si-Xhosa | ||
| 17SA-study(-MOOD) | 7-7-Xhosa | ||
‘that this child will study Xhosa’
2.5 V-raising across intervening negation
A second class of problems for PoP (and the strategy sketched out in §2.3) lies in one of the standard diagnostics for V-v-to-T movement. Following Pollock 1989, we take the presence of negation between the surface position of V and its object to indicate that V has raised out of VP.
| Je | n’ | aime | pas | les | fraises |
| I | ne | like | not | DET | strawberries |
‘I don’t like strawberries’
[TP SU... V+v+T [NegP Neg [vP <SU> <v>...]]]
Recall PoP’s proposal that only hierarchical relations (not category) are involved in the calculation of closeness relevant to T-to-C movement. If we extend this idea to V-v-to-T movement, then it cannot be relevant whether the raised item is a verb, and Neg itself ought to be a candidate for movement to T, contrary to fact. If we assume that the subject does not pass through a Spec, NegP en route to Spec TP, then Neg should be the only candidate, since it is most local to T (see (14)a). Neg’s structural relation to T parallels that of T to C in (3), repeated below. If we assume instead that the subject occupies Spec, NegP at a point before verb-raising, then minimal search should yield an ambiguous result. Neg and D in the subject should compete for raising to T in (14)b, just as PoP argues would be true of T-to-C across a subject in Spec, TP in the hypothetical (2). In neither case is raising of the verb expected to cross Negation.
Minimal search should find and raise Neg to T, not v
T [NegP Neg [vP EA v...]]
If EA raises to Spec, Neg, minimal search should yield ambiguous results
T [NegP EA Neg [vP <EA> v...]]
Chomsky’s claim: only T can raise to C because the C-T relationship is
established when EA is still in situ
[C [TP T... [vP [D NP] v...]]]
3 Wh-questions: the subject/non-subject asymmetry
3.1 The problem
Consider now a second case of English T-to-C movement not addressed in PoP. T-to-C occurs in Wh-questions (WHQ) as well as Yes/No questions. Subject and non-subject Wh-questions display a well-known asymmetry: [18]
Which boys (*did) eat the pizza
Which pizza *(did) the boys eat
Which boys1 *(did) you say t1 ate the pizza
As (15) demonstrates, T-to-C is obligatory in all direct English WHQ except local subject questions like (15)a, where it is forbidden. (16) sketches out the relevant structure before Wh- movement for the three examples prior to C-to-T Feature Inheritance (Chomsky 2007, 2008). Shading indicates areas to which Transfer has applied; following Chomsky 2000 we assume the VP complement of v* Transfers before Wh moves to Spec, CP. This forces all but local subject WHs to move to the edge of the local vP phase edge, as is commonly assumed. Local subjects are externally Merged to the vP phase edge.
C [TP T [vP which boys [v [VP eat the pizza]]
C [TP T [vP which pizza [vP the boys [v [VP eat <which pizza>]]]]
C [TP T [vP which boys [vP you [v [VP say...]]]]
The problem for deriving (15)a-c should be evident: there is no difference between subject and non-subject WHQ in terms of the relation between T and C or in terms of distance between C and WH, and hence no clear basis on which to predict when T-to-C movement applies.
3.2 Pesetsky and Torrego’s (2001) solution
There is a simple way around this problem. Suppose with Pesetsky and Torrego 2001, and contra Chomsky 2007, 2008, that subject WH movement in English proceeds to Spec, CP via Spec TP, rather than directly from Spec, vP. [19] Representations of the two cases are sketched in (17) (relative locations of T and the Wh-phrase being crucial).
C [TP which boys [T [vP <which boys> [v [VP eat the pizza]]]]]
C [TP the boys T... [vP which pizza [<the boys> v [VP eat <which pizza>]]]]
In (17)b, T is clearly closer to C than the Wh-phrase is, in marked contrast to (17)a. This assumption allows a potential account of the suppression of T-to-C movement in subject questions like (15)a on the basis of locality, much in the spirit of the PoP attempt to explain why C rather than D raises in (3). That is, in (17)b T raises to C since T is unambiguously close(est) unlike in (17)a. But in a departure from PoP assumptions, reference to features is crucial to ensure that a WH subject prevents T-to-C in (17) while a non-Wh-subject like that in (3) or (15)c does not, assuming subject and T are equidistant from C. Following Pesetsky & Torrego’s account, both have features relevant to C, but a Wh-subject has more of them, making a derivation that raises the WH-subject more economical than a derivation that raises both. [20]
This analysis is incompatible with PoP assumptions in several ways. First, as we saw in §2, equidistant candidates for raising should yield completely free alternatives; hence it would be expected that T and (subparts of) the Wh-subject would be equally able to raise to C(P) in a subject question. Second, PoP proposes that Wh-subjects never raise to surface in local Spec, CP, in contrast with Pesetsky & Torrego’s analysis. Hence either some PoP assumptions must be abandoned or some alternative account of this asymmetry must be found.
3.3 An affix-hopping alternative
Two anonymous reviewers suggest an alternative in terms of affix-hopping that we think it is instructive to explore. Consider again (16), a PoP-style derivation under which C merges before the subject raises to Spec, TP. Assume that T raises to C in all questions including the subject question in (16)a. After syntactic movements and transfer have applied, the results will be (18).
C [TP T [vP which boys [v’ eat+v [VP <eat> the pizza]]]]
C [TP T [vP which pizza [vP the boys [v [VP <eat> <which pizza>]]]]]
[CP Which boys T+C [TP <which boys> <T> [vP eat+v [VP...]]]]
[CP Which pizza T+C [TP the boys <T> [vP eat+v [VP...]]]]
The reviewers suggest that affix-hopping, sensitive only to pronounced material, applies successfully in (18)a (see (19)a). In (18)b, the overt subject the boys blocks it, yielding the necessity for do-support in C, as shown in (19)b (see Chomsky 1957, Lasnik 1995, Lasnik et al. 2000, Halle and Marantz 1993 among many others).
[CP Which boys __ [TP <which boys> __ [vP eat+v+PST [VP...]]]]
[CP Which pizza do+T+C [TP the boys __ [vP eat+v [...]]]]
While we see this as a viable alternative to Pesetsky & Torrego’s approach, its compatibility with PoP assumptions seems equally questionable. PoP’s claim regarding T to C is that “inversion depends on locality independent of category,” and it assumes that D of a subject is in principle as likely a candidate as T is for raising to C. The question that arises in connection with (19)b then is why affix hopping should not be equally a matter of pure locality, independent of category? But if this were the case, there would seem to be no reason for D of the subject in (19)b not to be a potential target for affix-hopping.
[CP Which pizza __ [TP the+PST boys __ [vP eat+v [VP...]]]]
If, on the other hand, the affix-hopping operation is recognized to be categorically sensitive, the solution is viable. [21]
Summing up, the subject/non-subject asymmetry for T-to-C movement can be captured under the assumption that a wh-subject in Spec, TP suppresses it a la Pesetsky & Torrego, or by hypthesizing that affix-hopping applies across the subject trace but refuses to attach to D. Under either approach, reference to categorical and other features is critical.
3.4 A speculation re: the treatment of subject questions in PoP
3.4.1 Subject questions in PoP
PoP proposes that subject and non-subject questions differ in a way that is potentially relevant to the asymmetry of concern: while non-subject WH move to Spec, CP, Wh-subjects surface in Spec TP of local subject questions in PoP. As two anonymous reviewers note, there are some interesting arguments against this conclusion in the syntactic literature including McCloskey 2000, Agbayani 2000, Ginzburg and Sag 2000, and Pesetsky and Torrego 2001); but some of these are neutralized by the approach in PoP, under which TP of a subject question inherits all the properties of an interrogative CP. For the sake of argument, we assume the proposal is viable and explore its potential consequences for T-to-C movement.
As noted in §1 and §2.3, PoP follows Chomsky 2007, 2008 and Richards 2007 in assuming that T obtains features from C. In addition to the unvalued phi-features these works discuss, PoP adds a proposal that the Q feature of an interrogative C is among those that T inherits on the basis that “features of an LI cannot move independently of the feature bundle to which they belong” (PoP:47). Copies of all C’s features including Q are therefore inherited by T in a bundle. Crucially, PoP argues that Feature Inheritance must be construed as copying “leaving Q in its original position for selection and labeling” (PoP:47, note 47). We illustrate in (20):
Feature Inheritance in PoP:
C[Q, uPhi...] T→ b. C[Q1, uPhi1...] T[Q2, uPhi2...]
Phrases obtain labels in PoP by means of a feature-seeking algorithm that applies at the phase level. In the configuration [α XP, YP], α can obtain a label if XP and YP share a “prominent feature.” In subject questions, the copy of Q on T agrees with the Wh-subject and the constituent formerly known as TP is labeled QP (see (21)a). [22] In contrast, it is the Q feature on C that agrees with a non-subject WH, labeling CP as QP (see (21)b). In this case, sharing of prominent features between the raised EA and T labels their containing phrase PhiP. Once these labelings have taken place, no further movement is possible for the XPs involved because under PoP assumptions, criterial freezing accompanies successful sharing of a prominent feature (see discussion in PoP: 47, citing Rizzi personal communication; see also Rizzi 2013, Rizzi and Shlonsky 2007). Hence the subject WH cannot raise to Spec, CP in (21)a.
C[Q1, uPhi1...] [QP [How many mechanics] [T[Q2,
uPhi2...] fixed the cars?]][QP [How many cars] C[Q1, uPhi1...] [PhiP [the mechanics] T[Q2,
uPhi2...]...]]
PoP does not address the question of why T must raise to C in the circumstances where this is required. The facts are puzzling on PoP assumptions. Feature Inheritance ensures that C and T’s features are largely identical. T therefore has nothing that C does not also have apart from their differing categories, and PoP assumptions rule out reference to category. Furthermore, C needs only its own Q-feature in order to interact with a non-subject WH-phrase for labeling in a case like (21)b, or to participate in selection.
In prior treatments including that of Pesetsky & Torrego mentioned above, C has an additional feature that requires valuation in a local relation with a matching feature of T or the subject. But in the PoP system, there is no comparable motivation for T-to-C movement in non-subject questions. It is thus somewhat mysterious that T should have to raise. If we suppose that T-to-C simply happens freely in a move-alpha sort of way, with locality the only relevant issue, it is not clear why it should be barred in (21)a. Crucially, C is present in the derivation to supply T’s features. The impossibility of T-to-C movement in subject questions therefore cannot be attributed to C being absent.
3.4.2 A speculation about multiple Qs
It seems to us in principle possible that PoP’s duplication of the Q-feature might yield a novel account of the subject/non-subject asymmetry for T-to-C in Wh-questions. Our proposal relies on one crucial assumption: retention of Qs at both CP and TP levels yields a deviant result. Where Q does not pair with a wh-phrase, it yields a Yes/No question (see Epstein et al. 2015). Therefore a structure like (22)a,b present gibberish combinations of Yes/No and wh-questions.
*[QP1 C[Q1] [QP2 [How many mechanics] T[Q2] fixed the cars?]]
*[QP1 How many cars C[Q1] [QP2 the mechanics T[Q2] fix <wh...>?]]
To rectify this problem, in a subject question like (22)a, C bearing Q1 would lower to T (see (23)). [23] In Yes/No questions, and in a non-subject question like (22)b, where the interrogative operator moves to Spec, CP, T must raise to place its copy of Q in C (see (24)). [24]
<C[Q1, uPhi1...]> [QP [How many mechanics] [C[Q1, uPhi1...]+T[Q2, uPhi2...]...]]

[QP [How many cars] T[Q2, uPhi2...]+C[Q1, uPhi1...] [PhiP SU <T[Q2, uPhi2...]>...]]

This is the only interpretation of the PoP system that we have been able to find which might capture the distribution of T-to-C in questions.
Like Pesetsky & Torrego’s approach to T-to-C movement and like the affix-hopping possibility, the hypothesis we have sketched out here ties head-movement to features of the moving element and its landing site. Absent Q features, T-to-C (and its inversion, C lowering to T) do not happen at all.
In sum, there are several promising options for capturing the distribution of T-to-C in matrix questions: raising the subject via Spec, TP; affix-hopping from C down to V+v in subject questions, or relocating an extraneous duplicate Q to the projection containing interrogative material. [25] It lies outside this paper’s scope to choose between them. What matters for our purposes is that all the possibilities take head-movement to be driven and constrained by factors other than pure locality, as we also argued to be the case for V-v-to-T. A natural move is to extend our conclusions to the question of why T and not D raises to C in questions, contrary to PoP’s approach.
4 Unsolved mysteries
PoP’s proposals about Feature Inheritance raise some important conceptual questions. While Feature Inheritance is not head movement, it is an important sub-case of head-head relations relevant to the material in this paper. We therefore offer a few observations on its implications.
Under the interpretation of Feature Inheritance as copying, it is not clear why the unvalued phi-features left on C do not cause the derivation to crash. PoP’s footnote 47 suggests that these phi-features may be “deleted or given a phonetic form (as in West Flemish) hence [are] invisible at the next phase.”
Chomsky (2001, 2007, 2008) argues that transferring features before valuation is “too early” because unvalued features cause crash; and that transferring features after valuation is “too late” because they cannot be distinguished from intrinsically valued interpretable features and will therefore also cause a crash (see also Epstein et al. 2010 for discussion). The upshot is that unvalued features must obtain values and be immediately transferred. They cannot be retained on a phase head, either valued or unvalued. Chomsky’s two suggestions – that C’s uPhi can be deleted unvalued, or given phonetic form – are not consistent with these prior positions on the status of uPhi, but PoP contains no explanation as to how these inconsistencies are to be resolved. [26]
5 Concluding remarks
PoP’s explanatory achievements fall short of its goals, as revealed when one compares its outcomes with those of traditional accounts. In more conventional analyses of head raising, selection for category and other features work in concert with hierarchical locality considerations to dictate what moves. Thus, for example, V moves to T because of some specific requirements of T that V meets; D fails to have the relevant features or properties and hence is not a potential mover in these cases (the same is true for Neg). Similarly, in cases of WHQs, features of T and C (and under Pesetsky and Torrego’s account, the subject) determine when movement does and does not apply. The features of relevance are ad hoc, however. PoP eschews such devices and tries to provide a more principled account. Sadly, the account appears to be incompatible with broad classes of phenomena and with standard assumptions about clausal architecture and the Merge location of subjects. Moreover, it is at least questionable whether one wants too principled an account for these cases. T-to-C movement and verb-raising are parametric options. It seems to us that features (including reference to category) are a reasonable way of distinguishing these grammar-specific options. If so, then PoP’s ambitions are misdirected (and Chomsky’s 2015 retreat from PoP’s pure locality approach is well-warranted). Sometimes ad hoc is just what we need. [27]
In addition to approaching T-to-C in terms of locality only, PoP presents some new proposals regarding Feature Inheritance as copying, and labeling as the driving force for XP-movement. While these have many interesting implications, we have pointed out that the approach leaves uPhi features on phase heads, raising some unresolved questions connected with the functioning and the rationale for Feature Inheritance.
[Correction added after online publication 8 January 2016: “5 Concluding remarks” section has been inserted before the paragraph “PoP’s explanatory achievements...” and the second sentence of the last paragraph “While these have many ..... workable.” has been deleted]
Acknowledgments
Thanks to Samuel Epstein and Hisatsugu Kitahara for sharing insights into PoP; to the University of Michigan for organizing the 2013 Linguistics Society of America’s summer institute where the ideas for this paper originated; to the University of Missouri’s South African Education Program for funding Xhosa research reported in §2.4; to Brandon Fry for eloquently articulating in EKS’s LSA class that things other than locality should be considered in the analysis of T-to-C, and for helpful comments on an earlier draft; and, finally, to two anonymous reviewers for ideas, comments and suggestions that improved the final version.
References
Agbayani, Brian. 2000. Wh-subjects in English and the vacuous movement hypothesis. Linguistic Inquiry 31. 703–713.10.1162/002438900554523Search in Google Scholar
Brandi, Luciana & Patricia Cordin. 1989. Two Italian dialects and the null subject parameter. In Osvaldo Jaggli & Kenneth Safir (eds.), The null subject parameter, 111–142. Dordrecht: Kluwer.10.1007/978-94-009-2540-3_4Search in Google Scholar
Bošković, Željko. 2015. On the timing of labeling: deducing Comp-trace effects, the Subject Condition, the Adjunct Condition, and tucking in from labeling. Ms., University of Connecticut.10.1515/tlr-2015-0013Search in Google Scholar
Carstens, Vicki & Loyiso Mletshe. 2015. Radical defectivity: implications of Xhosa expletive constructions. Linguistic Inquiry 46(2).187–242.10.1162/LING_a_00180Search in Google Scholar
Chomsky, Noam. 1957. Syntactic structures. Paris: Mouton.10.1515/9783112316009Search in Google Scholar
Chomsky, Noam. 1968. Language and Mind. New York: Harcourt, Brace, Jovanovitch.10.1037/e400082009-004Search in Google Scholar
Chomsky, Noam. 2001. Derivation by phase. In M. Kenstowicz (ed.), Ken Hale: a life in language, 1–52. Cambridge: MIT Press.Search in Google Scholar
Chomsky, Noam. 2007. Approaching UG from below. In U. Sauerland & H.-M. Gartner (eds.), Interfaces +Recursion = Language? 1–29. Berlin: Mouton de Gruyter..10.1515/9783110207552-001Search in Google Scholar
Chomsky, Noam. 2008. On Phases. In Robert Freidin, Carlos P. Otero & Maria Luisa Zubizarreta (eds.), Foundational issues in linguistic theory, 133–166. Cambridge, MA: MIT Press.Search in Google Scholar
Chomsky, Noam. 2013. Problems of projection. Lingua 130.33–49.10.1075/la.223.01choSearch in Google Scholar
Chomsky, Noam. 2015. Problems of Projection: Extensions. In Elisa Di Domenico, Cornelia Hamann & Simona Matteini (eds.), Structures, strategies and beyond – studies in honour of Adriana Belletti, 3–16. Amsterdam/Philadelphia: John Benjamins.10.1075/la.223.01choSearch in Google Scholar
Epstein, Sam, Hisa Kitahara, & Daniel Seely. 2010. Uninterpretable features: what are they and what do they do? In Michael Putnam (ed.), Exploring crash-proof grammars, 125–142. Amsterdam: John Benjamins.10.1075/lfab.3.07epsSearch in Google Scholar
Epstein, S. D., H. Kitahara & T. D. Seely. 2012. Structure building that can’t be!. In M. Uribe-etxebarria & V. Valmala (eds.), Ways of structure building, 253–270. Oxford: Oxford University Press.10.1093/acprof:oso/9780199644933.003.0011Search in Google Scholar
Epstein, Samuel, H.Kitahara & T. D.Seely. 2015. ‘What do you wonder’ is not syntactic. In Samuel Epstein, H.Kitahara & T. D.Seely (eds.), Explorations in maximizing syntactic, inimization, 222–239. Routledge: Routledge Leading Linguists Series.10.4324/9781315722764Search in Google Scholar
Erlewine, Michael. to appear. Anti-locality and optimality in Kaqchikel Agent Focus. Natural Language and Linguistic Theory.10.1007/s11049-015-9310-zSearch in Google Scholar
Fassi Fehri, Abdelkader. 1993. Issues in the structure of Arabic clauses and words. Dordrecht: Kluwer Publications.10.1007/978-94-017-1986-5Search in Google Scholar
Ginzburg, Jonathan & Ivan A. Sag. 2000. Interrogative Investigations. Palo Alto: CSLI Publications.Search in Google Scholar
Halle, Morris & Alec Marantz. 1993. Distributed Morphology and the pieces of inflection. In K. Hale & S. J. Keyser (eds.), The view from Building 20: Essays in linguistics in honor of Sylvain Bromberger, 1–52. Cambridge, MA: MIT Press.Search in Google Scholar
Kandybowicz, Jason. 2008. On the incompatibility of Feature Inheritance and the PIC. Ms., Swarthmore College.Search in Google Scholar
Kramer, Ruth. 2009. VSO and SVO word order in Middle Egyptian. In Charles G. Häberl (ed.), Afroasiatic studies in memory of Robert Hetzron: Proceedings of the 35th annual North American conference on Asiatic linguistics, 92–147. Cambridge: Cambridge Scholars Press.Search in Google Scholar
Lasnik, Howard. 1995b. Verbal morphology: Syntactic Structures meets the Minimalist Program. In H. Campos & P. Kempchinsky (eds.), Evolution and revolution in linguistic theory: Essays in honor of Carlos Otero, 251–275. Washington, DC: Georgetown University Press.Search in Google Scholar
Lasnik, Howard (with Marcela Depiante and Arthur Stepanov). 2000. Syntactic structures revisited. Cambridge: MIT Press.10.7551/mitpress/6592.001.0001Search in Google Scholar
McCloskey, James. 2005. Predicates and heads in Irish clausal syntax. In Andrew Carnie, Heidi Harley & Sheila Ann Dooley (eds.), Verb first: on the syntax of verb-initial languages, 155–174. Amsterdam: John Benjamins.10.1075/la.73.12mccSearch in Google Scholar
McCloskey, James. 2000. Quantifier float and Wh-movement in an Irish English. Linguistic Inquiry 31(1). 57–84.10.1162/002438900554299Search in Google Scholar
McCloskey, James. 1996. On the scope of verb movement in Irish. Natural Language and Linguistic Theory 14. 47–104.10.1007/BF00133403Search in Google Scholar
McCloskey, James. 2011. The shape of Irish clauses. In Andrew Carnie (ed.) Formal approaches to Celtic linguistics, 143–178. Newcastle upon Tyne: Cambridge Scholars Publishing.Search in Google Scholar
McCloskey, James. 2012. Irish existentials in context. To appear in Syntax.Search in Google Scholar
Melebari, Ala’a & T. Daniel Seely. 2012. Agreement without ‘exception’ in Standard Arabic? Presentation at the Illinois Symposium on Semitic Linguistics, University of Illinois, Urbana-Champaign.Search in Google Scholar
Mohammad, Mohammad A. 2000. Word order, agreement, and pronominalization in Standard and Palestinian Arabic. Amsterdam and Philadelphia: John Benjamins.10.1075/cilt.181Search in Google Scholar
Ouali, Hamid. 2008. On C-to-T ɸ-feature transfer: the nature of agreement and anti-agreement in Berber. In: Rl D’Alessandro, G. H. Hrafnbjargarson & S. Fischer (eds.), Agreement Restrictions, 115–18. Berlin: Mouton de Gruyter.10.1515/9783110207835.159Search in Google Scholar
Pesetsky, David & Esther Torrego. 2001. T-to-C movement: Causes and consequences. In M. Kenstowicz (ed.), Ken Hale: A life in language, 355–426. Cambridge: MIT Press.Search in Google Scholar
Piatelli-Palmarini, M. (ed.), 1980. Language and learning: the debate between Jean Piaget and Noam Chomsky. Cambridge, MA: Harvard University Press.Search in Google Scholar
Pollock 1989. Verb movement, Universal Grammar and the structure of IP. Linguistic Inquiry 20(3). 365–424.Search in Google Scholar
Richards, M. 2007. On feature inheritance: an argument from the phase impenetrability condition. Linguistic Inquiry 38. 563–572.10.1162/ling.2007.38.3.563Search in Google Scholar
Rizzi, Luigi. 2013. Cartography, criteria, and labeling. Ms, Università di Sienna.Search in Google Scholar
Rizzi, Luigi & Ur Shlonsky. 2007. Strategies for subject extraction. In U. Sauerland & H.-M. Gartner (eds.), Interfaces + recursion = language?; Chomsky’s minimalism and the view from syntax-semantics. Studies in Generative Grammar 89, 115–160. Berlin: Mouton de Gruyter.Search in Google Scholar
Roberts, Ian. 2010. Agreement and head-movement: Clitics, incorporation, and defective goals. Cambridge: MIT Press.10.7551/mitpress/9780262014304.001.0001Search in Google Scholar
Schneider-Zioga, Patricia. 2007. Anti-agreement, anti-locality, and minimality. Natural language & Linguistic Theory 25.403–446.10.1007/s11049-006-9014-5Search in Google Scholar
Taraldsen, Knut Tarald. 1986. On verb-second and the functional content of syntactic categories. In Haider, Hubert & Martin Prinzhorn (eds.), Verb second phenomena in the Germanic languages, 7–25. Dordrecht: Foris Publications.Search in Google Scholar
©2016 by De Gruyter Mouton
Articles in the same Issue
- Frontmatter
- In Memoriam of Yoshiyuki Shibata
- Introduction
- On the timing of labeling: Deducing Comp-trace effects, the Subject Condition, the Adjunct Condition, and tucking in from labeling
- Head-head relations in Problems of projection
- Phase cancellation by external pair-merge of heads
- Labeling, maximality and the head – phrase distinction
- (A) Case for labeling: labeling in languages without ɸ-feature agreement
- Labeling through Spell-Out
Articles in the same Issue
- Frontmatter
- In Memoriam of Yoshiyuki Shibata
- Introduction
- On the timing of labeling: Deducing Comp-trace effects, the Subject Condition, the Adjunct Condition, and tucking in from labeling
- Head-head relations in Problems of projection
- Phase cancellation by external pair-merge of heads
- Labeling, maximality and the head – phrase distinction
- (A) Case for labeling: labeling in languages without ɸ-feature agreement
- Labeling through Spell-Out