Toba Batak Extraction Restriction @cite{erlewine-2018} #
Theory-neutral empirical data on the extraction restriction in Toba Batak: only the pivot can undergo Ā-movement (wh-movement, relativization, clefting).
Key Empirical Findings #
- Pivot-only extraction: Only the pivot argument can be Ā-extracted. All other arguments are trapped (§2).
- Voice determines pivot: Actor Voice → agent is pivot; Object Voice → patient is pivot (§2.1).
- Voice symmetry: AV makes the agent extractable but not the patient; OV makes the patient extractable but not the agent.
- Long-distance extraction: The extracted element must be the pivot of the most deeply embedded clause, and intermediate clauses show extraction morphology (§4).
Cross-Linguistic Context #
Toba Batak's extraction restriction contrasts with:
- Tagalog/Seediq (Austronesian): Philippine-style voice/Case determines extraction via agreement morphology — a voice-as-case system. @cite{erlewine-2018} argues TB is NOT this type.
- Mam (Mayan): Extraction is unrestricted, but oblique extraction triggers a dedicated morpheme (=(y)a') as an Agree reflex on Voice⁰.
- English: No extraction restriction; gap strategy.
TB and Tagalog are both Austronesian and both use voice morphology, but the mechanism differs: Tagalog has voice-as-Case, while TB has predicate fronting + nominal licensing (DPs in Spec,CP lack a Case licensor). Mam (Mayan) shows a third pattern: unrestricted extraction with morphological reflexes of Agree.
AV + patient (non-pivot): ungrammatical.
AV + oblique (non-pivot): ungrammatical.
OV + agent (non-pivot): ungrammatical.
OV + oblique (non-pivot): ungrammatical.
AV + adjunct: grammatical (adjuncts don't need Case).
OV + adjunct: grammatical (adjuncts don't need Case).
For DP arguments: extraction is grammatical iff the extracted element is the pivot for the given voice (§2).
For non-DP adjuncts: extraction is always grammatical regardless of voice, because adjuncts don't need Case licensing (§4.3).
Voice symmetry: AV and OV are mirror images — each makes exactly one argument extractable (the pivot) and blocks all others.
TB uses structural restriction, not dedicated morpheme or voice alternation in the Tagalog sense.
TB distinguishes extracted positions via voice (which role is pivot).
Extraction data → ExtractionProfile → AH → Relativization #
Connects three independent data sources through the AH bridge:
- Individual extraction datums (from @cite{erlewine-2018})
- The ExtractionProfile summary (markedPositions)
- RelClauseMarkers (from @cite{keenan-comrie-1977})
If any link is wrong — e.g., listing .directObject as extractable when
relativization markers don't cover DO — the chain breaks.
Note: the extraction data uses ArgumentRole (agent, patient) while the
ExtractionProfile and AH use ExtractionTarget (subject, directObject).
Voice promotion means the patient's default position is DO, but its
surface extraction position is always subject (the pivot). The chain
must go through the ExtractionProfile, which encodes this fact.
Link 1→2: The ExtractionProfile's marked positions are exactly the
positions for which some voice makes a DP argument grammatically
extractable. Only .subject qualifies (the pivot position).
Link 2→3: Every extractable position (ExtractionTarget) maps to an
AH position that is covered by some Toba Batak relativization
marker. Would have failed with the old [.subject, .directObject]
because .directObject maps to AHPosition.directObject, which
no marker covers.
Full chain (all three links as a single conjunction):
(1) every grammatical DP extraction extracts the pivot,
(2) the pivot position (subject) is in markedPositions, and
(3) every marked position maps to a relativizable AH position.
Connects @cite{erlewine-2018}'s extraction data through the
ExtractionProfile to @cite{keenan-comrie-1977}'s markers.