Extraction Morphology Interface #
@cite{elkins-torrence-brown-2026} @cite{erlewine-2018} @cite{mccloskey-2002} @cite{erlewine-2016}
Theory-neutral interface for cross-linguistic extraction morphology — how languages morphologically mark that a constituent has undergone Ā-movement (wh-movement, relativization, focus fronting, etc.).
Languages vary dramatically in whether and how they track extraction:
- English: no overt marking (gap strategy)
- Austronesian (Tagalog, Malagasy): voice alternation marks which argument has been extracted
- Mayan (Mam, K'iche'): dedicated morphemes on verbal complex (Mam =(y)a', K'iche' wi)
- Celtic (Irish): complementizer changes form (aL vs. aN)
- Chamorro: agreement morphology tracks extracted position
This interface parametrizes these strategies without committing to a particular syntactic theory of how extraction works.
How a language morphologically marks extraction (Ā-movement).
This is a descriptive typology of the surface strategy; different syntactic theories will derive these differently.
- none : ExtractionMarkingStrategy
No overt marking of extraction. The extracted position is a silent gap. E.g., English "What did you buy __?"
- voiceAlternation : ExtractionMarkingStrategy
Voice alternation: the verbal voice morphology changes to mark which argument has been extracted. E.g., Tagalog Actor/Patient/Locative voice.
- dedicatedMorpheme : ExtractionMarkingStrategy
A dedicated morpheme appears on the verbal complex when extraction occurs. E.g., Mam =(y)a' on Voice⁰/Dir⁰, K'iche' wi, Irish complementizer aL.
- agreementTracking : ExtractionMarkingStrategy
Agreement morphology on the verb tracks the extracted position. E.g., Chamorro wh-agreement.
- complementizerChange : ExtractionMarkingStrategy
The complementizer changes form depending on whether extraction has occurred through its clause. E.g., Irish aL (direct) vs. aN (indirect).
- structuralRestriction : ExtractionMarkingStrategy
Extraction is structurally restricted to a designated position (the "pivot"), not by surface morphology but by clause-structural factors such as predicate fronting + anti-locality. Voice morphology determines which argument occupies the pivot, but the restriction itself is structural. E.g., Toba Batak.
- agentFocusAlternation : ExtractionMarkingStrategy
Clause-local extraction of a specific argument role (typically agent/ergative) triggers an alternation in clause structure — a "repair" that avoids a locality crash. The canonical case is Kaqchikel Agent Focus: clause-local agent extraction crashes the normal transitive because movement from Spec,TP to Spec,CP violates Spec-to-Spec Anti-Locality (SSAL), so the grammar selects an intransitive-like AF structure with distinct verbal morphology (-Vn, no Set A). Long-distance agent extraction does NOT trigger AF — the repair is locality-sensitive.
Instances For
Equations
- Interfaces.instBEqExtractionMarkingStrategy.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The grammatical position from which extraction occurs.
This intersects with the @cite{keenan-comrie-1977} Accessibility Hierarchy
(see FillerGap/Typology.lean), but is defined independently because
extraction morphology may make finer distinctions than relativization.
- subject : ExtractionTarget
Subject (ergative/nominative) extraction
- directObject : ExtractionTarget
Direct object (accusative/absolutive) extraction
- indirectObject : ExtractionTarget
Indirect object (dative/applicative) extraction
- oblique : ExtractionTarget
Oblique (instrumental, locative, etc.) extraction
- possessor : ExtractionTarget
Possessor extraction
Instances For
Equations
- Interfaces.instBEqExtractionTarget.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
The thematic category of an argument being extracted: agent (external argument), patient (internal argument), or oblique.
Coarser than ThetaRole (which distinguishes agent/experiencer/
causer, patient/theme, goal/source/instrument). Used when the
relevant distinction is which macro-role is extracted, not fine-
grained thematic relations or structural positions.
Complements ExtractionTarget (structural position): ArgumentRole
identifies what is extracted; ExtractionTarget identifies where
it was extracted. The two coincide in simple active clauses
(agent = subject, patient = object) but diverge under voice
alternation (in OV, the patient becomes the subject).
- agent : ArgumentRole
- patient : ArgumentRole
- oblique : ArgumentRole
Instances For
Equations
Equations
- Interfaces.instBEqArgumentRole.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Default structural position for a given argument role (active voice).
Equations
Instances For
What is being extracted: a DP argument (which has a thematic role and needs Case licensing) or a non-DP adjunct (which has no thematic role and is Case-exempt).
This distinction drives the DP/non-DP extraction asymmetry: in predicate-fronting languages like Toba Batak, only DP extraction is restricted to the pivot; adjuncts extract freely.
- dpArg : ArgumentRole → Extractee
- adjunct : Extractee
Instances For
Equations
- Interfaces.instDecidableEqExtractee.decEq (Interfaces.Extractee.dpArg a) (Interfaces.Extractee.dpArg b) = if h : a = b then h ▸ isTrue ⋯ else isFalse ⋯
- Interfaces.instDecidableEqExtractee.decEq (Interfaces.Extractee.dpArg a) Interfaces.Extractee.adjunct = isFalse ⋯
- Interfaces.instDecidableEqExtractee.decEq Interfaces.Extractee.adjunct (Interfaces.Extractee.dpArg a) = isFalse ⋯
- Interfaces.instDecidableEqExtractee.decEq Interfaces.Extractee.adjunct Interfaces.Extractee.adjunct = isTrue ⋯
Instances For
Equations
Instances For
Equations
Equations
- Interfaces.instReprExtractee = { reprPrec := Interfaces.instReprExtractee.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
A language's extraction morphology profile: what strategy it uses, which positions are marked, and whether the marking distinguishes between different extracted positions.
Follows the RelativizationProfile pattern from FillerGap/Typology.lean.
- language : String
Language name
- strategy : ExtractionMarkingStrategy
Primary extraction-marking strategy
- markedPositions : List ExtractionTarget
Which extraction targets trigger overt marking. Empty for
nonestrategy languages. - distinguishesPosition : Bool
Does the marking distinguish which position was extracted? E.g., Tagalog voice distinguishes subject/object/oblique; Mam =(y)a' marks only oblique.
- notes : String
Notes
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Does this profile mark extraction from a given position?
Equations
- p.marks t = p.markedPositions.any fun (x : Interfaces.ExtractionTarget) => x == t
Instances For
Does this profile use any overt extraction marking?