Documentation

Linglib.Phenomena.Alignment.Typology

Morphosyntactic Alignment Typology (WALS Chapters 98--100) #

@cite{comrie-1978} @cite{comrie-2013} @cite{dixon-1994} @cite{dryer-haspelmath-2013} @cite{dixon-1972} @cite{haspelmath-2021}

Formalizes three chapters from the World Atlas of Language Structures (WALS) covering morphosyntactic alignment -- the way languages mark the core grammatical relations S (sole argument of intransitive), A (agent of transitive), and P (patient of transitive):

Alignment Types #

Five alignment patterns recur across all three domains:

Key Generalizations #

Accusative alignment dominates cross-linguistically in all three domains. However, the three domains frequently diverge within a single language -- the phenomenon of split ergativity. @cite{dixon-1994} established the generalization that ergative case marking is more common for full NPs than for pronouns: many "ergative" languages are ergative for NPs but accusative for pronouns (e.g., Dyirbal, many Australian languages). Tripartite and active systems are typologically rare.

Morphosyntactic alignment type for case marking or verbal person marking.

These five categories classify how a language groups the three core grammatical relations S, A, P:

  • S = sole argument of an intransitive verb ("The man fell")
  • A = agent-like argument of a transitive verb ("The man hit the dog")
  • P = patient-like argument of a transitive verb ("The man hit the dog")

The alignment determines which relations receive the same morphological treatment (case marking or agreement pattern).

  • neutral : AlignmentType

    Neutral (S = A = P): no morphological distinction among the three relations. Common in isolating languages (e.g., Mandarin, Thai).

  • accusative : AlignmentType

    Accusative (S = A ≠ P): subject and agent grouped as "nominative", patient distinctly marked as "accusative". The most common pattern worldwide (e.g., English, Latin, Russian, Japanese).

  • ergative : AlignmentType

    Ergative (S = P ≠ A): absolutive groups S and P, while A receives distinct "ergative" marking (e.g., Basque, Dyirbal NPs, Hindi perfective).

  • tripartite : AlignmentType

    Tripartite (S ≠ A ≠ P): all three relations distinctly marked. Extremely rare (e.g., Nez Perce pronouns, some Australian languages).

  • active : AlignmentType

    Active / split-S: intransitive S is split -- agent-like S patterns with A, patient-like S patterns with P. The split is typically lexically or semantically determined (e.g., Georgian, Guarani).

Instances For
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Whether this alignment groups S with A (nominative-accusative pattern).

      Equations
      Instances For

        Whether this alignment groups S with P (absolutive-ergative pattern).

        Equations
        Instances For

          A language's alignment profile, combining classifications from all three WALS alignment chapters.

          This structure records a single language's position in each of the three typological dimensions: case marking on full NPs (Ch 98), case marking on pronouns (Ch 99), and verbal person marking (Ch 100). Languages frequently have different alignment types in different domains -- this is the phenomenon of split ergativity.

          • name : String

            Language name

          • iso639 : String

            ISO 639-3 code

          • npAlignment : AlignmentType

            Ch 98: Alignment of case marking of full NPs

          • pronAlignment : AlignmentType

            Ch 99: Alignment of case marking of pronouns

          • verbAlignment : AlignmentType

            Ch 100: Alignment of verbal person marking

          • notes : String

            Notes on the alignment system

          Instances For
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For
              Equations
              • One or more equations did not get rendered due to their size.
              Instances For
                Equations
                Instances For

                  Whether NP and pronoun alignment match (no split ergativity in case).

                  Equations
                  Instances For

                    Whether the language shows the classic NP-ergative / pronoun-accusative split (Dixon's generalization).

                    Equations
                    Instances For

                      Whether all three domains have the same alignment.

                      Equations
                      Instances For

                        Whether the language has any ergative alignment in any domain.

                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          English: accusative case marking on pronouns (I/me, he/him), no case on full NPs (neutral), and accusative verb agreement (verb agrees with subject = S/A grouping).

                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For

                            Hindi-Urdu: split-ergative. Ergative case marking (-ne on A) in perfective aspect for both NPs and pronouns. Accusative alignment in imperfective. WALS codes the dominant pattern as ergative for NPs, accusative for pronouns (the pronoun forms are less consistently ergative). Verb agreement is neutral (agrees with unmarked argument).

                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For

                              Basque: consistently ergative across all three domains. Ergative case (-k) marks A on both NPs and pronouns. Verb agreement cross-references S/P (absolutive) and A (ergative) with distinct slots -- ergative alignment in agreement.

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                Dyirbal (Pama-Nyungan, Australia): the textbook case of split ergativity. Full NPs have ergative case marking, but 1st/2nd person pronouns follow accusative alignment. Verb has no person agreement (neutral).

                                Equations
                                • One or more equations did not get rendered due to their size.
                                Instances For

                                  Georgian: active (split-S) alignment. The verb agreement system shows an active pattern: some intransitive S arguments pattern like A, others like P, depending on the verb class (unergative vs. unaccusative). Case marking is also active in some series.

                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For

                                    Tagalog: neutral case marking on NPs and pronouns (the Philippine voice system does not straightforwardly map to accusative/ergative alignment). Verbal person marking is also neutral in the WALS sense.

                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For

                                      Japanese: accusative case marking on NPs (ga nom, o acc) and pronouns. No verb person agreement (neutral).

                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For

                                        Latin: accusative case marking on both NPs and pronouns (nominative vs. accusative). Verb agreement is accusative (agrees with S/A).

                                        Equations
                                        • One or more equations did not get rendered due to their size.
                                        Instances For

                                          Russian: accusative alignment across all three domains. Nominative marks S/A, accusative marks P (with animacy-conditioned syncretism). Verb agrees with S/A.

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For

                                            Mandarin Chinese: neutral alignment in all three domains. No morphological case marking, no verb person agreement. Grammatical relations encoded by word order.

                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For

                                              Turkish: accusative case marking on NPs (with differential object marking) and pronouns. Verb agreement is accusative (agrees with S/A).

                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For

                                                Tongan (Austronesian): ergative case marking on NPs. Pronouns are also ergative. No verb person agreement (neutral).

                                                Equations
                                                • One or more equations did not get rendered due to their size.
                                                Instances For

                                                  Guarani (Tupian): active (split-S) case marking and verb agreement. The set of prefixes used on the verb depends on whether the S argument is agent-like or patient-like.

                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    Samoan (Austronesian): ergative case marking on full NPs (ergative particle e before A). Pronouns have accusative alignment. Verb agreement is neutral.

                                                    Equations
                                                    • One or more equations did not get rendered due to their size.
                                                    Instances For

                                                      German: accusative case marking on NPs and pronouns (nom/acc distinction). Verb agreement is accusative (agrees with S/A).

                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For

                                                        Swahili (Bantu): neutral case marking (no morphological case on NPs or pronouns). Verb agreement is accusative (subject prefixes cross-reference S/A).

                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For

                                                          Tibetan (Central, Lhasa): ergative case marking on NPs and pronouns (ergative particles -gis and -kyis on A). Verb is neutral (no person agreement).

                                                          Equations
                                                          • One or more equations did not get rendered due to their size.
                                                          Instances For

                                                            Nez Perce (Sahaptian): tripartite case marking on NPs (nominative for S, ergative for A, accusative for P -- all three distinctly marked). Pronouns are also tripartite. Verb agreement is accusative.

                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For

                                                              Finnish: accusative case marking on NPs and pronouns (nominative for S/A, partitive/accusative for P). Verb agreement is accusative (agrees with S/A).

                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                Warlpiri (Pama-Nyungan, Australia): ergative case marking on full NPs (ergative -ngku and -rlu on A). Pronouns have accusative alignment (nom and acc). Verb agreement is neutral (no person marking on verb in the standard analysis).

                                                                Equations
                                                                • One or more equations did not get rendered due to their size.
                                                                Instances For

                                                                  Dargwa (Tanti; Nakh-Dagestanian): consistently ergative across all three domains. Ergative -li marks A on both full NPs and pronouns (@cite{sumbatova-2021} §4.4.3). Gender agreement on the verb is controlled by the absolutive argument (ergative alignment). Person agreement follows a hierarchical pattern (1,2 > 3; abs > erg) but the primary dimension is ergative. No tense-conditioned or animacy-conditioned split.

                                                                  Equations
                                                                  • One or more equations did not get rendered due to their size.
                                                                  Instances For

                                                                    Yukatek Maya (Yucatecan Mayan): aspect-conditioned split-intransitive system. In perfective clauses (completive/subjunctive status), S is cross-referenced by set-B markers (ergative pattern). In imperfective clauses (incompletive status), S is cross-referenced by set-A markers (accusative pattern). Transitive marking is not affected by the split.

                                                                    @cite{bohnemeyer-2004}: the split is not conditioned by lexical verb class (not split-S or fluid-S) but by viewpoint aspect, making it a typologically rare case of aspect-conditioned split intransitivity. WALS codes Yukatek verbal person marking as "split" (F100A).

                                                                    Equations
                                                                    • One or more equations did not get rendered due to their size.
                                                                    Instances For

                                                                      All alignment profiles in our sample.

                                                                      Equations
                                                                      • One or more equations did not get rendered due to their size.
                                                                      Instances For

                                                                        All ISO 639-3 codes are non-empty.

                                                                        All ISO 639-3 codes are exactly 3 characters (standard length).

                                                                        No duplicate ISO codes (each language appears once).

                                                                        Spot-checks that each language has the expected alignment values.

                                                                        Generalization 1: Accusative is the most common alignment for pronouns. #

                                                                        Across WALS Ch 99, accusative (83) outnumbers all other pronoun alignment types. In our sample, accusative pronoun marking is likewise the most frequent pattern.

                                                                        Generalization 2: Ergative NP marking is more common than ergative #

                                                                        pronoun marking (Dixon's generalization).

                                                                        @cite{dixon-1994} established that in the animacy/nominal hierarchy, ergative case marking is more likely to appear on full NPs than on pronouns. In our sample, more languages have ergative NP alignment than ergative pronoun alignment.

                                                                        Generalization 3: Split ergativity -- ergative NPs with accusative #

                                                                        pronouns.

                                                                        Many "ergative" languages show a split: NPs are ergative but pronouns are accusative. This pattern is attested multiple times in our sample (Dyirbal, Hindi-Urdu, Samoan, Warlpiri).

                                                                        Generalization 4: No language has accusative NPs with ergative pronouns. #

                                                                        The reverse of Dixon's split (accusative NPs, ergative pronouns) is predicted not to occur. Our sample confirms this: whenever pronouns are ergative, NPs are at least ergative too.

                                                                        Generalization 5: Tripartite alignment is extremely rare. #

                                                                        In WALS Ch 98, only 2 out of 190 languages have tripartite NP alignment. In our sample, only Nez Perce is tripartite for NPs.

                                                                        Generalization 6: Active alignment is rare for case marking. #

                                                                        Active (split-S) systems are uncommon for case marking. In our sample, only Georgian shows active case marking on NPs and pronouns.

                                                                        Generalization 6a: Aspect-conditioned split intransitivity. #

                                                                        Yukatek Maya and Georgian both show active (split-S) verbal person marking. In both languages, the split is conditioned by viewpoint aspect: perfective triggers ergative-like marking, imperfective triggers accusative-like marking. @cite{bohnemeyer-2004} argues this reduces to a single linking-by-viewpoint mechanism projected from the causal chain of subevents in event structure.

                                                                        Generalization 7: Languages with ergative NP marking tend to have #

                                                                        ergative or neutral verbal person marking.

                                                                        When NP case is ergative, the verb agreement is typically either ergative (like Basque) or neutral (like Dyirbal, Tibetan). Accusative verb agreement with ergative NP case is uncommon.

                                                                        Generalization 8: All five alignment types are attested for NPs. #

                                                                        Our sample covers every WALS alignment category for full NPs.

                                                                        Generalization 9: All five alignment types are attested for pronouns. #

                                                                        Generalization 10: Four of the five alignment types are attested for #

                                                                        verbal person marking. Tripartite verb agreement is exceedingly rare cross-linguistically (WALS Ch 100 lists only 5 out of 378 languages) and is not represented in our sample.

                                                                        Generalization 11: Neutral NP alignment implies neutral or accusative #

                                                                        pronoun alignment.

                                                                        Languages without case marking on NPs (neutral) either also lack case marking on pronouns (neutral) or have accusative pronoun case (like English, where case survives only on pronouns). They never have ergative pronoun case without NP case.

                                                                        Generalization 12: Fully uniform alignment is common. #

                                                                        Many languages have the same alignment type across all three domains. This includes languages like Basque (uniformly ergative), Mandarin (uniformly neutral), Latin/Russian/Turkish (uniformly accusative), and Georgian (uniformly active).

                                                                        Generalization 13: Accusative verb agreement is the dominant pattern #

                                                                        among languages that have verb agreement.

                                                                        Among languages with non-neutral verb alignment, accusative agreement (verb agrees with S/A, i.e., subject agreement) is the most common.

                                                                        Number of languages with ergative NP alignment.

                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          Number of languages with ergative pronoun alignment.

                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            Languages with no case on NPs should not have ergative NP alignment.

                                                                            In our sample, every language with accusative NP case also has accusative pronoun case. (Accusative alignment does not split across the NP/pronoun divide the way ergative does.)

                                                                            No language has tripartite NP alignment without also having tripartite pronoun alignment (in our sample).

                                                                            Silverstein's Hierarchy #

                                                                            @cite{silverstein-1976} predicts that ergative marking targets the less prominent end of the animacy/definiteness scale. More prominent NPs (pronouns, 1st/2nd person) get accusative treatment; less prominent NPs (full NPs, 3rd person, inanimate) get ergative treatment.

                                                                            The silverstein function encodes this as a threshold-based predicate over prominence values, and its monotonicity property captures the implicational nature of the hierarchy: if a less-prominent NP gets accusative treatment, all more-prominent NPs do too.

                                                                            The dyirbalSplit instantiates Core.SplitErgativity for Dyirbal's animacy-conditioned split, connecting the split mechanism to the dyirbal alignment profile above.

                                                                            Silverstein's hierarchy: NPs at or above the prominence threshold get accusative alignment; those below get ergative.

                                                                            Equations
                                                                            Instances For
                                                                              theorem Phenomena.Alignment.Typology.silverstein_monotone (threshold p₁ p₂ : Nat) (h_ge : p₁ p₂) (h_acc : silverstein threshold p₂ = Core.AlignmentFamily.accusative) :

                                                                              Silverstein is monotone: if prominence p₁ ≥ p₂ and p₂ gets accusative, then p₁ gets accusative. (Higher prominence → more likely accusative.)

                                                                              Silverstein predicts Dixon's generalization: with threshold 1, full NPs (prominence 0) get ergative, pronouns (prominence 1) get accusative — exactly the NP-ergative / pronoun-accusative split.

                                                                              Dyirbal-style split by animacy: human/animate → accusative, inanimate → ergative (@cite{dixon-1972}, @cite{blake-1994} Ch. 4).

                                                                              Equations
                                                                              Instances For

                                                                                The Dyirbal split matches the Dyirbal alignment profile: inanimate NPs get ergative alignment (matching dyirbal.npAlignment).

                                                                                Ditransitive alignment classifies how a language codes R (recipient) and T (theme) relative to the monotransitive roles A and P.

                                                                                The terminology follows @cite{haspelmath-2005}:
                                                                                - **Indirective**: T = P, R distinctly marked (like accusative for
                                                                                  monotransitives: the "subject-like" argument is unmarked)
                                                                                - **Secundative**: R = P, T distinctly marked (like ergative for
                                                                                  monotransitives: the "object-like" argument is unmarked)
                                                                                - **Neutral**: R = T = P (no distinction among objects)
                                                                                - **Tripartite**: R ≠ T ≠ P (all three distinctly marked)
                                                                                
                                                                                Haspelmath's key observation: indirective is to ditransitives what
                                                                                accusative is to monotransitives; secundative is to ditransitives
                                                                                what ergative is to monotransitives. 
                                                                                

                                                                                Ditransitive alignment type. Classifies how R (recipient) and T (theme) are coded relative to the monotransitive P.

                                                                                • neutral : DitransitiveAlignment

                                                                                  Neutral (R = T = P): no distinction among non-agent arguments

                                                                                • indirective : DitransitiveAlignment

                                                                                  Indirective (T = P ≠ R): R is distinctly marked, T patterns with P. Analogous to accusative alignment. E.g., English "give the book TO Mary".

                                                                                • secundative : DitransitiveAlignment

                                                                                  Secundative (R = P ≠ T): T is distinctly marked, R patterns with P. Analogous to ergative alignment. E.g., many Bantu applicatives.

                                                                                • tripartite : DitransitiveAlignment

                                                                                  Tripartite (R ≠ T ≠ P): all three roles distinctly marked

                                                                                Instances For
                                                                                  Equations
                                                                                  • One or more equations did not get rendered due to their size.
                                                                                  Instances For

                                                                                    A language's ditransitive alignment profile.

                                                                                    Instances For
                                                                                      Equations
                                                                                      • One or more equations did not get rendered due to their size.
                                                                                      Instances For
                                                                                        Equations
                                                                                        • One or more equations did not get rendered due to their size.
                                                                                        Instances For

                                                                                          English: indirective (T = P, R marked with "to"). "give the book to Mary" vs. "give Mary the book" (double-object alternation, but prepositional dative is the indirective base).

                                                                                          Equations
                                                                                          • One or more equations did not get rendered due to their size.
                                                                                          Instances For

                                                                                            German: indirective (T = ACC, R = DAT).

                                                                                            Equations
                                                                                            • One or more equations did not get rendered due to their size.
                                                                                            Instances For

                                                                                              Latin: indirective (T = ACC, R = DAT).

                                                                                              Equations
                                                                                              • One or more equations did not get rendered due to their size.
                                                                                              Instances For

                                                                                                Japanese: indirective (T = o, R = ni).

                                                                                                Equations
                                                                                                • One or more equations did not get rendered due to their size.
                                                                                                Instances For

                                                                                                  Swahili: secundative (R = P, T distinctly marked via applicative).

                                                                                                  Equations
                                                                                                  • One or more equations did not get rendered due to their size.
                                                                                                  Instances For

                                                                                                    Mandarin: neutral (R and T both unmarked, distinguished by order).

                                                                                                    Equations
                                                                                                    • One or more equations did not get rendered due to their size.
                                                                                                    Instances For
                                                                                                      Equations
                                                                                                      • One or more equations did not get rendered due to their size.
                                                                                                      Instances For

                                                                                                        Indirective is more common than secundative (parallel to accusative being more common than ergative for monotransitives).

                                                                                                        Bridge: Fragment grammatical descriptions ↔ Typology classifications #

                                                                                                        These theorems verify that the inline AlignmentProfile entries are consistent with the grammatical facts described in each language's Fragment directory. The Fragment describes what the language does (e.g., agent gets ERG, patient gets ABS); the Typology classifies that pattern (e.g., ergative NP alignment). Bridge theorems ensure the classification faithfully reflects the grammar.

                                                                                                        Languages without Fragment directories rely on WALS grounding (above) instead.

                                                                                                        Dargwa: Fragment says A=ERG, S/P=ABS → Typology says ergative NP alignment. The classification follows from the case facts.

                                                                                                        Dargwa: Fragment says alignment family is ergative → Typology profile is consistently ergative.