Documentation

Linglib.Phenomena.Agreement.NounCategorization

Noun Categorization and Agreement Typology #

@cite{aikhenvald-2000} @cite{chierchia-1998} @cite{dixon-1982} @cite{greenberg-1972}

Cross-linguistic typology of noun categorization systems, following @cite{aikhenvald-2000}. The central diagnostic is agreement: noun class/gender systems (French) require it; classifier systems (Mandarin, Japanese) lack it. This is @cite{dixon-1982}'s definitional divide.

This file provides the cross-linguistic context for the English-specific agreement data in Agreement.Basic, Agreement.DetNoun, and Agreement.Case: why do some languages use agreement-based noun categorization while others use classifiers?

Part I — Typology #

Three languages from three families mapped to NounCategorizationSystem:

French (Indo-European): Noun class / gender (2 classes). Agreement. [-arg, +pred].
Mandarin (Sino-Tibetan): Numeral classifiers (~100+). No agreement. [+arg, -pred].
Japanese (Japonic): Numeral classifiers (josūshi). No agreement. [+arg, -pred].

System descriptions are derived from Fragment data (single source of truth).

Part II — Universals #

Aikhenvald's empirical generalizations (Chapters 11, 15): agreement diagnostics, semantic parameter universals, inventory size constraints, @cite{greenberg-1972} classifier–number complementarity.

Thread map #

Core infrastructure: Core.NounCategorization — ClassifierType, SemanticParameter, NounCategorizationSystem
Classifier lexicons: Fragments.Mandarin.Classifiers, Fragments.Japanese.Classifiers
Noun entries: Fragments.{Mandarin,Japanese,French}.Nouns
Chierchia bridge: Semantics.Lexical.Noun.Kind.Chierchia1998

def Phenomena.Agreement.NounCategorization.french :

Core.NounCategorization.NounCategorizationSystem

French noun categorization: 2-class gender system (masc/fem). Agreement on determiners, adjectives, and past participles. Aikhenvald type: noun class.

Equations

One or more equations did not get rendered due to their size.

Instances For

def Phenomena.Agreement.NounCategorization.italian :

Core.NounCategorization.NounCategorizationSystem

Italian noun categorization: 2-class gender system (masc/fem). Like French, [-arg, +pred]. Agreement on determiners (il/la, un/una), adjectives (‑o/‑a), and past participles (‑o/‑a). Richer article allomorphy than French (il/lo/la, i/gli/le).

Equations

One or more equations did not get rendered due to their size.

Instances For

def Phenomena.Agreement.NounCategorization.mandarin :

Core.NounCategorization.NounCategorizationSystem

Mandarin noun categorization: numeral classifier system. Large inventory, semantically motivated, no agreement. Aikhenvald type: numeral classifier.

Equations

One or more equations did not get rendered due to their size.

Instances For

def Phenomena.Agreement.NounCategorization.japanese :

Core.NounCategorization.NounCategorizationSystem

Japanese noun categorization: numeral classifier system (josūshi). Similar to Mandarin but with native Japanese default counter (つ). Aikhenvald type: numeral classifier.

Equations

One or more equations did not get rendered due to their size.

Instances For

theorem Phenomena.Agreement.NounCategorization.french_is_noun_class :

Core.NounCategorization.isNounClassType french.classifierType = true

French is a noun-class system.

theorem Phenomena.Agreement.NounCategorization.mandarin_is_classifier :

Core.NounCategorization.isClassifierType mandarin.classifierType = true

Mandarin is a classifier system (not noun class).

theorem Phenomena.Agreement.NounCategorization.japanese_is_classifier :

Core.NounCategorization.isClassifierType japanese.classifierType = true

Japanese is a classifier system (not noun class).

theorem Phenomena.Agreement.NounCategorization.agreement_divides_types :

french.hasAgreement = true ∧ mandarin.hasAgreement = false ∧ japanese.hasAgreement = false

French has agreement; Mandarin and Japanese do not (Table 15.1).

theorem Phenomena.Agreement.NounCategorization.mandarin_inventory_from_fragment :

mandarin.inventorySize = 11

Mandarin inventory is derived from the classifier lexicon.

theorem Phenomena.Agreement.NounCategorization.japanese_inventory_from_fragment :

japanese.inventorySize = 9

Japanese inventory is derived from the classifier lexicon.

theorem Phenomena.Agreement.NounCategorization.classifier_systems_have_default :

mandarin.hasUnmarkedDefault = true ∧ japanese.hasUnmarkedDefault = true

Both classifier systems have a default (Mandarin 个, Japanese つ).

def Phenomena.Agreement.NounCategorization.allSystems :

List Core.NounCategorization.NounCategorizationSystem

Equations

One or more equations did not get rendered due to their size.

Instances For

theorem Phenomena.Agreement.NounCategorization.all_obligatory :

(allSystems.all fun (x : Core.NounCategorization.NounCategorizationSystem) => x.isObligatory) = true

All three systems are obligatory (not optional).

theorem Phenomena.Agreement.NounCategorization.all_have_default :

(allSystems.all fun (x : Core.NounCategorization.NounCategorizationSystem) => x.hasUnmarkedDefault) = true

All three systems have an unmarked default.

Numeral classifier languages have no agreement; noun class languages have agreement (Aikhenvald Table 15.1).

theorem Phenomena.Agreement.NounCategorization.classifier_semantic_nounclass_mixed :

mandarin.assignment = Core.NounCategorization.AssignmentPrinciple.semantic ∧ japanese.assignment = Core.NounCategorization.AssignmentPrinciple.semantic ∧ french.assignment = Core.NounCategorization.AssignmentPrinciple.mixed

Numeral classifier systems have purely semantic assignment; noun class systems have mixed assignment (Aikhenvald Table 15.2).

theorem Phenomena.Agreement.NounCategorization.bare_np_tracks_arg :

Fragments.Mandarin.Nouns.bareNPLicensed = true ∧ Fragments.Japanese.Nouns.bareNPLicensed = true ∧ Fragments.French.Nouns.barePluralLicensed = false

Bare NPs are licensed in [+arg] languages, not in [-arg] languages. This connects Fragment-level bare NP facts to the typological parameter.

theorem Phenomena.Agreement.NounCategorization.blocking_tracks_mapping :

Fragments.Mandarin.Nouns.mandarinBlocking.iotaBlocked = false ∧ Fragments.Japanese.Nouns.japaneseBlocking.iotaBlocked = false ∧ Fragments.French.Nouns.frenchBlocking.iotaBlocked = true

Blocking principle: [+arg, -pred] languages have no articles to block covert type shifts. [-arg, +pred] languages block ι and ∃.

axiom Phenomena.Agreement.NounCategorization.noun_class_requires_agreement (sys : Core.NounCategorization.NounCategorizationSystem) :

Core.NounCategorization.isNounClassType sys.classifierType = true → sys.hasAgreement = true

U1 (Aikhenvald Table 15.1): Noun class / gender systems require agreement. This is definitional — agreement is what makes a noun class system a "class" rather than a "classifier" (@cite{dixon-1982}, Table 1.2).

axiom Phenomena.Agreement.NounCategorization.numeral_classifier_no_agreement (sys : Core.NounCategorization.NounCategorizationSystem) :

sys.classifierType = Core.NounCategorization.ClassifierType.numeralClassifier → sys.hasAgreement = false

U2 (Aikhenvald Table 15.1): Numeral classifier systems lack agreement. Classifiers are independent morphemes, not agreement markers. Witnessed by Mandarin and Japanese in our typology.

axiom Phenomena.Agreement.NounCategorization.classifier_assignment_semantic (sys : Core.NounCategorization.NounCategorizationSystem) :

Core.NounCategorization.isClassifierType sys.classifierType = true → sys.assignment = Core.NounCategorization.AssignmentPrinciple.semantic ∨ sys.assignment = Core.NounCategorization.AssignmentPrinciple.mixed

U3 (Aikhenvald §11.1.1): Classifier selection is always at least partly semantic. There are no purely phonological or purely morphological classifier systems (unlike noun class, which can be morphological).

theorem Phenomena.Agreement.NounCategorization.assignment_difference :

french.assignment = Core.NounCategorization.AssignmentPrinciple.mixed ∧ mandarin.assignment = Core.NounCategorization.AssignmentPrinciple.semantic ∧ japanese.assignment = Core.NounCategorization.AssignmentPrinciple.semantic

U4 (Aikhenvald Table 15.2): Noun class assignment may be mixed (semantic core + morphological overlay), while classifier systems are purely semantic. Witnessed by French (mixed) vs Mandarin (semantic).

axiom Phenomena.Agreement.NounCategorization.animacy_universal (sys : Core.NounCategorization.NounCategorizationSystem) :

(sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.animacy) = true ∨ (sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.humanness) = true

U5 (Aikhenvald §11.1.1): Animacy (animate vs. inanimate or human vs. non-human) is a semantic parameter found in EVERY type of noun categorization device. This is the universal semantic "core."

theorem Phenomena.Agreement.NounCategorization.classifiers_prefer_physical :

(mandarin.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.shape) = true ∧ (japanese.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.shape) = true

U6 (Aikhenvald §11.1.1): Physical properties (shape, size) are the preferred semantic parameters for numeral classifiers, while animacy is the core for noun classes.

axiom Phenomena.Agreement.NounCategorization.classifier_semantic_hierarchy (sys : Core.NounCategorization.NounCategorizationSystem) :

Core.NounCategorization.isClassifierType sys.classifierType = true → ((sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.shape) = true → (sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.animacy) = true) ∧ ((sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.function) = true → (sys.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.shape) = true)

U7 (Aikhenvald §11.2.3): In numeral classifier systems, animacy outranks shape, which outranks function. Formalized as an implicational universal: if a system uses shape, it also uses animacy; if function, also shape. TODO: prove from attested systems once typology is extended.

theorem Phenomena.Agreement.NounCategorization.animacy_in_both_classifier_systems :

(mandarin.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.animacy) = true ∧ (japanese.preferredSemantics.any fun (x : Core.NounCategorization.SemanticParameter) => x == Core.NounCategorization.SemanticParameter.animacy) = true

Animacy is attested in both Mandarin and Japanese classifiers. Derived from the classifier lexicons (witnessed by 只 zhī and 匹 hiki).

axiom Phenomena.Agreement.NounCategorization.noun_class_small_inventory (sys : Core.NounCategorization.NounCategorizationSystem) :

Core.NounCategorization.isNounClassType sys.classifierType = true → sys.inventorySize ≤ 20

U8 (Aikhenvald Table 15.1): Noun class systems have small inventories (2–20 classes), while classifier systems have large inventories (typically 20–200+).

theorem Phenomena.Agreement.NounCategorization.french_smaller_than_classifiers :

french.inventorySize < mandarin.inventorySize ∧ french.inventorySize < japanese.inventorySize

U9 (Aikhenvald §1.5): Classifier systems have larger inventories than noun class systems. Open (extendable) vs. closed.

theorem Phenomena.Agreement.NounCategorization.numeral_classifiers_scope_numeralNP :

(mandarin.scopes.any fun (x : Core.NounCategorization.CategorizationScope) => x == Core.NounCategorization.CategorizationScope.numeralNP) = true ∧ (japanese.scopes.any fun (x : Core.NounCategorization.CategorizationScope) => x == Core.NounCategorization.CategorizationScope.numeralNP) = true

U10 (Aikhenvald Table 15.1): Numeral classifiers operate inside numeral/quantifier NPs.

theorem Phenomena.Agreement.NounCategorization.noun_class_scope_agreement :

(french.scopes.any fun (x : Core.NounCategorization.CategorizationScope) => x == Core.NounCategorization.CategorizationScope.headModifierNP) = true ∧ (french.scopes.any fun (x : Core.NounCategorization.CategorizationScope) => x == Core.NounCategorization.CategorizationScope.predicateArgument) = true

U11 (Aikhenvald Table 15.1): Noun classes operate inside head-modifier NPs and predicate-argument structures (agreement).

inductive Phenomena.Agreement.NounCategorization.GrammaticalCategory :

Table 10.17 interaction matrix (simplified): Which grammatical categories interact with which classifier types.

Key patterns:

Noun classes interact with definiteness, number, case, tense/aspect
Numeral classifiers interact with number, definiteness
Verbal classifiers interact with tense/aspect

Instances For

instance Phenomena.Agreement.NounCategorization.instDecidableEqGrammaticalCategory :

DecidableEq GrammaticalCategory

Equations

Phenomena.Agreement.NounCategorization.instDecidableEqGrammaticalCategory x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

instance Phenomena.Agreement.NounCategorization.instReprGrammaticalCategory :

Repr GrammaticalCategory

Equations

Phenomena.Agreement.NounCategorization.instReprGrammaticalCategory = { reprPrec := Phenomena.Agreement.NounCategorization.instReprGrammaticalCategory.repr }

def Phenomena.Agreement.NounCategorization.instReprGrammaticalCategory.repr :

GrammaticalCategory → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Phenomena.Agreement.NounCategorization.instBEqGrammaticalCategory :

BEq GrammaticalCategory

Equations

Phenomena.Agreement.NounCategorization.instBEqGrammaticalCategory = { beq := Phenomena.Agreement.NounCategorization.instBEqGrammaticalCategory.beq }

def Phenomena.Agreement.NounCategorization.instBEqGrammaticalCategory.beq :

GrammaticalCategory → GrammaticalCategory → Bool

Equations

Phenomena.Agreement.NounCategorization.instBEqGrammaticalCategory.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

def Phenomena.Agreement.NounCategorization.interacts :

Core.NounCategorization.ClassifierType → GrammaticalCategory → Bool

Whether a classifier type typically interacts with a grammatical category (Aikhenvald Table 10.17).

Equations

Instances For

theorem Phenomena.Agreement.NounCategorization.noun_class_more_interactions :

have cats := [GrammaticalCategory.definiteness , GrammaticalCategory.number , GrammaticalCategory.case_, GrammaticalCategory.tenseAspect , GrammaticalCategory.possession ]; have ncInteractions := List.filter (interacts Core.NounCategorization.ClassifierType.nounClass) cats; have clInteractions := List.filter (interacts Core.NounCategorization.ClassifierType.numeralClassifier) cats; ncInteractions.length > clInteractions.length

Noun classes interact with more grammatical categories than numeral classifiers (Table 10.17). This reflects their deeper grammaticalization.

theorem Phenomena.Agreement.NounCategorization.mandarin_no_blocking :

Fragments.Mandarin.Nouns.mandarinBlocking.iotaBlocked = false ∧ Fragments.Mandarin.Nouns.mandarinBlocking.existsBlocked = false ∧ Fragments.Mandarin.Nouns.mandarinBlocking.downBlocked = false

No type-shift blocking in Mandarin.

theorem Phenomena.Agreement.NounCategorization.japanese_no_blocking :

Fragments.Japanese.Nouns.japaneseBlocking.iotaBlocked = false ∧ Fragments.Japanese.Nouns.japaneseBlocking.existsBlocked = false ∧ Fragments.Japanese.Nouns.japaneseBlocking.downBlocked = false

No type-shift blocking in Japanese.

theorem Phenomena.Agreement.NounCategorization.default_classifiers_exist :

Fragments.Mandarin.Classifiers.defaultClassifier.isDefault = true ∧ Fragments.Japanese.Classifiers.defaultClassifier.isDefault = true

U12: Every numeral classifier system has a semantically bleached default classifier that can substitute for any specific classifier (Aikhenvald §4.2). The default is the "elsewhere" case.

Witnessed by: Mandarin 个 gè, Japanese つ tsu.

theorem Phenomena.Agreement.NounCategorization.specific_classifiers_motivated :

((List.filter (fun (x : Core.NounCategorization.ClassifierEntry) => !x.isDefault) Fragments.Mandarin.Classifiers.allClassifiers).all fun (x : Core.NounCategorization.ClassifierEntry) => decide (x.semantics.length > 0)) = true ∧ ((List.filter (fun (x : Core.NounCategorization.ClassifierEntry) => !x.isDefault) Fragments.Japanese.Classifiers.allClassifiers).all fun (x : Core.NounCategorization.ClassifierEntry) => decide (x.semantics.length > 0)) = true

Non-default classifiers always carry at least one semantic parameter. The default is the only semantically empty classifier.