Noun Categorization and Agreement Typology #
@cite{aikhenvald-2000} @cite{chierchia-1998} @cite{dixon-1982} @cite{greenberg-1972}
Cross-linguistic typology of noun categorization systems, following @cite{aikhenvald-2000}. The central diagnostic is agreement: noun class/gender systems (French) require it; classifier systems (Mandarin, Japanese) lack it. This is @cite{dixon-1982}'s definitional divide.
This file provides the cross-linguistic context for the English-specific
agreement data in Agreement.Basic, Agreement.DetNoun, and Agreement.Case:
why do some languages use agreement-based noun categorization while others
use classifiers?
Part I — Typology #
Three languages from three families mapped to NounCategorizationSystem:
- French (Indo-European): Noun class / gender (2 classes). Agreement. [-arg, +pred].
- Mandarin (Sino-Tibetan): Numeral classifiers (~100+). No agreement. [+arg, -pred].
- Japanese (Japonic): Numeral classifiers (josūshi). No agreement. [+arg, -pred].
System descriptions are derived from Fragment data (single source of truth).
Part II — Universals #
Aikhenvald's empirical generalizations (Chapters 11, 15): agreement diagnostics, semantic parameter universals, inventory size constraints, @cite{greenberg-1972} classifier–number complementarity.
Thread map #
- Core infrastructure:
Core.NounCategorization—ClassifierType,SemanticParameter,NounCategorizationSystem - Classifier lexicons:
Fragments.Mandarin.Classifiers,Fragments.Japanese.Classifiers - Noun entries:
Fragments.{Mandarin,Japanese,French}.Nouns - Chierchia bridge:
Semantics.Lexical.Noun.Kind.Chierchia1998
French noun categorization: 2-class gender system (masc/fem). Agreement on determiners, adjectives, and past participles. Aikhenvald type: noun class.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Italian noun categorization: 2-class gender system (masc/fem). Like French, [-arg, +pred]. Agreement on determiners (il/la, un/una), adjectives (‑o/‑a), and past participles (‑o/‑a). Richer article allomorphy than French (il/lo/la, i/gli/le).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mandarin noun categorization: numeral classifier system. Large inventory, semantically motivated, no agreement. Aikhenvald type: numeral classifier.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Japanese noun categorization: numeral classifier system (josūshi). Similar to Mandarin but with native Japanese default counter (つ). Aikhenvald type: numeral classifier.
Equations
- One or more equations did not get rendered due to their size.
Instances For
French is a noun-class system.
Mandarin is a classifier system (not noun class).
Japanese is a classifier system (not noun class).
French has agreement; Mandarin and Japanese do not (Table 15.1).
Mandarin inventory is derived from the classifier lexicon.
Japanese inventory is derived from the classifier lexicon.
Both classifier systems have a default (Mandarin 个, Japanese つ).
Equations
- One or more equations did not get rendered due to their size.
Instances For
All three systems are obligatory (not optional).
All three systems have an unmarked default.
Numeral classifier languages have no agreement; noun class languages have agreement (Aikhenvald Table 15.1).
Numeral classifier systems have purely semantic assignment; noun class systems have mixed assignment (Aikhenvald Table 15.2).
Bare NPs are licensed in [+arg] languages, not in [-arg] languages. This connects Fragment-level bare NP facts to the typological parameter.
Blocking principle: [+arg, -pred] languages have no articles to block covert type shifts. [-arg, +pred] languages block ι and ∃.
U1 (Aikhenvald Table 15.1): Noun class / gender systems require agreement. This is definitional — agreement is what makes a noun class system a "class" rather than a "classifier" (@cite{dixon-1982}, Table 1.2).
U2 (Aikhenvald Table 15.1): Numeral classifier systems lack agreement. Classifiers are independent morphemes, not agreement markers. Witnessed by Mandarin and Japanese in our typology.
U3 (Aikhenvald §11.1.1): Classifier selection is always at least partly semantic. There are no purely phonological or purely morphological classifier systems (unlike noun class, which can be morphological).
U4 (Aikhenvald Table 15.2): Noun class assignment may be mixed (semantic core + morphological overlay), while classifier systems are purely semantic. Witnessed by French (mixed) vs Mandarin (semantic).
U5 (Aikhenvald §11.1.1): Animacy (animate vs. inanimate or human vs. non-human) is a semantic parameter found in EVERY type of noun categorization device. This is the universal semantic "core."
U6 (Aikhenvald §11.1.1): Physical properties (shape, size) are the preferred semantic parameters for numeral classifiers, while animacy is the core for noun classes.
U7 (Aikhenvald §11.2.3): In numeral classifier systems, animacy outranks shape, which outranks function. Formalized as an implicational universal: if a system uses shape, it also uses animacy; if function, also shape. TODO: prove from attested systems once typology is extended.
Animacy is attested in both Mandarin and Japanese classifiers. Derived from the classifier lexicons (witnessed by 只 zhī and 匹 hiki).
U8 (Aikhenvald Table 15.1): Noun class systems have small inventories (2–20 classes), while classifier systems have large inventories (typically 20–200+).
U9 (Aikhenvald §1.5): Classifier systems have larger inventories than noun class systems. Open (extendable) vs. closed.
U10 (Aikhenvald Table 15.1): Numeral classifiers operate inside numeral/quantifier NPs.
U11 (Aikhenvald Table 15.1): Noun classes operate inside head-modifier NPs and predicate-argument structures (agreement).
Table 10.17 interaction matrix (simplified): Which grammatical categories interact with which classifier types.
Key patterns:
- Noun classes interact with definiteness, number, case, tense/aspect
- Numeral classifiers interact with number, definiteness
- Verbal classifiers interact with tense/aspect
- definiteness : GrammaticalCategory
- number : GrammaticalCategory
- case_ : GrammaticalCategory
- tenseAspect : GrammaticalCategory
- possession : GrammaticalCategory
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Whether a classifier type typically interacts with a grammatical category (Aikhenvald Table 10.17).
Equations
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Agreement.NounCategorization.GrammaticalCategory.definiteness = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Agreement.NounCategorization.GrammaticalCategory.number = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Agreement.NounCategorization.GrammaticalCategory.case_ = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Agreement.NounCategorization.GrammaticalCategory.tenseAspect = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Agreement.NounCategorization.GrammaticalCategory.possession = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.definiteness = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.number = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.possession = false
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.case_ = false
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.tenseAspect = false
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.verbalClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.tenseAspect = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.verbalClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.number = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.relationalClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.possession = true
- Phenomena.Agreement.NounCategorization.interacts Core.NounCategorization.ClassifierType.possessedClassifier Phenomena.Agreement.NounCategorization.GrammaticalCategory.possession = true
- Phenomena.Agreement.NounCategorization.interacts x✝¹ x✝ = false
Instances For
Noun classes interact with more grammatical categories than numeral classifiers (Table 10.17). This reflects their deeper grammaticalization.
No type-shift blocking in Mandarin.
No type-shift blocking in Japanese.
U12: Every numeral classifier system has a semantically bleached default classifier that can substitute for any specific classifier (Aikhenvald §4.2). The default is the "elsewhere" case.
Witnessed by: Mandarin 个 gè, Japanese つ tsu.
Non-default classifiers always carry at least one semantic parameter. The default is the only semantically empty classifier.