Classifier Typology #
@cite{aikhenvald-2000} @cite{chierchia-1998} @cite{dixon-1982} @cite{greenberg-1972}
Cross-linguistic typology of noun categorization systems, following @cite{aikhenvald-2000} "Classifiers: A Typology of Noun Categorization Devices."
NounCategorizationSystem #
@cite{aikhenvald-2000}'s 7-property (A–G) per-language system description.
The vocabulary types it depends on (ClassifierType, SemanticParameter,
ClassifierEntry, etc.) live in Core.Lexical.NounCategorization as
settled descriptive infrastructure; this file provides the framework for
assembling them into per-language system descriptions.
Part I — Per-Language Data #
Four languages from three families:
- French (Indo-European): Noun class / gender (2 classes). Agreement.
- Italian (Indo-European): Noun class / gender (2 classes). Agreement.
- Mandarin (Sino-Tibetan): Numeral classifiers (~100+). No agreement.
- Japanese (Japonic): Numeral classifiers (josūshi). No agreement.
System descriptions are derived from Fragment data (single source of truth).
Part II — Universals #
@cite{aikhenvald-2000}'s empirical generalizations (Chapters 11, 15): agreement diagnostics, semantic parameter universals, inventory size constraints, @cite{greenberg-1972} classifier–number complementarity.
Thread map #
- Core infrastructure:
Core.NounCategorization—ClassifierType,SemanticParameter,ClassifierEntry - Classifier lexicons:
Fragments.Mandarin.Classifiers,Fragments.Japanese.Classifiers - Noun entries:
Fragments.{Mandarin,Japanese,French,Italian}.Nouns - WALS 55A:
Core.WALS.F55A— 400-language numeral classifier survey (imported inPhenomena.Numerals.Typology) - Chierchia bridge:
Theories.Semantics.Lexical.Noun.Kind.Chierchia1998
A noun categorization system in a language.
Captures @cite{aikhenvald-2000}'s 7 definitional properties (A–G from §1.5):
(A) morphosyntactic locus → scopes
(B) scope/domain → classifierType + scopes
(C) assignment principles → assignment
(D) surface realization → realizations
(E) agreement → hasAgreement
(F) markedness → hasUnmarkedDefault
(G) grammaticalization → isObligatory
- language : String
Language name
- family : String
Language family
- classifierType : Core.NounCategorization.ClassifierType
Aikhenvald classifier type
Morphosyntactic scopes this system operates in (A, B)
- assignment : Core.NounCategorization.AssignmentPrinciple
How nouns are assigned to classes/classifiers (C)
- realizations : List Core.NounCategorization.SurfaceRealization
Morphological realization types used (D)
- hasAgreement : Bool
Does the system involve agreement? (E) — definitional for noun classes
- inventorySize : ℕ
Inventory size (number of classes or classifiers)
- isObligatory : Bool
Is realization obligatory or optional? (G)
- hasUnmarkedDefault : Bool
Is there a formally/functionally unmarked default? (F)
- preferredSemantics : List Core.NounCategorization.SemanticParameter
Preferred semantic parameters (§11.2, Table 11.13)
- source : String
Citation
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
@cite{dixon-1982}'s noun-class vs. classifier divide (Table 1.2). Noun classes: small, closed, grammaticalized, agreement. Classifiers: large, open, lexical, no agreement.
Equations
Instances For
All non-noun-class types are "classifier" types in the broad sense.
Equations
Instances For
French noun categorization: 2-class gender system (masc/fem). Agreement on determiners, adjectives, and past participles. Aikhenvald type: noun class.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Italian noun categorization: 2-class gender system (masc/fem). Like French, [-arg, +pred]. Agreement on determiners (il/la, un/una), adjectives (‑o/‑a), and past participles (‑o/‑a). Richer article allomorphy than French (il/lo/la, i/gli/le).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mandarin noun categorization: numeral classifier system. Large inventory, semantically motivated, no agreement. Aikhenvald type: numeral classifier.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Japanese noun categorization: numeral classifier system (josūshi). Similar to Mandarin but with native Japanese default counter (つ). Aikhenvald type: numeral classifier.
Equations
- One or more equations did not get rendered due to their size.
Instances For
French is a noun-class system.
Mandarin is a classifier system (not noun class).
Japanese is a classifier system (not noun class).
French has agreement; Mandarin and Japanese do not (Table 15.1).
Mandarin inventory is derived from the classifier lexicon.
Japanese inventory is derived from the classifier lexicon.
Both classifier systems have a default (Mandarin 个, Japanese つ).
Equations
- One or more equations did not get rendered due to their size.
Instances For
All four systems are obligatory (not optional).
All four systems have an unmarked default.
Numeral classifier languages have no agreement; noun class languages have agreement (Aikhenvald Table 15.1).
Numeral classifier systems have purely semantic assignment; noun class systems have mixed assignment (Aikhenvald Table 15.2).
Bare NPs are licensed in [+arg] languages, not in [-arg] languages. This connects Fragment-level bare NP facts to the typological parameter.
Blocking principle: [+arg, -pred] languages have no articles to block covert type shifts. [-arg, +pred] languages block ι and ∃.
U1 (Aikhenvald Table 15.1): Noun class / gender systems require agreement. This is definitional — agreement is what makes a noun class system a "class" rather than a "classifier" (@cite{dixon-1982}, Table 1.2).
U2 (Aikhenvald Table 15.1): Numeral classifier systems lack agreement. Classifiers are independent morphemes, not agreement markers. Witnessed by Mandarin and Japanese in our typology.
U3 (Aikhenvald §11.1.1): Classifier selection is always at least partly semantic. There are no purely phonological or purely morphological classifier systems (unlike noun class, which can be morphological).
U4 (Aikhenvald Table 15.2): Noun class assignment may be mixed (semantic core + morphological overlay), while classifier systems are purely semantic. Witnessed by French (mixed) vs Mandarin (semantic).
U5 (Aikhenvald §11.1.1): Animacy (animate vs. inanimate or human vs. non-human) is a semantic parameter found in EVERY type of noun categorization device. This is the universal semantic "core."
U6 (Aikhenvald §11.1.1): Physical properties (shape, size) are the preferred semantic parameters for numeral classifiers, while animacy is the core for noun classes.
U7 (Aikhenvald §11.2.3): In numeral classifier systems, animacy outranks shape, which outranks function. Formalized as an implicational universal: if a system uses shape, it also uses animacy; if function, also shape. TODO: prove from attested systems once typology is extended.
Animacy is attested in both Mandarin and Japanese classifiers. Derived from the classifier lexicons (witnessed by 只 zhī and 匹 hiki).
U8 (Aikhenvald Table 15.1): Noun class systems have small inventories (2–20 classes), while classifier systems have large inventories (typically 20–200+).
U9 (Aikhenvald §1.5): Classifier systems have larger inventories than noun class systems. Open (extendable) vs. closed.
U10 (Aikhenvald Table 15.1): Numeral classifiers operate inside numeral/quantifier NPs.
U11 (Aikhenvald Table 15.1): Noun classes operate inside head-modifier NPs and predicate-argument structures (agreement).
Table 10.17 interaction matrix (simplified): Which grammatical categories interact with which classifier types.
Key patterns:
- Noun classes interact with definiteness, number, case, tense/aspect
- Numeral classifiers interact with number, definiteness
- Verbal classifiers interact with tense/aspect
- definiteness : GrammaticalCategory
- number : GrammaticalCategory
- case_ : GrammaticalCategory
- tenseAspect : GrammaticalCategory
- possession : GrammaticalCategory
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Whether a classifier type typically interacts with a grammatical category (Aikhenvald Table 10.17).
Equations
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Classifiers.Typology.GrammaticalCategory.definiteness = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Classifiers.Typology.GrammaticalCategory.number = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Classifiers.Typology.GrammaticalCategory.case_ = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Classifiers.Typology.GrammaticalCategory.tenseAspect = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.nounClass Phenomena.Classifiers.Typology.GrammaticalCategory.possession = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.definiteness = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.number = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.possession = false
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.case_ = false
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.numeralClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.tenseAspect = false
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.verbalClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.tenseAspect = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.verbalClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.number = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.relationalClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.possession = true
- Phenomena.Classifiers.Typology.interacts Core.NounCategorization.ClassifierType.possessedClassifier Phenomena.Classifiers.Typology.GrammaticalCategory.possession = true
- Phenomena.Classifiers.Typology.interacts x✝¹ x✝ = false
Instances For
Noun classes interact with more grammatical categories than numeral classifiers (Table 10.17). This reflects their deeper grammaticalization.
U12: Every numeral classifier system has a semantically bleached default classifier that can substitute for any specific classifier (Aikhenvald §4.2). The default is the "elsewhere" case.
Witnessed by: Mandarin 个 gè, Japanese つ tsu.
Non-default classifiers always carry at least one semantic parameter. The default is the only semantically empty classifier.
Whether a language uses numeral classifiers (WALS Ch 55).
Numeral classifiers are morphemes that co-occur with a numeral when it modifies a noun (e.g., Mandarin san ge ren 'three CL person'). The key distinction is between obligatory classifiers (required whenever a numeral modifies a noun) and optional classifiers (available but not required).
- absent : ClassifierStatus
- optional : ClassifierStatus
- obligatory : ClassifierStatus
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Instances For
Actual WALS Ch 55 counts.
Equations
- Phenomena.Classifiers.Typology.ch55Distribution = { absent := 260, optional := 62, obligatory := 78 }
Instances For
Convert WALS 55A numeral classifier values to our ClassifierStatus type. The mapping is one-to-one: absent, optional, obligatory.
Equations
- Phenomena.Classifiers.Typology.fromWALS55A Core.WALS.F55A.NumeralClassifiers.absent = Phenomena.Classifiers.Typology.ClassifierStatus.absent
- Phenomena.Classifiers.Typology.fromWALS55A Core.WALS.F55A.NumeralClassifiers.optional = Phenomena.Classifiers.Typology.ClassifierStatus.optional
- Phenomena.Classifiers.Typology.fromWALS55A Core.WALS.F55A.NumeralClassifiers.obligatory = Phenomena.Classifiers.Typology.ClassifierStatus.obligatory
Instances For
Languages without numeral classifiers are the global majority (WALS Ch 55). 260 out of 400 sampled languages lack classifiers entirely.
Languages without classifiers constitute over half the sample.
Obligatory classifiers are more common than optional ones globally. This is somewhat counterintuitive — it suggests that once a language develops classifiers, they tend to become grammaticalized as obligatory.