Copula and Predication Typology (WALS Chapters 117--120) #
@cite{hengeveld-1992} @cite{pustet-2003} @cite{stassen-2013} @cite{haspelmath-2001}
Cross-linguistic data on predication strategies and copulas from four WALS chapters, all authored by Leon @cite{stassen-2013}. These chapters address a cluster of related questions: how languages express predication when the predicate is an adjective, a noun phrase, or a locative, and whether a copular element is needed in each case.
The key insight from Stassen's typological work is that predication is not uniform across predicate types. Many languages use fundamentally different strategies for adjectival predication ("The book is red"), nominal predication ("She is a doctor"), and locational predication ("The cat is on the mat"). The copula, where it exists, is best understood not as a single phenomenon but as a family of strategies that languages deploy differently across predicate types.
Ch 117: Predicative Possession (N = 240) #
How languages express predicative possession ("I have a book"). Five values:
- Locational (48/240): possession expressed as location ("a book is at me").
- Genitive (22/240): possession expressed with genitive ("my book exists").
- Topic (48/240): possession expressed as topic construction.
- Conjunctional (59/240): possession expressed with conjunctional strategy.
- 'Have' (63/240): a dedicated 'have' verb is used.
Ch 118: Predicative Adjectives (N = 386) #
How languages express adjectival predication. Three values:
- Verbal encoding: adjectives behave like verbs (take verbal morphology, no copula). This is the most common strategy worldwide (151/386 = 39.1%).
- Nonverbal encoding: adjectives require a copula or other non-verbal strategy (132/386 = 34.2%).
- Mixed: some adjectives are verbal, others require a copula (103/386 = 26.7%).
Ch 119: Nominal and Locational Predication (N = 386) #
Whether a language uses the same or different strategy for nominal predication ("is a doctor") and locational predication ("is in the room"). Two values:
- Different: different copulas or strategies for NOM vs LOC (269/386 = 69.7%).
- Identical: same copula/strategy for both (117/386 = 30.3%).
The majority pattern is to differentiate: even English, which uses "be" for both, is cross-linguistically unusual in using a single copula.
Ch 120: Zero Copula for Predicate Nominals (N = 386) #
Whether the copula can be absent in nominal predication. Two values:
- Impossible: copula is always required (211/386 = 54.7%).
- Possible: zero copula is allowed in some or all contexts (175/386 = 45.3%).
WALS Ch 117: How a language expresses adjectival predication ("The book is red").
The three-way distinction reflects the categorial status of adjectives in the language. In "verbal" languages, adjectives inflect like verbs and need no copula. In "non-verbal" languages, adjectives are a distinct category requiring a copula. "Mixed" languages have adjectives that split between verbal and non-verbal behavior.
- verbal : PredAdjStrategy
Adjectives behave like verbs: they take verbal morphology (tense, aspect, negation) and occur as predicates without a copula. Example: Mandarin
shu hong'book red' = 'The book is red', wherehongcan take aspect markers directly. - mixed : PredAdjStrategy
Some adjectives are verbal (typically core/frequent ones), others require a copula (typically peripheral/borrowed ones). Example: Japanese has verbal adjectives (i-adjectives:
takai'is expensive') and non-verbal adjectives (na-adjectives:shizuka-da'quiet-COP'). - nonVerbal : PredAdjStrategy
Adjectives are categorially distinct from verbs and require a copula or other linking element for predication. Example: English
The book is redrequires the copulais.
Instances For
Equations
- Phenomena.Copulas.Typology.instBEqPredAdjStrategy.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS Ch 118: How a language expresses nominal predication ("She is a doctor").
The binary distinction captures whether the language uses a verbal copula (a word that inflects like a verb and carries tense/agreement) or a non-verbal strategy (juxtaposition, pronominal copula, or invariant particle).
- verbal : PredNounStrategy
Nominal predication uses a verbal copula that inflects for tense, agreement, etc. Example: English
She is a doctor, whereisis a fully inflecting verb (am/is/are/was/were). - nonVerbal : PredNounStrategy
Nominal predication uses juxtaposition or a non-verbal element (particle, pronoun, etc.), not a copula verb. Example: Russian present tense
Ona vrach'She doctor' = 'She is a doctor' (no copula in present tense).
Instances For
Equations
- Phenomena.Copulas.Typology.instBEqPredNounStrategy.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS Ch 119: Whether a language uses the same or different strategy for nominal predication ("She is a doctor") and locational predication ("The cat is on the mat").
Many languages use different verbs: e.g., Spanish ser (nominal) vs
estar (locational), or have a copula for one but not the other. The
"different" value is the majority pattern cross-linguistically.
- identical : NomLocStrategy
The same copula or strategy is used for both nominal and locational predication. Example: English
She is a doctor/The cat is on the mat-- both usebe. - different : NomLocStrategy
Different copulas or strategies are used for nominal vs locational predication. Example: Spanish
ser(nominal:Es doctora) vsestar(locational:Esta en la casa).
Instances For
Equations
- Phenomena.Copulas.Typology.instBEqNomLocStrategy.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS Ch 120: Whether the copula can be absent in nominal predication.
Zero copula is typically conditioned by tense (present only) or person (3rd person only). "Widespread" means zero copula is the default, unrestricted strategy.
- impossible : ZeroCopulaStatus
Zero copula is impossible: the copula is always required in nominal predication, regardless of tense, person, or other factors. Example: English
*She doctoris ungrammatical. - restricted : ZeroCopulaStatus
Zero copula is possible in restricted contexts: typically present tense, or 3rd person, or in certain clause types. The copula appears in other environments. Example: Russian allows zero copula in present tense (
Ona vrach'She doctor') but requires it in past tense (Ona byla vrach'She was doctor'). - widespread : ZeroCopulaStatus
Zero copula is the normal, widespread, or default strategy. The copula is absent in most environments. Example: Tagalog
Doktor siya'Doctor she' = 'She is a doctor'.
Instances For
Equations
- Phenomena.Copulas.Typology.instBEqZeroCopulaStatus.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Morphosyntactic type of copula, when present.
This supplements the WALS classification with a finer-grained typology of copular elements, following @cite{pustet-2003} and @cite{hengeveld-1992}.
- verbalCopula : CopulaType
Fully inflecting verbal copula: takes tense, agreement, negation like other verbs. Example: English
be(am/is/are/was/were). - pronominalCopula : CopulaType
Pronominal copula: a pronoun-like element that agrees with the subject. Example: Hebrew
hu/hi/hem/hen(he/she/they.m/they.f) used as a copula in present tense. - particle : CopulaType
Invariant particle: a non-inflecting element linking subject and predicate. Example: Swahili
ni(affirmative copula, invariant). - zero : CopulaType
No copula: predication by juxtaposition.
Instances For
Equations
- Phenomena.Copulas.Typology.instBEqCopulaType.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
A single row in a WALS frequency table: a category label and its count.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Instances For
Sum of counts in a WALS table.
Equations
- Phenomena.Copulas.Typology.WALSCount.totalOf cs = List.foldl (fun (acc : Nat) (c : Phenomena.Copulas.Typology.WALSCount) => acc + c.count) 0 cs
Instances For
Chapter 117 distribution: predicative possession (N = 240).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Chapter 118 distribution: predicative adjectives (N = 386).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Chapter 119 distribution: nominal and locational predication (N = 386).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Chapter 120 distribution: zero copula for predicate nominals (N = 386).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Ch 117 total: 240 languages.
Ch 118 total: 386 languages.
Ch 119 total: 386 languages.
Ch 120 total: 386 languages.
Chapters 118, 119, 120 use the same 386-language sample. Chapter 117 uses a smaller 240-language sample.
A language's copula and predication profile across WALS Chapters 117--120.
- language : String
Language name.
- iso : String
ISO 639-3 code.
- family : String
Language family.
- predAdj : PredAdjStrategy
Ch 117: How adjectival predication is expressed.
- predNoun : PredNounStrategy
Ch 118: How nominal predication is expressed.
- nomLoc : NomLocStrategy
Ch 119: Whether nominal and locational predication use the same or different strategy.
- zeroCopula : ZeroCopulaStatus
Ch 120: Whether zero copula is possible in nominal predication.
- copulaType : CopulaType
Primary copula type in the language (supplementary).
Illustrative copula form(s).
- notes : String
Notes on the predication system.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
English (Indo-European, Germanic).
Adjectives are non-verbal: require copula be ("The book is red").
Nominal predication uses the verbal copula be ("She is a doctor").
Same copula be for both nominal and locational predication.
Zero copula is impossible: *She doctor, *Book red.
Equations
- One or more equations did not get rendered due to their size.
Instances For
French (Indo-European, Romance).
Adjectives are non-verbal: Le livre est rouge ('The book is red').
Nominal predication uses verbal copula etre: Elle est medecin.
Same copula etre for nominal and locational predication.
Zero copula is impossible.
Equations
- One or more equations did not get rendered due to their size.
Instances For
German (Indo-European, Germanic).
Adjectives are non-verbal: Das Buch ist rot ('The book is red').
Nominal predication uses verbal copula sein: Sie ist Arztin.
Same copula for nominal and locational predication.
Zero copula is impossible.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Spanish (Indo-European, Romance).
Adjectives are non-verbal: El libro es rojo ('The book is red').
Nominal predication uses verbal copula ser: Ella es doctora.
DIFFERENT copulas for nominal vs locational: ser (permanent/nominal)
vs estar (temporary/locational).
Zero copula is impossible.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Russian (Indo-European, Slavic).
Adjectives are non-verbal in present tense (zero copula: Kniga krasnaja
'Book red'), verbal copula byt' in past (Kniga byla krasnaja).
Nominal predication uses zero copula in present: Ona vrach ('She doctor').
Same strategy for nominal and locational (both use zero copula in present).
Zero copula is RESTRICTED to present tense: past and future require byt'.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Arabic (Standard) (Afro-Asiatic, Semitic).
Adjectives are non-verbal: al-kitab ahmar 'the-book red'.
Nominal predication is non-verbal in present: hiya tabiba 'she doctor'.
Different strategies for nominal vs locational (locational uses fi
'in' with possible copula).
Zero copula is RESTRICTED: present tense allows zero, past requires kaana.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hebrew (Afro-Asiatic, Semitic).
Adjectives are non-verbal: ha-sefer adom 'the-book red'.
Nominal predication uses a pronominal copula in present: hi rofaa
or hi hi rofaa (she COP.3F doctor). Past/future use haya.
Different strategies for nominal vs locational.
Zero copula is RESTRICTED: present allows zero/pronominal, past requires
haya.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hungarian (Uralic).
Adjectives are non-verbal: A konyv piros 'The book red'.
Nominal predication: zero copula in 3rd person present (O orvos
'She doctor'), verbal copula van in other persons/tenses.
Same strategy for nominal and locational.
Zero copula is RESTRICTED to 3rd person present tense.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Japanese (Japonic).
Adjectives are MIXED: i-adjectives are verbal (takai 'is.expensive',
takes tense: takakatta 'was.expensive'), but na-adjectives require
copula da (shizuka-da 'quiet-COP').
Nominal predication uses copula da/desu: Kanojo wa isha da 'She TOP
doctor COP'.
Different strategies for nominal and locational (locational uses ni iru/aru).
Zero copula is impossible in standard forms (copula always required for
nominals).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Korean (Koreanic).
Adjectives are verbal: chaek-i ppalgah-da 'book-NOM red-DECL' with
verbal inflection on the adjective. No copula needed.
Nominal predication uses copula -i-da: geunyeo-neun uisa-i-da
'she-TOP doctor-COP-DECL'.
Different strategies for nominal vs locational (locational uses iss-da
'exist-DECL').
Zero copula is impossible for nominals.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mandarin Chinese (Sino-Tibetan).
Adjectives are verbal: shu hong 'book red' = 'The book is red'.
Adjectives can take aspect markers (le, guo) and negation (bu) directly.
Nominal predication uses copula shi: ta shi yisheng 'she COP doctor'.
Different strategies for nominal vs locational (locational uses zai 'at').
Zero copula is impossible for nominal predication (shi is required).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Turkish (Turkic).
Adjectives are verbal: Kitap kirmizi 'Book red' = 'The book is red'.
Adjective takes verbal suffixes for tense.
Nominal predication uses a copular suffix: O doktor-dur 'S/he doctor-COP'.
Different strategies for nominal vs locational (locational uses
postposition + var/yok).
Zero copula is RESTRICTED: 3rd person present often has zero copula
(O doktor 'S/he doctor'), but past requires -di/-ydi.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Swahili (Niger-Congo, Bantu).
Adjectives are verbal: -zuri 'good' takes verbal agreement prefixes
(m-zuri 'good (cl.1)'). Predication: Kitabu ni kizuri uses ni.
Nominal predication uses particle ni: Yeye ni daktari 'S/he COP doctor'.
Different strategies for nominal vs locational (locational uses -ko
locative suffixes and different verbal forms).
Zero copula is WIDESPREAD in some constructions: Yeye daktari is
acceptable in many contexts.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hindi-Urdu (Indo-European, Indo-Aryan).
Adjectives are non-verbal: kitab lal hai 'book red COP.PRS'.
Nominal predication uses copula hona: vo daktar hai 'she doctor COP.PRS'.
Same copula hona for nominal and locational predication.
Zero copula is impossible in standard Hindi-Urdu.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Tagalog (Austronesian).
Adjectives are verbal: maganda ang babae 'beautiful ANG woman'.
Nominal predication is non-verbal: doktor siya 'doctor she'.
Same strategy for nominal and locational.
Zero copula is WIDESPREAD: doktor siya with no overt copula is the
normal strategy.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Irish (Indo-European, Celtic).
Adjectives are non-verbal: Ta an leabhar dearg 'COP the book red'.
Nominal predication uses copula is: Is dochtúir í 'COP doctor she'.
DIFFERENT copulas: is (classificatory/identificational) vs ta
(attributive/locational).
Zero copula is impossible.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Finnish (Uralic).
Adjectives are non-verbal: Kirja on punainen 'Book COP red'.
Nominal predication uses verbal copula olla: Han on laakari 'S/he
COP doctor'.
Same copula olla for nominal and locational predication.
Zero copula is impossible.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Quechua (Quechuan).
Adjectives are non-verbal: Wasi hatun-mi 'House big-EVID' with copular
suffix.
Nominal predication uses copular suffixes: Pay yachachiq-mi 'S/he
teacher-EVID' (copula as suffix -mi/-n).
Same strategy for nominal and locational predication.
Zero copula is RESTRICTED: 3rd person present often has zero or reduced
copula, but other persons/tenses require overt copula ka-.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Yoruba (Niger-Congo, Atlantic-Congo).
Adjectives are verbal: Iwe na tobi 'book the big' (adjective inflects
verbally).
Nominal predication uses non-verbal strategy (juxtaposition or pronoun):
O je dokita 'She be doctor' or O dokita ni 'She doctor COP'.
Different strategies for nominal vs locational.
Zero copula is WIDESPREAD for nominal predication.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Thai (Tai-Kadai).
Adjectives are verbal: Nangsue daeng 'Book red' = 'The book is red',
where the adjective takes verbal markers directly.
Nominal predication uses verbal copula pen: Khao pen mor 'S/he COP
doctor'.
Different strategies for nominal vs locational (locational uses yuu 'be.at').
Zero copula is impossible for nominal predication.
Equations
- One or more equations did not get rendered due to their size.
Instances For
All language profiles in the sample.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Does a language have verbal adjectives?
Equations
Instances For
Does a language have a verbal copula for nominal predication?
Instances For
Does a language allow zero copula (restricted or widespread)?
Equations
Instances For
Does a language split nominal and locational predication?
Equations
Instances For
Does a language require a copula in all contexts?
Equations
Instances For
Count of languages in a list matching a predicate.
Equations
- Phenomena.Copulas.Typology.countBy langs pred = (List.filter pred langs).length
Instances For
Number of languages in our sample.
Ch 117 distribution in our sample.
Ch 118 distribution in our sample.
Ch 119 distribution in our sample.
Ch 120 distribution in our sample.
Ch 118: Verbal encoding is the most common strategy for predicative adjectives, exceeding both mixed and nonverbal encoding.
Ch 118: Verbal adjectives account for roughly 39% of the sample (151/386).
Ch 118: Nonverbal encoding (132) is less common than verbal encoding (151) for predicative adjectives, but the three values are fairly balanced across the 386-language sample.
Ch 118: No single predicative adjective strategy is a strict majority.
Ch 119: Using different strategies for nominal and locational predication is more common than using the same strategy. This means that languages typically distinguish "is a doctor" from "is in the room" with different grammatical means.
Ch 119: The split pattern accounts for a clear majority.
Ch 120: "Impossible" (copula always required) is the majority value.
Ch 120: "Impossible" accounts for more than half the sample.
In our sample, languages with verbal adjectives (Ch 117) tend NOT to also have verbal copulas for nouns -- they often use a different strategy for nominal predication. This is because the verbal adjective strategy eliminates the need for a copula with adjectives but does not necessarily provide a copula for nouns.
Count of languages with verbal adjectives AND non-verbal noun predication.
In our sample, languages with restricted zero copula (Ch 120) allow zero copula in present tense but require an overt copula in past tense. Russian, Arabic, Hungarian, and Turkish all exhibit this pattern: present tense allows copula dropping but past requires an overt form.
All restricted-zero-copula languages in our sample have overt past-tense copula forms.
Languages that use non-verbal noun predication (Ch 118) tend to allow zero copula (Ch 120). In our sample, every language with non-verbal noun predication allows zero copula (restricted or widespread).
Languages with verbal adjectives tend to split nominal and locational predication (Ch 119 = different). In our sample, the majority of verbal-adjective languages have different nom-loc strategies.
Languages where the copula is always required (Ch 120 = impossible) are more likely to use the SAME copula for nominal and locational predication (Ch 119 = identical) than languages that allow zero copula.
Among copula-requiring languages in our sample, at least half use the same strategy for both predication types.
Western European languages (English, French, German) share a common copula profile: non-verbal adjectives, verbal nominal copula, and no zero copula. This is the "Standard Average European" pattern.
Equations
Instances For
Cross-chapter interaction: languages with verbal adjectives (Ch 117) and verbal noun predication (Ch 118) are the only languages in our sample that can have verbal predication for BOTH adjectives and nouns with different copula forms.
In our sample, no language has mixed adjectives (Ch 117) AND non-verbal noun predication (Ch 118). Mixed adjective languages tend to have verbal copulas for nouns.
Languages with widespread zero copula (Ch 120) always allow non-verbal nominal predication (Ch 118). This is logically expected: if zero copula is the norm, nominal predication IS non-verbal.
Stassen's Implicational Universal: if a language has non-verbal encoding for nouns (Ch 118), it allows some form of zero copula (Ch 120 = restricted or widespread). In our sample this holds without exception.
Converse of Stassen's implication: if a language has zero copula impossible (Ch 120), then it uses verbal nominal predication (Ch 118). This is the contrapositive of the above.
A split predication language uses different strategies for at least two of: adjectival, nominal, and locational predication.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Count of languages in our sample with some form of split predication.
Spanish and Irish exemplify the clearest nom-loc splits: both have two distinct copulas that partition predicate types.
Mandarin exemplifies a three-way split: adjectives are verbal (no
copula), nouns require shi, locations use zai. Three different
strategies for three predicate types.
Languages that split nominal and locational predication (Ch 119) often use existential verbs for locational predication. In our sample, several nom-loc-split languages have distinct existential/locational forms in their copula inventory.
East and Southeast Asian languages (Japanese, Korean, Mandarin, Thai) all have verbal adjectives. This is an areal feature of the region.
Equations
Instances For
Semitic languages (Arabic, Hebrew) share a copula profile: non-verbal adjectives, non-verbal nominal predication, and restricted zero copula (present tense only).
Equations
Instances For
East Asian languages all split nominal and locational predication.
Distribution of copula types in our sample.
Verbal copulas are the most common copula type in our sample.
F117A: 240 languages (predicative possession).
F118A: 386 languages (predicative adjectives).
F119A: 386 languages (nominal and locational predication).
F120A: 386 languages (zero copula for predicate nominals).
F118A, F119A, F120A all use the same 386-language sample.
F118A distribution: predicative adjectives.
F119A distribution: nominal and locational predication.
F120A distribution: zero copula for predicate nominals.
F117A distribution: predicative possession.