Documentation

Linglib.Phenomena.Copulas.Typology

Copula and Predication Typology (WALS Chapters 117--120) #

@cite{hengeveld-1992} @cite{pustet-2003} @cite{stassen-2013} @cite{haspelmath-2001}

Cross-linguistic data on predication strategies and copulas from four WALS chapters, all authored by Leon @cite{stassen-2013}. These chapters address a cluster of related questions: how languages express predication when the predicate is an adjective, a noun phrase, or a locative, and whether a copular element is needed in each case.

The key insight from Stassen's typological work is that predication is not uniform across predicate types. Many languages use fundamentally different strategies for adjectival predication ("The book is red"), nominal predication ("She is a doctor"), and locational predication ("The cat is on the mat"). The copula, where it exists, is best understood not as a single phenomenon but as a family of strategies that languages deploy differently across predicate types.

Ch 117: Predicative Possession (N = 240) #

How languages express predicative possession ("I have a book"). Five values:

Ch 118: Predicative Adjectives (N = 386) #

How languages express adjectival predication. Three values:

Ch 119: Nominal and Locational Predication (N = 386) #

Whether a language uses the same or different strategy for nominal predication ("is a doctor") and locational predication ("is in the room"). Two values:

The majority pattern is to differentiate: even English, which uses "be" for both, is cross-linguistically unusual in using a single copula.

Ch 120: Zero Copula for Predicate Nominals (N = 386) #

Whether the copula can be absent in nominal predication. Two values:

WALS Ch 117: How a language expresses adjectival predication ("The book is red").

The three-way distinction reflects the categorial status of adjectives in the language. In "verbal" languages, adjectives inflect like verbs and need no copula. In "non-verbal" languages, adjectives are a distinct category requiring a copula. "Mixed" languages have adjectives that split between verbal and non-verbal behavior.

  • verbal : PredAdjStrategy

    Adjectives behave like verbs: they take verbal morphology (tense, aspect, negation) and occur as predicates without a copula. Example: Mandarin shu hong 'book red' = 'The book is red', where hong can take aspect markers directly.

  • mixed : PredAdjStrategy

    Some adjectives are verbal (typically core/frequent ones), others require a copula (typically peripheral/borrowed ones). Example: Japanese has verbal adjectives (i-adjectives: takai 'is expensive') and non-verbal adjectives (na-adjectives: shizuka-da 'quiet-COP').

  • nonVerbal : PredAdjStrategy

    Adjectives are categorially distinct from verbs and require a copula or other linking element for predication. Example: English The book is red requires the copula is.

Instances For
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      WALS Ch 118: How a language expresses nominal predication ("She is a doctor").

      The binary distinction captures whether the language uses a verbal copula (a word that inflects like a verb and carries tense/agreement) or a non-verbal strategy (juxtaposition, pronominal copula, or invariant particle).

      • verbal : PredNounStrategy

        Nominal predication uses a verbal copula that inflects for tense, agreement, etc. Example: English She is a doctor, where is is a fully inflecting verb (am/is/are/was/were).

      • nonVerbal : PredNounStrategy

        Nominal predication uses juxtaposition or a non-verbal element (particle, pronoun, etc.), not a copula verb. Example: Russian present tense Ona vrach 'She doctor' = 'She is a doctor' (no copula in present tense).

      Instances For
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          WALS Ch 119: Whether a language uses the same or different strategy for nominal predication ("She is a doctor") and locational predication ("The cat is on the mat").

          Many languages use different verbs: e.g., Spanish ser (nominal) vs estar (locational), or have a copula for one but not the other. The "different" value is the majority pattern cross-linguistically.

          • identical : NomLocStrategy

            The same copula or strategy is used for both nominal and locational predication. Example: English She is a doctor / The cat is on the mat -- both use be.

          • different : NomLocStrategy

            Different copulas or strategies are used for nominal vs locational predication. Example: Spanish ser (nominal: Es doctora) vs estar (locational: Esta en la casa).

          Instances For
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              WALS Ch 120: Whether the copula can be absent in nominal predication.

              Zero copula is typically conditioned by tense (present only) or person (3rd person only). "Widespread" means zero copula is the default, unrestricted strategy.

              • impossible : ZeroCopulaStatus

                Zero copula is impossible: the copula is always required in nominal predication, regardless of tense, person, or other factors. Example: English *She doctor is ungrammatical.

              • restricted : ZeroCopulaStatus

                Zero copula is possible in restricted contexts: typically present tense, or 3rd person, or in certain clause types. The copula appears in other environments. Example: Russian allows zero copula in present tense (Ona vrach 'She doctor') but requires it in past tense (Ona byla vrach 'She was doctor').

              • widespread : ZeroCopulaStatus

                Zero copula is the normal, widespread, or default strategy. The copula is absent in most environments. Example: Tagalog Doktor siya 'Doctor she' = 'She is a doctor'.

              Instances For
                Equations
                • One or more equations did not get rendered due to their size.
                Instances For

                  Morphosyntactic type of copula, when present.

                  This supplements the WALS classification with a finer-grained typology of copular elements, following @cite{pustet-2003} and @cite{hengeveld-1992}.

                  • verbalCopula : CopulaType

                    Fully inflecting verbal copula: takes tense, agreement, negation like other verbs. Example: English be (am/is/are/was/were).

                  • pronominalCopula : CopulaType

                    Pronominal copula: a pronoun-like element that agrees with the subject. Example: Hebrew hu/hi/hem/hen (he/she/they.m/they.f) used as a copula in present tense.

                  • particle : CopulaType

                    Invariant particle: a non-inflecting element linking subject and predicate. Example: Swahili ni (affirmative copula, invariant).

                  • zero : CopulaType

                    No copula: predication by juxtaposition.

                  Instances For
                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      A single row in a WALS frequency table: a category label and its count.

                      Instances For
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For
                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For
                            Equations
                            Instances For

                              Chapter 117 distribution: predicative possession (N = 240).

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                Chapter 118 distribution: predicative adjectives (N = 386).

                                Equations
                                • One or more equations did not get rendered due to their size.
                                Instances For

                                  Chapter 119 distribution: nominal and locational predication (N = 386).

                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For

                                    Chapter 120 distribution: zero copula for predicate nominals (N = 386).

                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For

                                      Chapters 118, 119, 120 use the same 386-language sample. Chapter 117 uses a smaller 240-language sample.

                                      A language's copula and predication profile across WALS Chapters 117--120.

                                      • language : String

                                        Language name.

                                      • iso : String

                                        ISO 639-3 code.

                                      • family : String

                                        Language family.

                                      • predAdj : PredAdjStrategy

                                        Ch 117: How adjectival predication is expressed.

                                      • predNoun : PredNounStrategy

                                        Ch 118: How nominal predication is expressed.

                                      • Ch 119: Whether nominal and locational predication use the same or different strategy.

                                      • zeroCopula : ZeroCopulaStatus

                                        Ch 120: Whether zero copula is possible in nominal predication.

                                      • copulaType : CopulaType

                                        Primary copula type in the language (supplementary).

                                      • copulaForms : List String

                                        Illustrative copula form(s).

                                      • notes : String

                                        Notes on the predication system.

                                      Instances For
                                        Equations
                                        • One or more equations did not get rendered due to their size.
                                        Instances For

                                          English (Indo-European, Germanic). Adjectives are non-verbal: require copula be ("The book is red"). Nominal predication uses the verbal copula be ("She is a doctor"). Same copula be for both nominal and locational predication. Zero copula is impossible: *She doctor, *Book red.

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For

                                            French (Indo-European, Romance). Adjectives are non-verbal: Le livre est rouge ('The book is red'). Nominal predication uses verbal copula etre: Elle est medecin. Same copula etre for nominal and locational predication. Zero copula is impossible.

                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For

                                              German (Indo-European, Germanic). Adjectives are non-verbal: Das Buch ist rot ('The book is red'). Nominal predication uses verbal copula sein: Sie ist Arztin. Same copula for nominal and locational predication. Zero copula is impossible.

                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For

                                                Spanish (Indo-European, Romance). Adjectives are non-verbal: El libro es rojo ('The book is red'). Nominal predication uses verbal copula ser: Ella es doctora. DIFFERENT copulas for nominal vs locational: ser (permanent/nominal) vs estar (temporary/locational). Zero copula is impossible.

                                                Equations
                                                • One or more equations did not get rendered due to their size.
                                                Instances For

                                                  Russian (Indo-European, Slavic). Adjectives are non-verbal in present tense (zero copula: Kniga krasnaja 'Book red'), verbal copula byt' in past (Kniga byla krasnaja). Nominal predication uses zero copula in present: Ona vrach ('She doctor'). Same strategy for nominal and locational (both use zero copula in present). Zero copula is RESTRICTED to present tense: past and future require byt'.

                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    Arabic (Standard) (Afro-Asiatic, Semitic). Adjectives are non-verbal: al-kitab ahmar 'the-book red'. Nominal predication is non-verbal in present: hiya tabiba 'she doctor'. Different strategies for nominal vs locational (locational uses fi 'in' with possible copula). Zero copula is RESTRICTED: present tense allows zero, past requires kaana.

                                                    Equations
                                                    • One or more equations did not get rendered due to their size.
                                                    Instances For

                                                      Hebrew (Afro-Asiatic, Semitic). Adjectives are non-verbal: ha-sefer adom 'the-book red'. Nominal predication uses a pronominal copula in present: hi rofaa or hi hi rofaa (she COP.3F doctor). Past/future use haya. Different strategies for nominal vs locational. Zero copula is RESTRICTED: present allows zero/pronominal, past requires haya.

                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For

                                                        Hungarian (Uralic). Adjectives are non-verbal: A konyv piros 'The book red'. Nominal predication: zero copula in 3rd person present (O orvos 'She doctor'), verbal copula van in other persons/tenses. Same strategy for nominal and locational. Zero copula is RESTRICTED to 3rd person present tense.

                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For

                                                          Japanese (Japonic). Adjectives are MIXED: i-adjectives are verbal (takai 'is.expensive', takes tense: takakatta 'was.expensive'), but na-adjectives require copula da (shizuka-da 'quiet-COP'). Nominal predication uses copula da/desu: Kanojo wa isha da 'She TOP doctor COP'. Different strategies for nominal and locational (locational uses ni iru/aru). Zero copula is impossible in standard forms (copula always required for nominals).

                                                          Equations
                                                          • One or more equations did not get rendered due to their size.
                                                          Instances For

                                                            Korean (Koreanic). Adjectives are verbal: chaek-i ppalgah-da 'book-NOM red-DECL' with verbal inflection on the adjective. No copula needed. Nominal predication uses copula -i-da: geunyeo-neun uisa-i-da 'she-TOP doctor-COP-DECL'. Different strategies for nominal vs locational (locational uses iss-da 'exist-DECL'). Zero copula is impossible for nominals.

                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For

                                                              Mandarin Chinese (Sino-Tibetan). Adjectives are verbal: shu hong 'book red' = 'The book is red'. Adjectives can take aspect markers (le, guo) and negation (bu) directly. Nominal predication uses copula shi: ta shi yisheng 'she COP doctor'. Different strategies for nominal vs locational (locational uses zai 'at'). Zero copula is impossible for nominal predication (shi is required).

                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                Turkish (Turkic). Adjectives are verbal: Kitap kirmizi 'Book red' = 'The book is red'. Adjective takes verbal suffixes for tense. Nominal predication uses a copular suffix: O doktor-dur 'S/he doctor-COP'. Different strategies for nominal vs locational (locational uses postposition + var/yok). Zero copula is RESTRICTED: 3rd person present often has zero copula (O doktor 'S/he doctor'), but past requires -di/-ydi.

                                                                Equations
                                                                • One or more equations did not get rendered due to their size.
                                                                Instances For

                                                                  Swahili (Niger-Congo, Bantu). Adjectives are verbal: -zuri 'good' takes verbal agreement prefixes (m-zuri 'good (cl.1)'). Predication: Kitabu ni kizuri uses ni. Nominal predication uses particle ni: Yeye ni daktari 'S/he COP doctor'. Different strategies for nominal vs locational (locational uses -ko locative suffixes and different verbal forms). Zero copula is WIDESPREAD in some constructions: Yeye daktari is acceptable in many contexts.

                                                                  Equations
                                                                  • One or more equations did not get rendered due to their size.
                                                                  Instances For

                                                                    Hindi-Urdu (Indo-European, Indo-Aryan). Adjectives are non-verbal: kitab lal hai 'book red COP.PRS'. Nominal predication uses copula hona: vo daktar hai 'she doctor COP.PRS'. Same copula hona for nominal and locational predication. Zero copula is impossible in standard Hindi-Urdu.

                                                                    Equations
                                                                    • One or more equations did not get rendered due to their size.
                                                                    Instances For

                                                                      Tagalog (Austronesian). Adjectives are verbal: maganda ang babae 'beautiful ANG woman'. Nominal predication is non-verbal: doktor siya 'doctor she'. Same strategy for nominal and locational. Zero copula is WIDESPREAD: doktor siya with no overt copula is the normal strategy.

                                                                      Equations
                                                                      • One or more equations did not get rendered due to their size.
                                                                      Instances For

                                                                        Irish (Indo-European, Celtic). Adjectives are non-verbal: Ta an leabhar dearg 'COP the book red'. Nominal predication uses copula is: Is dochtúir í 'COP doctor she'. DIFFERENT copulas: is (classificatory/identificational) vs ta (attributive/locational). Zero copula is impossible.

                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          Finnish (Uralic). Adjectives are non-verbal: Kirja on punainen 'Book COP red'. Nominal predication uses verbal copula olla: Han on laakari 'S/he COP doctor'. Same copula olla for nominal and locational predication. Zero copula is impossible.

                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            Quechua (Quechuan). Adjectives are non-verbal: Wasi hatun-mi 'House big-EVID' with copular suffix. Nominal predication uses copular suffixes: Pay yachachiq-mi 'S/he teacher-EVID' (copula as suffix -mi/-n). Same strategy for nominal and locational predication. Zero copula is RESTRICTED: 3rd person present often has zero or reduced copula, but other persons/tenses require overt copula ka-.

                                                                            Equations
                                                                            • One or more equations did not get rendered due to their size.
                                                                            Instances For

                                                                              Yoruba (Niger-Congo, Atlantic-Congo). Adjectives are verbal: Iwe na tobi 'book the big' (adjective inflects verbally). Nominal predication uses non-verbal strategy (juxtaposition or pronoun): O je dokita 'She be doctor' or O dokita ni 'She doctor COP'. Different strategies for nominal vs locational. Zero copula is WIDESPREAD for nominal predication.

                                                                              Equations
                                                                              • One or more equations did not get rendered due to their size.
                                                                              Instances For

                                                                                Thai (Tai-Kadai). Adjectives are verbal: Nangsue daeng 'Book red' = 'The book is red', where the adjective takes verbal markers directly. Nominal predication uses verbal copula pen: Khao pen mor 'S/he COP doctor'. Different strategies for nominal vs locational (locational uses yuu 'be.at'). Zero copula is impossible for nominal predication.

                                                                                Equations
                                                                                • One or more equations did not get rendered due to their size.
                                                                                Instances For

                                                                                  All language profiles in the sample.

                                                                                  Equations
                                                                                  • One or more equations did not get rendered due to their size.
                                                                                  Instances For

                                                                                    Does a language split nominal and locational predication?

                                                                                    Equations
                                                                                    Instances For

                                                                                      Count of languages in a list matching a predicate.

                                                                                      Equations
                                                                                      Instances For

                                                                                        Number of languages in our sample.

                                                                                        Ch 119: Using different strategies for nominal and locational predication is more common than using the same strategy. This means that languages typically distinguish "is a doctor" from "is in the room" with different grammatical means.

                                                                                        In our sample, languages with verbal adjectives (Ch 117) tend NOT to also have verbal copulas for nouns -- they often use a different strategy for nominal predication. This is because the verbal adjective strategy eliminates the need for a copula with adjectives but does not necessarily provide a copula for nouns.

                                                                                        Count of languages with verbal adjectives AND non-verbal noun predication.

                                                                                        In our sample, languages with restricted zero copula (Ch 120) allow zero copula in present tense but require an overt copula in past tense. Russian, Arabic, Hungarian, and Turkish all exhibit this pattern: present tense allows copula dropping but past requires an overt form.

                                                                                        All restricted-zero-copula languages in our sample have overt past-tense copula forms.

                                                                                        Languages that use non-verbal noun predication (Ch 118) tend to allow zero copula (Ch 120). In our sample, every language with non-verbal noun predication allows zero copula (restricted or widespread).

                                                                                        theorem Phenomena.Copulas.Typology.verbal_adj_tends_nomloc_split :
                                                                                        have verbalAdj := List.filter (fun (x : CopulaProfile) => x.hasVerbalAdj) allLanguages; have splitNomLoc := List.filter (fun (x : CopulaProfile) => x.hasNomLocSplit) verbalAdj; splitNomLoc.length > verbalAdj.length / 2

                                                                                        Languages with verbal adjectives tend to split nominal and locational predication (Ch 119 = different). In our sample, the majority of verbal-adjective languages have different nom-loc strategies.

                                                                                        Languages where the copula is always required (Ch 120 = impossible) are more likely to use the SAME copula for nominal and locational predication (Ch 119 = identical) than languages that allow zero copula.

                                                                                        Among copula-requiring languages in our sample, at least half use the same strategy for both predication types.

                                                                                        Western European languages (English, French, German) share a common copula profile: non-verbal adjectives, verbal nominal copula, and no zero copula. This is the "Standard Average European" pattern.

                                                                                        Equations
                                                                                        Instances For

                                                                                          Cross-chapter interaction: languages with verbal adjectives (Ch 117) and verbal noun predication (Ch 118) are the only languages in our sample that can have verbal predication for BOTH adjectives and nouns with different copula forms.

                                                                                          In our sample, no language has mixed adjectives (Ch 117) AND non-verbal noun predication (Ch 118). Mixed adjective languages tend to have verbal copulas for nouns.

                                                                                          Languages with widespread zero copula (Ch 120) always allow non-verbal nominal predication (Ch 118). This is logically expected: if zero copula is the norm, nominal predication IS non-verbal.

                                                                                          Stassen's Implicational Universal: if a language has non-verbal encoding for nouns (Ch 118), it allows some form of zero copula (Ch 120 = restricted or widespread). In our sample this holds without exception.

                                                                                          Converse of Stassen's implication: if a language has zero copula impossible (Ch 120), then it uses verbal nominal predication (Ch 118). This is the contrapositive of the above.

                                                                                          A split predication language uses different strategies for at least two of: adjectival, nominal, and locational predication.

                                                                                          Equations
                                                                                          • One or more equations did not get rendered due to their size.
                                                                                          Instances For

                                                                                            Count of languages in our sample with some form of split predication.

                                                                                            Spanish and Irish exemplify the clearest nom-loc splits: both have two distinct copulas that partition predicate types.

                                                                                            Mandarin exemplifies a three-way split: adjectives are verbal (no copula), nouns require shi, locations use zai. Three different strategies for three predicate types.

                                                                                            Languages that split nominal and locational predication (Ch 119) often use existential verbs for locational predication. In our sample, several nom-loc-split languages have distinct existential/locational forms in their copula inventory.

                                                                                            East and Southeast Asian languages (Japanese, Korean, Mandarin, Thai) all have verbal adjectives. This is an areal feature of the region.

                                                                                            Equations
                                                                                            Instances For

                                                                                              Semitic languages (Arabic, Hebrew) share a copula profile: non-verbal adjectives, non-verbal nominal predication, and restricted zero copula (present tense only).

                                                                                              Equations
                                                                                              Instances For

                                                                                                East Asian languages all split nominal and locational predication.

                                                                                                F117A: 240 languages (predicative possession).

                                                                                                F118A: 386 languages (predicative adjectives).

                                                                                                F119A: 386 languages (nominal and locational predication).

                                                                                                F120A: 386 languages (zero copula for predicate nominals).