Form-Frequency Correspondence @cite{haspelmath-2021} #
@cite{haspelmath-2021}'s deeper explanation of argument-coding splits: the Role-Reference Association Universal (Universal 1) itself reduces to a general cognitive tendency for frequent expressions to be short.
The key chain of explanation:
Frequency asymmetry: Some role-reference combinations are more frequent than others (e.g., "I saw him" is more frequent than "He saw me"; animate agents are more frequent than inanimate agents).
Form-frequency correspondence: More frequent expressions tend to get shorter forms. This is a diachronic process — frequent forms erode phonologically and resist analogical extension of explicit markers.
Coding asymmetry: The "usual" role-reference associations (which ARE the frequent ones) therefore get shorter (often zero) coding, while the "unusual" associations get longer (overt) coding.
This explains WHY the monotonicity patterns in DOM/DSM/differential indexing exist: they track frequency gradients on the prominence scales.
Relative coding length of an argument expression.
Haspelmath's claim is about relative length, not absolute morpheme counts. Zero marking is shorter than overt marking; shorter overt markers are shorter than longer ones.
- zero : CodingLength
Zero coding (no overt marker)
- short : CodingLength
Short overt coding (e.g., clitic, monosyllabic affix)
- long : CodingLength
Long overt coding (e.g., full adposition, bisyllabic affix)
Instances For
Equations
- Core.FormFrequency.instBEqCodingLength.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Numeric rank: zero (0) < short (1) < long (2).
Equations
Instances For
The form-frequency correspondence predicate: a coding system respects the form-frequency correspondence if more frequent role-reference combinations receive shorter (or equal) coding.
freq returns relative frequency (higher = more frequent).
coding returns the coding length for that combination.
The predicate holds when: freq(x) > freq(y) → coding(x) ≤ coding(y).
Equations
Instances For
The form-frequency correspondence applied to argument coding: usual role-reference associations (high frequency) should get shorter coding than unusual ones (low frequency).
This is the core claim of @cite{haspelmath-2021}: differential marking patterns are explained by frequency asymmetries, not by grammatical function per se.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Haspelmath's bridge claim: for P/T arguments, prominence correlates positively with unusualness (and thus with coding length). That is, the frequency of a P with a given prominence is inversely related to prominence rank.
For A/R arguments, the relationship is reversed: frequency is directly related to prominence rank.
This function returns a frequency proxy based on prominence rank and role. It captures the claim that A-prominence is positively correlated with frequency while P-prominence is negatively correlated.
Equations
- Core.FormFrequency.frequencyProxy Core.Prominence.ArgumentRole.A a d = Core.Prominence.prominenceRank a d
- Core.FormFrequency.frequencyProxy Core.Prominence.ArgumentRole.R a d = Core.Prominence.prominenceRank a d
- Core.FormFrequency.frequencyProxy Core.Prominence.ArgumentRole.P a d = 6 - Core.Prominence.prominenceRank a d
- Core.FormFrequency.frequencyProxy Core.Prominence.ArgumentRole.T a d = 6 - Core.Prominence.prominenceRank a d
- Core.FormFrequency.frequencyProxy Core.Prominence.ArgumentRole.S a d = 3
Instances For
The frequency proxy predicts that usual associations are more frequent.
Verb voice direction for direct/inverse systems. Direct forms mark downstream (usual) scenarios; inverse forms mark upstream (unusual) scenarios with morphologically more complex marking.
- direct : VoiceDirection
- inverse : VoiceDirection
Instances For
Equations
- Core.FormFrequency.instBEqVoiceDirection.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Ditransitive frame alternation. Double-object construction is shorter; prepositional dative is longer.
- doubleObject : DitransitiveFrame
- prepositionalDative : DitransitiveFrame
Instances For
Equations
- Core.FormFrequency.instBEqDitransitiveFrame.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Double-object is short coding; prepositional dative is long coding.
Equations
Instances For
Scenario-level form-frequency correspondence: higher frequency-class scenarios should get shorter-or-equal coding.
Equations
- One or more equations did not get rendered due to their size.