@cite{futrell-gibson-2020}: Crosslinguistic Dependency Length Data #
@cite{futrell-gibson-2020} @cite{zaslavsky-hu-levy-2020}
Empirical data from Table 2 of @cite{futrell-gibson-2020} "Dependency locality as an explanatory principle for word order", Language 96(2):371–412.
53 languages from Universal Dependencies corpora, measuring:
- Proportion of head-final dependencies
- Mean dependency length at sentence lengths 10, 15, 20
All values are scaled integers to avoid Float (permille for proportions, × 100 for dependency lengths).
Key Empirical Finding #
Head-final languages (Japanese, Korean, Turkish, Hindi) systematically have higher mean dependency lengths than head-initial languages (Arabic, Indonesian, Romanian), controlling for sentence length. This is predicted by DLM theory: head-final order with right-branching structures creates longer dependencies.
Crosslinguistic dependency length data for a single language.
Values are scaled integers:
propHeadFinal: × 1000 (permille), e.g., 890 = 89.0% head-finaldepLengthAt10/15/20: × 100, e.g., 245 = 2.45 mean dep length
- name : String
- isoCode : String
- family : String
- propHeadFinal : Nat
- depLengthAt10 : Nat
- depLengthAt15 : Nat
- depLengthAt20 : Nat
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
- Phenomena.WordOrder.DependencyLength.FutrellEtAl2020.instBEqLanguageDLM.beq x✝¹ x✝ = false
Instances For
A language is predominantly head-final if > 50% of deps are head-final.
Equations
- l.isHeadFinal = decide (l.propHeadFinal > 500)
Instances For
A language is predominantly head-initial if ≤ 50% of deps are head-final.
Equations
- l.isHeadInitial = decide (l.propHeadFinal ≤ 500)
Instances For
Arabic (afro-asiatic, head-initial, VSO/SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Basque (isolate, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Bulgarian (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Chinese (Sino-Tibetan, mixed)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Czech (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Danish (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Dutch (Indo-European, mixed V2)
Equations
- One or more equations did not get rendered due to their size.
Instances For
English (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Estonian (Uralic, mixed)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Finnish (Uralic, head-final, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
French (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
German (Indo-European, mixed V2/SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Greek (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hebrew (Afro-Asiatic, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hindi (Indo-European, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hungarian (Uralic, head-final)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Indonesian (Austronesian, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Italian (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Japanese (Japonic, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Korean (Koreanic, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Latin (Indo-European, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Norwegian (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Persian (Indo-European, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Polish (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Portuguese (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Romanian (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Russian (Indo-European, mixed)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Spanish (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Swedish (Indo-European, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Tamil (Dravidian, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Turkish (Turkic, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Urdu (Indo-European, head-final, SOV)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Vietnamese (Austroasiatic, head-initial, SVO)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Representative subset of 32 languages from Table 2.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Head-final languages in the dataset.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Head-initial languages in the dataset.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mean dep length at length 10 for head-final subset.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mean dep length at length 10 for head-initial subset.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Head-final languages have higher mean dep length at sentence length 10.
This is the core empirical finding: head-final languages systematically exhibit longer dependencies, consistent with DLM theory's prediction that consistently head-final order creates longer dependencies when combined with right-branching structure.
Same pattern at sentence length 20.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Japanese has the highest dep length at length 20 among all languages.
Indonesian has the lowest dep length at length 10 among all languages.
The head-finality gap increases with sentence length: the difference in mean dep length between head-final and head-initial languages is larger at length 20 than at length 10.