Bach, Brown & Marslen-Wilson (1986) #
@cite{bach-brown-marslen-wilson-1986}
Crossed and Nested Dependencies in German and Dutch: A Psycholinguistic Study. Language and Cognitive Processes, 1(4), 249–262.
Core Finding #
Dutch crossed verb-cluster dependencies (NP₁ NP₂ NP₃ V₁ V₂ V₃) are easier to process than German nested dependencies (NP₁ NP₂ NP₃ V₃ V₂ V₁) at two or more levels of embedding, in both comprehensibility ratings and comprehension accuracy. At one level of embedding (Level 2), German/Participle does not differ from Dutch, though German/Infinitive shows a significant baseline disadvantage across all levels. This confirms @cite{evers-1975}'s intuition that crossed dependencies are easier, with the first controlled experimental evidence.
Incremental Integration Model #
The paper argues qualitatively that crossed dependencies allow incremental
top-down integration while nested dependencies force bottom-up accumulation of
floating propositions. We formalize this via totalIntegrationCost: the
cumulative count of NPs awaiting matrix-connected integration during verb-cluster
processing. This metric is our formalization, not the paper's — they argue
informally about when partial interpretations become available.
The cost ratio nested/crossed is exactly 2 for all n, but the absolute difference n(n−1)/2 grows quadratically — consistent with the finding that the processing difference is undetectable at n=2 (gap = 1) but large at n=3 (gap = 3).
Dependency Length Invariance #
Crossed and nested patterns have identical total NP-verb dependency length (n²). This means the Bach et al. finding cannot be explained by dependency length minimization alone — the advantage of crossed dependencies is about when information becomes available for matrix integration, not about dependency distance.
Formal–Processing Dissociation #
Crossed dependencies require mildly context-sensitive power (@cite{shieber-1985}, @cite{bresnan-etal-1982}) while nested dependencies are context-free, yet crossed is psycholinguistically easier. This refutes models where parsing difficulty tracks the Chomsky hierarchy and provides evidence against push-down-store models of human parsing (@cite{evers-1975}).
NP-verb bindings connected to the matrix verb after k of n verbs heard.
Crossed (Dutch): matrix verb (V₁) arrives first → k bindings top-down. Nested (German): innermost verb first → 0 until matrix (last) → then n.
This counts only matrix-rooted integration. German listeners do build partial bottom-up structure (e.g., NP₃→V₃ after the first verb), but that proposition floats without a matrix root to attach to.
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.integratedBindings Phenomena.WordOrder.CrossSerial.DependencyPattern.crossSerial x✝¹ x✝ = min x✝ x✝¹
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.integratedBindings Phenomena.WordOrder.CrossSerial.DependencyPattern.nested x✝¹ x✝ = if x✝ ≥ x✝¹ then x✝¹ else 0
Instances For
NPs awaiting matrix-connected integration at verb position k.
Equations
Instances For
Cumulative unintegrated NPs across verb positions 1..n.
Crossed: (n−1) + (n−2) + ··· + 0 = n(n−1)/2 Nested: n + n + ··· + n + 0 = n(n−1)
Equations
Instances For
Level 2 (n=2): minimal gap (1 vs 2).
Level 3 (n=3): gap widens (3 vs 6).
Level 4 (n=4): gap widens further (6 vs 12).
The absolute cost gap grows with embedding depth.
Crossed is strictly cheaper for n ≥ 2.
Proof by element-wise comparison via Finset.sum_lt_sum: at each verb
position k ∈ {1,…,n}, unintegratedAt .crossSerial ≤ unintegratedAt .nested,
with strict inequality at k = 1 (the first verb heard).
Absolute string distance between NP_i (1-indexed) and its verb.
In a string NP₁...NPₙ V?₁...V?ₙ, NP_i is at absolute position i. Crossed: V_i is the i-th verb → position n + i → distance = n. Nested: V_{n+1−i} is the (n+1−i)-th verb → position n + (n+1−i) → distance = 2(n−i) + 1.
Equations
Instances For
Total NP-verb dependency length across all n pairs.
Equations
Instances For
Crossed and nested have identical total dependency length.
Crossed: all distances = n → total = n × n = n². Nested: distances are 2n−1, 2n−3, ..., 3, 1 → total = Σ(2k+1) = n². The Bach et al. finding is therefore NOT about dependency distance.
General case: both patterns yield total distance n².
Crossed dependencies are formally harder (mildly context-sensitive) but psycholinguistically easier — formal complexity ≠ processing complexity.
Two independent arguments against PDA parsing:
- Dutch is comprehensible at Level 2 despite requiring MCS power (a PDA cannot handle crossed deps at any depth)
- Dutch is easier than German at Level 3+ (a PDA predicts nested should be easier or equal)
Integration cost difference is NOT explained by dependency length.
Language group. German was tested with two verb-form versions (infinitive and past participle) due to normative disagreement among informants.
Instances For
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Test sentence comprehensibility ratings × 100 (Table 1). Original scale: 1 = easy, 9 = hard. Levels 1–4 indexed 0–3.
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 0 = 114
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 1 = 234
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 2 = 542
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 3 = 766
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 109
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 277
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 616
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 3 = 826
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 124
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 263
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 581
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 3 = 766
Instances For
Paraphrase sentence ratings × 100 (Table 1, Levels 2–4 indexed 0–2). Paraphrases express the same propositions using right-branching structure, controlling for propositional complexity.
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 0 = 211
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 1 = 406
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 2 = 594
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 202
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 413
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 589
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 236
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 385
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.paraRating Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 562
Instances For
Comprehension accuracy × 100 for Test sentences (Table 3). Questions tested whether each subject NP was correctly associated with its predicate verb phrase. Levels 2–3 indexed 0–1.
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 0 = 168
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 1 = 117
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 173
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 89
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 158
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.testComprehension Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 79
Instances For
Comprehension accuracy × 100 by NP position at Level 3, Test (Table 4). NP1 = matrix subject (highest clause), NP3 = most deeply embedded.
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 0 = 117
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 1 = 83
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 2 = 150
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 88
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 67
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 112
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 102
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 38
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.comprehensionByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 97
Instances For
Test−Paraphrase error rate difference × 100 by NP at Level 3 (Table 5). Higher = more syntactic disruption (Test harder relative to Paraphrase).
Equations
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 0 = 11
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 1 = 59
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.dutch 2 = 0
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 32
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 91
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 41
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 31
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 67
- Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.errorDiffByNP Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 36
Instances For
At Level 2, German/Participle does not differ from Dutch (spread = 29). German/Infinitive is slightly worse throughout (spread = 43). The paper reports a significant overall Ger/Inf disadvantage but no difference for Ger/Part vs Dutch at Level 2.
At Level 3, Dutch rates Test sentences as easier than both German groups.
At Level 3, Dutch comprehension accuracy exceeds both German groups.
The syntactic complexity effect (Test − Paraphrase) grows faster for both German groups than Dutch from Level 2 to Level 3, paralleling the model's prediction that the integration cost gap grows with depth.
NP2 (middle NP) is hardest for all three groups (Table 4, Test). This is an interference effect: NP2 is distinguished by neither primacy (NP1) nor recency (NP3), making it hardest to retrieve regardless of the dependency pattern.
Dutch advantage is largest for NP3 (most deeply embedded clause).
Dutch shows ZERO Test−Para error for NP3 (errorDiffByNP .dutch 2 = 0), while both German groups show substantial error (41, 36). The paper explains: in Dutch, NP3's verb (V₃) arrives last and integrates into an already-built matrix structure. In German, NP3's verb (V₃) arrives first — the proposition is immediately parseable but floats without a matrix root, so the information decays before it can be used.
The model predicts crossed < nested, the data confirms it, and dependency length cannot explain the difference.