Documentation

Linglib.Phenomena.WordOrder.Studies.BachBrownMarslenWilson1986

Bach, Brown & Marslen-Wilson (1986) #

@cite{bach-brown-marslen-wilson-1986}

Crossed and Nested Dependencies in German and Dutch: A Psycholinguistic Study. Language and Cognitive Processes, 1(4), 249–262.

Core Finding #

Dutch crossed verb-cluster dependencies (NP₁ NP₂ NP₃ V₁ V₂ V₃) are easier to process than German nested dependencies (NP₁ NP₂ NP₃ V₃ V₂ V₁) at two or more levels of embedding, in both comprehensibility ratings and comprehension accuracy. At one level of embedding (Level 2), German/Participle does not differ from Dutch, though German/Infinitive shows a significant baseline disadvantage across all levels. This confirms @cite{evers-1975}'s intuition that crossed dependencies are easier, with the first controlled experimental evidence.

Incremental Integration Model #

The paper argues qualitatively that crossed dependencies allow incremental top-down integration while nested dependencies force bottom-up accumulation of floating propositions. We formalize this via totalIntegrationCost: the cumulative count of NPs awaiting matrix-connected integration during verb-cluster processing. This metric is our formalization, not the paper's — they argue informally about when partial interpretations become available.

The cost ratio nested/crossed is exactly 2 for all n, but the absolute difference n(n−1)/2 grows quadratically — consistent with the finding that the processing difference is undetectable at n=2 (gap = 1) but large at n=3 (gap = 3).

Dependency Length Invariance #

Crossed and nested patterns have identical total NP-verb dependency length (n²). This means the Bach et al. finding cannot be explained by dependency length minimization alone — the advantage of crossed dependencies is about when information becomes available for matrix integration, not about dependency distance.

Formal–Processing Dissociation #

Crossed dependencies require mildly context-sensitive power (@cite{shieber-1985}, @cite{bresnan-etal-1982}) while nested dependencies are context-free, yet crossed is psycholinguistically easier. This refutes models where parsing difficulty tracks the Chomsky hierarchy and provides evidence against push-down-store models of human parsing (@cite{evers-1975}).

NP-verb bindings connected to the matrix verb after k of n verbs heard.

Crossed (Dutch): matrix verb (V₁) arrives first → k bindings top-down. Nested (German): innermost verb first → 0 until matrix (last) → then n.

This counts only matrix-rooted integration. German listeners do build partial bottom-up structure (e.g., NP₃→V₃ after the first verb), but that proposition floats without a matrix root to attach to.

Equations
Instances For

    Cumulative unintegrated NPs across verb positions 1..n.

    Crossed: (n−1) + (n−2) + ··· + 0 = n(n−1)/2 Nested: n + n + ··· + n + 0 = n(n−1)

    Equations
    Instances For

      Crossed is strictly cheaper for n ≥ 2.

      Proof by element-wise comparison via Finset.sum_lt_sum: at each verb position k ∈ {1,…,n}, unintegratedAt .crossSerial ≤ unintegratedAt .nested, with strict inequality at k = 1 (the first verb heard).

      Absolute string distance between NP_i (1-indexed) and its verb.

      In a string NP₁...NPₙ V?₁...V?ₙ, NP_i is at absolute position i. Crossed: V_i is the i-th verb → position n + i → distance = n. Nested: V_{n+1−i} is the (n+1−i)-th verb → position n + (n+1−i) → distance = 2(n−i) + 1.

      Equations
      Instances For

        Crossed and nested have identical total dependency length.

        Crossed: all distances = n → total = n × n = n². Nested: distances are 2n−1, 2n−3, ..., 3, 1 → total = Σ(2k+1) = n². The Bach et al. finding is therefore NOT about dependency distance.

        Crossed dependencies are formally harder (mildly context-sensitive) but psycholinguistically easier — formal complexity ≠ processing complexity.

        Two independent arguments against PDA parsing:

        1. Dutch is comprehensible at Level 2 despite requiring MCS power (a PDA cannot handle crossed deps at any depth)
        2. Dutch is easier than German at Level 3+ (a PDA predicts nested should be easier or equal)

        Language group. German was tested with two verb-form versions (infinitive and past participle) due to normative disagreement among informants.

        Instances For
          Equations
          • One or more equations did not get rendered due to their size.
          Instances For

            Test sentence comprehensibility ratings × 100 (Table 1). Original scale: 1 = easy, 9 = hard. Levels 1–4 indexed 0–3.

            Equations
            Instances For

              Paraphrase sentence ratings × 100 (Table 1, Levels 2–4 indexed 0–2). Paraphrases express the same propositions using right-branching structure, controlling for propositional complexity.

              Equations
              Instances For

                Comprehension accuracy × 100 by NP position at Level 3, Test (Table 4). NP1 = matrix subject (highest clause), NP3 = most deeply embedded.

                Equations
                Instances For

                  Test−Paraphrase error rate difference × 100 by NP at Level 3 (Table 5). Higher = more syntactic disruption (Test harder relative to Paraphrase).

                  Equations
                  Instances For

                    At Level 2, German/Participle does not differ from Dutch (spread = 29). German/Infinitive is slightly worse throughout (spread = 43). The paper reports a significant overall Ger/Inf disadvantage but no difference for Ger/Part vs Dutch at Level 2.

                    The syntactic complexity effect (Test − Paraphrase) grows faster for both German groups than Dutch from Level 2 to Level 3, paralleling the model's prediction that the integration cost gap grows with depth.

                    Dutch advantage is largest for NP3 (most deeply embedded clause).

                    Dutch shows ZERO Test−Para error for NP3 (errorDiffByNP .dutch 2 = 0), while both German groups show substantial error (41, 36). The paper explains: in Dutch, NP3's verb (V₃) arrives last and integrates into an already-built matrix structure. In German, NP3's verb (V₃) arrives first — the proposition is immediately parseable but floats without a matrix root, so the information decays before it can be used.