Documentation

Linglib.Phenomena.Reference.Studies.GilesEtAl2026

@cite{giles-etal-2026} #

Search Efficiency Drives Reference Production Across Modalities, But Colour Is Special. Open Mind: Discoveries in Cognitive Science 10, 236–260.

Core Argument #

Overinformativeness is communicatively efficient: speakers use redundant modifiers to help listeners search for the referent. The rate of overinformativeness tracks the search efficiency gained by adding the modifier — the interaction of discriminability (ease of perceptual search along the redundant attribute) and sufficiency (difficulty of search using the sufficient attribute alone).

Experiments #

Key Findings #

#FindingEvidenceβ95% CI
1Search efficiency: S-Low/R-High > S-High/R-LowExp 1−1.09[−1.35, −0.83]
2Search efficiency: S-Low/R-High > BaselineExp 1−0.94[−1.20, −0.68]
3Colour > material (cross-modal)Exp 1−1.43[−1.65, −1.20]
4Colour HF > orientationExp 2−0.97[−1.20, −0.75]
5Colour LF ≈ Colour HF (frequency doesn't explain)Exp 2−0.20[−0.44, 0.03]

Theoretical Implications #

The noise discrimination model (RSA.Noise) correctly predicts Finding 1–2 (discriminability drives overinformativeness) and Finding 3 (colour > material from gap ordering). But it incorrectly predicts that colour and orientation should be overinformed equally (since both have discrimination ≈ 0.98). Finding 4 falsifies this: colour has a residual privilege beyond discriminability.

Verified Data #

Regression coefficients verified against Tables 1 and 2 of the paper.

A regression coefficient from a Bayesian mixed-effects logistic model with 95% credible interval.

Instances For
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Whether the 95% CI excludes zero (evidence of a reliable effect).

      Equations
      Instances For

        Intercept (Table 1).

        Equations
        Instances For

          Material-Redundant vs Colour-Redundant (Table 1). Negative β: colour is overinformed MORE than material even with equalized discriminability via psychophysical staircases.

          Equations
          Instances For

            Baseline vs S-Low/R-High (Table 1). Negative β: overinformativeness is LOWER at baseline (both attributes high-discriminability) than when the sufficient attribute is hard.

            Equations
            Instances For

              S-High/R-Low vs S-Low/R-High (Table 1). Negative β: overinformativeness is LOWER when the sufficient attribute is already search-efficient.

              Equations
              Instances For

                Intercept (Table 2).

                Equations
                Instances For

                  Low-frequency colour terms (Table 2, sum contrasts). Small negative β relative to grand mean; CI includes zero → frequency does NOT explain colour's disproportionate use.

                  Equations
                  Instances For

                    Orientation-redundant (Table 2, sum contrasts). Large negative β relative to grand mean; CI excludes zero → colour is overinformed SIGNIFICANTLY MORE than orientation.

                    Equations
                    Instances For

                      The search efficiency ordering: S-Low/R-High > S-High/R-Low. When the sufficient attribute is hard to search and the redundant attribute facilitates search, speakers overinform more.

                      The search efficiency ordering: S-Low/R-High > Baseline. Speakers don't just mention all high-discriminability attributes; they selectively overinform to help difficult searches.

                      Colour is overinformed more than material (Exp 1), extending @cite{kursat-degen-2021}'s finding cross-modally.

                      @cite{kursat-degen-2021} showed colour > material with visual stimuli. This study confirms the asymmetry persists when material is presented in the auditory modality via impact sounds, with discriminability equalized via psychophysical staircases.

                      Converging evidence with @cite{kursat-degen-2021}: both studies find colour used redundantly more than material. The present study adds cross-modal generalisation.

                      The noise model's discrimination ordering (colour > size > material) is consistent with the search efficiency results of Exp 1: higher discrimination → more overinformativeness when redundant.

                      The noise model predicts colour = orientation (both discrimination 0.98), but the data shows colour >> orientation (β = −0.97). This is the central dissociation: search efficiency (noise discrimination) is necessary but not sufficient to explain overinformativeness patterns. Colour has a residual privilege.

                      Possible explanations (General Discussion):

                      1. Colour categories are optimised for perceptual communication (@cite{regier-etal-2007}), making colour inherently more search-efficient than orientation across naturalistic contexts.
                      2. Speakers learn from experience that colour is a reliable referential strategy and deploy it even when its search efficiency advantage is controlled away.

                      Word frequency does not explain the colour privilege: low-frequency colour terms (teal, jade) produce overinformativeness rates indistinguishable from high-frequency terms (green, blue).

                      The cs-RSA model (@cite{degen-etal-2020}) explains redundant modification via noisy perception. This study provides perceptual grounding for the noise parameters: discriminability measured via psychophysical staircases maps to the noise gap.

                      The cs-RSA prediction — that higher noise gap produces more overinformativeness — is confirmed for the discriminability × sufficiency interaction (Exp 1 display type effects).

                      Both this study and @cite{engelhardt-etal-2006} demonstrate that speakers routinely over-describe. The search efficiency view reinterprets these violations of Gricean Q2 as communicatively efficient: the "extra" information facilitates listener search.

                      theorem Phenomena.Reference.Studies.GilesEtAl2026.overmodification_iff_positive_gap (sM sMM cM cMM : ) (hsM : 0 < sM) (hsMM : 0 < sMM) (hcM : 0 < cM) (hcMM : 0 < cMM) :
                      sM * cM / (sMM * cM + sMM * cMM + sM * cM) > sM / (sM + 2 * sMM) cM > cMM

                      Algebraic biconditional: In a cs-RSA scene with one target and two distractors (cf. @cite{degen-etal-2020} §2), L0 prefers the overmodified form iff the redundant modifier's noise gap is positive.

                      Scene structure: size is sufficient (only target is small), color is redundant (one distractor shares the target's color).

                      • L0(target | sufficient) = sM / (sM + 2·sMM)
                      • L0(target | sufficient+redundant) = sM·cM / (sMM·cM + sMM·cMM + sM·cM)

                      Proved algebraically over free variables — a general property of the Product of Experts architecture, not a finite data check.

                      theorem Phenomena.Reference.Studies.GilesEtAl2026.dprime_iff_overmodification (sM sMM cM cMM : ) (hsM : 0 < sM) (hsMM : 0 < sMM) (hcM : 0 < cM) (hcMM : 0 < cMM) (hcM_lt1 : cM < 1) (hcMM_lt1 : cMM < 1) :
                      0 < Core.dPrimeFromRates cM cMM sM * cM / (sMM * cM + sMM * cMM + sM * cM) > sM / (sM + 2 * sMM)

                      d' predicts overmodification: The SDT sensitivity d' for the redundant feature is positive iff the cs-RSA L0 prefers the overmodified form. This unifies three levels of linglib:

                      • Psychophysics (Core.SDTModel): d' measures perceptual sensitivity
                      • Noise channel (RSA.Noise): match/mismatch are hit/false-alarm rates
                      • Pragmatics (cs-RSA PoE): L0 posterior determines speaker choice

                      The match/mismatch noise parameters ARE the observer's hit rate and false alarm rate for feature verification. Positive d' means the observer can discriminate match from mismatch above chance — exactly when the redundant modifier carries useful information through the noise channel.

                      @cite{giles-etal-2026} provide the perceptual grounding: discriminability measured via psychophysical staircases (d') maps to the noise parameters that drive overinformativeness in reference production.

                      Instantiation: for the standard @cite{degen-etal-2020} noise parameters (color match = 0.99, mismatch = 0.01), the redundant color modifier's d' is positive, so L0 prefers "small blue" over "small."

                      This connects the concrete cs-RSA demonstration to the general dprime_iff_overmodification theorem.

                      theorem Phenomena.Reference.Studies.GilesEtAl2026.overmod_monotone_in_likelihood_ratio (sM sMM cM₁ cMM₁ cM₂ cMM₂ : ) (hsM : 0 < sM) (hsMM : 0 < sMM) (hcM₁ : 0 < cM₁) (hcMM₁ : 0 < cMM₁) (hcM₂ : 0 < cM₂) (hcMM₂ : 0 < cMM₂) :
                      sM * cM₁ / (sMM * cM₁ + sMM * cMM₁ + sM * cM₁) > sM * cM₂ / (sMM * cM₂ + sMM * cMM₂ + sM * cM₂) cM₁ * cMM₂ > cM₂ * cMM₁

                      Monotonicity in likelihood ratio: For two redundant features with noise channels (cM₁, cMM₁) and (cM₂, cMM₂), the first produces a higher L0 posterior from overmodification iff its likelihood ratio cM₁/cMM₁ exceeds cM₂/cMM₂.

                      The likelihood ratio — not the noise gap (cM − cMM) or d' alone — is the quantity that determines the strength of overmodification. Two features with equal d' but different likelihood ratios produce different overmodification rates.

                      theorem Phenomena.Reference.Studies.GilesEtAl2026.one_param_ratio_is_param_ordering (x₁ x₂ : ) (_hx₁ : 0 < x₁) (_hx₁' : x₁ < 1) (_hx₂ : 0 < x₂) (_hx₂' : x₂ < 1) :
                      x₁ * (1 - x₂) > x₂ * (1 - x₁) x₁ > x₂

                      For the one-parameter noise family (x, 1−x) used in BDA fitting, the likelihood ratio ordering reduces to the parameter ordering: x₁ > x₂ iff x₁·(1−x₂) > x₂·(1−x₁). Combined with probit monotonicity, this gives: higher d' → stronger overmodification.

                      This is the one-parameter specialization where d', noise gap, and likelihood ratio are all monotonically related — the only regime where "higher d'" unambiguously predicts "more overmodification."

                      The d'/likelihood-ratio model's monotonicity is correct within a feature (higher d' → more overmod, §14) but incomplete across features: two features with equal d' can have different overmod rates.

                      @cite{giles-etal-2026} propose two accounts for the residual colour privilege:

                      1. Category optimality: Colour naming systems are near-optimal partitions of perceptual space (@cite{regier-etal-2007}, @cite{zaslavsky-etal-2019}). Colour categories maximise discriminability across natural contexts, making colour inherently more search-efficient than orientation even when within-trial d' is equalized.

                      2. Learned strategy: Speakers learn from experience that colour is a reliable referential cue and deploy it as a default strategy even when its perceptual advantage is controlled away.

                      Both accounts locate the symmetry-breaking outside the single-trial noise channel — in the ecological statistics of feature reliability across contexts.

                      The cs-RSA Product of Experts architecture — φ(u, o) = ∏ features, noiseChannel_f(u, o) — is the UNIQUE factoring consistent with @cite{luce-1959}'s dimension independence axiom, as proven by Core.multidimensional_decomposition in Psychophysics.lean.

                      The argument chain:

                      1. Dimension independence (@cite{luce-1959} §2.C): the ratio v(a[d↦s])/v(a) depends only on dimension d and the old/new values
                      2. Decomposition theorem: under independence, v(a) = C · ∏ scale_d(a_d)
                      3. cs-RSA instantiation: scale_color(match) = colorMatch = 0.99, scale_color(mismatch) = colorMismatch = 0.01, etc.
                      4. @cite{giles-etal-2026} ground the scale parameters in d' measured via psychophysical staircases

                      @cite{degen-etal-2020} already proves the factoring holds for the concrete φ (φ_product_of_experts). This bridge connects to the ABSTRACT infrastructure that shows the factoring is forced by independence — not an ad hoc modelling choice.

                      Equations
                      • One or more equations did not get rendered due to their size.
                      Instances For

                        The cs-RSA PoE φ function, expressed as a multidim_luce model. The score for each world is ∏ d, scale_d(stimulus(w)(d)), which equals sizeParam × colorParam — exactly the Product of Experts.

                        Equations
                        Instances For