Documentation

Linglib.Phenomena.Imprecision.Studies.LassiterGoodman2017

@cite{lassiter-goodman-2017} #

Adjectival vagueness in a Bayesian model of interpretation. Synthese 194:3801–3836.

Innovation #

Standard RSA models fix the literal meaning of each utterance. Threshold RSA introduces a free semantic variable — the threshold θ — that the pragmatic listener L1 jointly infers alongside the world state:

P_L1(s, θ | u) ∝ P_S1(u | s, θ) · P(s) · P(θ)

This yields three key predictions (§4.3–4.4):

  1. Information transmission: hearing "tall"/"short" shifts the height posterior above/below the prior mean despite vague semantics
  2. Pragmatic sweet spot: the threshold posterior peaks at an intermediate value, not at extremes — low θ makes "tall" uninformative (high cost, low information gain); high θ makes it implausible
  3. Context sensitivity: shifting the reference class prior (e.g., from the general population to basketball players) shifts both the height and threshold posteriors (§4.4, Figure 7)

Semantics (§4.1) #

Scalar adjectives have a free threshold variable (Eqs. 22–23):

RSAConfig Mapping #

The prior enters at both L0 (baked into meaning) and L1 (worldPrior), matching §4.2 where P_{L0}(s) = P_{L1}(s).

Verified Predictions #

  1. Hearing "tall" shifts height posterior upward (§4.4, Figure 5)
  2. Hearing "short" shifts height posterior downward (§4.4, Figure 6)
  3. Threshold posterior peaks at intermediate θ given "tall" (§4.4)
  4. Basketball prior shifts L1("tall") toward taller heights (§4.4, Figure 7)
@[reducible, inline]

Discretized height values (h0–h10).

The paper uses a continuous normal distribution over heights; we discretize to 11 values (Degree 10).

Equations
Instances For

    Threshold values (θ0–θ9).

    The threshold θ determines the cutoff: x is tall iff height(x) > θ (Eq. 21).

    The speaker can say "tall," "short," or stay silent.

    Instances For
      Equations
      • One or more equations did not get rendered due to their size.
      Instances For
        Equations
        • One or more equations did not get rendered due to their size.

        ⟦tall⟧(θ)(x) = 1 iff height(x) > θ (@cite{kennedy-2007}, positive form).

        Equations
        Instances For

          ⟦short⟧(θ)(x) = 1 iff height(x) < θ (@cite{kennedy-2007}, negative form).

          Equations
          Instances For

            Height prior: discretized normal distribution centered at h5.

            The paper assumes a continuous normal P(s) over heights. We approximate with unnormalized weights [1,2,5,10,15,20,15,10,5,2,1] peaked at h5.

            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              Threshold prior: uniform over all thresholds (Section 4.2).

              "P(V) is thus uniform for all possible combinations of values for the elements of V."

              Equations
              Instances For

                Basketball player height prior: peak shifted to h7 (§4.4, Figure 7).

                The paper uses "two input priors with different means" to demonstrate context sensitivity. We shift the same bell shape rightward by 2 steps, truncating the left tail at zero.

                Equations
                • One or more equations did not get rendered due to their size.
                Instances For

                  Utterance cost function (Eq. 23).

                  C(u) = 2/3 × length(u) in words. C("Sam is tall") = C("Sam is short") = 2/3 × 3 = 2. C(∅) = 0 (null utterance is free).

                  Equations
                  Instances For

                    Fitted cost value from Section 4.4: C = 2/3 × 3 = 2 for content words.

                    Equations
                    Instances For

                      Basketball height prior as ℝ.

                      Equations
                      Instances For
                        noncomputable def RSA.LassiterGoodman2017.beliefScore :
                        (UtteranceHeight)ThresholdHeightUtterance

                        S1 belief-based score with utterance costs (Eq. 23):

                        S1(u|s,V) ∝ exp(α · (log P_{L0}(s|u,V) − C(u)))

                        Gated on l0 u w = 0 because Lean's log 0 = 0, which would make exp(α · (0 − C)) positive for false utterances.

                        Equations
                        Instances For
                          theorem RSA.LassiterGoodman2017.beliefScore_nonneg (l0 : UtteranceHeight) (α : ) (l : Threshold) (w : Height) (u : Utterance) :
                          (∀ (u' : Utterance) (w' : Height), 0 l0 u' w')0 < α0 beliefScore l0 α l w u
                          @[reducible]
                          noncomputable def RSA.LassiterGoodman2017.mkThresholdCfg (prior : Height) (hp : ∀ (h : Height), 0 prior h) :

                          Parametric RSAConfig for threshold models.

                          Decouples the reference class prior from model structure so that defaultCfg and basketballCfg share the same architecture.

                          Both L0 and L1 use the same prior (§4.2):

                          • L0: P_{L0}(s|u,V) ∝ P(s) · ⟦u⟧_V(s)
                          • L1: P_{L1}(s,V|u) ∝ P_{S1}(u|s,V) · P(s) · P(V)

                          α = 4 (§4.4).

                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For
                            @[reducible]

                            Basketball config: basketball player prior (peak at h7). Tests context sensitivity (§4.4, Figure 7).

                            Equations
                            Instances For

                              Hearing "tall" shifts height posterior upward #

                              The pragmatic listener L1, upon hearing "tall," infers that the speaker's height is above average. The prior peaks at h5; L1("tall") shifts probability mass toward higher heights (Figure 5, left panel).

                              Hearing "short" shifts height posterior downward #

                              Mirror image: "short" shifts probability toward lower heights (Section 4.3, Figure 6).

                              Pragmatic sweet spot for thresholds #

                              Given "tall," the listener infers a threshold that balances informativity and plausibility (Figure 5, right panel). Very low thresholds (θ ≈ 0) make "tall" uninformative (everything is tall), so the cost of speaking outweighs the information gain. Very high thresholds (θ ≈ 9) make "tall" implausible (almost nothing is tall). The posterior peaks at an intermediate θ.

                              This sweet spot requires utterance costs (Section 4.4: α=4, C(tall)=2); without costs, L1_latent monotonically prefers lower thresholds.

                              Basketball context shifts height inference #

                              When the reference class prior shifts right (basketball players: peak at h7 vs general population: peak at h5), L1 hearing "tall" assigns more probability to taller heights (Figure 7). At h10, the basketball prior is 5× the general prior (5 vs 1), which dominates the normalization penalty.