@cite{qing-franke-2015} #
@cite{frank-goodman-2012} @cite{grice-1975} @cite{dale-reiter-1995}
"Variations on a Bayesian Theme: Comparing Bayesian Models of Referential Reasoning"
Paradigm #
Three objects varying on two dimensions (color × shape) in a reference game: {green_square, green_circle, blue_circle}. Speaker produces a single feature word; listener identifies the target object.
Utterances: {square, circle, green, blue}
The Decomposition #
The paper decomposes Bayesian reference games along 3 orthogonal dimensions, yielding a family of models that includes @cite{frank-goodman-2012} as one instance:
Speaker Belief (y ∈ {U, S}): What does L0 assume? #
- Uniform (U): L0 treats all referents equally: U(t|m) = ⟦m⟧(t) / |⟦m⟧| (Eq. 1)
- Salience (S): L0 weights by perceptual salience: S(t|m) = S(t) · ⟦m⟧(t) / Σ_t' S(t') · ⟦m⟧(t')
This enters the RSAConfig via meaning: uniform uses constant 1 for true worlds;
salience uses S(w) for true worlds.
Speaker Goal (x ∈ {a, b}): What does the speaker optimize? #
- Belief-oriented (b): maximize log-probability of correct belief σ_b(m|t) ∝ exp(λ_S · (log y(t|m) - Cost(m))) (Eq. 10)
- Action-oriented (a): maximize probability of correct action σ_a(m|t) ∝ exp(λ_S · (y(t|m) - Cost(m))) (Eq. 9)
This enters via s1Score: belief-oriented uses log L0; action-oriented uses raw L0.
Listener Action: How does the listener choose? #
- Belief-oriented (b): standard Bayesian update ρ_b(t|m) ∝ v(t) · σ(m|t) (Eq. 15)
- Action-oriented (a): softmax over Bayesian posterior ρ_a(t|m) ∝ exp(α_L · ρ_b(t|m)) (Eq. 14)
The belief-oriented listener IS RSAConfig.L1. The action-oriented listener is
a composable extension defined as softmax ∘ L1.
Speaker Models (4 variants) #
| Model | Goal | Belief | S1 Score |
|---|---|---|---|
| σ_bU | belief | uniform | exp(λ · (log U(t|m) - C(m))) |
| σ_aU | action | uniform | exp(λ · (U(t|m) - C(m))) |
| σ_bS | belief | salience | exp(λ · (log S(t|m) - C(m))) |
| σ_aS | action | salience | exp(λ · (S(t|m) - C(m))) |
σ_bU is standard RSA with utterance costs.
Key Findings #
Speaker data (Table 1, N=144 per target): σ_bU and σ_aU best explain production data (Table 3). Salience in the speaker does NOT help. Cost preference exists (c > 0).
Listener data (Table 2, N=180 per utterance): Salience-prior models dominate in model comparison (Table 4). Best overall: ρ_aS(σ_aU) with informed-correlated hyperprior.
Salience reversal: Uniform and salience priors make opposite L1 predictions for ambiguous utterances. For "circle", human data matches the salience direction (blue_circle: 117/180 = 65%). For "green", human data matches the pragmatic direction (green_circle: 115/180 = 64%), NOT salience.
Qualitative Findings #
| # | Finding | Type | Config |
|---|---|---|---|
| 1 | speaker_prefers_unique_shape | S1: "square" > "green" for green_sq | σ_bU |
| 2 | speaker_prefers_unique_color | S1: "blue" > "circle" for blue_circ | σ_bU |
| 3 | cost_breaks_symmetry | S1: "circle" > "green" for green_circ | σ_bU |
| 4 | no_cost_symmetry | ¬(S1 "circle" > "green" for green_circ) | σ_bU, no cost |
| 5 | salience_reversal_circle | uniform vs salience L1 flip for "circle" | σ_bU |
| 6 | salience_reversal_green | uniform vs salience L1 flip for "green" | σ_bU |
The 6 qualitative findings from @cite{qing-franke-2015}.
- speaker_prefers_unique_shape : Finding
For green_square targets, speakers prefer the unique shape word "square" over the shared color word "green". Evidence: 135/144 trials (Table 1).
- speaker_prefers_unique_color : Finding
For blue_circle targets, speakers prefer the unique color word "blue" over the shared shape word "circle". Evidence: 119/144 trials (Table 1).
- cost_breaks_symmetry : Finding
For green_circle targets, cost breaks the symmetry between the two ambiguous words: S1 prefers "circle" (noun, cost=0) over "green" (adjective, cost=1/2). Evidence: 81/144 chose "circle" (Table 1).
- no_cost_symmetry : Finding
Without cost, the two ambiguous words for green_circle are symmetric: neither dominates the other.
- salience_reversal_circle : Finding
Salience reversal for "circle": uniform L1 predicts green_circ > blue_circ, but salience L1 predicts blue_circ > green_circ. Human data matches the salience direction: 117/180 chose blue_circle (Table 2).
- salience_reversal_green : Finding
Salience reversal for "green": uniform L1 predicts green_circ > green_sq, but salience L1 predicts green_sq > green_circ. Human data matches the pragmatic direction: 115/180 chose green_circle (Table 2). The model predictions are correct; human data here follows pragmatics, not salience.
Instances For
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The three objects in the reference game context (Table 1).
green_square: unique shape, shared color blue_circle: unique color, shared shape green_circle: both features shared
Instances For
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Boolean semantics: ⟦utterance⟧(object).
- "square" applies to green_square only (unique shape)
- "circle" applies to blue_circle and green_circle (shared shape)
- "green" applies to green_square and green_circle (shared color)
- "blue" applies to blue_circle only (unique color)
Equations
- Phenomena.Reference.Studies.QingFranke2015.Utterance.square.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.green_square = true
- Phenomena.Reference.Studies.QingFranke2015.Utterance.circle.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle = true
- Phenomena.Reference.Studies.QingFranke2015.Utterance.circle.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.green_circle = true
- Phenomena.Reference.Studies.QingFranke2015.Utterance.green.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.green_square = true
- Phenomena.Reference.Studies.QingFranke2015.Utterance.green.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.green_circle = true
- Phenomena.Reference.Studies.QingFranke2015.Utterance.blue.appliesTo Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle = true
- x✝¹.appliesTo x✝ = false
Instances For
Adjective cost: shape words (nouns) cost 0, color words (adjectives) cost c. From @cite{qing-franke-2015} Eq. 11: Cost(m) = c if m is an adjective, 0 otherwise.
Equations
- Phenomena.Reference.Studies.QingFranke2015.adjCost c Phenomena.Reference.Studies.QingFranke2015.Utterance.square = 0
- Phenomena.Reference.Studies.QingFranke2015.adjCost c Phenomena.Reference.Studies.QingFranke2015.Utterance.circle = 0
- Phenomena.Reference.Studies.QingFranke2015.adjCost c Phenomena.Reference.Studies.QingFranke2015.Utterance.green = c
- Phenomena.Reference.Studies.QingFranke2015.adjCost c Phenomena.Reference.Studies.QingFranke2015.Utterance.blue = c
Instances For
Salience data from Table 2, salience condition (N = 240).
Blue circle (139) is most salient; green circle (30) least.
Equations
- Phenomena.Reference.Studies.QingFranke2015.saliencePrior Phenomena.Reference.Studies.QingFranke2015.Object.green_square = 71
- Phenomena.Reference.Studies.QingFranke2015.saliencePrior Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle = 139
- Phenomena.Reference.Studies.QingFranke2015.saliencePrior Phenomena.Reference.Studies.QingFranke2015.Object.green_circle = 30
Instances For
Belief-oriented S1 score [Eq. 10]: σ_b(m|t) ∝ exp(λ · (log y(t|m) - Cost(m)))
The speaker maximizes log-probability of correct belief at L0. Gated by
if-then-else because log 0 = 0 in Lean (unlike WebPPL where log 0 = -∞
naturally zeros out false utterances).
Equations
Instances For
Action-oriented S1 score [Eq. 9]: σ_a(m|t) ∝ exp(λ · (y(t|m) - Cost(m)))
The speaker maximizes the raw probability that the listener picks the correct referent, rather than log-probability. Unlike beliefGoalScore, this gives nonzero score even for false utterances (exp(-λ·C) > 0 when y=0). The paper notes (Footnote 13) that model comparison restricts to truthful predictions; here the model comparison theorems (§13) only compare utterances that are true of the target object, so no gating is needed.
Equations
- Phenomena.Reference.Studies.QingFranke2015.actionGoalScore cost l0 α x✝ w u = Real.exp (α * (l0 u w - cost u))
Instances For
Uniform prior: all objects equally weighted.
Equations
Instances For
Parametric Q&F RSAConfig constructor.
Decouples the three orthogonal dimensions:
speakerPrior: belief dimension — baked intomeaningas L0's prior weight (uniform = 1 for all; salience = S(w))s1+s1_nn: goal dimension — beliefGoalScore or actionGoalScorelistenerPrior: listener's world prior at L1 (independent of speaker's belief)
The speaker's belief about L0 and the listener's prior vary independently — the speaker may assume uniform L0 while the listener uses salience, or vice versa. This decoupling is a key feature of the RSAConfig API.
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bU: Belief-oriented speaker, uniform L0. This IS standard RSA with utterance costs. S1 score = exp(λ · (log U(t|m) - Cost(m))).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_aU: Action-oriented speaker, uniform L0. S1 score = exp(λ · (U(t|m) - Cost(m))).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bS: Belief-oriented speaker, salience-weighted L0. The speaker assumes L0 weights worlds by perceptual salience: L0(t|m) ∝ S(t) · ⟦m⟧(t). S1 score = exp(λ · (log S(t|m) - Cost(m))).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_aS: Action-oriented speaker, salience-weighted L0. Same salience-weighted L0 as σ_bS, but S1 score = exp(λ · (S(t|m) - Cost(m))).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bU with zero cost, uniform listener prior. Used for finding 4 (no cost → no symmetry breaking).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bU with adjective cost = 1/2, uniform listener prior. Standard RSA with cost asymmetry. Used for speaker predictions (findings 1–3) and the uniform half of salience reversal (findings 5–6).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bU with adjective cost = 1/2, salience-weighted listener prior. Shares the same S1 policy as costCfg (same speaker model σ_bU) but produces different L1 posteriors because L1's Bayesian inversion uses a salience-weighted world prior [Eq. 15 with v = S]. Used for findings 5–6 (salience half).
Equations
- One or more equations did not get rendered due to their size.
Instances For
"square" uniquely identifies green_square.
"blue" uniquely identifies blue_circle.
"circle" is ambiguous between blue_circle and green_circle.
"green" is ambiguous between green_square and green_circle.
Finding 1: For green_square, S1 prefers "square" (unique, cost=0) over "green" (ambiguous, cost=1/2). Both informativity and cost favor "square". Evidence: 135/144 speakers chose "square" (Table 1).
Finding 2: For blue_circle, S1 prefers "blue" (unique, cost=1/2) over "circle" (ambiguous, cost=0). Informativity dominates cost. Evidence: 119/144 speakers chose "blue" (Table 1).
Finding 3: For green_circle, cost breaks the tie between the two ambiguous words: S1 prefers "circle" (cost=0) over "green" (cost=1/2). Both are equally informative (each applies to 2 objects), so cost is the tiebreaker. Evidence: 81/144 chose "circle", 63/144 chose "green" (Table 1; not statistically significant: χ²=2.25, p=0.13).
Finding 4: Without cost, the symmetry is unbroken — neither ambiguous word dominates for green_circle. Both "circle" and "green" apply to exactly 2 objects, so L0 assigns equal probability, and S1 assigns equal weight.
The paper's central contribution: perceptual salience reverses L1 predictions.
Under uniform prior (costCfg), pragmatic reasoning dominates: the listener infers that ambiguous words signal the object without a unique alternative. Under salience prior (salienceCfg), salience overrides pragmatics: salient objects dominate even though pragmatics would favor the other.
The listener prior v ∈ {U, S} enters at L1 via worldPrior, independent of the
speaker's belief y ∈ {U, S} which enters at L0 via meaning. Both costCfg and
salienceCfg use σ_bU (speaker belief = uniform), but differ in the listener's
prior — exactly the comparison the paper runs (Table 4 rows for ρ_bU vs ρ_bS).
Uniform L1 for "circle": green_circle > blue_circle. Pragmatic reasoning: a speaker wanting blue_circle would say "blue" (unique), so saying "circle" signals green_circle.
Salience L1 for "circle": blue_circle > green_circle. Salience (139 vs 30) overrides pragmatic narrowing. Matches human data (Table 2: 117/180 = 65% chose blue_circle).
Finding 5: Salience reversal for "circle". Uniform and salience priors make opposite L1 predictions, and human data matches the salience direction.
Uniform L1 for "green": green_circle > green_square. Pragmatic reasoning: a speaker wanting green_square would say "square" (unique), so saying "green" signals green_circle.
Salience L1 for "green": green_square > green_circle. Salience (71 vs 30) overrides pragmatic narrowing in the model. However, human data goes in the opposite (pragmatic) direction: Table 2 shows 115/180 = 64% chose green_circle.
Finding 6: Salience reversal for "green". Uniform and salience priors make opposite L1 predictions. Human data matches the pragmatic (uniform) direction: 115/180 chose green_circle (Table 2).
The paper compares 4 speaker models along the speaker-goal × speaker-belief dimensions (Table 3). Only σ_bU correctly predicts all three speaker preferences. Each alternative fails on at least one observation:
| Model | green_sq (sq > gr) | blue_circ (bl > ci) | green_circ (ci > gr) | Score |
|---|---|---|---|---|
| σ_bU | ✓ | ✓ | ✓ | 3/3 |
| σ_aU | ✓ | = (tie) | ✓ | 2/3 |
| σ_bS | ✓ | ✗ (ci > bl) | ✗ (gr > ci) | 1/3 |
| σ_aS | ✓ | ✗ (ci > bl) | ✓ | 2/3 |
The discriminating observation is blue_circle: only σ_bU predicts "blue" > "circle". σ_aU ties (equal S1 scores), while σ_bS and σ_aS reverse the preference.
Salience in the speaker is harmful: it inflates L0's posterior for blue_circle given "circle" (since blue_circle has the highest salience, 139 vs 30), which raises S1's score for "circle" enough to match or exceed "blue".
σ_aU with adjective cost = 1/2. Action-oriented speaker, uniform L0. S1 score = exp(λ · (U(t|m) - Cost(m))). No exp/log cancellation.
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_bS with adjective cost = 1/2. Belief-oriented speaker, salience-weighted L0. L0 weights worlds by perceptual salience; S1 score = exp(λ · (log S(t|m) - Cost(m))).
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_aS with adjective cost = 1/2. Action-oriented speaker, salience-weighted L0.
Equations
- One or more equations did not get rendered due to their size.
Instances For
σ_aU agrees: "square" > "green" for green_square.
σ_aU agrees: "circle" > "green" for green_circle (cost breaks symmetry).
σ_aU fails: "blue" and "circle" are tied for blue_circle. This is the key prediction that distinguishes σ_aU from σ_bU. Under belief-oriented scoring (σ_bU), the log transform amplifies the informativity advantage of "blue" (L0 = 1) over "circle" (L0 = 1/2); under action-oriented scoring (σ_aU), the raw difference is exactly offset by cost.
σ_bS agrees: "square" > "green" for green_square. The unique shape word wins regardless of speaker belief, since "square" applies to only one object while "green" is ambiguous.
σ_bS reverses blue_circle: predicts "circle" > "blue". With salience-weighted L0, blue_circle has the highest salience (139), so L0(blue_circ|"circle") = 139/169 ≈ 0.82, making "circle" quite informative. Combined with zero cost for "circle" vs cost 1/2 for "blue", the pragmatic advantage of "blue" is overcome.
σ_bS reverses green_circle: predicts "green" > "circle". With salience weights, L0(green_circ|"green") = 30/101 ≈ 0.30 and L0(green_circ|"circle") = 30/169 ≈ 0.18. "green" has higher L0 posterior for green_circ, and the log transform in belief-oriented scoring amplifies this advantage enough to overcome its cost disadvantage.
σ_aS agrees: "square" > "green" for green_square.
σ_aS reverses blue_circle: predicts "circle" > "blue". Same mechanism as σ_bS: salience inflates L0(blue_circ|"circle") = 139/169. Under action-oriented scoring, this raw probability advantage (plus zero cost) overcomes "blue"'s informativity.
σ_aS agrees: "circle" > "green" for green_circle. Unlike σ_bS, the action-oriented scoring doesn't amplify the L0 advantage of "green" via log, so cost wins for green_circle.
σ_bU uniquely predicts "blue" > "circle" for blue_circle.
The blue_circle observation is the decisive test: 119/144 speakers chose "blue" over "circle" (Table 1). σ_bU is the only model that predicts this:
- σ_bU: blue > circle (correct) — log transform amplifies informativity
- σ_aU: blue = circle (tie) — raw probability and cost exactly cancel
- σ_bS: circle > blue (reversal) — salience makes "circle" informative
- σ_aS: circle > blue (reversal) — same salience effect
This is Q&F's main speaker-side finding: standard RSA (belief-oriented, uniform L0) best explains production data.
Map each empirical finding to the RSA model prediction that accounts for it.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The RSA model accounts for all 6 qualitative findings from @cite{qing-franke-2015}.
The speaker ranking under both belief-oriented and action-oriented scoring
is completely independent of λ (the rationality parameter). Since exp
is strictly monotone and multiplication by λ > 0 preserves strict order:
exp(λ · a) > exp(λ · b) ⟺ a > b (for λ > 0)
Consequence: The qualitative predictions (findings 1–4) hold for ALL λ > 0.
The paper's strong rejection of λ = 1 (§5) affects only the magnitude of
preferences (softmax temperature), not their direction. The existing
rsa_predict proofs at α = 1 establish log(L0) − cost orderings that hold
at every positive α.
Cost thresholds (the exact c ranges where each finding holds):
- Finding 1 (sq > gr for green_sq): all c ≥ 0 (log 1 − 0 > log ½ − c)
- Finding 2 (bl > ci for blue_circ): c < ln 2 ≈ 0.693 (see §16)
- Finding 3 (ci > gr for green_circ): all c > 0 (log ½ − 0 > log ½ − c)
- Finding 4 (ci = gr, no cost): equality, independent of everything
Belief-oriented score ranking reduces to log L0 minus cost, independent of λ.
For two truthful utterances (l0 ≠ 0 for both), the beliefGoalScore comparison
is equivalent to comparing log L0 − cost, which has no λ dependence.
Combined with RationalAction.policy_gt_of_score_gt, this means all
qualitative belief-oriented S1 predictions are λ-independent.
Action-oriented score ranking reduces to L0 minus cost, independent of λ.
Same λ-independence as belief-oriented, but comparing raw L0 rather than log L0. The difference between σ_a and σ_b is not λ-sensitivity (both are λ-independent) but how they transform L0: log compresses the informativity scale, amplifying small differences in L0 posterior.
The σ_aU tie at c = 1/2 (§13a) is the exact boundary: σ_aU predicts "blue" > "circle" for blue_circle iff c < 1/2. Action-oriented scoring uses raw L0: 1 − c > 1/2 − 0 ⟺ c < 1/2.
Belief-oriented scoring (σ_bU) uses log L0, giving a wider threshold of c < ln 2 ≈ 0.693: log 1 − c > log(1/2) − 0 ⟺ c < ln 2.
Figure 4 shows that the posterior over c for σ_bU peaks around c ≈ 0.15, well below the ln 2 ≈ 0.693 threshold. So the MAP cost estimate is consistent with σ_bU correctly predicting blue > circle.
σ_aU cost threshold for blue_circle: "blue" > "circle" ⟺ c < 1/2.
Given L0(blue_circ|"blue") = 1 and L0(blue_circ|"circle") = 1/2,
the action-oriented comparison reduces to 1 − c > 1/2. The existing tie
σ_aU_blue_circ_tie is the corollary at c = 1/2.
σ_bU cost threshold for blue_circle: "blue" > "circle" ⟺ c < ln 2.
Belief-oriented scoring uses log, so the threshold is wider than σ_aU's (ln 2 ≈ 0.693 > 0.5). This explains why σ_bU accommodates cost better than σ_aU: the log transform amplifies informativity differences, leaving more room for cost before the ranking flips.
Production and comprehension counts from the experiment (N = 1032 total: 432 speakers, 600 listeners). These connect model predictions to empirical observations.
Speaker production data from Table 1 (N = 144 per target object).
- green_square: 135 "square", 9 "green" (93.8% unique shape)
- blue_circle: 119 "blue", 25 "circle" (82.6% unique color)
- green_circle: 81 "circle", 63 "green" (56.3% preferred noun; n.s.)
Equations
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.green_square Phenomena.Reference.Studies.QingFranke2015.Utterance.square = 135
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.green_square Phenomena.Reference.Studies.QingFranke2015.Utterance.green = 9
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle Phenomena.Reference.Studies.QingFranke2015.Utterance.blue = 119
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle Phenomena.Reference.Studies.QingFranke2015.Utterance.circle = 25
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.green_circle Phenomena.Reference.Studies.QingFranke2015.Utterance.circle = 81
- Phenomena.Reference.Studies.QingFranke2015.speakerData Phenomena.Reference.Studies.QingFranke2015.Object.green_circle Phenomena.Reference.Studies.QingFranke2015.Utterance.green = 63
- Phenomena.Reference.Studies.QingFranke2015.speakerData x✝¹ x✝ = 0
Instances For
Listener comprehension data from Table 2 (N = 180 per ambiguous utterance).
- "circle": 117 blue_circle, 62 green_circle, 1 green_square (65% salience)
- "green": 65 green_square, 115 green_circle (64% pragmatic direction)
Equations
- Phenomena.Reference.Studies.QingFranke2015.listenerData Phenomena.Reference.Studies.QingFranke2015.Utterance.circle Phenomena.Reference.Studies.QingFranke2015.Object.blue_circle = 117
- Phenomena.Reference.Studies.QingFranke2015.listenerData Phenomena.Reference.Studies.QingFranke2015.Utterance.circle Phenomena.Reference.Studies.QingFranke2015.Object.green_circle = 62
- Phenomena.Reference.Studies.QingFranke2015.listenerData Phenomena.Reference.Studies.QingFranke2015.Utterance.circle Phenomena.Reference.Studies.QingFranke2015.Object.green_square = 1
- Phenomena.Reference.Studies.QingFranke2015.listenerData Phenomena.Reference.Studies.QingFranke2015.Utterance.green Phenomena.Reference.Studies.QingFranke2015.Object.green_square = 65
- Phenomena.Reference.Studies.QingFranke2015.listenerData Phenomena.Reference.Studies.QingFranke2015.Utterance.green Phenomena.Reference.Studies.QingFranke2015.Object.green_circle = 115
- Phenomena.Reference.Studies.QingFranke2015.listenerData x✝¹ x✝ = 0
Instances For
Speaker data sums to N = 144 per target.
Listener data sums to N = 180 per ambiguous utterance.
Speaker majority choice agrees with σ_bU S1 ranking (findings 1–3).
For "circle", listener majority matches the salience L1 direction (finding 5): blue_circle (117) > green_circle (62).
For "green", listener majority matches the pragmatic/uniform L1 direction, NOT the salience direction: green_circle (115) > green_square (65). The paper notes this explicitly (p. 212).
σ_bU with zero cost IS @cite{frank-goodman-2012}'s model. FG2012 defines:
s1Score l0 α _ w u := if l0 u w = 0 then 0 else exp(α * log(l0 u w))
which equals beliefGoalScore (fun _ => 0) since x − 0 = x (sub_zero).
FG2012 uses multiple reference game contexts across 7 conditions; Q&F's experiment (§4) focuses on one configuration: {green_square, green_circle, blue_circle}. The scoring rule, compositional pattern, and RSAConfig structure are identical — Q&F's contribution is decomposing along the cost, goal, and salience dimensions.
Zero-cost belief-oriented scoring equals FG2012's scoring rule.
beliefGoalScore (fun _ => 0) reduces to if l0 = 0 then 0 else exp(α * log l0)
by sub_zero. This is the formal statement that σ_bU generalizes FG2012
by adding utterance cost — setting cost to zero recovers the original model.
Summary: What this file tests about the RSAConfig API #
Pluggable s1Score:
beliefGoalScoreandactionGoalScoreare different scoring functions, plugged into the same RSAConfig structure vias1Score. The API is agnostic to what the speaker optimizes.Meaning encodes speaker belief: The speaker's assumption about L0 (uniform vs salience-weighted) is captured by the
meaningfunction. No separate "speaker belief" field is needed.Cost inside s1Score: Utterance cost is part of the speaker's score function, not a separate RSAConfig field. This is clean because cost IS part of what the speaker optimizes.
Independent listener prior:
worldPrior(the listener's prior at L1) is independent ofmeaning(which determines L0). This allows the speaker's belief about L0 and the listener's prior to vary independently, exactly as Q&F require.Composable extensions: The action-oriented listener is defined as
softmax ∘ L1, extending the API without modifying RSAConfig. The softmax properties (positivity, sum-to-one, monotonicity) transfer directly from Core.RationalAction.Model comparison: All 4 speaker models instantiate the same RSAConfig API with different
s1Scoreandmeaningchoices.rsa_predicthandles all comparisons including score-equality ties (σ_aU, zeroCost). The capstone theorem shows σ_bU uniquely predicts the blue_circle observation.λ-independence (§15):
beliefGoal_gt_iffandactionGoal_gt_iffprove that speaker rankings depend only onlog L0 − cost(orL0 − cost), not on the rationality parameter λ. The paper's rejection of λ = 1 affects only quantitative fit, not qualitative predictions.Cost thresholds (§16):
σ_aU_blue_circ_threshold(c < 1/2) andσ_bU_blue_circ_threshold(c < ln 2) give the exact cost boundaries for each model's blue_circle prediction. The log transform in σ_bU widens the viable cost range.Raw data (§17):
speakerDataandlistenerDataformalize Tables 1–2.speakerData_matches_modelverifies that speaker majority choices match σ_bU predictions.listenerData_circle_matches_salienceconfirms "circle" follows salience;listenerData_green_matches_pragmaticconfirms "green" follows pragmatics — a richer pattern than uniform salience dominance.FG2012 bridge (§18):
zeroCost_beliefGoal_eqproves that belief-oriented scoring at zero cost recovers @cite{frank-goodman-2012}'s scoring rule.
The cost dimension in @cite{qing-franke-2015}'s S1 score decomposition IS @cite{grice-1975}'s Q2 sub-maxim (brevity):
σ_b(m|t) ∝ exp(λ · (log L0(t|m) − Cost(m)))
╰── Q1 ──╯ ╰── Q2 ──╯
This connects three frameworks:
- @cite{grice-1975}: Q1 (be informative) and Q2 (be brief) are independent
- @cite{dale-reiter-1995}: No-Brevity = Q2 not enforced (strength = 0)
- @cite{qing-franke-2015}: Cost = 0 ↔ no Q2 pressure ↔ No Brevity
The zero-cost ↔ cost comparison (no_cost_symmetry vs cost_breaks_symmetry)
directly demonstrates Q2's role: it is the tiebreaker when Q1 (informativity)
is equal across alternatives. For green_circle, both "circle" and "green"
have the same L0 informativity (each applies to 2 objects), so Q1 cannot
distinguish them. Only Q2 (cost) breaks the tie.
Q&F's cost dimension IS Grice's Q2 sub-maxim. Without cost (No Brevity regime), ambiguous words with equal informativity are symmetric — Q1 alone cannot break the tie. With cost (Q2 active), the cheaper word wins. This maps onto @cite{dale-reiter-1995}'s No-Brevity interpretation (strength = 0), the weakest Q2.