Linglib.Theories.Pragmatics.RSA.Implementations.HawkinsGweonGoodman2021

Each feature denotes a characteristic function from entities (Objects) to truth values. These are the basic building blocks for compositional utterance semantics.

Equations

HawkinsGweonGoodman2021.MontaguGrounding.shapePred targetShape o = (o.features.shape == targetShape)

Instances For

source

def HawkinsGweonGoodman2021.MontaguGrounding.colorPred (targetColor : ℕ) :

Object → Bool

Equations

HawkinsGweonGoodman2021.MontaguGrounding.colorPred targetColor o = (o.features.color == targetColor)

Instances For

source

def HawkinsGweonGoodman2021.MontaguGrounding.texturePred (targetTexture : ℕ) :

Object → Bool

Equations

HawkinsGweonGoodman2021.MontaguGrounding.texturePred targetTexture o = (o.features.texture == targetTexture)

Instances For

source

def HawkinsGweonGoodman2021.MontaguGrounding.compositionalDenotation (u : Utterance) (targetFeatures : ObjectFeatures) :

Object → Bool

Compositionally derived utterance denotation.

An utterance mentions some subset of {shape, color, texture}. The denotation is the conjunction of all mentioned feature predicates, using predMod from Semantics.Montague.Modification:

⟦blue checked square⟧ = predMod (predMod ⟦blue⟧ ⟦checked⟧) ⟦square⟧ = λx. blue(x) ∧ checked(x) ∧ square(x)

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.MontaguGrounding.directDenotation (u : Utterance) (targetFeatures : ObjectFeatures) :

Object → Bool

Direct (ad-hoc) utterance denotation from Part 2

Equations

One or more equations did not get rendered due to their size.

Instances For

source

theorem HawkinsGweonGoodman2021.MontaguGrounding.grounding_compositional_equals_direct (u : Utterance) (tf : ObjectFeatures) (o : Object) :

compositionalDenotation u tf o = directDenotation u tf o

Grounding theorem: Direct denotation equals compositional derivation.

The ad-hoc semantics in utteranceApplies are exactly what we get from applying predicate modification (from Semantics.Montague.Modification) to individual feature predicates.

source

theorem HawkinsGweonGoodman2021.MontaguGrounding.utteranceApplies_grounded (u : Utterance) (tf : ObjectFeatures) (o : Object) :

utteranceApplies u tf o = compositionalDenotation u tf o

Grounding theorem: utteranceApplies = compositional denotation

source

theorem HawkinsGweonGoodman2021.MontaguGrounding.rsa_meaning_compositional (u : Utterance) (tf : ObjectFeatures) (o : Object) :

utteranceApplies u tf o = true ↔ compositionalDenotation u tf o = true

The RSA meaning function φ is grounded in compositional semantics

source

def HawkinsGweonGoodman2021.utteranceDenotation (u : Utterance) (targetFeatures : ObjectFeatures) :

Object → Bool

Equations

HawkinsGweonGoodman2021.utteranceDenotation = HawkinsGweonGoodman2021.MontaguGrounding.directDenotation

Instances For

source

theorem HawkinsGweonGoodman2021.semantics_grounded (u : Utterance) (target : ObjectFeatures) (o : Object) :

utteranceApplies u target o = utteranceDenotation u target o

Grounding: utteranceApplies equals compositional denotation

Asymmetric Case via Unified API #

The full perspective-taking case (w_S = 1) maps to RSAConfig with latent variables:

Latent = Speaker's visual access (which objects they)
World = Full context (visible objects + hidden object features)
speakerCredence = P(world | speaker's visual access)

The mixture model (w_S ∈ (0,1)) and resource-rational optimization (finding w*) are implementation-specific extensions that sit outside the unified API.

source

structure HawkinsGweonGoodman2021.WorldState :

Type

World state: visible objects + one hidden object behind occlusion

visible : List Object
hidden : ObjectFeatures
target : Object

Instances For

source

instance HawkinsGweonGoodman2021.instDecidableEqWorldState :

DecidableEq WorldState

Equations

HawkinsGweonGoodman2021.instDecidableEqWorldState = HawkinsGweonGoodman2021.instDecidableEqWorldState.decEq

source

def HawkinsGweonGoodman2021.instDecidableEqWorldState.decEq (x✝ x✝¹ : WorldState) :

Decidable (x✝ = x✝¹)

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.instBEqWorldState.beq :

WorldState → WorldState → Bool

Equations

HawkinsGweonGoodman2021.instBEqWorldState.beq { visible := a, hidden := a_1, target := a_2 } { visible := b, hidden := b_1, target := b_2 } = (a == b && (a_1 == b_1 && a_2 == b_2))
HawkinsGweonGoodman2021.instBEqWorldState.beq x✝¹ x✝ = false

Instances For

source

instance HawkinsGweonGoodman2021.instBEqWorldState :

BEq WorldState

Equations

HawkinsGweonGoodman2021.instBEqWorldState = { beq := HawkinsGweonGoodman2021.instBEqWorldState.beq }

source

instance HawkinsGweonGoodman2021.instReprWorldState :

Repr WorldState

Equations

HawkinsGweonGoodman2021.instReprWorldState = { reprPrec := HawkinsGweonGoodman2021.instReprWorldState.repr }

source

def HawkinsGweonGoodman2021.instReprWorldState.repr :

WorldState → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

structure HawkinsGweonGoodman2021.VisualAccess :

Type

Speaker's visual access: what objects they can see

visibleObjects : List Object
targetObject : Object

Instances For

source

def HawkinsGweonGoodman2021.instDecidableEqVisualAccess.decEq (x✝ x✝¹ : VisualAccess) :

Decidable (x✝ = x✝¹)

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance HawkinsGweonGoodman2021.instDecidableEqVisualAccess :

DecidableEq VisualAccess

Equations

HawkinsGweonGoodman2021.instDecidableEqVisualAccess = HawkinsGweonGoodman2021.instDecidableEqVisualAccess.decEq

source

instance HawkinsGweonGoodman2021.instBEqVisualAccess :

BEq VisualAccess

Equations

HawkinsGweonGoodman2021.instBEqVisualAccess = { beq := HawkinsGweonGoodman2021.instBEqVisualAccess.beq }

source

def HawkinsGweonGoodman2021.instBEqVisualAccess.beq :

VisualAccess → VisualAccess → Bool

Equations

HawkinsGweonGoodman2021.instBEqVisualAccess.beq { visibleObjects := a, targetObject := a_1 } { visibleObjects := b, targetObject := b_1 } = (a == b && a_1 == b_1)
HawkinsGweonGoodman2021.instBEqVisualAccess.beq x✝¹ x✝ = false

Instances For

source

def HawkinsGweonGoodman2021.instReprVisualAccess.repr :

VisualAccess → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance HawkinsGweonGoodman2021.instReprVisualAccess :

Repr VisualAccess

Equations

HawkinsGweonGoodman2021.instReprVisualAccess = { reprPrec := HawkinsGweonGoodman2021.instReprVisualAccess.repr }

source

def HawkinsGweonGoodman2021.allWorldStates (visible : List Object) (target : Object) :

List WorldState

All world states: each possible hidden object configuration

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.visualAccessCredence (access : VisualAccess) (world : WorldState) :

ℚ

Speaker credence: uniform over hidden objects given visual access.

P(world | access) = 1/64 if world.visible matches access, else 0. This encodes that speaker knows what's visible but is uncertain about hidden.

Equations

HawkinsGweonGoodman2021.visualAccessCredence access world = if (world.visible == access.visibleObjects && world.target == access.targetObject) = true then 1 / 64 else 0

Instances For

source

def HawkinsGweonGoodman2021.worldMeaning (u : Utterance) (world : WorldState) :

ℚ

Literal meaning: utterance applies to target in this world context

Equations

HawkinsGweonGoodman2021.worldMeaning u world = HawkinsGweonGoodman2021.literalListenerProb u world.target.features world.target ({ features := world.hidden, visible := false } :: world.visible)

Instances For

source

theorem HawkinsGweonGoodman2021.unified_worldMeaning_grounded (u : Utterance) (world : WorldState) :

worldMeaning u world = literalListenerProb u world.target.features world.target ({ features := world.hidden, visible := false } :: world.visible)

Grounding: The unified API's worldMeaning computes the same listener probability as our manual literalListenerProb for each world configuration.

source

theorem HawkinsGweonGoodman2021.unified_credence_matches_prior :

visualAccessCredence { visibleObjects := exampleVisible, targetObject := exampleTarget } { visible := exampleVisible, hidden := { shape := 0, color := 0, texture := 0 }, target := exampleTarget } = 1 / 64

Grounding: Speaker credence in unified API marginalizes uniformly over hidden objects, matching the manual uniformHiddenPrior.

Mixture Model (Implementation-Specific) #

The mixture model w_S · U_asym + (1-w_S) · U_ego and resource-rational optimization for finding optimal w* are handled in Parts 6-8 above.

These are implementation-specific extensions that:

Blend two reasoning modes (asymmetric vs egocentric)
Find optimal effort allocation via cost-benefit analysis

The unified API handles the asymmetric case directly; the mixture and meta-cognitive choice of w* sit outside the core RSA loop.

source

structure HawkinsGweonGoodman2021.ListenerBeliefs :

Type

Listener's belief about speaker's weight after observing utterances

wS_expectation : ℚ
observations : ℕ

Instances For

source

def HawkinsGweonGoodman2021.instReprListenerBeliefs.repr :

ListenerBeliefs → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance HawkinsGweonGoodman2021.instReprListenerBeliefs :

Repr ListenerBeliefs

Equations

HawkinsGweonGoodman2021.instReprListenerBeliefs = { reprPrec := HawkinsGweonGoodman2021.instReprListenerBeliefs.repr }

source

def HawkinsGweonGoodman2021.initialBeliefs :

ListenerBeliefs

Initial uniform belief about speaker's weight

Equations

HawkinsGweonGoodman2021.initialBeliefs = { wS_expectation := 1 / 2, observations := 0 }

Instances For

source

def HawkinsGweonGoodman2021.updateBeliefs (beliefs : ListenerBeliefs) (shortUtterance : Bool) :

ListenerBeliefs

Update beliefs after observing speaker use short utterances. If speaker consistently uses minimal descriptions, listener infers low w_S.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.beliefsAfterShortUtterances :

ListenerBeliefs

After seeing short utterances, listener expects lower w_S

Equations

One or more equations did not get rendered due to their size.

Instances For

source

theorem HawkinsGweonGoodman2021.listener_infers_low_wS_from_short_utterances :

beliefsAfterShortUtterances.wS_expectation < initialBeliefs.wS_expectation

source

def HawkinsGweonGoodman2021.optimalListenerWeight (speakerWS beta : ℚ) :

ℚ

Resource-rational listener response: increase own perspective-taking when speaker is under-informative.

Equations

HawkinsGweonGoodman2021.optimalListenerWeight speakerWS beta = 1 ⊓ (0 ⊔ (1 - speakerWS + beta))

Instances For

source

theorem HawkinsGweonGoodman2021.listener_compensates_for_low_speaker_effort :

optimalListenerWeight (3 / 10) (2 / 10) > optimalListenerWeight (7 / 10) (2 / 10)

Listener increases effort when speaker decreases theirs

Key Predictions from Paper (Section 2.4.1) #

The paper identifies four key qualitative predictions, which we verify as theorems:

speakersHedgeUnknowns: Speakers increase informativity with occlusions
divisionDependsOnPartner: Optimal effort depends on expected partner effort
listenersAdaptOverTime: Listeners update beliefs about speaker from observations
intermediateWeightsOptimal: Partial perspective-taking when cost > 0

source

theorem HawkinsGweonGoodman2021.paper_prediction_1_speakers_hedge :

asymmetricInformativity fullDescription exampleTarget exampleVisible uniformHiddenPrior > asymmetricInformativity shapeOnly exampleTarget exampleVisible uniformHiddenPrior

Paper Prediction 1: Speakers hedge against known unknowns.

From the paper: "speakers will anticipate possible confusion from the listener's perspective, and produce additional information beyond what would be necessary from their own viewpoint."

Verified by: asymmetric informativity favors more specific utterances.

source

def HawkinsGweonGoodman2021.utilityAt_wL0 :

ℚ

Paper Prediction 2: Division of labor depends on partner's expected effort.

From the paper: "The effort one participant ought to exert depends on how much effort they expect others to exert."

Verified by: at different listener weights, speaker utility differs. This shows speaker decisions depend on beliefs about listener.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.utilityAt_wL1 :

ℚ

Equations

HawkinsGweonGoodman2021.utilityAt_wL1 = HawkinsGweonGoodman2021.egocentricUtility HawkinsGweonGoodman2021.shapeOnly HawkinsGweonGoodman2021.exampleTarget HawkinsGweonGoodman2021.exampleVisible

Instances For

source

theorem HawkinsGweonGoodman2021.paper_prediction_2_division_depends_on_partner :

utilityAt_wL0 ≠ utilityAt_wL1

source

theorem HawkinsGweonGoodman2021.paper_prediction_3_listeners_adapt :

beliefsAfterShortUtterances.wS_expectation < initialBeliefs.wS_expectation

Paper Prediction 3: Listeners adapt over time.

From the paper: "listeners used violations to adaptively make fewer errors over time" (z = 2.6, p < 0.01)

Verified by: beliefs about speaker weight decrease when observing short utterances.

source

def HawkinsGweonGoodman2021.rrUtility (wS beta : ℚ) :

ℚ

Resource-rational utility at a given perspective weight and cost coefficient

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def HawkinsGweonGoodman2021.rrUtility_at_0 :

ℚ

Paper Prediction 4: Intermediate weights are optimal when β > 0.

From the paper (Figure 2): "Above a certain β (i.e., if perspective-taking is sufficiently effortful), an intermediate weighting of perspective-taking is boundedly optimal."

At β = 0.2: w*_S = 0.36, w*_L = 0.51

Note: At β = 0, egocentric may have higher raw informativity (since it doesn't average over hidden distractors). But at β > 0, the cost term creates a trade-off where intermediate weights become optimal. The key insight is that speakers should choose MORE INFORMATIVE utterances (like fullDescription) rather than shapeOnly when doing perspective-taking - that's where the benefit comes.

Equations

HawkinsGweonGoodman2021.rrUtility_at_0 = HawkinsGweonGoodman2021.rrUtility 0 (1 / 2)

Instances For

source

def HawkinsGweonGoodman2021.rrUtility_at_half :

ℚ

Equations

HawkinsGweonGoodman2021.rrUtility_at_half = HawkinsGweonGoodman2021.rrUtility (1 / 2) (1 / 2)

Instances For

source

def HawkinsGweonGoodman2021.rrUtility_at_1 :

ℚ

Equations

HawkinsGweonGoodman2021.rrUtility_at_1 = HawkinsGweonGoodman2021.rrUtility 1 (1 / 2)

Instances For

source

theorem HawkinsGweonGoodman2021.high_cost_penalizes_full_perspective_taking :

rrUtility_at_1 < rrUtility_at_0

source

theorem HawkinsGweonGoodman2021.paper_prediction_4_intermediate_weights_optimal :

rrUtility_at_1 < rrUtility_at_0 ∧ rrUtility (1 / 4) (1 / 2) > rrUtility_at_1

Paper Prediction 4 (continued): Intermediate weights optimal.

When cost is moderate, the optimal weight is strictly between 0 and 1. This matches Figure 2 of the paper where w*_S ≈ 0.36 at β = 0.2.

Empirical Findings from Paper #

Experiment 1 (Speaker Production, N=83 dyads) #

Occlusion effect: +1.3 words, t(120.3) = 8.8, p < .001
Distractor effect: +0.6 words, t(206) = 5.7, p < .001

Experiment 2 (Listener Comprehension, N=116 dyads) #

Scripted: 51% critical errors
Unscripted: 20% critical errors
χ²(1) = 43, p < .001

source

def HawkinsGweonGoodman2021.empirical_scripted_error_rate :

ℚ

Equations

HawkinsGweonGoodman2021.empirical_scripted_error_rate = 51 / 100

Instances For

source

def HawkinsGweonGoodman2021.empirical_unscripted_error_rate :

ℚ

Equations

HawkinsGweonGoodman2021.empirical_unscripted_error_rate = 20 / 100

Instances For

source

theorem HawkinsGweonGoodman2021.model_predicts_informativity_reduces_errors :

asymmetricInformativity fullDescription exampleTarget exampleVisible uniformHiddenPrior > asymmetricInformativity shapeOnly exampleTarget exampleVisible uniformHiddenPrior

Model correctly predicts that more informative speakers lead to fewer errors

source

def HawkinsGweonGoodman2021.informativity_error_correlation :

ℚ

Informativity-error correlation from paper: ρ = -0.81

Equations

HawkinsGweonGoodman2021.informativity_error_correlation = -81 / 100

Instances For

Model Summary #

Key model predictions verified as theorems:

more_specific_higher_asymmetric_informativity: More specific utterances have higher informativity when considering hidden objects
asymmetry_increases_specificity_gain: The asymmetric model predicts LARGER informativity gain from specificity than egocentric
full_description_preferred_at_wS1: At full perspective-taking, more specific utterances maximize listener success
shape_only_sufficient_at_wS0: At pure egocentric, minimal description is equally informative (target unique in shape)
listener_infers_low_wS_from_short_utterances: Listeners infer speaker's low effort from under-informative utterances
listener_compensates_for_low_speaker_effort: Optimal listener effort increases when speaker effort is low
semantics_grounded: Utterance semantics grounded in compositional (Montague) denotations