Iterated Best Response: Core Definitions #
Game-theoretic pragmatics following @cite{franke-2011} §6-8.
Defines interpretation games (InterpGame), speaker/hearer strategies,
literal listener (L₀), best response operators, IBR iteration (ibrN),
fixed points, and expected gain (expectedGain).
Interpretation game (Franke §6): states are equivalence classes over alternative truth patterns.
- State : Type
Type of states (equivalence classes of worlds)
- Message : Type
Type of messages (alternative utterances)
Semantic meaning: is message m true at state s?
Prior probability over states
Fintype instances
- stateDecEq : DecidableEq self.State
- messageDecEq : DecidableEq self.Message
Instances For
States where message m is true
Instances For
Messages true at state s
Instances For
Informativity of a message (reciprocal of true states, as ratio)
Equations
- G.informativity m = if (G.trueStates m).card = 0 then 0 else 1 / ↑(G.trueStates m).card
Instances For
A hearer strategy: P(state | message)
Instances For
A speaker strategy: P(message | state)
Instances For
Uniform distribution over states where m is true
Equations
- One or more equations did not get rendered due to their size.
Instances For
Support of hearer's response to message m
Instances For
Support of speaker's choice at state s
Instances For
Max utility among true messages at state s (0 if no true messages).
Equations
- RSA.IBR.SpeakerStrategy.maxUtility G H s = Finset.fold max 0 (fun (m' : G.Message) => H.respond m' s) (G.trueMessages s)
Instances For
Optimal messages at state s: true messages achieving max utility.
This is the set-level best response (Franke eq. 76): the speaker at state t uses messages in R_k^{-1}(t) that minimize |R_k(m)|, which corresponds to maximizing H(m|t) in the probabilistic rendering.
All probability-level reasoning should go through this set.
Equations
- RSA.IBR.SpeakerStrategy.optimalMessages G H s = {m ∈ G.trueMessages s | H.respond m s = RSA.IBR.SpeakerStrategy.maxUtility G H s}
Instances For
Best response speaker: uniform distribution over optimal messages (eq. 76).
Equations
- One or more equations did not get rendered due to their size.
Instances For
bestResponse gives 1/k to optimal messages, 0 to others.
Best response speaker always gives non-negative probabilities.
Best response speaker gives zero probability to false messages.
bestResponse gives positive probability iff m is optimal (and optimal set is nonempty).
Best response speaker probabilities sum to at most 1 at any state.
Best response speaker gives at most probability 1 to any message.
L₀: Literal listener (Franke Def. 22)
Equations
Instances For
Speaker update: Best response to hearer strategy.
S_{n+1}(m | s) = argmax_m L_n(s | m)
Uniform over optimal messages.
Equations
Instances For
Hearer best response: argmax of posterior probability (Franke eq. 77/120).
The hearer observes m, forms posterior μ(t|m) ∝ S(t,m) · P(t), and picks the state(s) with maximum posterior probability. Uniform over ties. For surprise messages (∀ t, S(t,m) · P(t) = 0), falls back to literal interpretation per the TCP assumption.
Equations
- One or more equations did not get rendered due to their size.
Instances For
hearerBR always gives non-negative response values.
One full IBR iteration step: speaker best-responds, hearer argmax-responds.
Equations
- RSA.IBR.ibrStep G H = RSA.IBR.hearerBR G (RSA.IBR.speakerUpdate G H)
Instances For
IBR reasoning for n iterations starting from L₀
Equations
- RSA.IBR.ibrN G 0 = RSA.IBR.L0 G
- RSA.IBR.ibrN G n_2.succ = RSA.IBR.ibrStep G (RSA.IBR.ibrN G n_2)
Instances For
S₁: First pragmatic speaker
Equations
- RSA.IBR.S1 G = RSA.IBR.speakerUpdate G (RSA.IBR.L0 G)
Instances For
L₂: First pragmatic listener (argmax response to S₁)
Equations
- RSA.IBR.L2 G = RSA.IBR.hearerBR G (RSA.IBR.S1 G)
Instances For
Check if hearer strategy is a fixed point of IBR
Equations
- RSA.IBR.isIBRFixedPoint G H = ∀ (m : G.Message) (s : G.State), H.respond m s = (RSA.IBR.ibrStep G H).respond m s
Instances For
At a fixed point with non-negative priors, H.respond is non-negative.
This follows from the fact that H = ibrStep G H, and ibrStep uses hearerBR which gives non-negative values (0 or 1/k).
The pragmatic interpretation: support of the IBR fixed point listener
Equations
- RSA.IBR.pragmaticSupport G H m = H.support m
Instances For
EG(S, R) = Σ_t Pr(t) × Σ_m S(t,m) × R(m,t): expected communication success.