Documentation

Linglib.Theories.Pragmatics.IBR.Core

Iterated Best Response: Core Definitions #

Game-theoretic pragmatics following @cite{franke-2011} §6-8.

Defines interpretation games (InterpGame), speaker/hearer strategies, literal listener (L₀), best response operators, IBR iteration (ibrN), fixed points, and expected gain (expectedGain).

Interpretation game (Franke §6): states are equivalence classes over alternative truth patterns.

Instances For

    States where message m is true

    Equations
    Instances For
      @[simp]

      Messages true at state s

      Equations
      Instances For

        Informativity of a message (reciprocal of true states, as ratio)

        Equations
        Instances For

          A hearer strategy: P(state | message)

          Instances For

            A speaker strategy: P(message | state)

            Instances For

              Uniform distribution over states where m is true

              Equations
              • One or more equations did not get rendered due to their size.
              Instances For

                Support of hearer's response to message m

                Equations
                Instances For

                  Support of speaker's choice at state s

                  Equations
                  Instances For

                    Max utility among true messages at state s (0 if no true messages).

                    Equations
                    Instances For

                      Optimal messages at state s: true messages achieving max utility.

                      This is the set-level best response (Franke eq. 76): the speaker at state t uses messages in R_k^{-1}(t) that minimize |R_k(m)|, which corresponds to maximizing H(m|t) in the probabilistic rendering.

                      All probability-level reasoning should go through this set.

                      Equations
                      Instances For

                        Best response speaker: uniform distribution over optimal messages (eq. 76).

                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          bestResponse gives 1/k to optimal messages, 0 to others.

                          Best response speaker always gives non-negative probabilities.

                          theorem RSA.IBR.SpeakerStrategy.bestResponse_false_zero (G : InterpGame) (H : HearerStrategy G) (s : G.State) (m : G.Message) (hFalse : G.meaning m s = false) :
                          (bestResponse G H).choose s m = 0

                          Best response speaker gives zero probability to false messages.

                          bestResponse gives positive probability iff m is optimal (and optimal set is nonempty).

                          Best response speaker probabilities sum to at most 1 at any state.

                          Best response speaker gives at most probability 1 to any message.

                          L₀: Literal listener (Franke Def. 22)

                          Equations
                          Instances For

                            Speaker update: Best response to hearer strategy.

                            S_{n+1}(m | s) = argmax_m L_n(s | m)

                            Uniform over optimal messages.

                            Equations
                            Instances For

                              Hearer best response: argmax of posterior probability (Franke eq. 77/120).

                              The hearer observes m, forms posterior μ(t|m) ∝ S(t,m) · P(t), and picks the state(s) with maximum posterior probability. Uniform over ties. For surprise messages (∀ t, S(t,m) · P(t) = 0), falls back to literal interpretation per the TCP assumption.

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For
                                theorem RSA.IBR.hearerBR_respond_nonneg (G : InterpGame) (S : SpeakerStrategy G) (m : G.Message) (s : G.State) :
                                (hearerBR G S).respond m s 0

                                hearerBR always gives non-negative response values.

                                One full IBR iteration step: speaker best-responds, hearer argmax-responds.

                                Equations
                                Instances For

                                  IBR reasoning for n iterations starting from L₀

                                  Equations
                                  Instances For

                                    S₁: First pragmatic speaker

                                    Equations
                                    Instances For

                                      L₂: First pragmatic listener (argmax response to S₁)

                                      Equations
                                      Instances For

                                        Check if hearer strategy is a fixed point of IBR

                                        Equations
                                        Instances For
                                          theorem RSA.IBR.fp_respond_nonneg (G : InterpGame) (H : HearerStrategy G) (hFP : isIBRFixedPoint G H) (_hPriorNonneg : ∀ (s : G.State), G.prior s 0) (m : G.Message) (s : G.State) :
                                          H.respond m s 0

                                          At a fixed point with non-negative priors, H.respond is non-negative.

                                          This follows from the fact that H = ibrStep G H, and ibrStep uses hearerBR which gives non-negative values (0 or 1/k).

                                          The pragmatic interpretation: support of the IBR fixed point listener

                                          Equations
                                          Instances For
                                            noncomputable def RSA.IBR.expectedGain (G : InterpGame) (S : SpeakerStrategy G) (H : HearerStrategy G) :

                                            EG(S, R) = Σ_t Pr(t) × Σ_m S(t,m) × R(m,t): expected communication success.

                                            Equations
                                            Instances For