Documentation

Linglib.Theories.Pragmatics.RSA.Implementations.EgreEtAl2023

Instances For
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For
      Equations
      • One or more equations did not get rendered due to their size.

      Tolerance y: "around n" with tolerance y means x ∈ [n-y, n+y].

      Instances For
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          Equations
          • One or more equations did not get rendered due to their size.
          Equations
          • One or more equations did not get rendered due to their size.
          Instances For
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              ⟦around n⟧(y)(x) = 1 iff |n - x| ≤ y

              Equations
              Instances For

                BIR weight: Σ_{y ≥ |n-x|} P(y) under uniform P(y) on {0,...,n}. Section 3.2.2, p.1085: y ranges over {0,...,n}, not the full value domain.

                Equations
                Instances For

                  Closed form (Section 3.2.2): P(x=k | around n) = (n - |n-k| + 1) / (n+1)²

                  Equations
                  Instances For

                    L0 for "between a b" = uniform over [a,b].

                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      L0 for "exactly n" = point mass at n.

                      Equations
                      • One or more equations did not get rendered due to their size.
                      Instances For
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          WIR: L(x=k | around n) = Σ_i P(x=k | x ∈ [n-i,n+i]) × P(y=i). Tolerance y ranges over {0,...,n} (Section 3.2.2).

                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For

                            Ratio Inequality: posterior concentrates more on center than prior. Under uniform prior, reduces to P(v3|around3) / P(v1|around3) > 1.

                            BIR joint marginalizes to favor large tolerances (more states compatible). With y ∈ {0,...,3}, y3 has 7 compatible values while y0 has 1.

                            Adjacent values have similar BIR probabilities (each step ≥ 50%).

                            Speaker's peaked private distribution centered on observed value.

                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For

                              Speaker utility: U(m, o) = -D_KL(P_o || L⁰_m).

                              Equations
                              Instances For

                                For a speaker who observed 3, "around 3" has better utility than "between 0 6" (same support, but flat). This is the paper's key result: peaked shape yields lower KL from peaked belief.

                                def RSA.EgreEtAl2023.SameSupport {α : Type} (d₁ d₂ : α) :

                                Same support: P(w|o₁) > 0 ↔ P(w|o₂) > 0.

                                Equations
                                Instances For
                                  def RSA.EgreEtAl2023.RespectsQuality {W I : Type} (m_true : IWBool) (obs : W) (i : I) :

                                  Quality: ∀ w, P(w|o) > 0 → ⟦m⟧ⁱ(w) = 1.

                                  Equations
                                  Instances For
                                    def RSA.EgreEtAl2023.RespectsWeakQuality {W I : Type} (m_true : IWBool) (obs : W) :

                                    Weak Quality: ∃ i, Quality(m, o, i).

                                    Equations
                                    Instances For
                                      theorem RSA.EgreEtAl2023.quality_preserved_by_same_support {W I : Type} (m_true : IWBool) (d₁ d₂ : W) (i : I) (h_same : SameSupport d₁ d₂) :
                                      RespectsQuality m_true d₁ i RespectsQuality m_true d₂ i

                                      (A-1a) Quality preserved under same support.

                                      theorem RSA.EgreEtAl2023.weak_quality_preserved_by_same_support {W I : Type} (m_true : IWBool) (d₁ d₂ : W) (h_same : SameSupport d₁ d₂) :

                                      (A-1b) Weak Quality preserved under same support.

                                      def RSA.EgreEtAl2023.softMaxScore (utilities : List ) (k : ) (alpha : ) :

                                      SoftMax(x_k, x, λ) = exp(λx_k) / Σ_j exp(λx_j).

                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For
                                        Equations
                                        Instances For
                                          def RSA.EgreEtAl2023.utilityDifferenceConstant {W : Type} [BEq W] (support : List W) (d₁ d₂ : W) :

                                          K(o₁,o₂): utility difference constant, independent of m and i (Core Lemma A-6).

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For
                                            def RSA.EgreEtAl2023.U1 {W M I : Type} [BEq W] (l0 : MIW) (obs : W) (m : M) (i : I) (worlds : List W) :

                                            U¹(m, o, i) = Σ_w P(w|o) · log L⁰(w | m, i) — speaker utility at level 1. This is the KL-based utility: higher when L⁰ matches the observation.

                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For
                                              def RSA.EgreEtAl2023.S1_score {W M I : Type} [BEq W] [BEq M] (l0 : MIW) (obs : W) (messages : List M) (i : I) (worlds : List W) (alpha : ) (m : M) :

                                              S¹(m | o, i) = SoftMax over U¹ utilities across messages.

                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For
                                                theorem RSA.EgreEtAl2023.no_quality_implies_S1_zero {W M I : Type} [BEq W] [BEq M] (l0 : MIW) (obs : W) (_messages : List M) (i : I) (worlds : List W) (_alpha : ) (m : M) (h_nq : ∀ (w : W), obs w > 0l0 m i w = 0) :
                                                U1 l0 obs m i worlds = 0
                                                theorem RSA.EgreEtAl2023.core_lemma_A6 {W M I : Type} [Fintype W] (f : W) (c : MI) (d₁ d₂ : W) (h_sum : w : W, d₁ w = w : W, d₂ w) (m₁ m₂ : M) (i₁ i₂ : I) :
                                                w : W, d₂ w * (f w + c m₁ i₁) - w : W, d₁ w * (f w + c m₁ i₁) = w : W, d₂ w * (f w + c m₂ i₂) - w : W, d₁ w * (f w + c m₂ i₂)

                                                (A-6) Core Lemma over ℝ: the utility difference U(m,d₂,i) - U(m,d₁,i) is constant across all messages m and interpretations i, provided Σd₁ = Σd₂.

                                                Under Quality, log L⁰(w|m,i) = f(w) + c(m,i) where f(w) = log prior(w) and c(m,i) = −log Z(m,i). Since f doesn't depend on m,i and Σd₁ = Σd₂, the c(m,i) term cancels in the difference, making K independent of m and i.

                                                theorem RSA.EgreEtAl2023.same_support_implies_equal_S1 {M : Type} [Fintype M] (u₁ u₂ : M) (α : ) (h_shift : ∃ (K : ), ∀ (m : M), u₂ m = u₁ m + K) :
                                                Core.softmax u₂ α = Core.softmax u₁ α

                                                (A-7) Same support → S¹ equal over ℝ: when utility vectors differ by a constant, softmax is invariant by Core.softmax_add_const.

                                                By A-6, U¹(·, d₂, i) = U¹(·, d₁, i) + K for some constant K. By A-5 (translation invariance), softmax(u + K, α) = softmax(u, α).

                                                theorem RSA.EgreEtAl2023.lu_limitation {M : Type} [Fintype M] (u₁ u₂ : M) (α : ) (h_shift : ∃ (K : ), ∀ (m : M), u₂ m = u₁ m + K) :
                                                Core.softmax u₂ α = Core.softmax u₁ α

                                                (A-8) LU Limitation over ℝ: same support → Sⁿ(m|o₁) = Sⁿ(m|o₂) for all n ≥ 1. At level 1, this is a direct corollary of A-7. The paper's full inductive argument (higher recursion depths) follows the same pattern: each Lⁿ is built from Sⁿ⁻¹ which are equal by inductive hypothesis, so Uⁿ differs by a constant, so Sⁿ is equal by softmax translation invariance.

                                                def RSA.EgreEtAl2023.U_std (l0_scores obs : Value) :

                                                C.1: Standard utility U_std(m,o) = Σ_w P(w|o) · log(Σ_{o'} L(w,o')). Under standard utility, U_std differs for same-support observations because the marginal Σ_{o'} L(w,o') washes out observation-specific shape.

                                                Equations
                                                • One or more equations did not get rendered due to their size.
                                                Instances For
                                                  def RSA.EgreEtAl2023.U_bergen (l0_scores obs : Value) :

                                                  C.2: Bergen utility U_bergen(m,o) = Σ_w P(w|o) · log L(w|o). Under Bergen utility, the observation enters both the weight and the listener posterior, so same-support observations yield different utilities (the peaked observation gets higher utility from a peaked L0).

                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    Peaked observation has better utility from triangular L0 than flat does. This is because the peaked observation puts more weight on center values where L0 also has higher probability — better KL alignment.

                                                    Both observations get the SAME utility under a uniform L0 (from "between"). This demonstrates the LU limitation: uniform L0 cannot distinguish shapes.

                                                    Equations
                                                    Instances For

                                                      BIR weight = marginalization of aroundMeaning over valid tolerances y ≤ n.

                                                      BIR posterior matches closed-form for each value (n=3).