Documentation

Linglib.Comparisons.RSANeoGricean

RSA prediction: L1 probability for "not all" > L1 probability for "all".

Equations
Instances For

    Directional Agreement Theorem

    Both theories agree that "some" favors the "not all" interpretation.

    • NeoGricean: categorically derives the implicature
    • RSA: assigns higher probability to "not all" world

    The RSA → IBR → EXH Limit #

    As RSA rationality parameter α → ∞:

    This is now proved via the chain:

    1. rsa_to_ibr_limit (Franke2011.lean): RSA S1 → IBR S1 as α → ∞

      • RSA S1 = softmax(log-informativity, α)
      • Uses tendsto_softmax_infty_at_max from Softmax.Limits
      • Softmax concentrates on the unique maximum
    2. ibr_equals_exhMW (in progress): IBR fixed point = exhMW

      • IBR eliminates non-minimal states
      • Converges to minimal-worlds exhaustification
    3. Under closure (Spector Thm 9): exhMW = exhIE

      • When alternatives are closed under conjunction
      • Innocent Exclusion = Minimal Worlds

    The Key Insight #

    Both RSA and NeoGricean implement the same rational principle: "Maximize informativity subject to truth"

    As α → ∞, RSA's softmax becomes EXH's argmax.

    The rationality parameter α controls how "sharp" RSA predictions are.

    • α = 0: uniform (speaker is random)
    • α = 1: softmax (standard RSA)
    • α → ∞: argmax (categorical)
    Instances For
      theorem Comparisons.RSANeoGricean.rsa_speaker_to_ibr (G : RSA.IBR.InterpGame) [Nonempty G.Message] (s : G.State) (m : G.Message) (hTrue : G.meaning m s = true) (hUnique : ∀ (m' : G.Message), m' mG.meaning m' s = trueG.informativity m > G.informativity m') (hInfPos : 0 < G.informativity m) :

      The Limit Theorem (@cite{franke-2011}, formalized):

      As α → ∞, RSA S1 probability concentrates on the IBR-optimal message.

      This is rsa_to_ibr_limit from Franke2011.lean, re-exported here for convenience.

      The full limit chain: RSA → IBR → exhMW → exhIE (under closure).

      This structure captures the complete picture of how RSA relates to exhaustive interpretation in the high-rationality limit.

      Instances For

        LimitAgreement (Removed) #

        The LimitAgreement structure that paired an RSAScenario with a StandardRecipeResult has been removed. The old RSAScenario type has been replaced by RSAConfig. Agreement between RSA and NeoGricean is demonstrated through the limit theorem and structural comparisons below.

        Both theories induce an ordering on interpretations.

        • NeoGricean: implicature present > implicature absent (for UE contexts)
        • RSA: higher probability > lower probability

        Ordinal agreement means these orderings are compatible.

        Instances For

          For "some" in UE context:

          • NeoGricean prefers "some but not all" interpretation
          • RSA gives "some but not all" world higher probability
          Equations
          Instances For

            Both theories predict reduced implicatures in DE contexts.

            • NeoGricean: Scale reversal blocks the implicature
            • RSA: Global interpretation has higher probability

            This is a key empirical prediction where both theories converge.

            • scalarItem : String

              The scalar item

            • neoGricean_blocks : Bool

              NeoGricean predicts blocking

            • rsa_prefers_global : Prop

              RSA prefers global (no local SI) — a Prop, not a stipulated Bool

            • agreement : self.neoGricean_blocks = trueself.rsa_prefers_global

              Agreement: NeoGricean blocking implies RSA global preference

            Instances For

              For "some" under "no one":

              • NeoGricean: Scale reversal blocks "not all"
              • RSA: Global interpretation preferred (proven in @cite{potts-etal-2016})

              The rsa_prefers_global field is now a genuine Prop — the L1 inequality from the Potts model — not a stipulated Bool. The agreement field dispatches to the proved de_blocking_NNN_vs_NNA theorem.

              Equations
              • One or more equations did not get rendered due to their size.
              Instances For

                Comparison of what each theory requires as input.

                • needsAlternatives : Bool

                  Does the theory need explicit alternatives?

                • needsPriors : Bool

                  Does the theory need prior probabilities?

                • needsCosts : Bool

                  Does the theory need cost functions?

                • needsRecursionDepth : Bool

                  Does the theory need recursion depth parameter?

                • categoricalOutput : Bool

                  Does the theory produce categorical outputs?

                Instances For
                  Equations
                  Instances For
                    Equations
                    Instances For

                      Conditions under which RSA = NeoGricean (conjectural).

                      • uniformPriors : Bool

                        Priors are uniform

                      • zeroCosts : Bool

                        Costs are zero

                      • highRationality : Bool

                        Rationality is "high" (ideally infinite)

                      • matchingAlternatives : Bool

                        Alternative sets match

                      Instances For

                        Standard conditions for equivalence.

                        Equations
                        Instances For

                          Information-Theoretic Connection to NeoGricean #

                          @cite{zaslavsky-hu-levy-2020} show that RSA optimizes: G_α = H_S(U|M) + α · E[V_L]

                          As α → ∞:

                          This is exactly the NeoGricean "say the strongest true thing" principle!

                          The NeoGricean Limit #

                          In information-theoretic terms, NeoGricean emerges when:

                          1. α → ∞ (pure informativity, no compression cost)
                          2. Speaker chooses argmax_u log L0(m|u) = argmax_u informativity(u)
                          3. This is the Gricean maxim of quantity

                          Entropy Contribution #

                          At finite α, the speaker balances:

                          As α → ∞, compression cost → 0, so only informativity matters. This recovers NeoGricean categorical predictions.

                          The NeoGricean limit can be characterized information-theoretically: at α → ∞, the speaker ignores compression and maximizes informativity.

                          Equations
                          Instances For

                            What This Module Establishes #

                            Proven #

                            1. Limit theorem: RSA S1 → IBR S1 as α → ∞ (rsa_to_ibr_limit)
                            2. Directional agreement: Both predict "some" → "not all"
                            3. Ordinal agreement: Both rank interpretations the same way
                            4. DE blocking agreement: Both predict blocking under negation
                            5. Structural comparison: RSA is more expressive (probabilities, priors)

                            In Progress #

                            1. IBR = exhMW: Full proof that IBR fixed point equals exhMW
                            2. Closure connection: Under conjunction closure, exhMW = exhIE (Spector Thm 9)

                            Information-Theoretic Perspective #

                            1. G_α objective: RSA optimizes H_S + α·E[V_L]
                            2. NeoGricean as limit: As α → ∞, H_S contribution vanishes
                            3. Categorical = pure informativity: argmax replaces softmax
                            4. Phase transition at α = 1: Rate-distortion optimum

                            The Limit Chain #

                            RSA S1 (softmax) ──α→∞──> IBR S1 (argmax) ────> exhMW ──closure──> exhIE
                                 ↑ ↑ ↑ ↑
                              PROVED PROVED (TODO) (Spector)
                              rsa_to_ibr_limit (trivial) ibr_equals_exhMW Theorem 9
                            

                            Future Work #

                            1. Complete ibr_equals_exhMW proof
                            2. Prove specific equivalence instances
                            3. Characterize exactly when predictions diverge
                            4. Connect to experimental data on implicature rates
                            5. Explore suboptimality for α < 1 (utility non-monotonicity)