Speaker Discrimination: JND/Trace on S1 #
@cite{luce-1959}
Applies the just-noticeable-difference (JND) and trace infrastructure from
@cite{luce-1959} — formalized in ChoiceApproximations.lean — to S1's
utterance choice in RSA.
Key idea #
S1 is a RationalAction W U: given world w and latent l, S1 assigns
each utterance u a score, which determines a Luce choice rule. The
score function S1agent.score w : U → ℝ is therefore a ratio scale on
utterances, and the JND/trace machinery applies directly:
- S1_jndL: utterance
u₁is discriminably preferred overu₂at thresholdπ— the speaker reliably choosesu₁overu₂ - S1_jndI: utterances
u₁andu₂are pragmatically indistinguishable — the speaker cannot reliably discriminate between them - S1_traceGe: the trace ordering on utterances, which recovers the latent preference ranking
Strict positivity #
The JND/trace theorems in ChoiceApproximations.lean require strictly
positive scale values (0 < v a), matching Luce's assumption that every
alternative has positive value. This is stronger than score_nonneg (0 ≤).
The positivity hypothesis hv_pos is carried explicitly — it holds whenever
S1's score function is strictly positive (e.g., for exponential scoring rules
like exp(α · u), but not necessarily for rpow(L0, α) when some L0 values
are zero).
Properties (zero sorry in §3) #
All semiorder and trace properties follow by direct application of the
corresponding theorems in ChoiceApproximations.lean:
- L-transitivity, I-symmetry, I-reflexivity
- Trace ↔ score ordering
L1 invariance (sorry'd) #
If S1 scores match everywhere for two utterances, L1 posteriors are identical. This is stated but sorry'd — the proof requires showing that equal S1 policies yield equal L1agent scores.
S1's score function viewed as a ratio scale on utterances.
Given a latent value l and world w, this is the function U → ℝ
that the JND/trace machinery operates on.
Instances For
S1-discriminable preference: utterance u₁ is discriminably preferred
over u₂ by the pragmatic speaker at threshold thr.
This means S1 reliably chooses u₁ over u₂ in a binary forced choice:
P(u₁, {u₁, u₂}) > thr.
Instances For
S1-indistinguishable utterances: the pragmatic speaker cannot reliably
discriminate between u₁ and u₂ at threshold thr.
This defines a pragmatic tolerance band: utterances within it are interchangeable from S1's perspective.
Instances For
S1 trace ordering on utterances: u₁ ≥_T u₂ iff
P(u₁, z) ≥ P(u₂, z) for all utterances z.
Equations
- cfg.S1_traceGe l w u₁ u₂ = Core.traceGe (cfg.S1_scale l w) u₁ u₂
Instances For
L-transitivity for S1 preferences: if S1 discriminably prefers u₁
over u₂ and u₂ over u₃, then S1 discriminably prefers u₁ over u₃.
Requires strict positivity of S1 scores (Luce's ratio scale assumption).
I-symmetry for S1 indistinguishability: if u₁ and u₂ are
indistinguishable, so are u₂ and u₁.
I-reflexivity for S1: every utterance is indistinguishable from itself.
Trace ↔ score ordering: u₁ ≥_T u₂ iff S1.score(w, u₂) ≤ S1.score(w, u₁).
If S1 scores match everywhere for two utterances, L1 posteriors are identical.
When S1agent.score w u₁ = S1agent.score w u₂ for all latent values and
worlds, the two utterances are indistinguishable at L1: the pragmatic
listener assigns them the same posterior over worlds.