Documentation

Linglib.Core.Agent.UtilityTheory

Luce's Utility Decomposition Theory (Chapter 3) @cite{luce-1959} #

@cite{luce-1959} extends the choice axiom from simple alternatives to gambles — uncertain prospects of the form "get outcome a if event ρ occurs, otherwise get outcome b." The key result is a decomposition theorem: the subjective value of a gamble factors multiplicatively into a component that depends only on the outcomes and a component that depends only on the event.

Architecture #

This module formalizes:

Gambles (§3.A): An outcome aρb means "receive a if event ρ, else b."
Decomposition Axiom (Axiom 2): When outcomes are fixed, choice between gambles depends only on the events.
Monotonicity Axiom (Axiom 3): Preferred outcome + preferred event → preferred gamble.
Luce ratio scales (§2a): The Luce choice axiom applied to event and gamble choice functions, providing ratio-scale representations Q(ρ,σ) = v(ρ)/(v(ρ)+v(σ)).
Three equivalence classes (Theorem 12): Under the Luce choice axiom, events classified as neutral relative to a reference are indifferent: Q(ρ,σ) = ½.
Scale decomposition (§3.D): v(aρb) = w(a,b) · φ(ρ) — gamble value factors into outcome value × event weight.
RSA bridge: RSA's additive utility structure utility = informativity - cost follows from Luce's decomposition in log-space.

structure Core.Event :

An event in a decision problem. Events are the uncertain conditions under which outcomes are determined.

id : ℕ

Instances For

instance Core.instDecidableEqEvent :

DecidableEq Event

Equations

Core.instDecidableEqEvent = Core.instDecidableEqEvent.decEq

def Core.instDecidableEqEvent.decEq (x✝ x✝¹ : Event) :

Decidable (x✝ = x✝¹)

Equations

Core.instDecidableEqEvent.decEq { id := a } { id := b } = if h : a = b then h ▸ isTrue ⋯ else isFalse ⋯

Instances For

instance Core.instReprEvent :

Equations

Core.instReprEvent = { reprPrec := Core.instReprEvent.repr }

def Core.instReprEvent.repr :

Event → ℕ → Std.Format

Equations

Core.instReprEvent.repr x✝ prec✝ = Std.Format.bracket "{ " (Std.Format.nil ++ Std.Format.text "id" ++ Std.Format.text " := " ++ (Std.Format.nest 6 (repr x✝.id)).group) " }"

Instances For

structure Core.Gamble (Outcome : Type u_1) :

Type u_1

A gamble aρb: receive outcome a if event ρ occurs, else outcome b. (@cite{luce-1959}, §3.A)

win : Outcome
Outcome if event occurs
event : Event
The conditioning event
lose : Outcome
Outcome if event does not occur

Instances For

structure Core.GambleChoiceFn (Outcome : Type u_2) :

Type u_2

A gamble choice function assigns choice probabilities to pairs of gambles. P(g₁, g₂) is the probability of choosing gamble g₁ over g₂. (@cite{luce-1959}, §3.A)

prob : Gamble Outcome → Gamble Outcome → ℝ
Binary choice probability: P(g₁ preferred over g₂)
prob_nonneg (g₁ g₂ : Gamble Outcome) : 0 ≤ self.prob g₁ g₂
Probabilities are in [0,1]
prob_le_one (g₁ g₂ : Gamble Outcome) : self.prob g₁ g₂ ≤ 1
prob_complement (g₁ g₂ : Gamble Outcome) : self.prob g₁ g₂ + self.prob g₂ g₁ = 1
Binary complement: P(g₁, g₂) + P(g₂, g₁) = 1

Instances For

structure Core.OutcomeChoiceFn (Outcome : Type u_2) :

Type u_2

A simple outcome choice function: P(a, b) = probability of choosing a over b. Used for the outcome-only component of decomposition.

prob : Outcome → Outcome → ℝ
prob_nonneg (a b : Outcome) : 0 ≤ self.prob a b
prob_le_one (a b : Outcome) : self.prob a b ≤ 1
prob_complement (a b : Outcome) : self.prob a b + self.prob b a = 1

Instances For

structure Core.EventChoiceFn :

An event choice function: Q(ρ, σ) = probability of preferring event ρ over σ. (Extracted when outcomes are held fixed.)

prob : Event → Event → ℝ
prob_nonneg (ρ σ : Event) : 0 ≤ self.prob ρ σ
prob_le_one (ρ σ : Event) : self.prob ρ σ ≤ 1
prob_complement (ρ σ : Event) : self.prob ρ σ + self.prob σ ρ = 1

Instances For

structure Core.DecompositionAxiom {Outcome : Type u_1} (P : GambleChoiceFn Outcome) :

Decomposition Axiom (@cite{luce-1959}, Axiom 2): When comparing gambles with fixed outcomes (same a, same b), the choice probability depends only on the events.

P(aρb, aσb) = Q(ρ, σ) for some event choice function Q.

eventChoice : EventChoiceFn
The extracted event choice function
decomp (a b : Outcome) (ρ σ : Event) : P.prob { win := a, event := ρ, lose := b } { win := a, event := σ, lose := b } = self.eventChoice.prob ρ σ
Fixed outcomes ⟹ choice depends only on events

Instances For

structure Core.MonotonicityAxiom {Outcome : Type u_1} (P : GambleChoiceFn Outcome) (outcomeChoice : OutcomeChoiceFn Outcome) (eventChoice : EventChoiceFn) :

Monotonicity Axiom (@cite{luce-1959}, Axiom 3): If outcome a is preferred to b (P(a,b) ≥ ½) and event ρ is preferred to σ (Q(ρ,σ) ≥ ½), then gamble aρb is preferred to bσa.

This captures the intuition that a better outcome under a more favorable event should be preferred to a worse outcome under a less favorable event.

mono (a b : Outcome) (ρ σ : Event) : outcomeChoice.prob a b ≥ 1 / 2 → eventChoice.prob ρ σ ≥ 1 / 2 → P.prob { win := a, event := ρ, lose := b } { win := b, event := σ, lose := a } ≥ 1 / 2

Instances For

structure Core.EventLuceScale (Q : EventChoiceFn) :

A Luce ratio scale for an event choice function: Q(ρ,σ) = v(ρ)/(v(ρ)+v(σ)) for some positive scoring function v. This is the event-level analog of the Luce choice rule from RationalAction.

v : Event → ℝ
v_pos (ρ : Event) : 0 < self.v ρ
luce_rule (ρ σ : Event) : Q.prob ρ σ = self.v ρ / (self.v ρ + self.v σ)

Instances For

structure Core.GambleLuceScale {Outcome : Type u_1} (P : GambleChoiceFn Outcome) :

Type u_1

A Luce ratio scale for a gamble choice function: P(g₁,g₂) = v(g₁)/(v(g₁)+v(g₂)) for some positive scoring function v. This is the gamble-level Luce choice axiom (@cite{luce-1959}, Chapter 1 applied to gamble alternatives).

v : Gamble Outcome → ℝ
v_pos (g : Gamble Outcome) : 0 < self.v g
luce_rule (g₁ g₂ : Gamble Outcome) : P.prob g₁ g₂ = self.v g₁ / (self.v g₁ + self.v g₂)

Instances For

inductive Core.EventClass :

An event ρ is favorable relative to an outcome preference P(a,b) ≥ ½ if Q(ρ, σ₀) > ½ for the neutral event σ₀. Intuitively: ρ makes the preferred outcome more likely.

favorable : EventClass
neutral : EventClass
unfavorable : EventClass

Instances For

instance Core.instDecidableEqEventClass :

DecidableEq EventClass

Equations

Core.instDecidableEqEventClass x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

instance Core.instReprEventClass :

Repr EventClass

Equations

Core.instReprEventClass = { reprPrec := Core.instReprEventClass.repr }

def Core.instReprEventClass.repr :

EventClass → ℕ → Std.Format

Equations

Core.instReprEventClass.repr Core.EventClass.favorable prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Core.EventClass.favorable")).group prec✝
Core.instReprEventClass.repr Core.EventClass.neutral prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Core.EventClass.neutral")).group prec✝
Core.instReprEventClass.repr Core.EventClass.unfavorable prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Core.EventClass.unfavorable")).group prec✝

Instances For

noncomputable def Core.classifyEvent (Q : EventChoiceFn) (ref ρ : Event) :

Classify an event based on its choice probability against a reference event. (@cite{luce-1959}, §3.C)

Equations

Core.classifyEvent Q ref ρ = if Q.prob ρ ref > 1 / 2 then Core.EventClass.favorable else if Q.prob ρ ref < 1 / 2 then Core.EventClass.unfavorable else Core.EventClass.neutral

Instances For

theorem Core.threeClasses (Q : EventChoiceFn) (hScale : EventLuceScale Q) (ref ρ σ : Event) (hρ : classifyEvent Q ref ρ = EventClass.neutral) (hσ : classifyEvent Q ref σ = EventClass.neutral) :

Q.prob ρ σ = 1 / 2

Neutral class indifference (@cite{luce-1959}, Theorem 12): Under the Luce choice axiom, events classified as neutral relative to a reference event are indifferent to each other: Q(ρ, σ) = ½.

If v(ρ) = v(ref) (neutral) and v(σ) = v(ref) (neutral), then v(ρ) = v(σ), hence Q(ρ,σ) = v(ρ)/(v(ρ)+v(σ)) = ½.

Note: For favorable and unfavorable events, same-class membership does NOT imply indifference — events within these classes can have different v-values and thus Q(ρ,σ) ≠ ½. See favorable_over_unfavorable for the between-class ordering.

theorem Core.favorable_over_unfavorable (Q : EventChoiceFn) (hScale : EventLuceScale Q) (ref ρ σ : Event) (hρ : classifyEvent Q ref ρ = EventClass.favorable) (hσ : classifyEvent Q ref σ = EventClass.unfavorable) :

Q.prob ρ σ > 1 / 2

Between-class ordering: favorable events are preferred over unfavorable events. If v(ρ) > v(ref) (favorable) and v(σ) < v(ref) (unfavorable), then v(ρ) > v(σ), hence Q(ρ,σ) > ½.

theorem Core.favorable_over_neutral (Q : EventChoiceFn) (hScale : EventLuceScale Q) (ref ρ σ : Event) (hρ : classifyEvent Q ref ρ = EventClass.favorable) (hσ : classifyEvent Q ref σ = EventClass.neutral) :

Q.prob ρ σ > 1 / 2

Between-class ordering: favorable events are preferred over neutral events.

structure Core.ScaleDecomposition (Outcome : Type u_2) :

Type u_2

A gamble value function that factors into outcome value × event weight. (@cite{luce-1959}, §3.D)

v(aρb) = w(a,b) · φ(ρ) where:

w(a,b) depends only on the outcomes
φ(ρ) depends only on the event

outcomeValue : Outcome → Outcome → ℝ
Outcome value component
eventWeight : Event → ℝ
Event weight component
outcomeValue_nonneg (a b : Outcome) : 0 ≤ self.outcomeValue a b
Non-negativity
eventWeight_nonneg (ρ : Event) : 0 ≤ self.eventWeight ρ

Instances For

noncomputable def Core.ScaleDecomposition.gambleValue {Outcome : Type u_1} (sd : ScaleDecomposition Outcome) (g : Gamble Outcome) :

The gamble value under a scale decomposition.

Equations

sd.gambleValue g = sd.outcomeValue g.win g.lose * sd.eventWeight g.event

Instances For

theorem Core.scaleDecomposition {Outcome : Type u_1} (P : GambleChoiceFn Outcome) (hDecomp : DecompositionAxiom P) (hLuce : GambleLuceScale P) (ρ₀ : Event) (a₀ b₀ : Outcome) :

∃ (sd : ScaleDecomposition Outcome), ∀ (g₁ g₂ : Gamble Outcome), sd.gambleValue g₁ + sd.gambleValue g₂ > 0 → P.prob g₁ g₂ = sd.gambleValue g₁ / (sd.gambleValue g₁ + sd.gambleValue g₂)

Scale decomposition theorem (@cite{luce-1959}, §3.D): Under the Luce choice axiom and the decomposition axiom, the choice probability for gambles can be represented as a Luce choice rule with scores that factor as v(aρb) = w(a,b) · φ(ρ).

The construction: fix a reference event ρ₀ and reference outcomes a₀, b₀.

w(a,b) := v(a,ρ₀,b) (gamble value with reference event)
φ(ρ) := v(a₀,ρ,b₀) / v(a₀,ρ₀,b₀) (event weight normalized by reference)

The decomposition axiom ensures that v(g)/v(g.win,ρ₀,g.lose) depends only on the event, so v(g) = w(g.win,g.lose) · φ(g.event).

RSA Bridge #

Luce's decomposition theorem justifies the additive structure of RSA utility.

In RSA, the speaker's utility is: utility(u, w) = log P(w|u) - cost(u)

This additive structure in log-space corresponds to multiplicative structure in ratio-scale space: score(u, w) = exp(α · utility) = exp(α · log P(w|u)) · exp(-α · cost(u)) = informativity^α · exp(-α · cost)

Luce's decomposition v(aρb) = w(a,b) · φ(ρ) provides the axiomatic foundation:

The "outcome" dimension corresponds to the communicative goal (informativity)
The "event" dimension corresponds to the production process (cost)
The multiplicative factoring is derived from IIA + monotonicity, not stipulated

structure Core.RSAUtilityDecomposition :

RSA utility as a Luce decomposition instance.

The speaker's score exp(α · (log P(w|u) - cost(u))) factors as:

outcome component: P(w|u)^α (informativity)
event component: exp(-α · cost(u)) (production ease)

This factoring means the Luce choice axiom guarantees that informativity and cost contribute independently to the speaker's choice, which is a substantive empirical prediction (not a modeling convenience).

α : ℝ
Rationality parameter
informativity : ℝ
Informativity: P(w|u) for each utterance-world pair
cost : ℝ
Cost of utterance
informativity_nonneg : 0 ≤ self.informativity
Informativity is a probability
informativity_le_one : self.informativity ≤ 1

Instances For

noncomputable def Core.RSAUtilityDecomposition.score (d : RSAUtilityDecomposition) :

RSA speaker score factors multiplicatively, following Luce decomposition.

Equations

d.score = d.informativity ^ d.α * Real.exp (-d.α * d.cost)

Instances For

theorem Core.rsa_utility_decomposition (d : RSAUtilityDecomposition) (hinfo_pos : 0 < d.informativity) :

Real.log d.score = d.α * Real.log d.informativity - d.α * d.cost

The RSA utility decomposition: log(score) = α · log(informativity) - α · cost. This additive structure in log-space is what Luce's Chapter 3 derives from the decomposition axioms.