Gumbel-Luce Equivalence (McFadden's Theorem) @cite{mcfadden-1974} #
The exact algebraic connection between Gumbel noise in a Random Utility Model (RUM) and the Luce choice rule (softmax). @cite{luce-1959}
Random Utility Models #
A Random Utility Model (RUM) assigns each alternative i a random utility:
`Uᵢ = uᵢ + εᵢ`
where uᵢ is a deterministic component and εᵢ is random noise.
The choice probability is P(i chosen) = P(Uᵢ = max_j Uⱼ).
Two noise distributions yield closed-form choice probabilities:
- Gaussian noise →
Φ(Thurstone Case V, §2.D of @cite{luce-1959}) - Gumbel noise → softmax (@cite{mcfadden-1974})
McFadden's Theorem #
If the εᵢ are i.i.d. Gumbel(0, β), then:
`P(i chosen) = exp(uᵢ/β) / Σⱼ exp(uⱼ/β) = softmax(u, 1/β)ᵢ`
This is exact — no approximation involved. The proof reduces to the
Laplace integral ∫₀^∞ exp(-S·t) dt = 1/S after a change of variables.
Thurstone–Luce Bridge #
Combined with the Thurstone result (Thurstone.lean), this gives the precise
relationship between the two models:
- Gaussian noise → normal CDF (Thurstone, exact)
- Gumbel noise → logistic/softmax (Luce, exact by McFadden)
Φ ≈ logistic(numerical approximation, max error ~0.01)
The two models are both RUMs, differing only in noise distribution.
The logistic approximation to the normal CDF (Thurstone.lean §4) is what
connects them, not a foundational assumption.
Main Results #
integral_exp_neg_mul_Ioi_zero: Laplace integral∫₀^∞ exp(-S·t) dt = 1/SmcfaddenIntegral_eq_softmax: McFadden's theorem (algebraic core)mcfaddenIntegral_binary: Binary case reduces to logistic functiongumbelCDF,gumbelPDF: Gumbel distribution definitions
The Gumbel CDF with location 0 and scale β:
F(x; β) = exp(-exp(-x/β))
The Type I extreme value distribution. Used as the noise distribution in the Luce/McFadden random utility model.
Instances For
The Laplace integral: ∫₀^∞ exp(-S·t) dt = 1/S for S > 0.
This is the core computation in McFadden's theorem. After the change of
variables t = exp(-x/β) in the Gumbel max-probability integral, the
problem reduces to this exponential integral.
Follows directly from Mathlib's integral_exp_mul_Ioi with a = -S
and c = 0.
The McFadden integral for alternative i with scale β > 0:
Iᵢ = exp(uᵢ/β) · ∫₀^∞ exp(-S·t) dt
where S = Σⱼ exp(uⱼ/β) is the partition function.
After the change of variables t = exp(-x/β) in the original
Gumbel density integral, the max-probability P(Uᵢ = max_j Uⱼ) takes
this form. See gumbelMaxProb_is_mcfaddenIntegral.
Equations
Instances For
McFadden's Theorem (algebraic core) — Lemma 1 of @cite{mcfadden-1974}: The McFadden integral equals softmax.
For any utilities u₁, ..., uₙ and scale β > 0:
exp(uᵢ/β) · ∫₀^∞ exp(-S·t) dt = exp(uᵢ/β) / S = softmax(u, 1/β)ᵢ
where S = Σⱼ exp(uⱼ/β).
This is the algebraic content of McFadden's Lemma 1 (p. 111), generalized
from unit scale (β = 1) to arbitrary β > 0. McFadden assumes i.i.d. Weibull
(Gnedenko extreme value) noise with CDF P(ε ≤ ε) = exp(-e^{-ε}) —
eq. (13) — and derives the softmax selection probability P_i = e^{V_i} / Σ e^{V_j}
— eq. (12). The probabilistic interpretation is formalized separately in
gumbelMaxProb_is_mcfaddenIntegral.
Binary McFadden = Logistic: For two alternatives, the McFadden integral reduces to the logistic function.
mcfaddenIntegral([u₁, u₂], β)(0) = logistic((u₁ - u₂) / β)
This is the exact Gumbel-Luce result for binary choice: if ε₁, ε₂ are
i.i.d. Gumbel(0, β), then P(u₁+ε₁ > u₂+ε₂) = logistic((u₁-u₂)/β).
Compare with Thurstone Case V (Thurstone.lean):
P(u₁+η₁ > u₂+η₂) = Φ((u₁-u₂)/(σ√2)) for Gaussian noise η.
Construct a RationalAction from a Gumbel RUM.
The score function is exp(uᵢ/β), which gives Luce's choice rule.
This is the exact optimal policy under i.i.d. Gumbel noise
(by McFadden's theorem), not an approximation.
Equations
Instances For
Gumbel RUM = McFadden Integral — the measure-theoretic content of Lemma 1 in @cite{mcfadden-1974} (p. 111).
If ε₁, ..., εₙ are i.i.d. Gumbel(0, β), then the probability that
alternative i has the maximum random utility equals the McFadden integral:
P(uᵢ + εᵢ = max_j(uⱼ + εⱼ)) = mcfaddenIntegral u β i
McFadden's proof (p. 111) with Vᵢ = v(s, xᵢ):
- From eq. (13),
F_i(ε + V_i - V_1, ..., ε + V_i - V_J)= exp(-ε) · ∏ⱼ exp(-exp(-ε - V_i + V_j))= exp(-ε) · exp(-exp(-ε) · Σⱼ exp(V_j - V_i)) - Substituting in eq. (3) and integrating yields eq. (12):
P_i = e^{V_i} / Σⱼ e^{V_j}
The proof uses the antiderivative G(x) = (C/S) · exp(-S · exp(-x/β)) where
C = exp(uᵢ/β) and S = Σⱼ exp(uⱼ/β). By FTC on ℝ
(integral_of_hasDerivAt_of_tendsto), ∫ G' = C/S = mcfaddenIntegral.
McFadden's Lemma 2 (Uniqueness) @cite{mcfadden-1974} #
McFadden's Lemma 2 shows that the Gumbel distribution is the only
i.i.d. noise distribution yielding the softmax (Luce) choice rule. The key
step is: if the max-CDF G satisfies the functional equation
`G(x - c) = G(x) ^ exp(c)` for all `x, c ∈ ℝ`
then G must be a Gumbel CDF: G(t) = exp(-α · exp(-t)) where
α = -log(G(0)) > 0.
The proof sets x = 0 and c = -t in the functional equation, giving
G(t) = G(0)^{exp(-t)}, then rewrites G(0)^y = exp(log(G(0)) · y)
using rpow_def_of_pos.
If a function G satisfies the functional equation
G(x - c) = G(x) ^ exp(c) for all x, c ∈ ℝ,
and 0 < G(0) < 1, then G is a Gumbel CDF:
G(t) = exp(-α · exp(-t)) where α = -log(G(0)) > 0.
This is the core of McFadden's Lemma 2: the functional equation characterizing the max-CDF uniquely determines the Gumbel family.