@cite{tessler-goodman-2019}: The Language of Generalization #

@cite{tessler-goodman-2019} @cite{lassiter-goodman-2017}

Psychological Review, 126(3), 395–436.

Core Insight #

Generics ("Robins lay eggs") use the SAME uncertain threshold semantics as gradable adjectives. The scale is prevalence rather than height/degree:

⟦gen⟧(p, θ) = 1 if prevalence p > threshold θ

This IS positiveMeaning from Semantics.Degree — the generic meaning is grounded in scalar adjective semantics by construction, not by bridge theorem.

Model #

Interpretation model (L0, Eq. 1): L(p, θ | u) ∝ δ_{⟦u⟧(p,θ)} · P(θ) · P(p)

Endorsement model (S1, Eq. 3): S(u | p) ∝ (∫_θ L(p, θ | u) dθ)^λ

The threshold θ is marginalized BEFORE exponentiation (matching the paper). With N discrete thresholds, the marginalized L0 is: L0(p | generic) ∝ P(p) · |{θ : p > θ}| = P(p) · p.toNat

This analytical marginalization eliminates the latent variable entirely, so the RSAConfig has Latent = Unit. S1 then exponentiates the marginalized L0, exactly matching the paper's endorsement model.

Parameters #

All parameters from the paper's code (analysis/model-simulations.Rmd, exampleParameters list, GitHub: mhtess/genlang-paper):

α = 2 in the paper (experimental fit: 2.47). We use α = 1 since the binary comparison S1(generic) > S1(silent) is α-invariant for α > 0
Bins: paper uses 98 bins (0.01–0.98); we use 21 bins (0%, 5%, ..., 100%) for exact rational arithmetic. Qualitative predictions are preserved.
Null component: Beta(1, 50)

Property	Stable Beta	φ (mix)	Ref. prev.	Paper endorse
bark	Beta(5,1)	0.4	95%	0.88
hasSpots	Beta(5,1)	0.7	10%	0.02
dontEatPeople	Beta(10,1)*	1.0	80%	0.41
laysEggs	Beta(10,10)	0.2	50%	0.95
isFemale	Beta(10,10)	1.0	50%	0.50
carriesMalaria	Beta(1,30)	0.1	10%	0.97

*Paper uses Beta(50,1); we use Beta(10,1) for tractable arithmetic (avoids k^49 terms). Both give the same qualitative prediction.

Prior Model #

Prevalence priors are mixtures of two Beta distributions (Figure 2): P(p) = φ · Beta_stable(p) / Z_s + (1-φ) · Beta_null(p) / Z_n

where φ is the probability a category has the stable causal mechanism, Beta_stable varies per property, and Beta_null = Beta(1,50) for all properties (representing categories lacking the property mechanism).

Each component is NORMALIZED before mixing (matching the WebPPL code, which uses categorical to normalize each component independently). We achieve this without ℚ division by computing: P(p) ∝ φ · BW_s(p) · Z_n + (1-φ) · BW_n(p) · Z_s

Verified Predictions #

#	Finding	Prior	p_ref	Theorem
1	"Dogs bark" endorsed	bark	95%	`bark_endorsed`
2	"Kangaroos have spots" NOT endorsed	hasSpots	10%	`spots_not_endorsed`
3	"Sharks don't eat people" NOT endorsed	dontEatPeople	80%	`dontEatPeople_not_endorsed`
4	"Robins lay eggs" endorsed despite 50%	laysEggs	50%	`laysEggs_endorsed`
5	"Robins are female" borderline at 50%	isFemale	50%	`isFemale_borderline`
6	"Mosquitos carry malaria" endorsed at 10%	carriesMalaria	10%	`malaria_endorsed`
7	Max prevalence satisfies all thresholds	—	—	`generic_top_true`
8	Zero prevalence fails all thresholds	—	—	`generic_zero_false`
9	Only rareWeak endorsed at 20%	all four causal	20%	`causal_20pct_pattern`
10	3/4 causal conditions endorsed at 70%	all four causal	70%	`causal_70pct_pattern`
11	Endorsement ⟺ exceeds E[k	prior]	—	—

@cite{tessler-goodman-2019}: The Language of Generalization #

Core Insight #

Model #

Parameters #

Prior Model #

Verified Predictions #

Mixture-of-Betas infrastructure #

Endorsement model (Eq. 3) #

Analytical endorsement condition #

Case Study 2: Habitual Language #

Case Study 3: Causal Language #

Cue Validity and Endorsement #

Unified Architecture #