Zaslavsky, Kemp, Tishby & Regier (2019) #

@cite{zaslavsky-etal-2019}

Color Naming Reflects Both Perceptual Structure and Communicative Need. Topics in Cognitive Science 11(1), 207–219.

Core Contributions #

@cite{zaslavsky-etal-2019} adjudicate between two explanations of cross-linguistic color naming patterns: perceptual structure (the geometry of CIELAB space) and communicative need (how often colors must be communicated). Their key finding is that both matter.

Perceptual structure partly explains the warm–cool asymmetry. K-means clustering on CIELAB coordinates produces artificial naming systems that already show lower expected surprisal S(c) for warm colors — without any communicative pressure.
Communicative need contributes beyond perceptual structure. The salience-weighted prior (from natural image statistics) exhibits a linear −log p(c) vs S(c) relationship predicted by the CAP theorem, while the perceptually-derived KM-CAP prior does not.
The CAP theorem links need and precision. At a capacity-achieving prior, −log p(c) = S(c) + log Z. This information-theoretic identity is the paper's central theoretical contribution, formalized in Core.ChannelCapacity.cap_linear.

Integration #

Theory layer: Core.ChannelCapacity (NamingChannel, CAP, cap_linear)
The RSA connection: a NamingChannel is an RSA literal speaker S₀, and the posterior is the literal listener L₀.

source

@[reducible, inline]

abbrev Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.WCSChip :

Type

The 80 WCS color chips analyzed by @cite{zaslavsky-etal-2019}. These are the standard Munsell chips from the World Color Survey, excluding achromatic chips. Each chip has coordinates in CIELAB perceptual color space.

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.WCSChip = Fin 80

Instances For

source

inductive Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.Temperature :

Type

Temperature classification: warm vs cool. The warm–cool asymmetry in communicative precision is the paper's central empirical finding. Warm colors (reds, yellows) have lower S(c) than cool colors (blues, greens) across languages.

warm : Temperature
cool : Temperature

Instances For

source

instance Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instDecidableEqTemperature :

DecidableEq Temperature

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instDecidableEqTemperature x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

source

def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instBEqTemperature.beq :

Temperature → Temperature → Bool

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instBEqTemperature.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

source

instance Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instBEqTemperature :

BEq Temperature

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instBEqTemperature = { beq := Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instBEqTemperature.beq }

source

def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprTemperature.repr :

Temperature → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprTemperature :

Repr Temperature

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprTemperature = { reprPrec := Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprTemperature.repr }

source

def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.WarmCoolAsymmetry {W : Type} [Fintype W] (nc : Core.ChannelCapacity.NamingChannel WCSChip W) (prior : WCSChip → ℝ) (temp : WCSChip → Temperature) :

Prop

The paper's main empirical finding: across languages, warm colors have lower expected surprisal (= higher communicative precision) than cool colors, regardless of prior choice.

We state this as a property of a naming channel and temperature classification rather than as a concrete computation (which would require the full WCS dataset).

Equations

One or more equations did not get rendered due to their size.

Instances For

source

structure Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.CIELABCoord :

Type

CIELAB coordinates for a WCS chip. L* = lightness, a* = red-green, b* = yellow-blue. Euclidean distance in CIELAB approximates perceptual dissimilarity.

The irregular distribution of the 80 WCS chips in CIELAB reveals perceptual asymmetries between warm and cool colors that partly explain the communicative precision asymmetry.

L : Float
a : Float
b : Float

Instances For

source

def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprCIELABCoord.repr :

CIELABCoord → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprCIELABCoord :

Repr CIELABCoord

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprCIELABCoord = { reprPrec := Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.instReprCIELABCoord.repr }

source

def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.cielabDist (p q : CIELABCoord) :

Float

Perceptual distance between two colors in CIELAB (Euclidean).

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.cielabDist p q = ((p.L - q.L) ^ 2 + (p.a - q.a) ^ 2 + (p.b - q.b) ^ 2).sqrt

Instances For

source

structure Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.KMeansSystem :

Type

A perceptually-derived naming system: k-means clustering on CIELAB assigns each chip to the nearest centroid, creating a hard partition. The paper shows these systems also exhibit warm–cool asymmetry in S(c), demonstrating that perceptual structure alone partially accounts for the effect.

k : ℕ
Number of clusters (= number of color terms in the language).
assignment : WCSChip → Fin self.k
Cluster assignment for each chip.

Instances For

source

noncomputable def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.KMeansSystem.toChannel (km : KMeansSystem) :

Core.ChannelCapacity.NamingChannel WCSChip (Fin km.k)

Convert a hard k-means partition to a NamingChannel. A hard partition assigns p(w|c) = 1 if w = assignment(c), else 0. This is a deterministic channel (zero conditional entropy).

Equations

One or more equations did not get rendered due to their size.

Instances For

The paper infers a universal need distribution by averaging per-language capacity-achieving priors (eq. 7): p̄(c) = 1/L Σ_l p_l(c), where each p_l is the CAP for language l's naming system p_l(w|c), found via Blahut-Arimoto.

Crucially, averaging CAPs does NOT in general preserve the CAP condition (footnote 4 of @cite{zaslavsky-etal-2019}): each p_l satisfies IsCAP for its own channel, but the averaged p̄ need not be a CAP for any single channel. The paper's key empirical finding is a dissociation:

WCS-CAP (averaged from actual WCS+ languages): empirically approximates a CAP — −log p̄(c) vs S̄(c) is approximately linear.
KM-CAP (averaged from k-means systems): does NOT approximate a CAP (r = 0.32) — suggesting real naming systems encode communicative structure beyond perceptual clustering.
Salience-weighted prior (from natural image statistics, @cite{gibson-etal-2017}): exhibits both the linear CAP relation AND the warm–cool asymmetry — evidence for communicative need beyond perceptual structure.

source

noncomputable def Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.averageCAP {L : ℕ} (priors : Fin L → WCSChip → ℝ) :

WCSChip → ℝ

Average a collection of per-language priors to obtain a universal need distribution (eq. 7 of @cite{zaslavsky-etal-2019}).

Equations

Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.averageCAP priors c = (∑ l : Fin L, priors l c) / ↑L

Instances For

source

theorem Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019.cap_implies_linearity {W : Type} [Fintype W] (nc : Core.ChannelCapacity.NamingChannel WCSChip W) (prior : WCSChip → ℝ) (hCAP : Core.ChannelCapacity.IsCAP nc prior) {c : WCSChip} (hc : prior c > 0) :

∃ Z > 0, -Real.log (prior c) = Core.ChannelCapacity.commPrecision nc prior c + Real.log Z

Any TRUE capacity-achieving prior exhibits the linear relation −log p(c) = S(c) + log Z (eq. 6 of @cite{zaslavsky-etal-2019}).

This applies to each per-language CAP p_l found via Blahut-Arimoto. However, the paper tests averaged priors (see averageCAP), not individual ones. The empirical finding that WCS-CAP approximately satisfies this relation despite averaging is evidence that the CAP condition is robust across languages. KM-CAP's failure to satisfy it (r = 0.32) shows that perceptual structure alone does not yield the same robustness.

A naming channel p(w|c) is exactly an RSA literal speaker S₀ evaluated at each world c. The posterior p(c|w) is the RSA literal listener L₀. Channel capacity channelCapacity nc = max_{p(c)} I(W;C) is the maximum informativity achievable under any world prior.

The paper shows that natural color naming systems operate near capacity: the salience-weighted prior exhibits the linear CAP relation with high correlation. This means color naming systems are approximately information-theoretically optimal — a prediction that RSA makes for any rational communication system.

The key difference from standard RSA: this paper analyzes the prior p(c), not the speaker/listener strategies. RSA typically takes the prior as given and derives speaker/listener behavior. The CAP framework goes one level up: it asks what prior would make the entire system optimally informative, and shows that natural priors approximate this optimum.

This "prior optimization" perspective connects to @cite{zaslavsky-hu-levy-2020}'s rate-distortion view of RSA, where the rationality parameter α trades off compression rate against distortion.

Documentation

Linglib.Phenomena.LexicalTypology.Studies.ZaslavskyEtAl2019

Zaslavsky, Kemp, Tishby & Regier (2019) #

Core Contributions #

Integration #