@cite{waldon-degen-2021} — Continuous-Incremental RSA (CI-RSA) #

@cite{cohn-gordon-goodman-potts-2019} @cite{degen-etal-2020}

Waldon, B. & Degen, J. (2021). Modeling cross-linguistic production of referring expressions. Proceedings of the Society for Computation in Linguistics (SCiL) 4, 206–215.

The Model #

CI-RSA synthesizes two RSA extensions:

Incremental RSA (@cite{cohn-gordon-goodman-potts-2019}): Word-by-word production via the chain rule S1(u|r) = ∏ₖ S1(wₖ | [w₁,...,wₖ₋₁], r)
Continuous semantics (@cite{degen-etal-2020}): Noisy adjective reliability L^C(r, i) = v^i if i true of r, else 1 - v^i

The incremental meaning function averages continuous semantics over grammatical completions of the current prefix:

X^C(c, i, r) = Σ_{u ⊒ c+i} ⟦u⟧^C(r) / |{u : u ⊒ c+i}|

The utterance set is scene-filtered: only utterances Boolean-true of at least one scene member are included (Figure 1).

Formalization #

This builds on RSAConfig's sequential infrastructure (following @cite{cohn-gordon-goodman-potts-2019}), adding:

Continuous (ℚ-valued) meaning instead of Boolean extension-counting
rpow-based s1Score with α = 7
Scene-parameterized configs for cross-condition comparisons

The three predictions are trajectory probability comparisons across different RSAConfig instances (language × scene).

Predictions #

#	Prediction	Status
1	English color/size asymmetry: SS > CS	`rsa_predict`
2	Cross-linguistic: English SS > Spanish SS	`rsa_predict`
3	Spanish flip: CS > SS for redundant size	`rsa_predict`
4	Overall: English total > Spanish total	`rsa_predict`

Connections #

Noise theory: lexContinuousQ instantiates the unified noise channel from RSA.Core.Noise. See lexContinuous_as_noiseChannel.
Incremental RSA: Extends @cite{cohn-gordon-goodman-potts-2019} with continuous semantics and cross-linguistic word order variation.

source

inductive Phenomena.Reference.Studies.WaldonDegen2021.Word :

Type

Words available to the incremental speaker: two color adjectives, two size adjectives, a noun ("pin"), and an explicit stop token. The stop token models the speaker's choice to end the utterance; without it, postnominal word orders lack a way to represent the stopping decision after the noun (cf. English where "pin" naturally terminates utterances).

blue : Word
red : Word
big : Word
small : Word
pin : Word
stop : Word

Instances For

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instDecidableEqWord :

DecidableEq Word

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instDecidableEqWord x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instFintypeWord :

Fintype Word

Equations

One or more equations did not get rendered due to their size.

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instBEqWord :

BEq Word

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instBEqWord = { beq := Phenomena.Reference.Studies.WaldonDegen2021.instBEqWord.beq }

source

def Phenomena.Reference.Studies.WaldonDegen2021.instBEqWord.beq :

Word → Word → Bool

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instBEqWord.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instReprWord :

Repr Word

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instReprWord = { reprPrec := Phenomena.Reference.Studies.WaldonDegen2021.instReprWord.repr }

source

def Phenomena.Reference.Studies.WaldonDegen2021.instReprWord.repr :

Word → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

inductive Phenomena.Reference.Studies.WaldonDegen2021.Referent :

Type

Referents in the 2×2 reference game: big/small × blue/red.

bigBlue : Referent
bigRed : Referent
smallBlue : Referent
smallRed : Referent

Instances For

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instDecidableEqReferent :

DecidableEq Referent

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instDecidableEqReferent x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instFintypeReferent :

Fintype Referent

Equations

One or more equations did not get rendered due to their size.

source

def Phenomena.Reference.Studies.WaldonDegen2021.instReprReferent.repr :

Referent → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

source

instance Phenomena.Reference.Studies.WaldonDegen2021.instReprReferent :

Repr Referent

Equations

Phenomena.Reference.Studies.WaldonDegen2021.instReprReferent = { reprPrec := Phenomena.Reference.Studies.WaldonDegen2021.instReprReferent.repr }

source

def Phenomena.Reference.Studies.WaldonDegen2021.wordApplies :

Word → Referent → Bool

Whether a word is veridically true of a referent.

Equations

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.semanticValueQ :

Word → ℚ

Semantic reliability values v^i. Color adjectives are more reliable than size adjectives: v^color = 19/20 (0.95), v^size = 4/5 (0.8).

Equations

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.lexContinuousQ (r : Referent) (w : Word) :

ℚ

Continuous lexical interpretation L^C(r, i). Returns v^i if true, (1 - v^i) if false.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.uttContinuousQ (r : Referent) (u : List Word) :

ℚ

Continuous utterance meaning ⟦u⟧^C(r) = ∏_{w ∈ u} L^C(r, w).

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.uttBoolTrue (u : List Word) (r : Referent) :

Bool

Boolean utterance truth: conjunction of word applicability.

Equations

Phenomena.Reference.Studies.WaldonDegen2021.uttBoolTrue u r = u.all fun (w : Phenomena.Reference.Studies.WaldonDegen2021.Word) => Phenomena.Reference.Studies.WaldonDegen2021.wordApplies w r

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.allUttsEng :

List (List Word)

All grammatical English (prenominal) utterances, each terminated by .stop. In English the noun always comes last before stop, so "pin" naturally precedes the stopping decision.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.allUttsSpn :

List (List Word)

All grammatical Spanish (postnominal) utterances, each terminated by .stop. The stop token is critical here: after [pin, blue], the S1 chooses between .stop (2-word non-redundant) and .small (continuing to the 3-word redundant utterance). Without .stop, the model forces continuation whenever valid extensions exist.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.sceneFilter (utts : List (List Word)) (scene : Referent → Bool) :

List (List Word)

Scene-filtered utterances: only those Boolean-true of at least one scene member (Figure 1). This yields 7 utterances per scene.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.wordCostQ :

Word → ℚ

Per-word production cost (Section 4): each adjective incurs cost 0.1. Pin and stop have zero cost (noun and utterance boundary).

Equations

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.continuousMeaningQ (utts : List (List Word)) (scene : Referent → Bool) (pfx : List Word) (r : Referent) :

ℚ

Incremental continuous meaning: average continuous semantics over all grammatical completions of prefix.

X^C(c, i, r) = Σ_{u ⊒ c+i} ⟦u⟧^C(r) / |{u : u ⊒ c+i}|

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.continuousMeaning (utts : List (List Word)) (scene : Referent → Bool) (pfx : List Word) (r : Referent) :

ℝ

Real-valued continuous meaning (for RSAConfig).

Equations

Phenomena.Reference.Studies.WaldonDegen2021.continuousMeaning utts scene pfx r = ↑(Phenomena.Reference.Studies.WaldonDegen2021.continuousMeaningQ utts scene pfx r)

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.ssScene :

Referent → Bool

Size-sufficient scene: {big_blue, big_red, small_blue}. Target small_blue is uniquely identified by size alone.

Equations

Instances For

source

def Phenomena.Reference.Studies.WaldonDegen2021.csScene :

Referent → Bool

Color-sufficient scene: {small_red, big_red, small_blue}. Target small_blue is uniquely identified by color alone.

Equations

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.mkCIRSA (utts : List (List Word)) (scene : Referent → Bool) :

RSA.RSAConfig Word Referent

CI-RSA configuration parameterized by utterance set and scene.

L0 uses extension-based continuous meaning, returning 0 for referents outside the scene
S1 uses rpow-based scoring with α = 7 and per-word cost C(i)
S1(i|c,r) ∝ L0(r|c,i)^α · exp(−α · C(i)) (Section 4)

Note: v^color = 0.95 here, matching the paper's fitted values. This differs from the @cite{degen-etal-2020} value of v^color = 0.99 used in RSA.Core.Noise, because the two papers fit different experimental datasets.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.englishSS :

RSA.RSAConfig Word Referent

English (prenominal) CI-RSA in size-sufficient scene.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.englishCS :

RSA.RSAConfig Word Referent

English (prenominal) CI-RSA in color-sufficient scene.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.spanishSS :

RSA.RSAConfig Word Referent

Spanish (postnominal) CI-RSA in size-sufficient scene.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def Phenomena.Reference.Studies.WaldonDegen2021.spanishCS :

RSA.RSAConfig Word Referent

Spanish (postnominal) CI-RSA in color-sufficient scene.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.ss_eng_has_7_utts :

(sceneFilter allUttsEng ssScene).length = 7

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.cs_eng_has_7_utts :

(sceneFilter allUttsEng csScene).length = 7

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.ss_spn_has_7_utts :

(sceneFilter allUttsSpn ssScene).length = 7

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.cs_spn_has_7_utts :

(sceneFilter allUttsSpn csScene).length = 7

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.color_more_reliable_than_size :

semanticValueQ Word.blue > semanticValueQ Word.big ∧ semanticValueQ Word.red > semanticValueQ Word.small

Color adjectives have higher reliability than size adjectives. This asymmetry drives the redundant modification predictions.

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.semantic_values_positive (w : Word) :

semanticValueQ w > 0

All semantic values are positive (required for valid probability).

source

theorem Phenomena.Reference.Studies.WaldonDegen2021.lexContinuous_as_noiseChannel (r : Referent) (w : Word) :

lexContinuousQ r w = RSA.Noise.noiseChannel (semanticValueQ w) (1 - semanticValueQ w) (if wordApplies w r = true then 1 else 0)

lexContinuousQ is an instance of the unified noise channel from RSA.Core.Noise. The continuous lexical semantics L^C(r, i) is exactly the noise channel with onMatch = v^i, onMismatch = 1 - v^i, b = 1 if item i is true of referent r, 0 otherwise.

This connects @cite{waldon-degen-2021} to the @cite{degen-etal-2020} parameterization where mismatch = 1 - match.

Documentation

Linglib.Phenomena.Reference.Studies.WaldonDegen2021

@cite{waldon-degen-2021} — Continuous-Incremental RSA (CI-RSA) #

The Model #

Formalization #

Predictions #

Connections #