Weak Evidence Effect: Empirical Data #
@cite{barnett-griffiths-hawkins-2022}
Experimental data from @cite{barnett-griffiths-hawkins-2022} on how weak positive evidence can backfire when listeners expect speakers to have persuasive goals.
The Stick Contest #
Two contestants each want to convince a judge that the average of 5 hidden sticks (1"–9") is longer (or shorter) than the midpoint (5"). Each reveals one stick. Participants play both the speaker role (expectation phase) and judge role (listener judgment phase).
Key Findings #
- 67% of participants expected speakers to show the strongest evidence
- For this group, weak evidence backfired (m = 34.7 on 0–100 scale, vs 50 midpoint)
- The speaker-dependent RSA model outperforms anchor-and-adjust alternatives
Listener type inferred from speaker expectation phase
- pragmatic : ListenerType
- literal : ListenerType
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Evidence strength conditions (distance from midpoint 5")
- weak : EvidenceStrength
- moderate : EvidenceStrength
- strong : EvidenceStrength
- strongest : EvidenceStrength
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The actual experimental parameters
Equations
- Phenomena.ScalarImplicatures.WeakEvidenceEffect.design = { nSticks := 5, minLength := 1, maxLength := 9, midpoint := 5, nParticipants := 723 }
Instances For
Proportion expecting strongest evidence (pragmatic listeners)
Equations
Instances For
Proportion expecting weaker evidence (literal listeners)
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Phenomena.ScalarImplicatures.WeakEvidenceEffect.interactionEffect = { tStatistic := 52 / 10, df := 718, pLessThan := 1 / 1000 }
Instances For
Behavioral result for a listener group
- listenerType : ListenerType
- nParticipants : ℕ
- meanSlider : ℚ
- ci95Lower : ℚ
- ci95Upper : ℚ
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pragmatic group: weak evidence backfires (mean below 50)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Literal group: no weak evidence effect (mean at 50)
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pragmatic group shows backfire: mean significantly below 50 (midpoint)
Literal group shows no backfire: mean at midpoint
The two groups differ in the predicted direction
Model families compared
- anchorAdjust : ModelFamily
- minAcceptable : ModelFamily
- rsaPragmatic : ModelFamily
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Model variant (how individual differences are handled)
- homogeneous : ModelVariant
- heterogeneous : ModelVariant
- speakerDependent : ModelVariant
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Model comparison result from Table 1
- family : ModelFamily
- variant : ModelVariant
- logLikelihood : ℚ
- waic : ℚ
- waicSE : ℚ
- psisLoo : ℚ
- psisLooSE : ℚ
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Table 1 data
Equations
- One or more equations did not get rendered due to their size.
Instances For
The RSA speaker-dependent model has the best (highest) log-likelihood
The RSA speaker-dependent model has the best (lowest) WAIC
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
β > 0 provides strong support for non-zero persuasive bias
Pragmatic group is best explained by J1 (pragmatic listener model)
Literal group is best explained by J0 (literal listener model)