Bidirectional Optimality Theory #
@cite{blutner-2000} @cite{horn-1984} @cite{atlas-levinson-1981}
Bidirectional OT integrates production and comprehension optimality into a single framework. The key insight is that the Gricean maxims decompose into two opposing optimization pressures:
Q-principle (production optimality): for a given meaning τ, the optimal form A is the one that produces ⟨A, τ⟩ most harmonically. Blocking mechanism: suppresses interpretations that can be expressed more economically by an alternative form. Corresponds to the first part of Grice's Quantity maxim ("be as informative as required") and the force of diversification / auditor's economy (@cite{atlas-levinson-1981}, @cite{horn-1984}).
I-principle (comprehension optimality): for a given form A, the optimal meaning τ is the one that produces ⟨A, τ⟩ most harmonically. Interpretation mechanism: selects the most coherent/stereotypical reading. Corresponds to the second part of Grice's Quantity maxim ("do not make your contribution more informative than required"), Relation, Manner, and the force of unification / speaker's economy (@cite{atlas-levinson-1981}, @cite{horn-1984}'s R-principle).
These correspond to the speaker/hearer optimization layers in RSA (@cite{frank-goodman-2012}) and IBR (@cite{franke-2011}).
Strong vs Weak BiOT #
@cite{blutner-2000} defines two versions:
Strong (eq. 9): Q and I are independent — ⟨A, τ⟩ is optimal iff it survives BOTH Q and I evaluated against the full candidate set. Solution concept: Nash Equilibrium.
Weak (eq. 14): Q and I mutually constrain each other — competition under Q is restricted to I-surviving pairs, and vice versa. Solution concept: greatest fixed point of the blocking operator.
The weak version derives @cite{horn-1984}'s division of pragmatic labour: unmarked forms pair with unmarked (stereotypical) meanings, and marked forms pair with marked (non-stereotypical) meanings. The strong version incorrectly predicts that marked forms are blocked in ALL their interpretations.
Connection to RSA and IBR #
All three frameworks — BiOT, IBR, RSA — perform alternating speaker/hearer optimization. They differ in the optimization criterion:
| Framework | Comparison | Solution |
|---|---|---|
| Strong BiOT | lexicographic (OT) | Nash equilibrium |
| Weak BiOT | lexicographic (OT) | greatest fixed point |
| IBR | argmax (set-level) | iterated best response |
| RSA | softmax (probabilistic) | Bayesian posterior |
The relationship: RSA α→∞ approximates IBR, and IBR fixed points
correspond to weak-BiOT super-optimal pairs for scalar games
(see IBR/ScalarGames.lean).
Q-principle (production optimality, @cite{blutner-2000} eq. 9(Q)): ⟨A, τ⟩ satisfies Q iff no other pair with the same meaning τ but a different form A' is strictly more harmonic.
Operationally: the speaker chose A because it is the best way to express τ. If a better form A' existed, A would be blocked. This is the blocking mechanism — it suppresses interpretations that can be triggered more economically by an alternative form (@cite{horn-1984}'s Q-based implicature).
Equations
Instances For
I-principle (comprehension optimality, @cite{blutner-2000} eq. 9(I)): ⟨A, τ⟩ satisfies I iff no other pair with the same form A but a different meaning τ' is strictly more harmonic.
Operationally: the hearer chose τ because it is the best interpretation of A. If a better meaning τ' existed, τ would be dispreferred. This is the interpretation mechanism — it selects the most coherent/stereotypical reading (@cite{horn-1984}'s R-based inference).
Equations
Instances For
Strong-optimal = satisfies BOTH Q and I against the full candidate set.
This is equivalent to strongOptimal in ConstraintEvaluation.
@cite{horn-1984}'s division of pragmatic labour: "unmarked forms tend to be used for unmarked situations and marked forms for marked situations." This emerges naturally from weak BiOT but NOT from strong BiOT. The example below formalizes @cite{blutner-2000}'s tableau (15).
Classic examples:
- kill (unmarked) / cause to die (marked): kill ↔ stereotypical
causation (direct), cause to die ↔ non-stereotypical (indirect)
- pork (unmarked) / pig-meat (marked, blocked): pork ↔ meat reading,
pig ↔ animal reading — *pig-meat is totally blocked because pork
exists
- him (unmarked) / himself (marked): himself ↔ coreferential,
him ↔ disjoint reference
Forms in Horn's division example.
Instances For
Equations
- Theories.Pragmatics.Bidirectional.instBEqHornForm.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Meanings in Horn's division example.
- stereotypical : HornMeaning
- nonStereotypical : HornMeaning
Instances For
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
All form-meaning pairs (forms are semantically equivalent).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Constraint profile: [F-violations, C-violations]. F penalizes marked forms; C penalizes non-stereotypical meanings.
Equations
- Theories.Pragmatics.Bidirectional.hornProfile (Theories.Pragmatics.Bidirectional.HornForm.unmarked, Theories.Pragmatics.Bidirectional.HornMeaning.stereotypical) = [0, 0]
- Theories.Pragmatics.Bidirectional.hornProfile (Theories.Pragmatics.Bidirectional.HornForm.unmarked, Theories.Pragmatics.Bidirectional.HornMeaning.nonStereotypical) = [0, 1]
- Theories.Pragmatics.Bidirectional.hornProfile (Theories.Pragmatics.Bidirectional.HornForm.marked, Theories.Pragmatics.Bidirectional.HornMeaning.stereotypical) = [1, 0]
- Theories.Pragmatics.Bidirectional.hornProfile (Theories.Pragmatics.Bidirectional.HornForm.marked, Theories.Pragmatics.Bidirectional.HornMeaning.nonStereotypical) = [1, 1]
Instances For
Weak BiOT derives Horn's division: unmarked ↔ stereotypical, marked ↔ non-stereotypical. Both pairs survive.
Strong BiOT blocks the marked form entirely — only the unmarked pair survives. This is empirically wrong: marked forms DO get used (for marked meanings).
The weak version admits strictly MORE pairs than the strong version for this case. This is the key empirical argument for weak BiOT.
When an unmarked form has a SPECIALIZED meaning (not semantically equivalent to the marked form), the marked form can be totally blocked — it loses ALL its interpretations.
Example: pork (= pig-meat) blocks *pig in the meat reading.
Unlike Horn's division, here the unmarked form doesn't compete
for the non-stereotypical meaning because Gen doesn't pair them.
Forms for total blocking: a listed (lexicalized) form and a derived (productive) form.
Instances For
Equations
- Theories.Pragmatics.Bidirectional.instBEqLexForm.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Theories.Pragmatics.Bidirectional.instBEqLexMeaning.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Gen for total blocking: the listed form only covers the specialized meaning; the derived form covers both meanings.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Profile: listed form is less marked (0 F-violations); specialized meaning is less marked (0 C-violations).
Equations
- Theories.Pragmatics.Bidirectional.totalBlockProfile (Theories.Pragmatics.Bidirectional.LexForm.listed, Theories.Pragmatics.Bidirectional.LexMeaning.specialized) = [0, 0]
- Theories.Pragmatics.Bidirectional.totalBlockProfile (Theories.Pragmatics.Bidirectional.LexForm.derived, Theories.Pragmatics.Bidirectional.LexMeaning.specialized) = [1, 0]
- Theories.Pragmatics.Bidirectional.totalBlockProfile (Theories.Pragmatics.Bidirectional.LexForm.derived, Theories.Pragmatics.Bidirectional.LexMeaning.general) = [1, 1]
- Theories.Pragmatics.Bidirectional.totalBlockProfile (Theories.Pragmatics.Bidirectional.LexForm.listed, Theories.Pragmatics.Bidirectional.LexMeaning.general) = [0, 1]
Instances For
Total blocking: the derived form loses the specialized meaning (blocked by the listed form) but retains the general meaning. Result: listed ↔ specialized, derived ↔ general.
Strong BiOT over-blocks: the derived form loses ALL interpretations because (.derived, .specialized) is I-better than (.derived, .general) under the same form. This makes strong BiOT too aggressive for partial blocking — only the listed form survives.
Weak BiOT correctly handles partial blocking — the derived form keeps the general meaning because its I-competitor (.derived, .specialized) was removed by Q-blocking from (.listed, .specialized).
@cite{blutner-2000} p. 12: "It is simple to prove that a pair which is optimal (strong bidirection), is super-optimal (weak bidirection) as well." Strong-optimal is a subset of super-optimal.
Horn's division demonstrates strong ⊂ weak (strict inclusion): the marked pair survives weak but not strong BiOT.
The iterative-removal algorithm (iterativeSuperoptimal) agrees with
strong BiOT for Horn's division. This shows that iterative removal
over-removes: it behaves like the strong version (eq. 9), not
the weak version (eq. 14).
The iterative-removal algorithm agrees with strong BiOT for total blocking as well.