Documentation

Linglib.Core.Agent.Emotion

Emotion as Post-Inference Appraisal @cite{houlihan-kleiman-weiner-hewitt-tenenbaum-saxe-2023} #

Houlihan, Kleiman-Weiner, @cite{houlihan-kleiman-weiner-hewitt-tenenbaum-saxe-2023} show that emotions are not primitive mental states — they are computed from more basic cognitive variables via a three-layer architecture:

Inverse Planning (= BToM): observe action → infer beliefs, desires, percepts
Computed Appraisals: from inferred mental states, compute appraisal variables across multiple utility domains (monetary, affiliation, social equity)
Emotion Concepts: each emotion is a specific qualitative pattern (β weights) over the shared appraisal space — a linear readout with logistic transform

The key architectural claims:

Appraisals are post-inference: computed FROM existing BToM posteriors without adding new latent variables to the generative model.
The same appraisal variables are computed for all emotions; different emotions are different readout patterns (β vectors) over the shared appraisal space.
All 20 emotion concepts are retrospective: they presuppose an observed outcome. Prospective emotions (hope, fear, dread) require different computations over uncertain future outcomes.

Appraisal Dimensions #

The paper computes appraisals along four types × two perspectives:

Type	Definition	BToM Component
AU	U(outcome \| preferences)	Desire marginal
PE	outcome − E[outcome \| beliefs]	Belief expectation
CFa	U(agent's alternative) − U(actual)	Planning model
CFo	U(opponent's alternative) − U(actual)	Planning model

Perspective	Meaning
Base	Direct outcomes: monetary + social utility
Reputational	How the agent's choices appear to others

The paper's full model decomposes base utility into three domains (monetary, affiliation, social equity), yielding an 18-dimensional appraisal space. Our qualitative profiles collapse these base domains into a single "base" perspective (4 types × 2 perspectives = 8 dimensions), which suffices to uniquely characterize all 20 emotion concepts.

inductive Core.Agent.Emotion.AppraisalType :

The four appraisal computation types from @cite{houlihan-kleiman-weiner-hewitt-tenenbaum-saxe-2023}.

Each type is a different way of evaluating an outcome relative to the agent's inferred mental states:

AU: value of actual outcome given preferences
PE: actual outcome minus expected outcome given beliefs
CFa: utility of agent's alternative action minus actual
CFo: utility of opponent's alternative action minus actual

achievedUtility : AppraisalType
predictionError : AppraisalType
counterfactualAgent : AppraisalType
counterfactualOpponent : AppraisalType

Instances For

instance Core.Agent.Emotion.instDecidableEqAppraisalType :

DecidableEq AppraisalType

Equations

Core.Agent.Emotion.instDecidableEqAppraisalType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

def Core.Agent.Emotion.instBEqAppraisalType.beq :

AppraisalType → AppraisalType → Bool

Equations

Core.Agent.Emotion.instBEqAppraisalType.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

instance Core.Agent.Emotion.instBEqAppraisalType :

BEq AppraisalType

Equations

Core.Agent.Emotion.instBEqAppraisalType = { beq := Core.Agent.Emotion.instBEqAppraisalType.beq }

def Core.Agent.Emotion.instReprAppraisalType.repr :

AppraisalType → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instReprAppraisalType :

Repr AppraisalType

Equations

Core.Agent.Emotion.instReprAppraisalType = { reprPrec := Core.Agent.Emotion.instReprAppraisalType.repr }

inductive Core.Agent.Emotion.AppraisalPerspective :

Perspective for appraisal computation.

The paper's full model has three base utility domains (monetary, affiliation, social equity) plus reputational utility. We collapse the three base domains into a single "base" perspective, capturing the key structural distinction: base appraisals evaluate direct outcomes, reputational appraisals evaluate how the agent's choices appear to others.

base : AppraisalPerspective
reputational : AppraisalPerspective

Instances For

instance Core.Agent.Emotion.instDecidableEqAppraisalPerspective :

DecidableEq AppraisalPerspective

Equations

Core.Agent.Emotion.instDecidableEqAppraisalPerspective x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

def Core.Agent.Emotion.instBEqAppraisalPerspective.beq :

AppraisalPerspective → AppraisalPerspective → Bool

Equations

Core.Agent.Emotion.instBEqAppraisalPerspective.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

instance Core.Agent.Emotion.instBEqAppraisalPerspective :

BEq AppraisalPerspective

Equations

Core.Agent.Emotion.instBEqAppraisalPerspective = { beq := Core.Agent.Emotion.instBEqAppraisalPerspective.beq }

def Core.Agent.Emotion.instReprAppraisalPerspective.repr :

AppraisalPerspective → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instReprAppraisalPerspective :

Repr AppraisalPerspective

Equations

Core.Agent.Emotion.instReprAppraisalPerspective = { reprPrec := Core.Agent.Emotion.instReprAppraisalPerspective.repr }

inductive Core.Agent.Emotion.AppraisalSign :

Qualitative sign of an appraisal dimension in an emotion profile.

Rather than modeling continuous β weights, we capture the qualitative structure: whether high values of a given appraisal dimension increase the emotion (+), decrease it (−), or are irrelevant (·). Abstracted from the learned transformation in Houlihan et al.'s Fig. 4.

positive : AppraisalSign
negative : AppraisalSign
irrelevant : AppraisalSign

Instances For

instance Core.Agent.Emotion.instDecidableEqAppraisalSign :

DecidableEq AppraisalSign

Equations

Core.Agent.Emotion.instDecidableEqAppraisalSign x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

def Core.Agent.Emotion.instBEqAppraisalSign.beq :

AppraisalSign → AppraisalSign → Bool

Equations

Core.Agent.Emotion.instBEqAppraisalSign.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

instance Core.Agent.Emotion.instBEqAppraisalSign :

BEq AppraisalSign

Equations

Core.Agent.Emotion.instBEqAppraisalSign = { beq := Core.Agent.Emotion.instBEqAppraisalSign.beq }

def Core.Agent.Emotion.instReprAppraisalSign.repr :

AppraisalSign → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instReprAppraisalSign :

Repr AppraisalSign

Equations

Core.Agent.Emotion.instReprAppraisalSign = { reprPrec := Core.Agent.Emotion.instReprAppraisalSign.repr }

inductive Core.Agent.Emotion.TemporalOrientation :

Temporal orientation of an emotion.

Retrospective emotions presuppose an observed outcome and evaluate it against the agent's inferred mental states. Prospective emotions (hope, fear, dread) concern uncertain future outcomes and require expected utility computations — a fundamentally different appraisal architecture not covered by the current model.

All 20 emotion concepts in Houlihan et al. are retrospective.

retrospective : TemporalOrientation
prospective : TemporalOrientation

Instances For

instance Core.Agent.Emotion.instDecidableEqTemporalOrientation :

DecidableEq TemporalOrientation

Equations

Core.Agent.Emotion.instDecidableEqTemporalOrientation x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯

instance Core.Agent.Emotion.instBEqTemporalOrientation :

BEq TemporalOrientation

Equations

Core.Agent.Emotion.instBEqTemporalOrientation = { beq := Core.Agent.Emotion.instBEqTemporalOrientation.beq }

def Core.Agent.Emotion.instBEqTemporalOrientation.beq :

TemporalOrientation → TemporalOrientation → Bool

Equations

Core.Agent.Emotion.instBEqTemporalOrientation.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)

Instances For

def Core.Agent.Emotion.instReprTemporalOrientation.repr :

TemporalOrientation → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instReprTemporalOrientation :

Repr TemporalOrientation

Equations

Core.Agent.Emotion.instReprTemporalOrientation = { reprPrec := Core.Agent.Emotion.instReprTemporalOrientation.repr }

def Core.Agent.Emotion.achievedUtility {F : Type u_1} [CommSemiring F] {A : Type u_2} {P : Type u_3} {B : Type u_4} {D : Type u_5} {S : Type u_6} {M : Type u_7} {W : Type u_8} [Fintype P] [Fintype B] [Fintype D] [Fintype S] [Fintype M] [Fintype W] (model : BToM.BToMModel F A P B D S M W) (utility : W → D → F) (action : A) (world : W) :

F

Achieved Utility from BToM desire marginals.

AU(a, w) = Σ_d P(d | a) · U(w, d)

The desire marginal P(d | a) is the observer's posterior over what the agent wanted, given the observed action. Weighting by utility U(w, d) yields the expected value of the actual outcome under the inferred desires.

This is the core post-inference property: AU is a function of the BToM posterior, not a primitive input to the model.

Equations

Core.Agent.Emotion.achievedUtility model utility action world = ∑ d : D, model.desireMarginal action d * utility world d

Instances For

def Core.Agent.Emotion.predictionError {F : Type u_1} [CommSemiring F] {A : Type u_2} {P : Type u_3} {B : Type u_4} {D : Type u_5} {S : Type u_6} {M : Type u_7} {W : Type u_8} [Fintype P] [Fintype B] [Fintype D] [Fintype S] [Fintype M] [Fintype W] [AddCommGroup F] (model : BToM.BToMModel F A P B D S M W) (outcome : W → F) (beliefPrediction : B → F) (action : A) (world : W) :

F

Prediction Error from BToM belief expectation.

PE(a, w) = outcome(w) − E_B[prediction(b) | a]

The belief expectation is the observer's posterior-weighted average of what the agent believed would happen. The difference from the actual outcome measures surprise.

Equations

Core.Agent.Emotion.predictionError model outcome beliefPrediction action world = outcome world - model.beliefExpectation beliefPrediction action

Instances For

def Core.Agent.Emotion.counterfactualAppraisal {F : Type u_1} {A : Type u_2} {W : Type u_8} (utility : W → A → F) [AddCommGroup F] (actualAction altAction : A) (world : W) :

F

Counterfactual Appraisal: utility difference from alternative action.

CF(a_actual, a_alt, w) = U(w, a_alt) − U(w, a_actual)

Serves as both CFa (agent counterfactual) and CFo (opponent counterfactual) depending on which agent's action space is being varied:

CFa: altAction ranges over the focal agent's alternative actions
CFo: altAction ranges over the opponent's alternative actions

Positive values mean the alternative would have been better.

Equations

Core.Agent.Emotion.counterfactualAppraisal utility actualAction altAction world = utility world altAction - utility world actualAction

Instances For

theorem Core.Agent.Emotion.au_from_btom_marginals {F : Type u_1} [CommSemiring F] {A : Type u_2} {P : Type u_3} {B : Type u_4} {D : Type u_5} {S : Type u_6} {M : Type u_7} {W : Type u_8} [Fintype P] [Fintype B] [Fintype D] [Fintype S] [Fintype M] [Fintype W] (model : BToM.BToMModel F A P B D S M W) (utility : W → D → F) (action : A) (world : W) :

achievedUtility model utility action world = ∑ d : D, model.desireMarginal action d * utility world d

AU is a desire-marginal weighted sum — structurally post-inference.

theorem Core.Agent.Emotion.pe_uses_belief_expectation {F : Type u_1} [CommSemiring F] {A : Type u_2} {P : Type u_3} {B : Type u_4} {D : Type u_5} {S : Type u_6} {M : Type u_7} {W : Type u_8} [Fintype P] [Fintype B] [Fintype D] [Fintype S] [Fintype M] [Fintype W] [AddCommGroup F] (model : BToM.BToMModel F A P B D S M W) (outcome : W → F) (beliefPrediction : B → F) (action : A) (world : W) :

predictionError model outcome beliefPrediction action world = outcome world - model.beliefExpectation beliefPrediction action

PE uses beliefExpectation — structurally post-inference.

structure Core.Agent.Emotion.PerspectiveWeights :

Appraisal weights for a single appraisal type, split by perspective.

base : AppraisalSign
reputational : AppraisalSign

Instances For

instance Core.Agent.Emotion.instDecidableEqPerspectiveWeights :

DecidableEq PerspectiveWeights

Equations

Core.Agent.Emotion.instDecidableEqPerspectiveWeights = Core.Agent.Emotion.instDecidableEqPerspectiveWeights.decEq

def Core.Agent.Emotion.instDecidableEqPerspectiveWeights.decEq (x✝ x✝¹ : PerspectiveWeights) :

Decidable (x✝ = x✝¹)

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instBEqPerspectiveWeights :

BEq PerspectiveWeights

Equations

Core.Agent.Emotion.instBEqPerspectiveWeights = { beq := Core.Agent.Emotion.instBEqPerspectiveWeights.beq }

def Core.Agent.Emotion.instBEqPerspectiveWeights.beq :

PerspectiveWeights → PerspectiveWeights → Bool

Equations

Core.Agent.Emotion.instBEqPerspectiveWeights.beq { base := a, reputational := a_1 } { base := b, reputational := b_1 } = (a == b && a_1 == b_1)
Core.Agent.Emotion.instBEqPerspectiveWeights.beq x✝¹ x✝ = false

Instances For

instance Core.Agent.Emotion.instReprPerspectiveWeights :

Repr PerspectiveWeights

Equations

Core.Agent.Emotion.instReprPerspectiveWeights = { reprPrec := Core.Agent.Emotion.instReprPerspectiveWeights.repr }

def Core.Agent.Emotion.instReprPerspectiveWeights.repr :

PerspectiveWeights → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

structure Core.Agent.Emotion.AppraisalWeights :

The 4 × 2 qualitative appraisal weight matrix for an emotion concept.

Each of the four appraisal types has a base and reputational weight, yielding 8 qualitative dimensions. This is our abstraction of the paper's full 18-dimensional learned β weight matrix (Fig. 4), collapsing monetary + affiliation + social equity into "base."

Instances For

instance Core.Agent.Emotion.instDecidableEqAppraisalWeights :

DecidableEq AppraisalWeights

Equations

Core.Agent.Emotion.instDecidableEqAppraisalWeights = Core.Agent.Emotion.instDecidableEqAppraisalWeights.decEq

def Core.Agent.Emotion.instDecidableEqAppraisalWeights.decEq (x✝ x✝¹ : AppraisalWeights) :

Decidable (x✝ = x✝¹)

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.instBEqAppraisalWeights.beq :

AppraisalWeights → AppraisalWeights → Bool

Equations

One or more equations did not get rendered due to their size.
Core.Agent.Emotion.instBEqAppraisalWeights.beq x✝¹ x✝ = false

Instances For

instance Core.Agent.Emotion.instBEqAppraisalWeights :

BEq AppraisalWeights

Equations

Core.Agent.Emotion.instBEqAppraisalWeights = { beq := Core.Agent.Emotion.instBEqAppraisalWeights.beq }

instance Core.Agent.Emotion.instReprAppraisalWeights :

Repr AppraisalWeights

Equations

Core.Agent.Emotion.instReprAppraisalWeights = { reprPrec := Core.Agent.Emotion.instReprAppraisalWeights.repr }

def Core.Agent.Emotion.instReprAppraisalWeights.repr :

AppraisalWeights → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

structure Core.Agent.Emotion.EmotionProfile :

An emotion concept as a qualitative pattern over the appraisal space.

The paper's central claim: each emotion is a specific readout pattern (β weight vector) over the shared set of computed appraisals. Different emotions = different patterns over the SAME underlying variables. The readout is applied identically for all emotions (eq. 8.2):

P(e | ψ) ∝ exp(β_e · ψ + b_e)

name : String
weights : AppraisalWeights
orientation : TemporalOrientation

Instances For

def Core.Agent.Emotion.instDecidableEqEmotionProfile.decEq (x✝ x✝¹ : EmotionProfile) :

Decidable (x✝ = x✝¹)

Equations

One or more equations did not get rendered due to their size.

Instances For

instance Core.Agent.Emotion.instDecidableEqEmotionProfile :

DecidableEq EmotionProfile

Equations

Core.Agent.Emotion.instDecidableEqEmotionProfile = Core.Agent.Emotion.instDecidableEqEmotionProfile.decEq

instance Core.Agent.Emotion.instBEqEmotionProfile :

BEq EmotionProfile

Equations

Core.Agent.Emotion.instBEqEmotionProfile = { beq := Core.Agent.Emotion.instBEqEmotionProfile.beq }

def Core.Agent.Emotion.instBEqEmotionProfile.beq :

EmotionProfile → EmotionProfile → Bool

Equations

Core.Agent.Emotion.instBEqEmotionProfile.beq { name := a, weights := a_1, orientation := a_2 } { name := b, weights := b_1, orientation := b_2 } = (a == b && (a_1 == b_1 && a_2 == b_2))
Core.Agent.Emotion.instBEqEmotionProfile.beq x✝¹ x✝ = false

Instances For

instance Core.Agent.Emotion.instReprEmotionProfile :

Repr EmotionProfile

Equations

Core.Agent.Emotion.instReprEmotionProfile = { reprPrec := Core.Agent.Emotion.instReprEmotionProfile.repr }

def Core.Agent.Emotion.instReprEmotionProfile.repr :

EmotionProfile → ℕ → Std.Format

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.AppraisalWeights.getSign (w : AppraisalWeights) (t : AppraisalType) (p : AppraisalPerspective) :

Look up the qualitative sign for a specific appraisal type and perspective.

Equations

Instances For

def Core.Agent.Emotion.EmotionProfile.isPurelyReputational (e : EmotionProfile) :

All base (non-reputational) dimensions are irrelevant.

Equations

One or more equations did not get rendered due to their size.

Instances For

Qualitative profiles abstracted from the learned β weights in Houlihan et al.'s Fig. 4. Convention: .positive = β > 0 (high appraisal values increase emotion), .negative = β < 0, .irrelevant = β ≈ 0.

Each definition specifies: ⟨name, ⟨au, pe, cfa, cfo⟩, orientation⟩ where each of au, pe, cfa, cfo is ⟨base, reputational⟩.

Emotion	AU_b	AU_r	PE_b	PE_r	CFa_b	CFa_r	CFo_b	CFo_r
joy	+	·	+	·	·	·	·	·
surprise	·	·	+	·	·	·	·	·
pride	+	+	·	·	+	·	·	·
gratitude	+	·	+	·	·	·	−	·
relief	+	·	+	·	−	·	·	·
amusement	+	·	+	·	·	·	+	·
disappointment	−	·	−	·	+	·	·	·
annoyance	−	·	−	·	·	·	·	·
fury	−	−	−	·	·	·	·	·
embarrassment	·	−	·	−	·	−	·	·
regret	−	·	·	·	−	·	·	·
guilt	·	−	·	·	·	−	·	·
disgust	−	−	·	·	·	·	·	·
devastation	−	·	−	·	·	·	−	·
contempt	·	−	·	·	·	·	·	·
respect	·	+	·	·	·	·	·	·
envy	−	·	·	·	·	·	+	·
sympathy	·	·	·	−	·	·	·	−
confusion	·	·	+	+	·	·	·	·
excitement	+	·	+	+	·	·	·	·

def Core.Agent.Emotion.joy :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.surprise :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.pride :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.gratitude :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.relief :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.amusement :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.disappointment :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.annoyance :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.fury :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.embarrassment :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.regret :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.guilt :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.disgust :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.devastation :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.contempt :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.respect :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.envy :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.sympathy :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.confusion :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.excitement :

Equations

One or more equations did not get rendered due to their size.

Instances For

def Core.Agent.Emotion.allEmotions :

List EmotionProfile

All 20 emotion concepts from @cite{houlihan-kleiman-weiner-hewitt-tenenbaum-saxe-2023}.

Equations

One or more equations did not get rendered due to their size.

Instances For

theorem Core.Agent.Emotion.twenty_emotions :

allEmotions.length = 20

theorem Core.Agent.Emotion.appraisal_patterns_distinguishable :

List.Pairwise (fun (x1 x2 : AppraisalWeights) => x1 ≠ x2) (List.map (fun (x : EmotionProfile) => x.weights) allEmotions)

All 20 emotions have distinct appraisal weight patterns. This is the paper's central empirical finding (Fig. 4): "the learned appraisal structure is unique for each emotion."

theorem Core.Agent.Emotion.all_retrospective :

(allEmotions.all fun (x : EmotionProfile) => x.orientation == TemporalOrientation.retrospective) = true

All 20 emotions in the model are retrospective (post-outcome). The paper explicitly excludes prospective emotions: "we target retrospective emotions... and did not include prospective emotions that concern uncertain future events (e.g. hope, fear)" (p. 22).

theorem Core.Agent.Emotion.embarrassment_purely_reputational :

embarrassment.isPurelyReputational = true

Embarrassment is purely reputational: all base dimensions are irrelevant.