Tagalog Nasal Substitution Data @cite{zuraw-2010} #
Empirical data for Tagalog nasal substitution (@cite{zuraw-2010}), the running case study in @cite{magri-2025}'s analysis of constraint interaction in probabilistic phonology.
The process #
When a nasal-final prefix (maŋ- or paŋ-) is concatenated with an obstruent-initial stem, the nasal and the obstruent may coalesce into a single consonant retaining the nasality of the former and the place of the latter (@cite{zuraw-2010}):
maŋ+bigáj→mamigáj'to distribute' (nasal substitution)paŋ+tabój→pantabój'to goad' (place assimilation, no substitution)
Data organization #
- § 1: Stem consonants — the six stem-initial obstruents from @cite{zuraw-2010}'s factorial typology
- § 2: Dictionary rates — text-verified substitution rates from the Tagalog dictionary study
- § 3–7: 2×2 square — @cite{magri-2025}'s arrangement of four underlying concatenations for probabilistic analysis
Equations
- Fragments.Tagalog.Phonology.instBEqStemC.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Whether nasal substitution applies.
Instances For
Equations
- Fragments.Tagalog.Phonology.instBEqSubSt.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
A candidate is a stem consonant paired with a substitution decision.
Equations
Instances For
Dictionary substitution rate for voiceless labial p (253/263 ≈ 96.2%). Text-verified from @cite{zuraw-2010}'s discussion of the Tagalog dictionary study.
Equations
- Fragments.Tagalog.Phonology.dictRate_p = 253 / 263
Instances For
Dictionary substitution rate for voiced labial b (177/277 ≈ 63.9%). Text-verified from @cite{zuraw-2010}'s discussion of the Tagalog dictionary study.
Equations
- Fragments.Tagalog.Phonology.dictRate_b = 177 / 277
Instances For
Voicing effect in dictionary data (labial place): voiceless p has a higher substitution rate than voiced b.
The four underlying concatenations from @cite{magri-2025}'s 2×2 square arrangement. These cross two prefixes (maŋ-, paŋ-) with two of the six stem consonants (b, k).
- mang_b : NasalSubInput
- mang_k : NasalSubInput
- pang_b : NasalSubInput
- pang_k : NasalSubInput
Instances For
Equations
- Fragments.Tagalog.Phonology.instBEqNasalSubInput.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The two surface variants for each underlying form.
- yes : NasalSubOutput
YES: nasal substitution applies — nasal and obstruent coalesce.
- no : NasalSubOutput
NO: nasal substitution does not apply — place assimilation only.
Instances For
Equations
- Fragments.Tagalog.Phonology.instBEqNasalSubOutput.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Equations
- One or more equations did not get rendered due to their size.
Input–output pair for constraint evaluation.
Equations
Instances For
The 2×2 square of underlying forms: prefix × stem-initial obstruent.
Equations
- One or more equations did not get rendered due to their size.
Instances For
C₁ = *NC: one violation for every nasal–obstruent sequence. Violated by NO (place assimilation preserves the NC sequence).
Equations
Instances For
C₂ = *NC̥: one violation for nasal followed by voiceless obstruent. Violated by NO only for voiceless stems (k).
Equations
- Fragments.Tagalog.Phonology.starNCvoiceless (Fragments.Tagalog.Phonology.NasalSubInput.mang_k, Fragments.Tagalog.Phonology.NasalSubOutput.no) = 1
- Fragments.Tagalog.Phonology.starNCvoiceless (Fragments.Tagalog.Phonology.NasalSubInput.pang_k, Fragments.Tagalog.Phonology.NasalSubOutput.no) = 1
- Fragments.Tagalog.Phonology.starNCvoiceless x✝ = 0
Instances For
C₃ = *[stemη]: one violation when stem starts with a velar nasal. Violated by YES for k-initial stems (coalesced ŋ is velar).
Equations
- Fragments.Tagalog.Phonology.starStemVelar (Fragments.Tagalog.Phonology.NasalSubInput.mang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.starStemVelar (Fragments.Tagalog.Phonology.NasalSubInput.pang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.starStemVelar x✝ = 0
Instances For
C₄ = *[stemη]∕n: one violation when stem starts with a velar or coronal nasal. In the b vs k square, this coincides with C₃ (bilabial m is neither velar nor coronal).
Equations
- Fragments.Tagalog.Phonology.starStemVelarCoronal (Fragments.Tagalog.Phonology.NasalSubInput.mang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.starStemVelarCoronal (Fragments.Tagalog.Phonology.NasalSubInput.pang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.starStemVelarCoronal x✝ = 0
Instances For
C₅ = UNIFORMITY(maŋ): one violation when the maŋ- prefix coalesces with the stem-initial obstruent. Only relevant for maŋ- forms.
Equations
- Fragments.Tagalog.Phonology.unifMang (Fragments.Tagalog.Phonology.NasalSubInput.mang_b, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.unifMang (Fragments.Tagalog.Phonology.NasalSubInput.mang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.unifMang x✝ = 0
Instances For
C₆ = UNIFORMITY(paŋ): one violation when the paŋ- prefix coalesces. Only relevant for paŋ- forms.
Equations
- Fragments.Tagalog.Phonology.unifPang (Fragments.Tagalog.Phonology.NasalSubInput.pang_b, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.unifPang (Fragments.Tagalog.Phonology.NasalSubInput.pang_k, Fragments.Tagalog.Phonology.NasalSubOutput.yes) = 1
- Fragments.Tagalog.Phonology.unifPang x✝ = 0
Instances For
The six constraints as a Fin 6-indexed family.
Equations
- Fragments.Tagalog.Phonology.constraints ⟨0, isLt⟩ = Fragments.Tagalog.Phonology.starNC
- Fragments.Tagalog.Phonology.constraints ⟨1, isLt⟩ = Fragments.Tagalog.Phonology.starNCvoiceless
- Fragments.Tagalog.Phonology.constraints ⟨2, isLt⟩ = Fragments.Tagalog.Phonology.starStemVelar
- Fragments.Tagalog.Phonology.constraints ⟨3, isLt⟩ = Fragments.Tagalog.Phonology.starStemVelarCoronal
- Fragments.Tagalog.Phonology.constraints ⟨4, isLt⟩ = Fragments.Tagalog.Phonology.unifMang
- Fragments.Tagalog.Phonology.constraints ⟨5, isLt⟩ = Fragments.Tagalog.Phonology.unifPang
Instances For
Violation difference Δₖ(x) = Cₖ(x, NO) − Cₖ(x, YES) for each
underlying form x and constraint k. Positive Δ favors YES.
Equations
- Fragments.Tagalog.Phonology.violDiffProfile ⟨0, isLt⟩ x✝ = 1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨1, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.mang_k = 1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨1, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.pang_k = 1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨1, isLt⟩ x✝ = 0
- Fragments.Tagalog.Phonology.violDiffProfile ⟨2, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.mang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨2, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.pang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨2, isLt⟩ x✝ = 0
- Fragments.Tagalog.Phonology.violDiffProfile ⟨3, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.mang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨3, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.pang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨3, isLt⟩ x✝ = 0
- Fragments.Tagalog.Phonology.violDiffProfile ⟨4, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.mang_b = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨4, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.mang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨4, isLt⟩ x✝ = 0
- Fragments.Tagalog.Phonology.violDiffProfile ⟨5, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.pang_b = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨5, isLt⟩ Fragments.Tagalog.Phonology.NasalSubInput.pang_k = -1
- Fragments.Tagalog.Phonology.violDiffProfile ⟨5, isLt⟩ x✝ = 0
Instances For
Empirical rates of nasal substitution from @cite{zuraw-2010} type frequencies, arranged per @cite{magri-2025}'s 2×2 square (@cite{zuraw-hayes-2017}). The four cells correspond to the two extreme prefixes (maŋ-other = highest rate, paŋ-res = lowest) crossed with /b/ (voiced) and /k/ (voiceless).
Equations
- Fragments.Tagalog.Phonology.nasalSubRate Fragments.Tagalog.Phonology.NasalSubInput.mang_b = 916 / 1000
- Fragments.Tagalog.Phonology.nasalSubRate Fragments.Tagalog.Phonology.NasalSubInput.mang_k = 993 / 1000
- Fragments.Tagalog.Phonology.nasalSubRate Fragments.Tagalog.Phonology.NasalSubInput.pang_b = 434 / 1000
- Fragments.Tagalog.Phonology.nasalSubRate Fragments.Tagalog.Phonology.NasalSubInput.pang_k = 909 / 1000
Instances For
The violation differences cast to ℝ, for use with me_predicts_hz.
Equations
Instances For
Violation difference independence: the violation differences Δₖ
satisfy ViolDiffIndependence on the nasal substitution square.
- C₁–C₄ (markedness): Δₖ is the same for /maŋ+X/ and /paŋ+X/ (insensitive to prefix = row)
- C₅–C₆ (faithfulness): Δₖ is the same for /X+b/ and /X+k/ (insensitive to stem = column)
This is a data-level property of the constraint violation profiles, used by both @cite{zuraw-hayes-2017} and @cite{magri-2025}.