Documentation

Linglib.Phenomena.Morphology.Studies.AckermanMalouf2013

@cite{ackerman-malouf-2013}: Bridge Theorems @cite{ackerman-malouf-2013} #

@cite{carstairs-mccarthy-2010}

Verification theorems connecting the cross-linguistic typological data to the LCEC predictions. Each theorem proves that a language's reported I-complexity falls below the LCEC threshold.

Structure #

Each language's reported I-complexity is below the 1-bit threshold. These are "per-datum verification theorems" in linglib's sense: changing a language's avgCondEntropy breaks exactly the corresponding theorem.

The LCEC's key prediction: E-complexity and I-complexity are dissociated. A language can have enormous E-complexity but low I-complexity.

Mazatec's I-complexity is still below 1 bit despite 109 classes.

Kwerba has minimal E-complexity (2 classes) but its I-complexity is not the lowest — German (7 classes) has lower I-complexity. This shows E-complexity doesn't predict I-complexity in either direction.

Spanish has only 3 classes but 57 cells — yet its I-complexity is the lowest in the sample (0.003 bits). More cells with fewer classes means more implicative structure.

The Mazatec case study (§4 of the paper) demonstrates that the observed I-complexity is far below what random assignment of inflection-class patterns would produce.

Mazatec's observed I-complexity is far below the random baseline. Observed: 0.709 bits. Random permutation baseline: ~5.25 bits. The observed value is less than 14% of the random baseline.

The ratio of observed to random I-complexity is less than 1/7. (0.709 / 5.25 ≈ 0.135, i.e., ~13.5% of random)

Mazatec has nonzero I-complexity: it violates @cite{carstairs-mccarthy-2010}'s synonymy avoidance but satisfies the LCEC. This witnesses that the LCEC is strictly weaker than synonymy avoidance.