Word Reuse and Combination Support Efficient Communication #
@cite{xu-etal-2024}
Xu, A., Kemp, C., Frermann, L., & Xu, Y. (2024). Word reuse and combination support efficient communication of emerging concepts. PNAS 121(46), e2406971121.
Empirical Contributions #
Using WordNet data from English, French, and Finnish (1900–2000):
- Both reuse items and compounds sit near the Pareto frontier of communicative efficiency (Fig. 2)
- Attested encodings are more efficient than random and near-synonym baselines (Fig. 3)
- Literal items (hyponymic reuse, endocentric compounds) are more efficient than nonliteral counterparts (§Item-Level Variation)
- Reuse items tend shorter; compounds tend more informative (§Strategy Comparison)
Connection to Polysemy #
Word reuse IS polysemy generation: when "mouse" acquires the sense
"computer peripheral", the word becomes polysemous. This study provides
an information-theoretic explanation for WHY productive polysemy exists —
it is communicatively efficient. This bridges Phenomena.Polysemy.Data
(synchronic copredication judgments) to a diachronic functional account.
English reuse items from Table 1.
Equations
- One or more equations did not get rendered due to their size.
Instances For
English compounds from Table 1.
Equations
- One or more equations did not get rendered due to their size.
Instances For
French reuse items.
Equations
- One or more equations did not get rendered due to their size.
Instances For
French compounds.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Reuse items are shorter on average than compounds. @cite{xu-etal-2024} §Strategy Comparison: holds across all three languages and all time intervals.
Both strategies include literal and nonliteral items.
Word reuse creates polysemy: the reused word acquires a new sense alongside its existing one. This connects the diachronic process of lexicalization to the synchronic phenomenon of polysemy.
The copredication data in Phenomena.Polysemy.Data captures the
synchronic consequence of reuse (multiple aspects coexist);
this paper explains the diachronic cause (efficiency pressure).
Equations
- One or more equations did not get rendered due to their size.
Instances For
All reuse items in the English data generate polysemy.