Recursive numeral systems are highly regular and easy to process.
Paper i proceeding, 2026

Much recent work has shown how cross-linguistic variation is constrained by competing pressures from efficient communication. However, little attention has been paid to the role of the systematicity of forms (regularity), a key property of natural language. Here, we demonstrate the importance of regularity in explaining the shape of linguistic systems by looking at recursive numeral systems. Previous work has argued that these systems optimise the trade-off between lexicon size and average morphosyntatic complexity (Denic and Szymanik,2024). However, showing that only natural-language-like systems optimise this trade-off has proven elusive, and existing solutions rely on ad-hoc constraints to rule out unnatural systems (Yang and Regier, 2025). Drawing on the Minimum Description Length (MDL) approach, we argue that recursive numeral systems are better viewed as efficient with regard to their regularity and processing complexity. We show that our MDL-based measures of regularity and processing complexity better capture the key differences between attested, natural systems and theoretically possible ones, including “optimal” recursive numeral systems from previous work, and that the ad-hoc constraints naturally follow from regularity. Our approach highlights the need to incorporate regularity across sets of forms in studies attempting to measure efficiency in language.

Författare

Ponrawee Prasertsom

University of Edinburgh

Andrea Silvi

Chalmers, Data- och informationsteknik, Data Science och AI

Jennifer Culbertson

University of Edinburgh

Devdatt Dubhashi

Chalmers, Data- och informationsteknik, Data Science och AI

Moa Johansson

Chalmers, Data- och informationsteknik, Data Science och AI

Kenny Smith

University of Edinburgh

Association for Computational Linguistics. European Chapter . Proceedings of the Conference.

1525-2450 (ISSN)

Vol. Volume 1: Long Papers 4873-4885

19th Conference of the European Chapter of the Association for Computational Linguistics
Rabat, Morocco,

Ämneskategorier (SSIF 2025)

Jämförande språkvetenskap och allmän lingvistik

DOI

10.18653/v1/2026.eacl-long.226

Mer information

Senast uppdaterat

2026-05-29