Frequency-magnitude relation of numeral words based on search-engine results

Abstract:

We googled various numeral words from 28 languages. Different approaches for the description of the data were investigated. In all of the 28 languages, the found frequency-magnitude dependence fits better to a power law than to an exponential law. The result can be used to distinguish grammatically correct from incorrect numerals based on the prediction of search results.


Year: 2025
In session: Computational linguistics and LLM-related systems
Pages: 51 to 60