ScholarGate
Assistant
Process / pipelineQuantitative historical linguistics

Lexicostatistics

Lexicostatistics is a quantitative method in historical linguistics that gauges how closely two or more languages are genealogically related by measuring the percentage of cognates they share within a fixed list of basic, culture-neutral vocabulary — classically Morris Swadesh's 100- or 200-word list. By converting word comparisons into similarity percentages, it produces a matrix of pairwise scores from which subgroupings within a language family can be inferred. It is the statistical core that underlies glottochronology, but on its own it makes no claim about absolute dates — it speaks only to degree of relatedness.

Open in MethodMindSoonApply, compare, get guidance
Tools & resources
Download slides
Learn & explore
VideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Method map

The neighbourhood of related methods — select a node to explore.

Sources

  1. Swadesh, M. (1952). Lexico-statistic dating of prehistoric ethnic contacts. Proceedings of the American Philosophical Society, 96(4), 452–463. link
  2. Campbell, L. (2013). Historical Linguistics: An Introduction (3rd ed.). Edinburgh University Press. ISBN: 9780748675593

How to cite this page

ScholarGate. (2026, June 22). Lexicostatistics. ScholarGate. https://scholargate.app/en/linguistics/lexicostatistics

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

Referenced by

ScholarGateLexicostatistics (Lexicostatistics). Retrieved 2026-06-24 from https://scholargate.app/en/linguistics/lexicostatistics · Dataset: https://doi.org/10.5281/zenodo.20539026