Type-Token Ratio
The type-token ratio (TTR) is the oldest and most widely used measure of lexical diversity: the number of distinct word types in a text divided by the total number of word tokens. A text in which few words repeat yields a TTR near 1, while a text that recycles a small vocabulary yields a TTR near 0. Despite its intuitive appeal and trivial computation, the raw ratio is severely confounded by text length, which has motivated a long line of length-correcting transformations and, ultimately, the more robust indices that have largely superseded it for serious comparison.
阅读完整方法
使用免费账户登录即可阅读本节。
方法图谱
相关方法的邻域——选择一个节点以展开探索。
来源
- Johnson, W. (1944). Studies in language behavior: A program of research. Psychological Monographs, 56(2), 1–15. DOI: 10.1037/h0093508 ↗
- Malvern, D., Richards, B., Chipere, N., & Durán, P. (2004). Lexical Diversity and Language Development: Quantification and Assessment. Palgrave Macmillan. ISBN: 9781403902313
- McCarthy, P. M., & Jarvis, S. (2010). MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment. Behavior Research Methods, 42(2), 381–392. DOI: 10.3758/BRM.42.2.381 ↗
如何引用本页
ScholarGate. (2026, June 22). Type-Token Ratio (TTR) for Lexical Diversity. ScholarGate. https://scholargate.app/zh/linguistics/type-token-ratio
选用哪种方法?
将本方法与其最相近的同类并置,并排研读——本馆将书籍铺陈于案上,取舍则由您定夺。
- 词汇丰富度文本挖掘↔ 比较
- Measure of Textual Lexical Diversity (MTLD)语言学↔ 比较
- N-gram Analysis语言学↔ 比较
- vocd-D (D Measure)语言学↔ 比较