vocd-D (D Measure)
vocd-D, also called the D measure, is a length-robust index of lexical diversity developed by David Malvern and Brian Richards. Instead of reporting a single type-token ratio, it characterizes how a text's TTR falls as sample size grows and fits that empirical curve to a one-parameter probabilistic model; the fitted parameter D is the diversity score, with higher D meaning richer vocabulary. HD-D, introduced by McCarthy and Jarvis, is the mathematically exact, sampling-free counterpart that computes the same underlying quantity directly from the hypergeometric distribution.
阅读完整方法
使用免费账户登录即可阅读本节。
方法图谱
相关方法的邻域——选择一个节点以展开探索。
来源
- Malvern, D., Richards, B., Chipere, N., & Durán, P. (2004). Lexical Diversity and Language Development: Quantification and Assessment. Palgrave Macmillan. ISBN: 9781403902313
- McCarthy, P. M., & Jarvis, S. (2007). vocd: A theoretical and empirical evaluation. Language Testing, 24(4), 459–488. DOI: 10.1177/0265532207080767 ↗
- McCarthy, P. M., & Jarvis, S. (2010). MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment. Behavior Research Methods, 42(2), 381–392. DOI: 10.3758/BRM.42.2.381 ↗
如何引用本页
ScholarGate. (2026, June 22). vocd-D / HD-D Measure of Lexical Diversity. ScholarGate. https://scholargate.app/zh/linguistics/vocd-lexical-diversity
选用哪种方法?
将本方法与其最相近的同类并置,并排研读——本馆将书籍铺陈于案上,取舍则由您定夺。
- 词汇丰富度文本挖掘↔ 比较
- Measure of Textual Lexical Diversity (MTLD)语言学↔ 比较
- N-gram Analysis语言学↔ 比较
- Type-Token Ratio语言学↔ 比较