方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 格洛托年代学× | 词形分析× | |
|---|---|---|
| 领域≠ | 语言学 | 文本挖掘 |
| 方法族 | Process / pipeline | Process / pipeline |
| 起源年份≠ | 1950 | 1980 |
| 提出者≠ | Morris Swadesh | M.F. Porter (Porter stemmer) |
| 类型≠ | Empirical process pipeline | Text-normalisation preprocessing task |
| 开创性文献≠ | Swadesh, M. (1950). Salish internal relationships. International Journal of American Linguistics, 16(3), 157-167. DOI ↗ | Porter, M.F. (1980). An Algorithm for Suffix Stripping. Program, 14(3), 130-137. DOI ↗ |
| 别名≠ | Lexicostatistics, Glottochronological Dating | stemming, lemmatization, Morfolojik Analiz ve Kök Bulma |
| 相关≠ | 2 | 4 |
| 摘要≠ | Glottochronology, or lexicostatistics, is a quantitative method in historical linguistics that estimates the time of divergence between related languages based on the proportion of shared cognates in their basic vocabularies. Developed by Morris Swadesh in 1950, the method assumes that core vocabulary items change at a relatively constant rate over time, allowing linguists to calculate a 'time depth'—how long ago two languages shared a common ancestor. Though controversial due to its restrictive assumptions, glottochronology provides rough temporal estimates when archaeological or written records are unavailable. | Morphological analysis splits words into their stems and affixes so that different surface forms of the same word can be treated as one. It covers two complementary approaches — rule-based stemming, such as the Porter (1980) and Snowball algorithms, and dictionary-aware lemmatization — and is a critical text-normalisation step for agglutinative languages such as Turkish and Arabic. |
| ScholarGate数据集 ↗ |
|
|