Process / pipeline
词形分析 — 词干提取与词形还原
词形分析将词语分解为其词干和词缀,以便将同一词语的不同表面形式视为一个。它涵盖两种互补的方法——基于规则的词干提取,如 Porter (1980) 和 Snowball 算法,以及考虑词典的词形还原——并且是土耳其语和阿拉伯语等黏着语的关键文本规范化步骤。
阅读完整方法
仅限会员
登录使用免费账户登录即可阅读本节。
Method map
The neighbourhood of related methods — select a node to explore.
来源
- Porter, M.F. (1980). An Algorithm for Suffix Stripping. Program, 14(3), 130-137. DOI: 10.1108/eb046814 ↗
- Schmid, H. (1994). Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of the International Conference on New Methods in Language Processing (NEMLAP). link ↗
如何引用本页
ScholarGate. (2026, June 1). Morphological Analysis and Stemming. ScholarGate. https://scholargate.app/zh/text-mining/morphological-analysis
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
Compare side by side →