Process / pipeline

Morphological Analysis — Stemming and Lemmatization

Morphological analysis splits words into their stems and affixes so that different surface forms of the same word can be treated as one. It covers two complementary approaches — rule-based stemming, such as the Porter (1980) and Snowball algorithms, and dictionary-aware lemmatization — and is a critical text-normalisation step for agglutinative languages such as Turkish and Arabic.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Porter, M.F. (1980). An Algorithm for Suffix Stripping. Program, 14(3), 130-137. DOI: 10.1108/eb046814
  2. Schmid, H. (1994). Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of the International Conference on New Methods in Language Processing (NEMLAP). link

Related methods

Referenced by

ScholarGateMorphological Analysis (Morphological Analysis and Stemming). Retrieved 2026-06-04 from https://scholargate.app/en/text-mining/morphological-analysis