Σύγκριση μεθόδων

Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.

	Επισήμανση Μέρους του Λόγου (Part-of-Speech Tagging - POS Tagging)×	Μορφολογική Ανάλυση ×
Πεδίο	Εξόρυξη Κειμένου	Εξόρυξη Κειμένου
Οικογένεια	Process / pipeline	Process / pipeline
Έτος προέλευσης≠	—	1980
Δημιουργός≠	—	M.F. Porter (Porter stemmer)
Τύπος≠	NLP sequence-labelling task	Text-normalisation preprocessing task
Θεμελιώδης πηγή≠	Ratnaparkhi, A. (1996). A Maximum Entropy Model for Part-Of-Speech Tagging. EMNLP. link ↗	Porter, M.F. (1980). An Algorithm for Suffix Stripping. Program, 14(3), 130-137. DOI ↗
Εναλλακτικές ονομασίες	part-of-speech tagging, grammatical tagging, Sözcük Türü Etiketleme (POS Tagging)	stemming, lemmatization, Morfolojik Analiz ve Kök Bulma
Συναφείς≠	3	4
Σύνοψη≠	Part-of-speech tagging assigns a grammatical category label — noun, verb, adjective, and so on — to every word in a text. It is a foundational natural-language-processing task, formalised as a statistical model by Ratnaparkhi (1996) and packaged into widely used toolkits such as Stanford CoreNLP (Manning et al., 2014), and it serves as a preliminary step for syntactic analysis and information extraction.	Morphological analysis splits words into their stems and affixes so that different surface forms of the same word can be treated as one. It covers two complementary approaches — rule-based stemming, such as the Porter (1980) and Snowball algorithms, and dictionary-aware lemmatization — and is a critical text-normalisation step for agglutinative languages such as Turkish and Arabic.
ScholarGateΣύνολο δεδομένων ↗	v1 2 Πηγές PUBLISHED	v1 2 Πηγές PUBLISHED

Μετάβαση στην αναζήτηση → Λήψη διαφανειών