Process / pipelineCorpus annotation / NLP

Part-of-Speech Tagging

Part-of-speech (POS) tagging is the task of assigning each word (token) in a text its grammatical category — noun, verb, adjective, preposition, and finer distinctions such as past-tense verb or comparative adjective — drawn from a fixed tagset. Because the same word form can belong to different categories depending on context ("book a flight" versus "read a book"), tagging is fundamentally a disambiguation problem solved with contextual evidence. It is one of the oldest and most foundational tasks in natural language processing and corpus linguistics, supplying the grammatical layer on which concordancing, parsing, register analysis, and information extraction all depend. Modern taggers reach accuracies well above 97% on standard English benchmarks, using statistical sequence models or neural networks trained on annotated corpora.

在 MethodMind 中打开即将推出应用、比较、获取指导

工具与资源

下载幻灯片

学习与探索

视频即将推出

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

方法图谱

相关方法的邻域——选择一个节点以展开探索。

Part-of-Speech Tagging

搭配分析 Corpus Concordance Analy…Multidimensional Registe…N-gram Analysis 文本复杂度分析词义消歧 (WSD)

来源

Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press. ISBN: 9780521865715
Jurafsky, D., & Martin, J. H. (2023). Speech and Language Processing (3rd ed. draft). Stanford University. link ↗
Marcus, M. P., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313–330. link ↗

如何引用本页

ScholarGate. (2026, June 22). Part-of-Speech Tagging in Corpus and Computational Linguistics. ScholarGate. https://scholargate.app/zh/linguistics/part-of-speech-tagging

选用哪种方法？

将本方法与其最相近的同类并置，并排研读——本馆将书籍铺陈于案上，取舍则由您定夺。

搭配分析文本挖掘↔ 比较
Corpus Concordance Analysis语言学↔ 比较
Multidimensional Register Analysis语言学↔ 比较
N-gram Analysis语言学↔ 比较

并排比较 →

被引用于

Multidimensional Register Analysis 文本复杂度分析词义消歧 (WSD)

相似方法

POS Tagging Named Entity Recognition Word Sense Disambiguation Dependency Parsing Language Identification Constituency Parsing Chunking Intent Detection