Part-of-Speech Tagging
Part-of-speech (POS) tagging is the task of assigning each word (token) in a text its grammatical category — noun, verb, adjective, preposition, and finer distinctions such as past-tense verb or comparative adjective — drawn from a fixed tagset. Because the same word form can belong to different categories depending on context ("book a flight" versus "read a book"), tagging is fundamentally a disambiguation problem solved with contextual evidence. It is one of the oldest and most foundational tasks in natural language processing and corpus linguistics, supplying the grammatical layer on which concordancing, parsing, register analysis, and information extraction all depend. Modern taggers reach accuracies well above 97% on standard English benchmarks, using statistical sequence models or neural networks trained on annotated corpora.
방법 전문 읽기
무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.
방법 지도
관련 방법들로 이루어진 인접 영역 — 노드를 선택해 살펴보세요.
출처
- Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press. ISBN: 9780521865715
- Jurafsky, D., & Martin, J. H. (2023). Speech and Language Processing (3rd ed. draft). Stanford University. link ↗
- Marcus, M. P., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313–330. link ↗
이 페이지 인용 방법
ScholarGate. (2026, June 22). Part-of-Speech Tagging in Corpus and Computational Linguistics. ScholarGate. https://scholargate.app/ko/linguistics/part-of-speech-tagging
어떤 방법일까요?
이 방법을 가장 가까운 동류의 방법들과 나란히 놓고 비교해 보세요 — 라이브러리는 책을 펼쳐 놓을 뿐, 선택은 여러분의 몫입니다.
- 연어 분석텍스트 마이닝↔ 비교
- Corpus Concordance Analysis언어학↔ 비교
- Multidimensional Register Analysis언어학↔ 비교
- N-gram Analysis언어학↔ 비교