ScholarGate
Trợ lý

So sánh phương pháp

Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.

Collostructional Analysis×N-gram Analysis×
Lĩnh vựcNgôn ngữ họcNgôn ngữ học
HọProcess / pipelineProcess / pipeline
Năm ra đời20031999
Người khởi xướngAnatol Stefanowitsch & Stefan Th. GriesCorpus linguists (Douglas Biber; lexical bundles tradition)
LoạiStatistical association analysis of lexemes and grammatical constructionsFrequency analysis of contiguous word sequences
Công trình gốcStefanowitsch, A., & Gries, S. T. (2003). Collostructions: Investigating the interaction of words and constructions. International Journal of Corpus Linguistics, 8(2), 209–243. DOI ↗Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman Grammar of Spoken and Written English. Longman. ISBN: 9780582237254
Tên gọi khácCollexeme Analysis, Distinctive Collexeme Analysis, Co-varying Collexeme AnalysisLexical Bundle Analysis, Cluster Analysis (corpus linguistics), Contiguous Sequence Analysis
Liên quan44
Tóm tắtCollostructional analysis is a family of corpus-based methods, introduced by Anatol Stefanowitsch and Stefan Th. Gries in 2003, that quantify the mutual attraction or repulsion between specific words (lexemes) and the grammatical constructions they occur in. Rooted in construction grammar, it treats a construction — such as the ditransitive "V NP NP" or the "into-causative" — as a meaningful unit and asks which words are statistically drawn to it or kept from it. The core technique, simple collexeme analysis, cross-tabulates how often a lexeme appears in the construction against how often each appears elsewhere, and measures the strength of association, conventionally with a Fisher–Yates exact test. Two extensions handle near-synonymous constructions (distinctive collexeme analysis) and the joint behavior of two slots within one construction (co-varying collexeme analysis), making the method a rigorous quantitative window onto the lexis–grammar interface.N-gram analysis is a corpus-linguistic technique that extracts and ranks every contiguous sequence of n words (or characters) in a corpus, exposing the recurrent multi-word units — two-word bigrams, three-word trigrams, and longer 'lexical bundles' — that make up a register or text type. By counting how often each sequence recurs, it reveals the prefabricated, formulaic backbone of language that single-word frequency lists cannot capture.
ScholarGateBộ dữ liệu
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED
  1. v1
  2. 3 Nguồn tài liệu
  3. PUBLISHED

Đến trang tìm kiếm Tải xuống bản trình chiếu

ScholarGateSo sánh phương pháp: Collostructional Analysis · N-gram Analysis. Truy cập ngày 2026-06-24 từ https://scholargate.app/vi/compare