ScholarGate
Assistant

Compare methods

Review your selected methods side by side; rows that differ are highlighted.

Collostructional Analysis×Keyness Analysis×
FieldLinguisticsLinguistics
FamilyProcess / pipelineProcess / pipeline
Year of origin20031997
OriginatorAnatol Stefanowitsch & Stefan Th. GriesMike Scott
TypeStatistical association analysis of lexemes and grammatical constructionsCorpus comparison of relative word frequencies
Seminal sourceStefanowitsch, A., & Gries, S. T. (2003). Collostructions: Investigating the interaction of words and constructions. International Journal of Corpus Linguistics, 8(2), 209–243. DOI ↗Scott, M. (1997). PC analysis of key words — and key key words. System, 25(2), 233–245. DOI ↗
AliasesCollexeme Analysis, Distinctive Collexeme Analysis, Co-varying Collexeme AnalysisKeyword Analysis, Corpus Keyness, Keyness Statistics
Related43
SummaryCollostructional analysis is a family of corpus-based methods, introduced by Anatol Stefanowitsch and Stefan Th. Gries in 2003, that quantify the mutual attraction or repulsion between specific words (lexemes) and the grammatical constructions they occur in. Rooted in construction grammar, it treats a construction — such as the ditransitive "V NP NP" or the "into-causative" — as a meaningful unit and asks which words are statistically drawn to it or kept from it. The core technique, simple collexeme analysis, cross-tabulates how often a lexeme appears in the construction against how often each appears elsewhere, and measures the strength of association, conventionally with a Fisher–Yates exact test. Two extensions handle near-synonymous constructions (distinctive collexeme analysis) and the joint behavior of two slots within one construction (co-varying collexeme analysis), making the method a rigorous quantitative window onto the lexis–grammar interface.Keyness analysis identifies the words that are characteristically frequent (or infrequent) in a target corpus relative to a reference corpus, using statistical tests to measure how unexpected each word's frequency is. Introduced by Mike Scott in 1997, it answers the question 'what is this text or collection distinctively about?' and is a central technique in corpus linguistics and corpus-assisted discourse analysis for surfacing the salient vocabulary of a genre, period, author, or social group.
ScholarGateDataset
  1. v1
  2. 2 Sources
  3. PUBLISHED
  1. v1
  2. 3 Sources
  3. PUBLISHED

Go to search Download slides

ScholarGateCompare methods: Collostructional Analysis · Keyness Analysis. Retrieved 2026-06-24 from https://scholargate.app/en/compare