方法对比

并排查看您选择的方法；存在差异的行会高亮显示。

	N-gram Analysis ×	Keyness Analysis ×
领域	语言学	语言学
方法族	Process / pipeline	Process / pipeline
起源年份≠	1999	1997
提出者≠	Corpus linguists (Douglas Biber; lexical bundles tradition)	Mike Scott
类型≠	Frequency analysis of contiguous word sequences	Corpus comparison of relative word frequencies
开创性文献≠	Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman Grammar of Spoken and Written English. Longman. ISBN: 9780582237254	Scott, M. (1997). PC analysis of key words — and key key words. System, 25(2), 233–245. DOI ↗
别名	Lexical Bundle Analysis, Cluster Analysis (corpus linguistics), Contiguous Sequence Analysis	Keyword Analysis, Corpus Keyness, Keyness Statistics
相关≠	4	3
摘要≠	N-gram analysis is a corpus-linguistic technique that extracts and ranks every contiguous sequence of n words (or characters) in a corpus, exposing the recurrent multi-word units — two-word bigrams, three-word trigrams, and longer 'lexical bundles' — that make up a register or text type. By counting how often each sequence recurs, it reveals the prefabricated, formulaic backbone of language that single-word frequency lists cannot capture.	Keyness analysis identifies the words that are characteristically frequent (or infrequent) in a target corpus relative to a reference corpus, using statistical tests to measure how unexpected each word's frequency is. Introduced by Mike Scott in 1997, it answers the question 'what is this text or collection distinctively about?' and is a central technique in corpus linguistics and corpus-assisted discourse analysis for surfacing the salient vocabulary of a genre, period, author, or social group.
ScholarGate数据集 ↗	v1 3 来源 PUBLISHED	v1 3 来源 PUBLISHED

前往搜索 → 下载幻灯片