ScholarGate
Assistent
Process / pipelineCorpus linguistics

N-gram Analysis

N-gram analysis is a corpus-linguistic technique that extracts and ranks every contiguous sequence of n words (or characters) in a corpus, exposing the recurrent multi-word units — two-word bigrams, three-word trigrams, and longer 'lexical bundles' — that make up a register or text type. By counting how often each sequence recurs, it reveals the prefabricated, formulaic backbone of language that single-word frequency lists cannot capture.

Åbn i MethodMindSnartAnvend, sammenlign, få vejledning
Værktøjer og ressourcer
Hent slides
Lær og udforsk
VideoSnart

Læs hele metoden

Kun for medlemmer

Log ind med en gratis konto for at læse dette afsnit.

Log ind

Metodekort

Nabolaget af beslægtede metoder — vælg en knude for at udforske.

Kilder

  1. Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman Grammar of Spoken and Written English. Longman. ISBN: 9780582237254
  2. O'Keeffe, A., & McCarthy, M. (Eds.). (2010). The Routledge Handbook of Corpus Linguistics. Routledge. ISBN: 9780415464895
  3. Anthony, L. (2004). AntConc: A learner and classroom friendly, multi-platform corpus analysis toolkit. In Proceedings of IWLeL 2004: An Interactive Workshop on Language e-Learning (pp. 7–13). Waseda University. link

Sådan citerer du denne side

ScholarGate. (2026, June 22). N-gram Frequency Analysis in Corpus Linguistics. ScholarGate. https://scholargate.app/da/linguistics/n-gram-analysis

Hvilken metode?

Stil denne metode ved siden af dens nærmeste slægtninge, og læs dem side om side — biblioteket lægger bøgerne på bordet; valget er dit.

Sammenlign side om side

Refereret af

ScholarGateN-gram Analysis (N-gram Frequency Analysis in Corpus Linguistics). Hentet 2026-06-24 fra https://scholargate.app/da/linguistics/n-gram-analysis · Datasæt: https://doi.org/10.5281/zenodo.20539026