Process / pipelineEmpirical Linguistics
Corpus Linguistics
Corpus Linguistics is the study of language based on large, representative collections of texts (corpora) processed by computer. Pioneered by John Sinclair and others, the method uses statistical analysis, concordancing, and computational tools to examine patterns of actual language use. Corpus linguistics has transformed our understanding of English and other languages, revealing frequency patterns, collocation preferences, and register variation that were previously hidden. It serves theoretical linguistics, applied language teaching, and natural language processing.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Sinclair, J. M. (1991). Corpus, Concordance, Collocation. Oxford: Oxford University Press. DOI: 10.1093/acprof:oso/9780194371726.001.0001 ↗
- McEnery, T., & Hardie, A. (2012). Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press. DOI: 10.1017/CBO9780511981395 ↗
- Biber, D., Conrad, S., & Reppen, R. (2006). Corpus Linguistics: Investigating Language Structure and Use. Cambridge: Cambridge University Press. DOI: 10.1017/CBO9780511618529 ↗