Módszerek összehasonlítása
Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.
| Keyword-in-Context (KWIC) Analysis× | Kollokációelemzés× | |
|---|---|---|
| Tudományterület≠ | Nyelvtudomány | Szövegbányászat |
| Módszercsalád | Process / pipeline | Process / pipeline |
| Keletkezés éve≠ | 1960 | 1990 |
| Megalkotó≠ | H. P. Luhn (information retrieval); adopted in corpus linguistics by John Sinclair | Church & Hanks |
| Típus≠ | Indexing and display technique aligning a keyword with its surrounding co-text | Statistical text-mining technique |
| Alapmű≠ | Luhn, H. P. (1960). Key word-in-context index for technical literature (KWIC index). American Documentation, 11(4), 288–295. DOI ↗ | Church, K.W. & Hanks, P. (1990). Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics, 16(1), 22-29. link ↗ |
| Alternatív nevek | KWIC Index, Key Word in Context, Concordance Line Display | word association, collocation extraction, Birliktelik Analizi (Collocation Analysis) |
| Kapcsolódó≠ | 4 | 3 |
| Összefoglaló≠ | Keyword-in-context (KWIC) analysis is the indexing and display technique that presents every occurrence of a chosen keyword aligned in a fixed central column, flanked by a set span of the words that precede and follow it. Invented by H. P. Luhn in 1960 to index technical literature, the KWIC format became the standard way to read a concordance: by stacking instances of the keyword so they line up vertically, it lets an analyst scan the surrounding co-text for recurrent neighbors and patterns. It is the specific display layer underlying broader corpus concordance work, valued because alignment turns a list of scattered occurrences into a visually legible pattern. Today KWIC views are the default output of every corpus-analysis tool and the entry point for studying collocation, colligation, and meaning in context. | Collocation analysis is a statistical text-mining technique that identifies word pairs or expressions that frequently occur together, using association measures rather than chance co-occurrence. Introduced in the lexicography work of Church and Hanks (1990), it is used for terminology extraction and language analysis, surfacing the multi-word units that carry meaning in a corpus. |
| ScholarGateAdatkészlet ↗ |
|
|