Sammenlign metoder
Gjennomgå de valgte metodene side om side; rader som avviker, er uthevet.
| Glottochronology× | Morfologisk analyse× | |
|---|---|---|
| Fagfelt≠ | Lingvistikk | Tekstutvinning |
| Familie | Process / pipeline | Process / pipeline |
| Opprinnelsesår≠ | 1950 | 1980 |
| Opphavsperson≠ | Morris Swadesh | M.F. Porter (Porter stemmer) |
| Type≠ | Empirical process pipeline | Text-normalisation preprocessing task |
| Opprinnelig kilde≠ | Swadesh, M. (1950). Salish internal relationships. International Journal of American Linguistics, 16(3), 157-167. DOI ↗ | Porter, M.F. (1980). An Algorithm for Suffix Stripping. Program, 14(3), 130-137. DOI ↗ |
| Alias≠ | Lexicostatistics, Glottochronological Dating | stemming, lemmatization, Morfolojik Analiz ve Kök Bulma |
| Relaterte≠ | 2 | 4 |
| Sammendrag≠ | Glottochronology, or lexicostatistics, is a quantitative method in historical linguistics that estimates the time of divergence between related languages based on the proportion of shared cognates in their basic vocabularies. Developed by Morris Swadesh in 1950, the method assumes that core vocabulary items change at a relatively constant rate over time, allowing linguists to calculate a 'time depth'—how long ago two languages shared a common ancestor. Though controversial due to its restrictive assumptions, glottochronology provides rough temporal estimates when archaeological or written records are unavailable. | Morphological analysis splits words into their stems and affixes so that different surface forms of the same word can be treated as one. It covers two complementary approaches — rule-based stemming, such as the Porter (1980) and Snowball algorithms, and dictionary-aware lemmatization — and is a critical text-normalisation step for agglutinative languages such as Turkish and Arabic. |
| ScholarGateDatasett ↗ |
|
|