Porovnat metody
Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.
| Kontrola pravopisu a gramatiky× | N-gramový jazykový model× | |
|---|---|---|
| Obor | Dolování textu | Dolování textu |
| Rodina | Process / pipeline | Process / pipeline |
| Rok vzniku≠ | 2003 | — |
| Tvůrce≠ | Daniel Naber (rule-based checker); Peter Norvig (statistical spelling correction) | — |
| Typ≠ | Text-mining preprocessing / quality-assessment task | Statistical language model |
| Původní zdroj≠ | Naber, D. (2003). A Rule-Based Style and Grammar Checker. Diploma Thesis. link ↗ | Jurafsky, D. & Martin, J.H. (2023). Speech and Language Processing, 3rd ed. link ↗ |
| Další názvy≠ | spell checking, grammar checking, text proofing, Yazım ve Dilbilgisi Denetimi | n-gram model, statistical language model, N-gram Dil Modeli |
| Příbuzné | 4 | 4 |
| Shrnutí≠ | Spelling and grammar checking is a text-mining task that detects spelling mistakes and grammatical errors in text and proposes corrections. Building on Naber's rule-based style and grammar checker (2003) and Norvig's statistical spelling corrector (2009), it is used for data-quality assessment and text normalisation before further analysis. | An n-gram language model is a statistical model that predicts the probability of the next word by looking only at the previous n−1 words. Described in detail by Jurafsky and Martin (Speech and Language Processing), it provides foundational infrastructure for text generation, spelling correction, and speech recognition. |
| ScholarGateDatová sada ↗ |
|
|