方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 拼写和语法检查× | N-gram语言模型× | |
|---|---|---|
| 领域 | 文本挖掘 | 文本挖掘 |
| 方法族 | Process / pipeline | Process / pipeline |
| 起源年份≠ | 2003 | — |
| 提出者≠ | Daniel Naber (rule-based checker); Peter Norvig (statistical spelling correction) | — |
| 类型≠ | Text-mining preprocessing / quality-assessment task | Statistical language model |
| 开创性文献≠ | Naber, D. (2003). A Rule-Based Style and Grammar Checker. Diploma Thesis. link ↗ | Jurafsky, D. & Martin, J.H. (2023). Speech and Language Processing, 3rd ed. link ↗ |
| 别名≠ | spell checking, grammar checking, text proofing, Yazım ve Dilbilgisi Denetimi | n-gram model, statistical language model, N-gram Dil Modeli |
| 相关 | 4 | 4 |
| 摘要≠ | Spelling and grammar checking is a text-mining task that detects spelling mistakes and grammatical errors in text and proposes corrections. Building on Naber's rule-based style and grammar checker (2003) and Norvig's statistical spelling corrector (2009), it is used for data-quality assessment and text normalisation before further analysis. | An n-gram language model is a statistical model that predicts the probability of the next word by looking only at the previous n−1 words. Described in detail by Jurafsky and Martin (Speech and Language Processing), it provides foundational infrastructure for text generation, spelling correction, and speech recognition. |
| ScholarGate数据集 ↗ |
|
|