Módszerek összehasonlítása
Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.
| Visszakereséssel kiegészített generálás (RAG)× | BERT finomhangolás× | |
|---|---|---|
| Tudományterület≠ | Szövegbányászat | Mélytanulás |
| Módszercsalád≠ | Process / pipeline | Machine learning |
| Keletkezés éve≠ | 2020 | 2019 |
| Megalkotó≠ | Lewis, Patrick et al. (Meta AI / Facebook AI Research) | Devlin, J. et al. |
| Típus≠ | Hybrid retrieval + generation pipeline | Transfer learning (fine-tuning a pre-trained transformer) |
| Alapmű≠ | Lewis, P. et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Advances in Neural Information Processing Systems (NeurIPS), 33, 9459-9474. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL. DOI ↗ |
| Alternatív nevek | RAG, retrieval-augmented LLM, grounded generation, Erişim Destekli Metin Üretimi (RAG) | BERT İnce Ayar (Fine-Tuning), BERT ince ayar, fine-tuning BERT, transfer learning with BERT |
| Kapcsolódó≠ | 7 | 5 |
| Összefoglaló≠ | Retrieval-Augmented Generation (RAG) is a natural-language-processing pipeline introduced by Lewis et al. in 2020 that strengthens a large language model (LLM) with evidence fetched at inference time from an external knowledge base. Instead of relying solely on what a model memorised during training, RAG first retrieves the most relevant passages from a document index and then hands those passages to the LLM as context, grounding the generated answer in verifiable, up-to-date information. The approach reduces hallucination and allows domain-specific or time-sensitive knowledge to be injected without retraining the model. | BERT fine-tuning, building on the BERT model introduced by Devlin and colleagues in 2019, re-trains a pre-trained BERT model on a small labelled dataset for a target task such as classification, named-entity recognition, or question answering. Through transfer learning it reaches high performance even with relatively little task-specific data. |
| ScholarGateAdatkészlet ↗ |
|
|