Porovnat metody
Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.
| Přenosové učení s rozpoznáváním pojmenovaných entit× | Vektorové reprezentace vět× | |
|---|---|---|
| Obor | Hluboké učení | Hluboké učení |
| Rodina | Machine learning | Machine learning |
| Rok vzniku≠ | 2010 / 2019 | 2015–2019 |
| Tvůrce≠ | Pan & Yang (transfer learning); Devlin et al. (BERT-based NER fine-tuning) | Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019) |
| Typ≠ | Supervised sequence labeling via pretrained encoder fine-tuning | Representation learning / embedding |
| Původní zdroj≠ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗ |
| Další názvy | TL-NER, Fine-Tuned NER, Pretrained Model NER, BERT NER | sentence vectors, sentence representations, SBERT, semantic sentence encoding |
| Příbuzné≠ | 5 | 4 |
| Shrnutí≠ | Transfer Learning with Named Entity Recognition (NER) adapts a large pretrained language model — such as BERT, RoBERTa, or a domain-specific encoder — to the task of identifying and classifying named entities (persons, locations, organizations, dates, etc.) in text. By reusing rich linguistic representations learned from massive corpora, this approach requires only modest labeled NER data while achieving state-of-the-art span detection and classification accuracy. | Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines. |
| ScholarGateDatová sada ↗ |
|
|