Salīdzināt metodes
Apskatiet izvēlētās metodes blakus; rindas, kas atšķiras, ir izceltas.
| Pārneses apmācība ar nosaukto entītiju atpazīšanu× | Ievietojumi teikumiem× | |
|---|---|---|
| Nozare | Dziļā mācīšanās | Dziļā mācīšanās |
| Saime | Machine learning | Machine learning |
| Izcelsmes gads≠ | 2010 / 2019 | 2015–2019 |
| Autors≠ | Pan & Yang (transfer learning); Devlin et al. (BERT-based NER fine-tuning) | Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019) |
| Tips≠ | Supervised sequence labeling via pretrained encoder fine-tuning | Representation learning / embedding |
| Pirmavots≠ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗ |
| Citi nosaukumi | TL-NER, Fine-Tuned NER, Pretrained Model NER, BERT NER | sentence vectors, sentence representations, SBERT, semantic sentence encoding |
| Saistītās≠ | 5 | 4 |
| Kopsavilkums≠ | Transfer Learning with Named Entity Recognition (NER) adapts a large pretrained language model — such as BERT, RoBERTa, or a domain-specific encoder — to the task of identifying and classifying named entities (persons, locations, organizations, dates, etc.) in text. By reusing rich linguistic representations learned from massive corpora, this approach requires only modest labeled NER data while achieving state-of-the-art span detection and classification accuracy. | Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines. |
| ScholarGateDatu kopa ↗ |
|
|