Сравнение методов
Просматривайте выбранные методы рядом; строки с различиями подсвечены.
| Трансферное обучение для распознавания именованных сущностей× | Векторные представления предложений× | |
|---|---|---|
| Область | Глубокое обучение | Глубокое обучение |
| Семейство | Machine learning | Machine learning |
| Год появления≠ | 2010 / 2019 | 2015–2019 |
| Автор метода≠ | Pan & Yang (transfer learning); Devlin et al. (BERT-based NER fine-tuning) | Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019) |
| Тип≠ | Supervised sequence labeling via pretrained encoder fine-tuning | Representation learning / embedding |
| Основополагающий источник≠ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗ |
| Другие названия | TL-NER, Fine-Tuned NER, Pretrained Model NER, BERT NER | sentence vectors, sentence representations, SBERT, semantic sentence encoding |
| Связанные≠ | 5 | 4 |
| Сводка≠ | Transfer Learning with Named Entity Recognition (NER) adapts a large pretrained language model — such as BERT, RoBERTa, or a domain-specific encoder — to the task of identifying and classifying named entities (persons, locations, organizations, dates, etc.) in text. By reusing rich linguistic representations learned from massive corpora, this approach requires only modest labeled NER data while achieving state-of-the-art span detection and classification accuracy. | Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines. |
| ScholarGateНабор данных ↗ |
|
|