Vertaile menetelmiä
Tarkastele valitsemiasi menetelmiä rinnakkain; eroavat rivit korostetaan.
| Siirto-oppiminen nimettyjen entiteettien tunnistuksessa (NER)× | Lauseupotukset× | |
|---|---|---|
| Tieteenala | Syväoppiminen | Syväoppiminen |
| Menetelmäperhe | Machine learning | Machine learning |
| Syntyvuosi≠ | 2010 / 2019 | 2015–2019 |
| Kehittäjä≠ | Pan & Yang (transfer learning); Devlin et al. (BERT-based NER fine-tuning) | Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019) |
| Tyyppi≠ | Supervised sequence labeling via pretrained encoder fine-tuning | Representation learning / embedding |
| Alkuperäislähde≠ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗ |
| Rinnakkaisnimet | TL-NER, Fine-Tuned NER, Pretrained Model NER, BERT NER | sentence vectors, sentence representations, SBERT, semantic sentence encoding |
| Liittyvät≠ | 5 | 4 |
| Tiivistelmä≠ | Transfer Learning with Named Entity Recognition (NER) adapts a large pretrained language model — such as BERT, RoBERTa, or a domain-specific encoder — to the task of identifying and classifying named entities (persons, locations, organizations, dates, etc.) in text. By reusing rich linguistic representations learned from massive corpora, this approach requires only modest labeled NER data while achieving state-of-the-art span detection and classification accuracy. | Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines. |
| ScholarGateAineisto ↗ |
|
|