Võrdle meetodeid
Vaata valitud meetodeid kõrvuti; erinevad read on esile tõstetud.
| Domeenipõhine Word2Vec× | Domeeni-kohandatud lausendusendid× | |
|---|---|---|
| Valdkond | Süvaõpe | Süvaõpe |
| Perekond | Machine learning | Machine learning |
| Tekkeaasta≠ | 2013–2016 | 2019–2020 |
| Looja≠ | Mikolov, T. et al. (Word2Vec); domain adaptation practice emerged in NLP community ~2014–2016 | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) |
| Tüüp≠ | Domain-adapted word embedding model | Domain-adaptive representation learning |
| Algallikas≠ | Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of ICLR Workshop. link ↗ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ |
| Rööpnimetused | domain-specific Word2Vec, domain-adapted word embeddings, domain Word2Vec, specialized Word2Vec | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings |
| Seotud≠ | 5 | 6 |
| Kokkuvõte≠ | Domain-adaptive Word2Vec trains or fine-tunes Word2Vec embeddings on a domain-specific text corpus so that word vectors capture the specialized vocabulary, semantic relationships, and jargon of a target field — such as clinical medicine, legal text, financial reports, or scientific literature — rather than reflecting general-purpose web or news language. | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. |
| ScholarGateAndmestik ↗ |
|
|