Usporedite metode
Pregledajte odabrane metode jednu uz drugu; retci koji se razlikuju su istaknuti.
| Domenski prilagođeni Word2Vec× | Domenski prilagođene ugradnje rečenica× | |
|---|---|---|
| Područje | Duboko učenje | Duboko učenje |
| Obitelj | Machine learning | Machine learning |
| Godina nastanka≠ | 2013–2016 | 2019–2020 |
| Tvorac≠ | Mikolov, T. et al. (Word2Vec); domain adaptation practice emerged in NLP community ~2014–2016 | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) |
| Vrsta≠ | Domain-adapted word embedding model | Domain-adaptive representation learning |
| Temeljni izvor≠ | Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of ICLR Workshop. link ↗ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ |
| Drugi nazivi | domain-specific Word2Vec, domain-adapted word embeddings, domain Word2Vec, specialized Word2Vec | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings |
| Srodne≠ | 5 | 6 |
| Sažetak≠ | Domain-adaptive Word2Vec trains or fine-tunes Word2Vec embeddings on a domain-specific text corpus so that word vectors capture the specialized vocabulary, semantic relationships, and jargon of a target field — such as clinical medicine, legal text, financial reports, or scientific literature — rather than reflecting general-purpose web or news language. | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. |
| ScholarGateSkup podataka ↗ |
|
|