So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Word2Vec thích ứng miền× | Nhúng câu thích ứng miền× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2013–2016 | 2019–2020 |
| Người khởi xướng≠ | Mikolov, T. et al. (Word2Vec); domain adaptation practice emerged in NLP community ~2014–2016 | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) |
| Loại≠ | Domain-adapted word embedding model | Domain-adaptive representation learning |
| Công trình gốc≠ | Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of ICLR Workshop. link ↗ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ |
| Tên gọi khác | domain-specific Word2Vec, domain-adapted word embeddings, domain Word2Vec, specialized Word2Vec | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings |
| Liên quan≠ | 5 | 6 |
| Tóm tắt≠ | Domain-adaptive Word2Vec trains or fine-tunes Word2Vec embeddings on a domain-specific text corpus so that word vectors capture the specialized vocabulary, semantic relationships, and jargon of a target field — such as clinical medicine, legal text, financial reports, or scientific literature — rather than reflecting general-purpose web or news language. | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. |
| ScholarGateBộ dữ liệu ↗ |
|
|