So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Doc2Vec thích ứng miền× | Nhúng câu thích ứng miền× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2014 (Doc2Vec); domain-adaptive application mid-2010s onward | 2019–2020 |
| Người khởi xướng≠ | Le & Mikolov (Doc2Vec); domain adaptation literature (Blitzer, Daumé III, and others) | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) |
| Loại≠ | Unsupervised / domain-adaptive document embedding | Domain-adaptive representation learning |
| Công trình gốc≠ | Le, Q. V., & Mikolov, T. (2014). Distributed representations of sentences and documents. Proceedings of the 31st International Conference on Machine Learning (ICML 2014), PMLR 32(2), 1188–1196. link ↗ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ |
| Tên gọi khác | domain-adapted Doc2Vec, cross-domain paragraph vector, domain-adaptive PV-DM, domain-adaptive PV-DBOW | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings |
| Liên quan≠ | 5 | 6 |
| Tóm tắt≠ | Domain-adaptive Doc2Vec adapts the Paragraph Vector (Doc2Vec) framework so that document embeddings learned on a source domain transfer effectively to a target domain. By aligning the representation space across domains during or after training, the model produces embeddings that are informative on both, enabling cross-domain classification, sentiment analysis, and retrieval with limited target-domain labels. | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. |
| ScholarGateBộ dữ liệu ↗ |
|
|