قارن الطرق
راجع الطرق التي اخترتها جنبًا إلى جنب؛ الصفوف المختلفة مميَّزة.
| Doc2Vec التكيفي مع النطاق× | Doc2Vec المُحسَّن بدقة× | |
|---|---|---|
| المجال | التعلم العميق | التعلم العميق |
| العائلة | Machine learning | Machine learning |
| سنة النشأة≠ | 2014 (Doc2Vec); domain-adaptive application mid-2010s onward | 2014 (base); fine-tuning practice ca. 2015 |
| صاحب الطريقة≠ | Le & Mikolov (Doc2Vec); domain adaptation literature (Blitzer, Daumé III, and others) | Le, Q. V. & Mikolov, T. (Doc2Vec base); fine-tuning practice adopted by the NLP community ca. 2015–2017 |
| النوع≠ | Unsupervised / domain-adaptive document embedding | Representation learning / transfer learning |
| المصدر التأسيسي | Le, Q. V., & Mikolov, T. (2014). Distributed representations of sentences and documents. Proceedings of the 31st International Conference on Machine Learning (ICML 2014), PMLR 32(2), 1188–1196. link ↗ | Le, Q. V., & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML 2014), PMLR 32(2), 1188–1196. link ↗ |
| الأسماء البديلة | domain-adapted Doc2Vec, cross-domain paragraph vector, domain-adaptive PV-DM, domain-adaptive PV-DBOW | fine-tuned Paragraph Vector, domain-adapted Doc2Vec, PV fine-tuning, Doc2Vec transfer learning |
| ذات صلة | 5 | 5 |
| الملخص≠ | Domain-adaptive Doc2Vec adapts the Paragraph Vector (Doc2Vec) framework so that document embeddings learned on a source domain transfer effectively to a target domain. By aligning the representation space across domains during or after training, the model produces embeddings that are informative on both, enabling cross-domain classification, sentiment analysis, and retrieval with limited target-domain labels. | Fine-Tuned Doc2Vec adapts a pre-trained Paragraph Vector (Doc2Vec) model by continuing its training on a target corpus, producing document embeddings that capture both the general language knowledge of the original training and the vocabulary and style of the new domain. It is used for text classification, semantic similarity, and clustering when labeled data are scarce but unlabeled domain text is available. |
| ScholarGateمجموعة البيانات ↗ |
|
|