مقایسهٔ روشها
روشهای انتخابی خود را کنار هم مرور کنید؛ ردیفهای متفاوت برجسته شدهاند.
| یادگیری انتقالی با Word2Vec× | یادگیری انتقالی مبتنی بر طبقهبندی با BERT× | |
|---|---|---|
| حوزه | یادگیری عمیق | یادگیری عمیق |
| خانواده | Machine learning | Machine learning |
| سال پیدایش≠ | 2013-2014 | 2019 (BERT); transfer learning paradigm established circa 2010 |
| پدیدآور≠ | Mikolov, T. et al. (Word2Vec); transfer-learning application popularised by Kim, Y. | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (BERT); Pan, S. J. & Yang, Q. (transfer learning survey) |
| نوع≠ | Transfer learning / embedding initialization | Pre-trained transformer fine-tuned for classification |
| منبع بنیادین≠ | Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems (NIPS), 26, 3111-3119. link ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019, 4171–4186. Association for Computational Linguistics. DOI ↗ |
| نامهای دیگر | Word2Vec transfer learning, pre-trained Word2Vec embeddings, Word2Vec embedding initialization, Word2Vec fine-tuning | BERT fine-tuning for classification, BERT transfer learning classifier, pre-trained BERT classifier, BERT downstream classification |
| مرتبط≠ | 5 | 4 |
| خلاصه≠ | Transfer Learning with Word2Vec uses word embeddings pre-trained on large text corpora via the Skip-gram or CBOW objectives introduced by Mikolov et al. (2013) to initialize the embedding layer of a downstream NLP model. This approach transfers distributional semantic knowledge to tasks where labeled data is scarce, consistently outperforming random initialization. | Transfer Learning with BERT-based Classification adapts a large transformer language model, pre-trained on massive text corpora, to a target classification task by fine-tuning its weights on labeled examples. The pre-trained representations encode rich syntactic and semantic knowledge, enabling high accuracy even when the labeled dataset is small. |
| ScholarGateمجموعهداده ↗ |
|
|