เปรียบเทียบวิธี
ดูวิธีที่เลือกเทียบกันแบบเคียงข้าง แถวที่ต่างกันจะถูกเน้นไว้
| Fine-Tuned Doc2Vec× | การจำแนกประเภทโดยใช้ BERT× | |
|---|---|---|
| สาขาวิชา | การเรียนรู้เชิงลึก | การเรียนรู้เชิงลึก |
| ตระกูล | Machine learning | Machine learning |
| ปีกำเนิด≠ | 2014 (base); fine-tuning practice ca. 2015 | 2019 |
| ผู้ริเริ่ม≠ | Le, Q. V. & Mikolov, T. (Doc2Vec base); fine-tuning practice adopted by the NLP community ca. 2015–2017 | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (Google AI Language) |
| ประเภท≠ | Representation learning / transfer learning | Pre-trained language model with fine-tuning |
| แหล่งต้นตำรับ≠ | Le, Q. V., & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML 2014), PMLR 32(2), 1188–1196. link ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ |
| ชื่อเรียกอื่น | fine-tuned Paragraph Vector, domain-adapted Doc2Vec, PV fine-tuning, Doc2Vec transfer learning | BERT classifier, BERT fine-tuning for classification, BERT text classification, BERT-CLS |
| ที่เกี่ยวข้อง≠ | 5 | 4 |
| สรุป≠ | Fine-Tuned Doc2Vec adapts a pre-trained Paragraph Vector (Doc2Vec) model by continuing its training on a target corpus, producing document embeddings that capture both the general language knowledge of the original training and the vocabulary and style of the new domain. It is used for text classification, semantic similarity, and clustering when labeled data are scarce but unlabeled domain text is available. | BERT-based Classification fine-tunes Google's Bidirectional Encoder Representations from Transformers model on a labelled text dataset, replacing the generic pre-trained head with a task-specific classification layer. It exploits deep bidirectional context from hundreds of millions of pre-trained parameters to deliver state-of-the-art accuracy on short- and medium-length text classification tasks with relatively modest amounts of labelled data. |
| ScholarGateชุดข้อมูล ↗ |
|
|