השוואת שיטות
סקרו את השיטות שבחרתם זו לצד זו; שורות שבהן יש הבדל מודגשות.
| שיפור הטמעות משפטים מותאמות תחום× | סיווג מבוסס BERT× | |
|---|---|---|
| תחום | למידה עמוקה | למידה עמוקה |
| משפחה | Machine learning | Machine learning |
| שנת המקור≠ | 2019–2020 | 2019 |
| הוגה השיטה≠ | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (Google AI Language) |
| סוג≠ | Domain-adaptive representation learning | Pre-trained language model with fine-tuning |
| מקור מכונן≠ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171–4186). Association for Computational Linguistics. DOI ↗ |
| כינויים | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings | BERT classifier, BERT fine-tuning for classification, BERT text classification, BERT-CLS |
| קשורות≠ | 6 | 4 |
| תקציר≠ | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. | BERT-based Classification fine-tunes Google's Bidirectional Encoder Representations from Transformers model on a labelled text dataset, replacing the generic pre-trained head with a task-specific classification layer. It exploits deep bidirectional context from hundreds of millions of pre-trained parameters to deliver state-of-the-art accuracy on short- and medium-length text classification tasks with relatively modest amounts of labelled data. |
| ScholarGateמערך נתונים ↗ |
|
|