So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Nhúng câu thích ứng miền× | Phân loại dựa trên RoBERTa× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2019–2020 | 2019 |
| Người khởi xướng≠ | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) | Liu, Y. et al. (Facebook AI Research / University of Washington) |
| Loại≠ | Domain-adaptive representation learning | Pre-trained transformer fine-tuned for sequence classification |
| Công trình gốc≠ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ | Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2019). RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv preprint arXiv:1907.11692. link ↗ |
| Tên gọi khác | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings | RoBERTa classifier, RoBERTa text classification, Robustly Optimized BERT Classification, RoBERTa fine-tuning for classification |
| Liên quan≠ | 6 | 5 |
| Tóm tắt≠ | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. | RoBERTa-based Classification applies the RoBERTa pre-trained transformer — trained more robustly than BERT with dynamic masking and larger batches — to text categorisation tasks by adding a lightweight classification head on top of the [CLS] token representation and fine-tuning the entire model on labelled examples. It consistently matches or outperforms BERT on standard NLP benchmarks. |
| ScholarGateBộ dữ liệu ↗ |
|
|