Bandingkan metode
Tinjau metode pilihan Anda berdampingan; baris yang berbeda akan disorot.
| Embedding Kalimat Swadaya-Terawasi× | Klasifikasi Berbasis BERT Self-Supervised× | |
|---|---|---|
| Bidang | Pembelajaran Mendalam | Pembelajaran Mendalam |
| Keluarga | Machine learning | Machine learning |
| Tahun asal≠ | 2019–2021 | 2019 |
| Pencetus≠ | Gao, T., Yao, X., & Chen, D. (SimCSE); Reimers, N. & Gurevych, I. (Sentence-BERT) | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (Google AI Language) |
| Tipe≠ | Self-supervised representation learning | Pretrain-then-fine-tune transformer model |
| Sumber perintis≠ | Gao, T., Yao, X., & Chen, D. (2021). SimCSE: Simple Contrastive Learning of Sentence Embeddings. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 6894–6910. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019, 4171–4186. Association for Computational Linguistics. DOI ↗ |
| Alias | self-supervised sentence representation learning, contrastive sentence embeddings, SimCSE, unsupervised sentence encoders | BERT fine-tuning for classification, BERT text classifier, self-supervised transformer classification, masked LM pretraining with classification head |
| Terkait≠ | 5 | 0 |
| Ringkasan≠ | Self-supervised sentence embeddings train a neural encoder to map sentences into a dense vector space without requiring manually labeled pairs. By constructing positive examples automatically — for instance by passing the same sentence through dropout twice — and using contrastive objectives, the model learns semantically rich representations that transfer well to similarity, retrieval, and classification tasks. | Self-supervised BERT-based classification uses Google's Bidirectional Encoder Representations from Transformers (BERT), pretrained on massive unlabelled text via masked-language modelling, and fine-tunes it on labelled examples to assign text into categories. It consistently achieves state-of-the-art accuracy on sentiment analysis, topic classification, intent detection, and similar NLP tasks even with limited labelled data. |
| ScholarGateSet data ↗ |
|
|