Módszerek összehasonlítása
Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.
| Önfelügyelt mondatbeágyazások× | Önfelügyelt BERT-alapú osztályozás× | |
|---|---|---|
| Tudományterület | Mélytanulás | Mélytanulás |
| Módszercsalád | Machine learning | Machine learning |
| Keletkezés éve≠ | 2019–2021 | 2019 |
| Megalkotó≠ | Gao, T., Yao, X., & Chen, D. (SimCSE); Reimers, N. & Gurevych, I. (Sentence-BERT) | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (Google AI Language) |
| Típus≠ | Self-supervised representation learning | Pretrain-then-fine-tune transformer model |
| Alapmű≠ | Gao, T., Yao, X., & Chen, D. (2021). SimCSE: Simple Contrastive Learning of Sentence Embeddings. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 6894–6910. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019, 4171–4186. Association for Computational Linguistics. DOI ↗ |
| Alternatív nevek | self-supervised sentence representation learning, contrastive sentence embeddings, SimCSE, unsupervised sentence encoders | BERT fine-tuning for classification, BERT text classifier, self-supervised transformer classification, masked LM pretraining with classification head |
| Kapcsolódó≠ | 5 | 0 |
| Összefoglaló≠ | Self-supervised sentence embeddings train a neural encoder to map sentences into a dense vector space without requiring manually labeled pairs. By constructing positive examples automatically — for instance by passing the same sentence through dropout twice — and using contrastive objectives, the model learns semantically rich representations that transfer well to similarity, retrieval, and classification tasks. | Self-supervised BERT-based classification uses Google's Bidirectional Encoder Representations from Transformers (BERT), pretrained on massive unlabelled text via masked-language modelling, and fine-tunes it on labelled examples to assign text into categories. It consistently achieves state-of-the-art accuracy on sentiment analysis, topic classification, intent detection, and similar NLP tasks even with limited labelled data. |
| ScholarGateAdatkészlet ↗ |
|
|