Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Plongements de phrases semi-supervisés× | Plongements de phrases auto-supervisés× | |
|---|---|---|
| Domaine | Apprentissage profond | Apprentissage profond |
| Famille | Machine learning | Machine learning |
| Année d'origine | 2019–2021 | 2019–2021 |
| Auteur d'origine≠ | Gao, T.; Reimers, N. et al. (multiple contributors) | Gao, T., Yao, X., & Chen, D. (SimCSE); Reimers, N. & Gurevych, I. (Sentence-BERT) |
| Type≠ | Semi-supervised representation learning | Self-supervised representation learning |
| Source fondatrice≠ | Gao, T., Yao, X., & Chen, D. (2021). SimCSE: Simple Contrastive Learning of Sentence Embeddings. In Proceedings of EMNLP 2021 (pp. 6894–6910). Association for Computational Linguistics. DOI ↗ | Gao, T., Yao, X., & Chen, D. (2021). SimCSE: Simple Contrastive Learning of Sentence Embeddings. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 6894–6910. DOI ↗ |
| Alias | Semi-supervised SimCSE, Self-training sentence encoders, Pseudo-labeled sentence representation learning, SSL sentence embeddings | self-supervised sentence representation learning, contrastive sentence embeddings, SimCSE, unsupervised sentence encoders |
| Apparentées | 5 | 5 |
| Résumé≠ | Semi-supervised sentence embeddings combine a small set of labeled sentence pairs with large quantities of unlabeled text to train dense vector representations of sentences. By exploiting abundant unlabeled data through contrastive objectives or pseudo-labeling, these models produce high-quality embeddings for semantic similarity, retrieval, and classification even when annotated data is scarce. | Self-supervised sentence embeddings train a neural encoder to map sentences into a dense vector space without requiring manually labeled pairs. By constructing positive examples automatically — for instance by passing the same sentence through dropout twice — and using contrastive objectives, the model learns semantically rich representations that transfer well to similarity, retrieval, and classification tasks. |
| ScholarGateJeu de données ↗ |
|
|