Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Embeddings de phrases adaptatifs au domaine× | Plongements de phrases× | |
|---|---|---|
| Domaine | Apprentissage profond | Apprentissage profond |
| Famille | Machine learning | Machine learning |
| Année d'origine≠ | 2019–2020 | 2015–2019 |
| Auteur d'origine≠ | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) | Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019) |
| Type≠ | Domain-adaptive representation learning | Representation learning / embedding |
| Source fondatrice≠ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗ |
| Alias | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings | sentence vectors, sentence representations, SBERT, semantic sentence encoding |
| Apparentées≠ | 6 | 4 |
| Résumé≠ | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. | Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines. |
| ScholarGateJeu de données ↗ |
|
|