Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| Βελτιστοποιημένο Doc2Vec× | Λεπτομερώς Προσαρμοσμένες Ενσωματώσεις Προτάσεων× | |
|---|---|---|
| Πεδίο | Βαθιά Μάθηση | Βαθιά Μάθηση |
| Οικογένεια | Machine learning | Machine learning |
| Έτος προέλευσης≠ | 2014 (base); fine-tuning practice ca. 2015 | 2019 |
| Δημιουργός≠ | Le, Q. V. & Mikolov, T. (Doc2Vec base); fine-tuning practice adopted by the NLP community ca. 2015–2017 | Reimers, N. & Gurevych, I. |
| Τύπος≠ | Representation learning / transfer learning | Supervised / contrastive fine-tuning of pre-trained sentence encoders |
| Θεμελιώδης πηγή≠ | Le, Q. V., & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML 2014), PMLR 32(2), 1188–1196. link ↗ | Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3982–3992. DOI ↗ |
| Εναλλακτικές ονομασίες | fine-tuned Paragraph Vector, domain-adapted Doc2Vec, PV fine-tuning, Doc2Vec transfer learning | SBERT fine-tuning, sentence transformer fine-tuning, domain-adapted sentence embeddings, fine-tuned sentence encoders |
| Συναφείς | 5 | 5 |
| Σύνοψη≠ | Fine-Tuned Doc2Vec adapts a pre-trained Paragraph Vector (Doc2Vec) model by continuing its training on a target corpus, producing document embeddings that capture both the general language knowledge of the original training and the vocabulary and style of the new domain. It is used for text classification, semantic similarity, and clustering when labeled data are scarce but unlabeled domain text is available. | Fine-Tuned Sentence Embeddings adapt a general-purpose pre-trained sentence encoder — such as Sentence-BERT — to a specific domain or task by continuing training on labeled or paired text data from that domain. The resulting embeddings capture domain-specific semantic structure far better than off-the-shelf vectors, improving downstream tasks such as semantic similarity, clustering, classification, and retrieval. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|