Comparar métodos
Examine os métodos selecionados lado a lado; as linhas que diferem ficam destacadas.
| Embeddings de Sentenças Adaptados ao Domínio× | Aprendizagem por Transferência com Embeddings de Sentenças× | |
|---|---|---|
| Área | Aprendizado profundo | Aprendizado profundo |
| Família | Machine learning | Machine learning |
| Ano de origem≠ | 2019–2020 | 2017–2019 |
| Autor original≠ | Reimers, N. & Gurevych, I. (Sentence-BERT); Gururangan et al. (domain-adaptive pretraining) | Reimers, N. & Gurevych, I. (SBERT); Conneau, A. et al. (InferSent) |
| Tipo≠ | Domain-adaptive representation learning | Transfer learning / sentence representation |
| Fonte seminal≠ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, pp. 3982–3992. DOI ↗ | Reimers, N. & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3982–3992. link ↗ |
| Outros nomes | domain-adapted sentence transformers, domain-specific sentence embeddings, target-domain sentence representations, DAPT sentence embeddings | sentence embedding transfer learning, pre-trained sentence encoder fine-tuning, SBERT transfer learning, sentence representation transfer |
| Relacionados≠ | 6 | 5 |
| Resumo≠ | Domain-adaptive sentence embeddings extend general-purpose sentence encoders — such as Sentence-BERT — by continuing their training on domain-specific text. The result is a fixed-length vector representation that captures both universal language understanding and the vocabulary, style, and semantic nuances of the target domain, improving downstream NLP tasks such as semantic search, clustering, and classification. | Transfer Learning with Sentence Embeddings takes a large pre-trained encoder — such as Sentence-BERT or the Universal Sentence Encoder — that already encodes general language knowledge into fixed-length vectors, and adapts it to a new task or domain with little additional labelled data. The pre-trained representations give a head start that often outperforms task-specific models trained from scratch on modest corpora. |
| ScholarGateConjunto de dados ↗ |
|
|