ScholarGate
Assistent

Sammenlign metoder

Gennemgå dine valgte metoder side om side; rækker, der afviger, er fremhævet.

Fler-sproglig Doc2Vec×Flersprogede sætningsindlejringer×
FagområdeDyb læringDyb læring
FamilieMachine learningMachine learning
Oprindelsesår2014–20162019–2022
OphavspersonLe, Q. & Mikolov, T. (Doc2Vec); multilingual extension by communityReimers, N. & Gurevych, I.; Feng, F. et al. (Google)
TypeDistributed document embedding (unsupervised / self-supervised)Cross-lingual representation learning
Oprindelig kildeLe, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning (ICML), PMLR 32(2), 1188–1196. link ↗Reimers, N. & Gurevych, I. (2020). Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation. Proceedings of EMNLP 2020, 4512–4525. link ↗
Aliassermultilingual paragraph vector, cross-lingual Doc2Vec, multilingual PV-DM, multilingual PV-DBOWmultilingual sentence representations, cross-lingual sentence embeddings, mSE, multilingual semantic embeddings
Relaterede45
ResuméMultilingual Doc2Vec extends the Paragraph Vector framework of Le and Mikolov (2014) to two or more languages, training document-level embeddings in a shared or aligned vector space so that semantically similar documents — regardless of their language — end up close together. It enables cross-lingual document retrieval, classification, and clustering without requiring parallel corpora or translation.Multilingual sentence embeddings map sentences from many languages into a single shared vector space so that semantically equivalent sentences — regardless of language — land close together. Models such as LaBSE, multilingual Sentence-BERT, and mUSE have made it practical to compare, retrieve, and classify text across 50 to 100+ languages without translating anything first.
ScholarGateDatasæt
  1. v1
  2. 2 Kilder
  3. PUBLISHED
  1. v1
  2. 2 Kilder
  3. PUBLISHED

Gå til søgning Hent slides

ScholarGateSammenlign metoder: Multilingual Doc2Vec · Multilingual Sentence Embeddings. Hentet 2026-06-17 fra https://scholargate.app/da/compare