Machine learningDeep learning / NLP / CV

Multilingual Doc2Vec

Multilingual Doc2Vec extends the Paragraph Vector framework of Le and Mikolov (2014) to two or more languages, training document-level embeddings in a shared or aligned vector space so that semantically similar documents — regardless of their language — end up close together. It enables cross-lingual document retrieval, classification, and clustering without requiring parallel corpora or translation.

MethodMind'de açSoonVideoSoon

Tam yöntemi oku

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning (ICML), PMLR 32(2), 1188–1196. link
  2. Multilingualism. Wikipedia. link

Related methods

ScholarGateMultilingual Doc2Vec (Multilingual Paragraph Vector (Doc2Vec) Model). Retrieved 2026-06-04 from https://scholargate.app/tr/deep-learning/multilingual-doc2vec