Machine learningDeep learning / NLP / CV
Multilingual Doc2Vec
Multilingual Doc2Vec extends the Paragraph Vector framework of Le and Mikolov (2014) to two or more languages, training document-level embeddings in a shared or aligned vector space so that semantically similar documents — regardless of their language — end up close together. It enables cross-lingual document retrieval, classification, and clustering without requiring parallel corpora or translation.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.