Process / pipeline
Cross-lingual Text Analysis — Multilingual Representation
Cross-lingual text analysis lets you compare and analyse texts written in different languages within a shared vector space. Building on multilingual representation learning surveyed by Conneau et al. (2020) and Pires et al. (2019), it maps documents from several languages into one common embedding space so multilingual corpora can be studied together.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Conneau, A. et al. (2020). Unsupervised Cross-lingual Representation Learning at Scale. Proceedings of ACL. DOI: 10.18653/v1/2020.acl-main.747 ↗
- Pires, T., Schlinger, E. & Garrette, D. (2019). How Multilingual is Multilingual BERT? Proceedings of ACL. DOI: 10.18653/v1/P19-1493 ↗