Machine learningDeep learning / NLP / CV

Multilingual RoBERTa-based Classification

Multilingual RoBERTa-based classification uses XLM-RoBERTa — a transformer pretrained on 100+ languages via masked language modeling — and fine-tunes it on labeled text to assign categories across multiple languages. By sharing a single model across languages, it enables robust cross-lingual and zero-shot text classification without needing separate per-language classifiers.

MethodMind'de açSoonVideoSoon

Tam yöntemi oku

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzman, F., Grave, E., Ott, M., Zettlemoyer, L., & Stoyanov, V. (2020). Unsupervised Cross-lingual Representation Learning at Scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pp. 8440–8451. DOI: 10.18653/v1/2020.acl-main.747
  2. Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2019). RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv preprint arXiv:1907.11692. link

Related methods

Referenced by

ScholarGateMultilingual RoBERTa-based Classification (Multilingual RoBERTa-based Text Classification (XLM-RoBERTa)). Retrieved 2026-06-04 from https://scholargate.app/tr/deep-learning/multilingual-roberta-based-classification