Machine learningDeep learning / NLP / CV
Multilingual Text Summarization
Multilingual text summarization applies pre-trained multilingual encoder-decoder models — such as mT5 or mBART — to generate concise summaries of documents written in many languages, either within the same language (monolingual) or across languages (cross-lingual). Fine-tuning these models on multilingual summarization benchmarks like XL-Sum enables coverage of dozens of languages with a single model.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Xue, L., Constant, N., Roberts, A., Kale, M., Al-Rfou, R., Siddhant, A., Barua, A., & Raffel, C. (2021). mT5: A Massively Multilingual Pre-Trained Text-to-Text Transformer. Proceedings of NAACL-HLT 2021, pp. 483–498. Association for Computational Linguistics. link ↗
- Hasan, T., Bhattacharjee, A., Islam, M. S., Mubasshir, K., Li, Y.-F., Kang, Y.-B., Rahman, M. S., & Shahriyar, R. (2021). XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages. Findings of ACL-IJCNLP 2021, pp. 4693–4703. Association for Computational Linguistics. link ↗