Machine learningDeep learning / NLP / CV

Multilingual Image Classification

Multilingual image classification trains visual models to recognise and label images when class names, supervision signals, or evaluation benchmarks span multiple languages. Enabled by multilingual vision-language models such as CLIP, it allows a single model to classify images using prompts or labels in any supported language, facilitating cross-cultural and cross-lingual deployment of computer vision systems.

MethodMind'de açSoonVideoSoon

Tam yöntemi oku

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link
  2. Image classification. Wikipedia. link

Related methods

ScholarGateMultilingual Image Classification (Multilingual Image Classification (Cross-Lingual Vision Model)). Retrieved 2026-06-04 from https://scholargate.app/tr/deep-learning/multilingual-image-classification