ScholarGate
助手
Machine learningDeep learning / NLP / CV

多语言图像分类

多语言图像分类训练视觉模型,使其在类别名称、监督信号或评估基准涵盖多种语言时,能够识别和标注图像。通过多语言视觉-语言模型(如CLIP)实现,它允许单个模型使用任何支持语言的提示或标签对图像进行分类,从而促进计算机视觉系统在跨文化和跨语言环境中的部署。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

来源

  1. Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link
  2. Image classification. Wikipedia. link

如何引用本页

ScholarGate. (2026, June 3). Multilingual Image Classification (Cross-Lingual Vision Model). ScholarGate. https://scholargate.app/zh/deep-learning/multilingual-image-classification

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side
ScholarGateMultilingual Image Classification (Multilingual Image Classification (Cross-Lingual Vision Model)). 于 2026-06-15 检索自 https://scholargate.app/zh/deep-learning/multilingual-image-classification · 数据集: https://doi.org/10.5281/zenodo.20539026