ScholarGate
Assistent

Compara mètodes

Revisa els mètodes seleccionats l'un al costat de l'altre; les files que difereixen es ressalten.

Classificació d'imatges multilingüe×Classificació d'imatges×
CampAprenentatge profundAprenentatge profund
FamíliaMachine learningMachine learning
Any d'origen2020s2012 (deep CNN era); conceptual roots 1989 (LeCun)
Autor originalCommunity / Radford et al. (CLIP, 2021) as key enablerKrizhevsky, A.; Sutskever, I.; Hinton, G. E.
TipusCross-lingual supervised image classificationSupervised classification task
Font seminalRadford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link ↗Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems (NeurIPS), 25, 1097–1105. link ↗
ÀliesCross-lingual image classification, Multilingual visual recognition, Cross-cultural image classification, Multilingual vision-language classificationvisual classification, image recognition, CNN-based classification, visual categorization
Relacionats55
ResumMultilingual image classification trains visual models to recognise and label images when class names, supervision signals, or evaluation benchmarks span multiple languages. Enabled by multilingual vision-language models such as CLIP, it allows a single model to classify images using prompts or labels in any supported language, facilitating cross-cultural and cross-lingual deployment of computer vision systems.Image classification is the task of assigning a single semantic label to an entire image from a fixed set of categories. Modern approaches rely on deep convolutional neural networks (CNNs) or Vision Transformers (ViTs) trained end-to-end on large labeled datasets such as ImageNet, achieving superhuman accuracy on many benchmarks and underpinning applications from medical imaging to autonomous vehicles.
ScholarGateConjunt de dades
  1. v1
  2. 2 Fonts
  3. PUBLISHED
  1. v1
  2. 2 Fonts
  3. PUBLISHED

Ves a la cerca Baixa les diapositives

ScholarGateCompara mètodes: Multilingual Image Classification · Image Classification. Recuperat el 2026-06-15 de https://scholargate.app/ca/compare