ScholarGate
Assistent

Sammenlign metoder

Gennemgå dine valgte metoder side om side; rækker, der afviger, er fremhævet.

Tosproget Billedklassifikation×Billedklassifikation×
FagområdeDyb læringDyb læring
FamilieMachine learningMachine learning
Oprindelsesår2020s2012 (deep CNN era); conceptual roots 1989 (LeCun)
OphavspersonCommunity / Radford et al. (CLIP, 2021) as key enablerKrizhevsky, A.; Sutskever, I.; Hinton, G. E.
TypeCross-lingual supervised image classificationSupervised classification task
Oprindelig kildeRadford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link ↗Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems (NeurIPS), 25, 1097–1105. link ↗
AliasserCross-lingual image classification, Multilingual visual recognition, Cross-cultural image classification, Multilingual vision-language classificationvisual classification, image recognition, CNN-based classification, visual categorization
Relaterede55
ResuméMultilingual image classification trains visual models to recognise and label images when class names, supervision signals, or evaluation benchmarks span multiple languages. Enabled by multilingual vision-language models such as CLIP, it allows a single model to classify images using prompts or labels in any supported language, facilitating cross-cultural and cross-lingual deployment of computer vision systems.Image classification is the task of assigning a single semantic label to an entire image from a fixed set of categories. Modern approaches rely on deep convolutional neural networks (CNNs) or Vision Transformers (ViTs) trained end-to-end on large labeled datasets such as ImageNet, achieving superhuman accuracy on many benchmarks and underpinning applications from medical imaging to autonomous vehicles.
ScholarGateDatasæt
  1. v1
  2. 2 Kilder
  3. PUBLISHED
  1. v1
  2. 2 Kilder
  3. PUBLISHED

Gå til søgning Hent slides

ScholarGateSammenlign metoder: Multilingual Image Classification · Image Classification. Hentet 2026-06-15 fra https://scholargate.app/da/compare