ScholarGate
Βοηθός

Σύγκριση μεθόδων

Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.

Πολυγλωσσική Ταξινόμηση Εικόνων×Κατηγοριοποίηση Εικόνων×
ΠεδίοΒαθιά ΜάθησηΒαθιά Μάθηση
ΟικογένειαMachine learningMachine learning
Έτος προέλευσης2020s2012 (deep CNN era); conceptual roots 1989 (LeCun)
ΔημιουργόςCommunity / Radford et al. (CLIP, 2021) as key enablerKrizhevsky, A.; Sutskever, I.; Hinton, G. E.
ΤύποςCross-lingual supervised image classificationSupervised classification task
Θεμελιώδης πηγήRadford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link ↗Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems (NeurIPS), 25, 1097–1105. link ↗
Εναλλακτικές ονομασίεςCross-lingual image classification, Multilingual visual recognition, Cross-cultural image classification, Multilingual vision-language classificationvisual classification, image recognition, CNN-based classification, visual categorization
Συναφείς55
ΣύνοψηMultilingual image classification trains visual models to recognise and label images when class names, supervision signals, or evaluation benchmarks span multiple languages. Enabled by multilingual vision-language models such as CLIP, it allows a single model to classify images using prompts or labels in any supported language, facilitating cross-cultural and cross-lingual deployment of computer vision systems.Image classification is the task of assigning a single semantic label to an entire image from a fixed set of categories. Modern approaches rely on deep convolutional neural networks (CNNs) or Vision Transformers (ViTs) trained end-to-end on large labeled datasets such as ImageNet, achieving superhuman accuracy on many benchmarks and underpinning applications from medical imaging to autonomous vehicles.
ScholarGateΣύνολο δεδομένων
  1. v1
  2. 2 Πηγές
  3. PUBLISHED
  1. v1
  2. 2 Πηγές
  3. PUBLISHED

Μετάβαση στην αναζήτηση Λήψη διαφανειών

ScholarGateΣύγκριση μεθόδων: Multilingual Image Classification · Image Classification. Ανακτήθηκε στις 2026-06-15 από https://scholargate.app/el/compare