ScholarGate
Assistent
Machine learningDeep learning / NLP / CV

Tosproget Billedklassifikation

Tosproget billedklassifikation træner visuelle modeller til at genkende og mærke billeder, når klassenavne, overvågningssignaler eller evalueringsbenchmarks spænder over flere sprog. Muliggjort af tosprogede vision-sprogmodeller som CLIP, tillader det en enkelt model at klassificere billeder ved hjælp af prompts eller etiketter på ethvert understøttet sprog, hvilket letter tværkulturel og tværsproglig implementering af computersynssystemer.

Åbn i MethodMindSnartVideoSnartDownload slides

Læs hele metoden

Kun for medlemmer

Log ind med en gratis konto for at læse dette afsnit.

Log ind

Method map

The neighbourhood of related methods — select a node to explore.

Kilder

  1. Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link
  2. Image classification. Wikipedia. link

Sådan citerer du denne side

ScholarGate. (2026, June 3). Multilingual Image Classification (Cross-Lingual Vision Model). ScholarGate. https://scholargate.app/da/deep-learning/multilingual-image-classification

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side
ScholarGateMultilingual Image Classification (Multilingual Image Classification (Cross-Lingual Vision Model)). Hentet 2026-06-15 fra https://scholargate.app/da/deep-learning/multilingual-image-classification · Datasæt: https://doi.org/10.5281/zenodo.20539026