ScholarGate
Msaidizi
Machine learningDeep learning / NLP / CV

Vision Transformer Iliyobadilishwa

Vision Transformer Iliyobadilishwa (Fine-Tuned ViT) hubadilisha modeli kubwa ya awali iliyofunzwa ya ViT — ambayo hugawanya picha katika vipande vya ukubwa sawa na huchakata kupitia tabaka za kujijali — kwa kazi mpya ya uainishaji au utambuzi wa picha kwa kutumia seti ndogo ya data yenye lebo. Inafikia usahihi wa hali ya juu katika taswira kompyuta kwa kutumia uwakilishi tajiri uliojifunzwa wakati wa mafunzo ya awali kwa kiwango kikubwa.

Fungua katika MethodMindHivi karibuniVideoHivi karibuniPakua slaidi

Soma mbinu kamili

Kwa wanachama pekee

Ingia kwa akaunti ya bure ili kusoma sehemu hii.

Ingia

Ramani ya mbinu

Jirani ya mbinu zinazohusiana — chagua nodi ili kuchunguza.

+4 zaidi

Vyanzo

  1. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations (ICLR 2021). link
  2. Zhai, X., Kolesnikov, A., Houlsby, N., & Beyer, L. (2022). Scaling Vision Transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), pp. 12104-12113. link

Jinsi ya kunukuu ukurasa huu

ScholarGate. (2026, June 3). Fine-Tuned Vision Transformer (ViT with Task-Specific Adaptation). ScholarGate. https://scholargate.app/sw/deep-learning/fine-tuned-vision-transformer

Mbinu ipi?

Weka mbinu hii kando ya jamaa zake wa karibu na uzisome bega kwa bega — maktaba huweka vitabu mezani; uamuzi ni wako.

Linganisha bega kwa bega

Imerejelewa na

ScholarGateFine-Tuned Vision Transformer (Fine-Tuned Vision Transformer (ViT with Task-Specific Adaptation)). Imepatikana 2026-06-15 kutoka https://scholargate.app/sw/deep-learning/fine-tuned-vision-transformer · Seti ya data: https://doi.org/10.5281/zenodo.20539026