ScholarGate
Msaidizi
Machine learningDeep learning / NLP / CV

Kigeuzi cha Taswira Kinachosimamiwa Kidogo

Kigeuzi cha Taswira Kinachosimamiwa Kidogo (WS-ViT) hufunza Kigeuzi cha Taswira kwa data ya picha isiyo na ufafanuzi sahihi wa kiwango cha pikseli, badala yake hutumia usimamizi wa bei nafuu na wenye kelele zaidi kama vile vitambulisho vya darasa la kiwango cha picha, visanduku vya mipaka, au maandishi yaliyochukuliwa kutoka mtandaoni. Utaratibu wa umakini binafsi wa kimataifa wa kigeuzi huifanya iwe na uwezo maalum wa kubainisha vitu na kujifunza sifa bainifu kutoka kwa lebo hizi zisizokamilika.

Fungua katika MethodMindHivi karibuniVideoHivi karibuniDownload slides

Soma mbinu kamili

Kwa wanachama pekee

Ingia kwa akaunti ya bure ili kusoma sehemu hii.

Ingia

Method map

The neighbourhood of related methods — select a node to explore.

Vyanzo

  1. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link
  2. Zhou, Z.-H. (2022). A brief introduction to weakly supervised learning. National Science Review, 5(1), 44–53. DOI: 10.1093/nsr/nwx106

Jinsi ya kunukuu ukurasa huu

ScholarGate. (2026, June 3). Weakly Supervised Vision Transformer (WS-ViT). ScholarGate. https://scholargate.app/sw/deep-learning/weakly-supervised-vision-transformer

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side
ScholarGateWeakly supervised vision transformer (Weakly Supervised Vision Transformer (WS-ViT)). Imepatikana 2026-06-15 kutoka https://scholargate.app/sw/deep-learning/weakly-supervised-vision-transformer · Seti ya data: https://doi.org/10.5281/zenodo.20539026