Transformer wa Maono
Transformer wa Maono (ViT), ulioanzishwa na Dosovitskiy na wenzake mwaka 2021, hugawanya picha katika vipande vya ukubwa sawa, huwatendea vipande hivyo kama mfuatano, na hutumia utaratibu wa kujitazama wa Transformer kwa ajili ya uainishaji wa picha. Kwa data ya kutosha ya mafunzo, unazidi mitandao ya neva ya konvolusheni (CNNs).
Soma mbinu kamili
Ingia kwa akaunti ya bure ili kusoma sehemu hii.
Method map
The neighbourhood of related methods — select a node to explore.
+27 more
Vyanzo
Jinsi ya kunukuu ukurasa huu
ScholarGate. (2026, June 1). Vision Transformer (ViT). ScholarGate. https://scholargate.app/sw/deep-learning/vision-transformer
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
- Mfumo wa UenezajiUjifunzaji wa Kina↔ compare
- Mtandao wa Kushawishi unaozalisha (Generative Adversarial Network - GAN)Ujifunzaji wa Kina↔ compare
- Msitu NasibuUjifunzaji wa Mashine↔ compare
- Support Vector Machine (Uainishaji)Ujifunzaji wa Mashine↔ compare
- Variational AutoencoderUjifunzaji wa Kina↔ compare
Imerejelewa na
Umeona tatizo kwenye ukurasa huu? Ripoti au pendekeza marekebisho →