Transformer wa Maono unaojifundisha
Transformer wa Maono unaojifundisha (SSL-ViT) hutumia malengo ya mafunzo ya awali yanayojifundisha — kama vile utabiri wa kiraka kilichofichwa (MAE) au kujifundisha bila lebo (DINO) — kwa usanifu wa Transformer wa Maono, kuwezesha uwakilishi wenye nguvu wa kuona kujifunzwa kutoka kwa makusanyo makubwa ya picha zisizo na lebo kabla ya marekebisho yoyote maalum ya kazi.
Soma mbinu kamili
Ingia kwa akaunti ya bure ili kusoma sehemu hii.
Method map
The neighbourhood of related methods — select a node to explore.
Vyanzo
- Caron, M., Touvron, H., Misra, I., Jegou, H., Mairal, J., Bojanowski, P., & Joulin, A. (2021). Emerging Properties in Self-Supervised Vision Transformers. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 9650–9660. link ↗
- He, K., Chen, X., Xie, S., Li, Y., Dollar, P., & Girshick, R. (2022). Masked Autoencoders Are Scalable Vision Learners. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16000–16009. link ↗
Jinsi ya kunukuu ukurasa huu
ScholarGate. (2026, June 3). Self-supervised Vision Transformer (SSL-ViT). ScholarGate. https://scholargate.app/sw/deep-learning/self-supervised-vision-transformer
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
- Vision Transformer IliyobadilishwaUjifunzaji wa Kina↔ compare
- Transformer wa Maono wa MultimodalUjifunzaji wa Kina↔ compare
- Self-supervised convolutional neural networkUjifunzaji wa Kina↔ compare
- Transformer wa MaonoUjifunzaji wa Kina↔ compare
Imerejelewa na
Umeona tatizo kwenye ukurasa huu? Ripoti au pendekeza marekebisho →