ScholarGate
Asisten

Bandingkan metode

Tinjau metode pilihan Anda berdampingan; baris yang berbeda akan disorot.

Vision Transformer Supervisi Lemah×Pembelajaran Mandiri Terawasi×
BidangPembelajaran MendalamPembelajaran Mesin
KeluargaMachine learningMachine learning
Tahun asal2021–20222018–2020
PencetusDosovitskiy et al. (ViT); weak supervision paradigm from Zhou and othersLeCun, Y. and community (formalized ~2018–2020)
TipeSelf-attention image model with weakly supervised trainingRepresentation learning paradigm
Sumber perintisDosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link ↗LeCun, Y. & Misra, I. (2022). Self-supervised learning: The dark matter of intelligence. Meta AI Blog. https://ai.facebook.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ link ↗
AliasWS-ViT, weakly supervised ViT, weak supervision with vision transformer, ViT with weak labelsSSL, self-supervised pre-training, pretext-task learning, unsupervised representation learning
Terkait43
RingkasanWeakly Supervised Vision Transformer (WS-ViT) trains a Vision Transformer on image data that lacks precise pixel-level annotations, instead using cheaper, noisier supervision such as image-level class tags, bounding boxes, or web-scraped text. The global self-attention mechanism of the transformer makes it especially capable of localising objects and learning discriminative features from these incomplete labels.Self-supervised learning (SSL) is a machine-learning paradigm that generates its own supervisory signal directly from unlabeled data by defining an auxiliary pretext task — such as predicting masked words, rotating images, or contrasting augmented views — and uses the learned representations as a powerful starting point for downstream tasks with minimal labeled examples.
ScholarGateSet data
  1. v1
  2. 2 Sumber
  3. PUBLISHED
  1. v1
  2. 2 Sumber
  3. PUBLISHED

Ke halaman pencarian Unduh salindia

ScholarGateBandingkan metode: Weakly supervised vision transformer · Self-supervised Learning. Diakses 2026-06-15 dari https://scholargate.app/id/compare