ScholarGate
Assistent

Sammenlign metoder

Gjennomgå de valgte metodene side om side; rader som avviker, er uthevet.

Svakt veiledet Vision Transformer×Selv-supervisert læring×
FagfeltDyp læringMaskinlæring
FamilieMachine learningMachine learning
Opprinnelsesår2021–20222018–2020
OpphavspersonDosovitskiy et al. (ViT); weak supervision paradigm from Zhou and othersLeCun, Y. and community (formalized ~2018–2020)
TypeSelf-attention image model with weakly supervised trainingRepresentation learning paradigm
Opprinnelig kildeDosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link ↗LeCun, Y. & Misra, I. (2022). Self-supervised learning: The dark matter of intelligence. Meta AI Blog. https://ai.facebook.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ link ↗
AliasWS-ViT, weakly supervised ViT, weak supervision with vision transformer, ViT with weak labelsSSL, self-supervised pre-training, pretext-task learning, unsupervised representation learning
Relaterte43
SammendragWeakly Supervised Vision Transformer (WS-ViT) trains a Vision Transformer on image data that lacks precise pixel-level annotations, instead using cheaper, noisier supervision such as image-level class tags, bounding boxes, or web-scraped text. The global self-attention mechanism of the transformer makes it especially capable of localising objects and learning discriminative features from these incomplete labels.Self-supervised learning (SSL) is a machine-learning paradigm that generates its own supervisory signal directly from unlabeled data by defining an auxiliary pretext task — such as predicting masked words, rotating images, or contrasting augmented views — and uses the learned representations as a powerful starting point for downstream tasks with minimal labeled examples.
ScholarGateDatasett
  1. v1
  2. 2 Kilder
  3. PUBLISHED
  1. v1
  2. 2 Kilder
  3. PUBLISHED

Gå til søk Last ned lysbilder

ScholarGateSammenlign metoder: Weakly supervised vision transformer · Self-supervised Learning. Hentet 2026-06-15 fra https://scholargate.app/no/compare