ScholarGate
Assistant

Comparer des méthodes

Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.

Vision Transformer faiblement supervisé×Apprentissage auto-supervisé×
DomaineApprentissage profondApprentissage automatique
FamilleMachine learningMachine learning
Année d'origine2021–20222018–2020
Auteur d'origineDosovitskiy et al. (ViT); weak supervision paradigm from Zhou and othersLeCun, Y. and community (formalized ~2018–2020)
TypeSelf-attention image model with weakly supervised trainingRepresentation learning paradigm
Source fondatriceDosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link ↗LeCun, Y. & Misra, I. (2022). Self-supervised learning: The dark matter of intelligence. Meta AI Blog. https://ai.facebook.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ link ↗
AliasWS-ViT, weakly supervised ViT, weak supervision with vision transformer, ViT with weak labelsSSL, self-supervised pre-training, pretext-task learning, unsupervised representation learning
Apparentées43
RésuméWeakly Supervised Vision Transformer (WS-ViT) trains a Vision Transformer on image data that lacks precise pixel-level annotations, instead using cheaper, noisier supervision such as image-level class tags, bounding boxes, or web-scraped text. The global self-attention mechanism of the transformer makes it especially capable of localising objects and learning discriminative features from these incomplete labels.Self-supervised learning (SSL) is a machine-learning paradigm that generates its own supervisory signal directly from unlabeled data by defining an auxiliary pretext task — such as predicting masked words, rotating images, or contrasting augmented views — and uses the learned representations as a powerful starting point for downstream tasks with minimal labeled examples.
ScholarGateJeu de données
  1. v1
  2. 2 Sources
  3. PUBLISHED
  1. v1
  2. 2 Sources
  3. PUBLISHED

Aller à la recherche Télécharger les diapositives

ScholarGateComparer des méthodes: Weakly supervised vision transformer · Self-supervised Learning. Consulté le 2026-06-15 sur https://scholargate.app/fr/compare