ScholarGate
Asistente

Comparar métodos

Revisa los métodos seleccionados uno junto a otro; las filas que difieren aparecen resaltadas.

Vision Transformer con Supervisión Débil×Aprendizaje autosupervisado×
CampoAprendizaje profundoAprendizaje automático
FamiliaMachine learningMachine learning
Año de origen2021–20222018–2020
Autor originalDosovitskiy et al. (ViT); weak supervision paradigm from Zhou and othersLeCun, Y. and community (formalized ~2018–2020)
TipoSelf-attention image model with weakly supervised trainingRepresentation learning paradigm
Fuente seminalDosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link ↗LeCun, Y. & Misra, I. (2022). Self-supervised learning: The dark matter of intelligence. Meta AI Blog. https://ai.facebook.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ link ↗
AliasWS-ViT, weakly supervised ViT, weak supervision with vision transformer, ViT with weak labelsSSL, self-supervised pre-training, pretext-task learning, unsupervised representation learning
Relacionados43
ResumenWeakly Supervised Vision Transformer (WS-ViT) trains a Vision Transformer on image data that lacks precise pixel-level annotations, instead using cheaper, noisier supervision such as image-level class tags, bounding boxes, or web-scraped text. The global self-attention mechanism of the transformer makes it especially capable of localising objects and learning discriminative features from these incomplete labels.Self-supervised learning (SSL) is a machine-learning paradigm that generates its own supervisory signal directly from unlabeled data by defining an auxiliary pretext task — such as predicting masked words, rotating images, or contrasting augmented views — and uses the learned representations as a powerful starting point for downstream tasks with minimal labeled examples.
ScholarGateConjunto de datos
  1. v1
  2. 2 Fuentes
  3. PUBLISHED
  1. v1
  2. 2 Fuentes
  3. PUBLISHED

Ir a la búsqueda Descargar diapositivas

ScholarGateComparar métodos: Weakly supervised vision transformer · Self-supervised Learning. Recuperado el 2026-06-15 de https://scholargate.app/es/compare