ScholarGate
Βοηθός

Σύγκριση μεθόδων

Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.

Ασθενώς επιβλεπόμενος Μετασχηματιστής Όρασης×Αυτο-εποπτευόμενη Μάθηση×
ΠεδίοΒαθιά ΜάθησηΜηχανική Μάθηση
ΟικογένειαMachine learningMachine learning
Έτος προέλευσης2021–20222018–2020
ΔημιουργόςDosovitskiy et al. (ViT); weak supervision paradigm from Zhou and othersLeCun, Y. and community (formalized ~2018–2020)
ΤύποςSelf-attention image model with weakly supervised trainingRepresentation learning paradigm
Θεμελιώδης πηγήDosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR). link ↗LeCun, Y. & Misra, I. (2022). Self-supervised learning: The dark matter of intelligence. Meta AI Blog. https://ai.facebook.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ link ↗
Εναλλακτικές ονομασίεςWS-ViT, weakly supervised ViT, weak supervision with vision transformer, ViT with weak labelsSSL, self-supervised pre-training, pretext-task learning, unsupervised representation learning
Συναφείς43
ΣύνοψηWeakly Supervised Vision Transformer (WS-ViT) trains a Vision Transformer on image data that lacks precise pixel-level annotations, instead using cheaper, noisier supervision such as image-level class tags, bounding boxes, or web-scraped text. The global self-attention mechanism of the transformer makes it especially capable of localising objects and learning discriminative features from these incomplete labels.Self-supervised learning (SSL) is a machine-learning paradigm that generates its own supervisory signal directly from unlabeled data by defining an auxiliary pretext task — such as predicting masked words, rotating images, or contrasting augmented views — and uses the learned representations as a powerful starting point for downstream tasks with minimal labeled examples.
ScholarGateΣύνολο δεδομένων
  1. v1
  2. 2 Πηγές
  3. PUBLISHED
  1. v1
  2. 2 Πηγές
  3. PUBLISHED

Μετάβαση στην αναζήτηση Λήψη διαφανειών

ScholarGateΣύγκριση μεθόδων: Weakly supervised vision transformer · Self-supervised Learning. Ανακτήθηκε στις 2026-06-15 από https://scholargate.app/el/compare