Porównaj metody

Przeglądaj wybrane metody obok siebie; wiersze, które się różnią, są wyróżnione.

	Vision Transformer ×	Maszyna wektorów nośnych (klasyfikacja)×
Dziedzina≠	Uczenie głębokie	Uczenie maszynowe
Rodzina	Machine learning	Machine learning
Rok powstania≠	2021	1995
Twórca≠	Dosovitskiy, A. et al.	Cortes, C. & Vapnik, V.
Typ≠	Transformer architecture for images (self-attention over patches)	Maximum-margin classifier (kernel method)
Źródło pierwotne≠	Dosovitskiy, A. et al. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR. link ↗	Cortes, C. & Vapnik, V. (1995). Support-Vector Networks. Machine Learning, 20, 273–297. DOI ↗
Inne nazwy	Görsel Transformer (ViT), görsel transformer, ViT, patch transformer for images	Destek Vektör Makinesi (SVM — Sınıflandırma), support-vector network, SVM classifier, maximum-margin classifier
Pokrewne	5	5
Podsumowanie≠	The Vision Transformer (ViT), introduced by Dosovitskiy and colleagues in 2021, splits an image into fixed-size patches, treats those patches as a sequence, and applies the Transformer self-attention mechanism to image classification. Given enough training data, it surpasses convolutional neural networks (CNNs).	The Support Vector Machine, introduced by Corinna Cortes and Vladimir Vapnik in 1995, is a classifier that finds the optimal separating hyperplane between classes in a high-dimensional space. It chooses the boundary that leaves the widest possible margin to the nearest training points, which makes its decisions robust on new data.
ScholarGateZbiór danych ↗	v1 2 Źródła PUBLISHED	v1 1 Źródła PUBLISHED

Przejdź do wyszukiwania → Pobierz slajdy