ScholarGate
Асистент

Порівняння методів

Переглядайте обрані методи поруч; рядки з відмінностями підсвічено.

Мережі Колмогорова-Арнольда×Mamba (модель на основі простору станів)×Масковані автокодувальники×Трансформер для комп'ютерного зору×
ГалузьГлибоке навчанняГлибоке навчанняГлибоке навчанняГлибоке навчання
РодинаMachine learningMachine learningMachine learningMachine learning
Рік появи2024202320212021
Автор методуZiming LiuAlbert GuKaiming HeDosovitskiy, A. et al.
ТипNeural network architectureNeural network architectureNeural network architectureTransformer architecture for images (self-attention over patches)
Основоположне джерелоLiu, Z., Wang, Y., Vaidya, S., Ruehle, F., Halverson, J., Soljačić, M., Hou, T. Y., & Tegmark, M. (2024). KAN: Kolmogorov-Arnold Networks. arXiv preprint arXiv:2404.19756. link ↗Gu, A., & Dao, C. (2023). Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.08956. link ↗He, K., Chen, X., Xie, S., Li, Y., Dollár, P., & Girshick, R. (2022). Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 16000-16009). DOI ↗Dosovitskiy, A. et al. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR. link ↗
Інші назвиKAN, Kolmogorov-ArnoldMamba, State space models, Selective state spaceMAE, Vision MAEGörsel Transformer (ViT), görsel transformer, ViT, patch transformer for images
Пов'язані4445
ПідсумокKolmogorov-Arnold Networks (KAN) is a neural network architecture introduced by Liu et al. in 2024 that replaces linear transformations with learned univariate functions on edges. Inspired by the Kolmogorov-Arnold representation theorem, KAN achieves superior function approximation with fewer parameters than traditional MLPs, offering potential efficiency gains and improved interpretability.Mamba is a sequence model architecture introduced by Gu and Dao in 2023 that achieves linear-time complexity while maintaining strong performance on language modeling tasks. By combining state space models with input-dependent selectivity, Mamba addresses the quadratic complexity of transformers while preserving modeling power.Masked Autoencoders (MAE) is a self-supervised learning approach introduced by He et al. in 2021 that masks random patches of an image and trains a model to reconstruct the missing content. Adapting the masked language modeling paradigm from NLP to vision, MAE learns rich visual representations by solving a challenging reconstruction task without requiring labels.The Vision Transformer (ViT), introduced by Dosovitskiy and colleagues in 2021, splits an image into fixed-size patches, treats those patches as a sequence, and applies the Transformer self-attention mechanism to image classification. Given enough training data, it surpasses convolutional neural networks (CNNs).
ScholarGateНабір даних
  1. v1
  2. 1 Джерела
  3. PUBLISHED
  1. v1
  2. 1 Джерела
  3. PUBLISHED
  1. v1
  2. 1 Джерела
  3. PUBLISHED
  1. v1
  2. 2 Джерела
  3. PUBLISHED

Перейти до пошуку Завантажити слайди

ScholarGateПорівняння методів: Kolmogorov-Arnold Networks · Mamba (State Space Model) · Masked Autoencoders · Vision Transformer. Отримано 2026-06-20 з https://scholargate.app/uk/compare