ScholarGate
עוזר

השוואת שיטות

סקרו את השיטות שבחרתם זו לצד זו; שורות שבהן יש הבדל מודגשות.

רשתות קולמוגורוב-ארנולד×Mamba (מודל מרחב מצב)×מקודדים אוטומטיים ממוסכים×טרנספורמר ראייה×
תחוםלמידה עמוקהלמידה עמוקהלמידה עמוקהלמידה עמוקה
משפחהMachine learningMachine learningMachine learningMachine learning
שנת המקור2024202320212021
הוגה השיטהZiming LiuAlbert GuKaiming HeDosovitskiy, A. et al.
סוגNeural network architectureNeural network architectureNeural network architectureTransformer architecture for images (self-attention over patches)
מקור מכונןLiu, Z., Wang, Y., Vaidya, S., Ruehle, F., Halverson, J., Soljačić, M., Hou, T. Y., & Tegmark, M. (2024). KAN: Kolmogorov-Arnold Networks. arXiv preprint arXiv:2404.19756. link ↗Gu, A., & Dao, C. (2023). Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.08956. link ↗He, K., Chen, X., Xie, S., Li, Y., Dollár, P., & Girshick, R. (2022). Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 16000-16009). DOI ↗Dosovitskiy, A. et al. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR. link ↗
כינוייםKAN, Kolmogorov-ArnoldMamba, State space models, Selective state spaceMAE, Vision MAEGörsel Transformer (ViT), görsel transformer, ViT, patch transformer for images
קשורות4445
תקצירKolmogorov-Arnold Networks (KAN) is a neural network architecture introduced by Liu et al. in 2024 that replaces linear transformations with learned univariate functions on edges. Inspired by the Kolmogorov-Arnold representation theorem, KAN achieves superior function approximation with fewer parameters than traditional MLPs, offering potential efficiency gains and improved interpretability.Mamba is a sequence model architecture introduced by Gu and Dao in 2023 that achieves linear-time complexity while maintaining strong performance on language modeling tasks. By combining state space models with input-dependent selectivity, Mamba addresses the quadratic complexity of transformers while preserving modeling power.Masked Autoencoders (MAE) is a self-supervised learning approach introduced by He et al. in 2021 that masks random patches of an image and trains a model to reconstruct the missing content. Adapting the masked language modeling paradigm from NLP to vision, MAE learns rich visual representations by solving a challenging reconstruction task without requiring labels.The Vision Transformer (ViT), introduced by Dosovitskiy and colleagues in 2021, splits an image into fixed-size patches, treats those patches as a sequence, and applies the Transformer self-attention mechanism to image classification. Given enough training data, it surpasses convolutional neural networks (CNNs).
ScholarGateמערך נתונים
  1. v1
  2. 1 מקורות
  3. PUBLISHED
  1. v1
  2. 1 מקורות
  3. PUBLISHED
  1. v1
  2. 1 מקורות
  3. PUBLISHED
  1. v1
  2. 2 מקורות
  3. PUBLISHED

מעבר לחיפוש הורדת מצגת

ScholarGateהשוואת שיטות: Kolmogorov-Arnold Networks · Mamba (State Space Model) · Masked Autoencoders · Vision Transformer. אוחזר בתאריך 2026-06-20 מתוך https://scholargate.app/he/compare