ScholarGate
Assistent

Sammenlign metoder

Gennemgå dine valgte metoder side om side; rækker, der afviger, er fremhævet.

LoRA og PEFT×Vision Transformer×
FagområdeDyb læringDyb læring
FamilieMachine learningMachine learning
Oprindelsesår20222021
OphavspersonHu, E. J. et al.; Lester, B. et al.Dosovitskiy, A. et al.
TypeParameter-efficient fine-tuning of large pretrained modelsTransformer architecture for images (self-attention over patches)
Oprindelig kildeHu, E. J. et al. (2022). LoRA: Low-Rank Adaptation of Large Language Models. ICLR. link ↗Dosovitskiy, A. et al. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR. link ↗
AliasserLoRA ve PEFT — Parametre Verimli İnce Ayar, Low-Rank Adaptation, parameter-efficient fine-tuning, prefix tuningGörsel Transformer (ViT), görsel transformer, ViT, patch transformer for images
Relaterede55
ResuméLoRA (Low-Rank Adaptation), introduced by Hu et al. in 2022, and the broader family of parameter-efficient fine-tuning (PEFT) methods adapt large pretrained language models to new tasks by training only a small number of extra parameters instead of every weight in the model. This makes fine-tuning possible with far less GPU memory and compute while leaving the original model largely untouched.The Vision Transformer (ViT), introduced by Dosovitskiy and colleagues in 2021, splits an image into fixed-size patches, treats those patches as a sequence, and applies the Transformer self-attention mechanism to image classification. Given enough training data, it surpasses convolutional neural networks (CNNs).
ScholarGateDatasæt
  1. v1
  2. 2 Kilder
  3. PUBLISHED
  1. v1
  2. 2 Kilder
  3. PUBLISHED

Gå til søgning Hent slides

ScholarGateSammenlign metoder: LoRA and PEFT · Vision Transformer. Hentet 2026-06-17 fra https://scholargate.app/da/compare