Confronta i metodi

Esamina i metodi selezionati fianco a fianco; le righe che differiscono sono evidenziate.

	LoRA e PEFT ×	Vision Transformer ×
Campo	Apprendimento profondo	Apprendimento profondo
Famiglia	Machine learning	Machine learning
Anno di origine≠	2022	2021
Ideatore≠	Hu, E. J. et al.; Lester, B. et al.	Dosovitskiy, A. et al.
Tipo≠	Parameter-efficient fine-tuning of large pretrained models	Transformer architecture for images (self-attention over patches)
Fonte seminale≠	Hu, E. J. et al. (2022). LoRA: Low-Rank Adaptation of Large Language Models. ICLR. link ↗	Dosovitskiy, A. et al. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR. link ↗
Alias≠	LoRA ve PEFT — Parametre Verimli İnce Ayar, Low-Rank Adaptation, parameter-efficient fine-tuning, prefix tuning	Görsel Transformer (ViT), görsel transformer, ViT, patch transformer for images
Correlati	5	5
Sintesi≠	LoRA (Low-Rank Adaptation), introduced by Hu et al. in 2022, and the broader family of parameter-efficient fine-tuning (PEFT) methods adapt large pretrained language models to new tasks by training only a small number of extra parameters instead of every weight in the model. This makes fine-tuning possible with far less GPU memory and compute while leaving the original model largely untouched.	The Vision Transformer (ViT), introduced by Dosovitskiy and colleagues in 2021, splits an image into fixed-size patches, treats those patches as a sequence, and applies the Transformer self-attention mechanism to image classification. Given enough training data, it surpasses convolutional neural networks (CNNs).
ScholarGateInsieme di dati ↗	v1 2 Fonti PUBLISHED	v1 2 Fonti PUBLISHED

Vai alla ricerca → Scarica le diapositive