Machine learningDeep Learning, Language Models, Parameter Efficient Fine-Tuning

QLoRA

QLoRA 是一种由 Dettmers 等人于 2023 年提出的高效微调方法，它通过量化和低秩适配（low-rank adaptation）实现了对大型语言模型的微调。通过结合 4 位量化与 LoRA，QLoRA 将内存需求降低了 75%，使得在单个 GPU 上微调拥有 650 亿参数的模型成为可能。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

Method map

The neighbourhood of related methods — select a node to explore.

QLoRA

直接偏好优化潜在扩散模型 Mamba（状态空间模型）掩码自编码器

来源

Dettmers, T., Pagnoni, A., Holtzman, A., & Contrastive, L. (2023). QLoRA: Efficient finetuning of quantized LLMs. arXiv preprint arXiv:2305.14314. link ↗

如何引用本页

ScholarGate. (2026, June 3). Efficient Finetuning of Quantized LLMs. ScholarGate. https://scholargate.app/zh/deep-learning/qlora

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

被引用于

直接偏好优化

发现本页有问题？报告或提出修改建议 →

阅读完整方法

Method map

来源

如何引用本页

相关方法

Which method?

被引用于