ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

多任务学习×知识蒸馏×
领域深度学习深度学习
方法族Machine learningMachine learning
起源年份19972015
提出者Rich CaruanaHinton, G., Vinyals, O. & Dean, J.
类型Inductive transfer methodNeural network compression (teacher–student)
开创性文献Caruana, R. (1997). Multitask learning. Machine Learning, 28(1), 41–75. DOI ↗Hinton, G., Vinyals, O. & Dean, J. (2015). Distilling the Knowledge in a Neural Network. NeurIPS Deep Learning Workshop. link ↗
别名MTL, Joint Learning, Shared Representation Learning, Çok Görevli ÖğrenmeBilgi Damıtma (Knowledge Distillation), bilgi damıtma, teacher-student distillation, model distillation
相关35
摘要Multitask Learning (MTL) is a machine learning paradigm in which a model is trained simultaneously on multiple related tasks, sharing representations across them to improve generalization. Introduced formally by Rich Caruana in 1997, MTL draws on the intuition that auxiliary tasks act as inductive bias, providing extra supervision signals that help the shared layers learn richer, more robust feature representations than single-task training would yield.Knowledge Distillation is a model-compression technique, introduced by Geoffrey Hinton and colleagues in 2015, that trains a small student model using the soft-label outputs of a large teacher model. Distilled models such as DistilBERT and TinyBERT reach roughly 97% of the larger model's performance while running far faster.
ScholarGate数据集
  1. v1
  2. 1 来源
  3. PUBLISHED
  1. v1
  2. 2 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: Multitask Learning · Knowledge Distillation. 于 2026-06-15 检索自 https://scholargate.app/zh/compare