ScholarGate
어시스턴트

방법 비교

선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.

다중 작업 학습 (Multitask Learning, MTL)×커리큘럼 학습×지식 증류×
분야딥러닝딥러닝딥러닝
계열Machine learningMachine learningMachine learning
기원 연도199720092015
창시자Rich CaruanaYoshua Bengio et al.Hinton, G., Vinyals, O. & Dean, J.
유형Inductive transfer methodTraining strategyNeural network compression (teacher–student)
원전Caruana, R. (1997). Multitask learning. Machine Learning, 28(1), 41–75. DOI ↗Bengio, Y., Louradour, J., Collobert, R., & Weston, J. (2009). Curriculum learning. International Conference on Machine Learning (ICML), 41–48. DOI ↗Hinton, G., Vinyals, O. & Dean, J. (2015). Distilling the Knowledge in a Neural Network. NeurIPS Deep Learning Workshop. link ↗
별칭MTL, Joint Learning, Shared Representation Learning, Çok Görevli ÖğrenmeScheduled Training, Difficulty-Based Training, Self-Paced Learning, Müfredat ÖğrenimiBilgi Damıtma (Knowledge Distillation), bilgi damıtma, teacher-student distillation, model distillation
관련335
요약Multitask Learning (MTL) is a machine learning paradigm in which a model is trained simultaneously on multiple related tasks, sharing representations across them to improve generalization. Introduced formally by Rich Caruana in 1997, MTL draws on the intuition that auxiliary tasks act as inductive bias, providing extra supervision signals that help the shared layers learn richer, more robust feature representations than single-task training would yield.Curriculum Learning is a training strategy for machine learning models, introduced by Bengio et al. in 2009, in which training examples are presented in a meaningful order—typically from easy to hard—rather than at random. Inspired by how humans and animals learn progressively, it organizes training data into a curriculum that starts with simpler, cleaner, or more representative samples and gradually introduces harder or more complex examples as the model matures.Knowledge Distillation is a model-compression technique, introduced by Geoffrey Hinton and colleagues in 2015, that trains a small student model using the soft-label outputs of a large teacher model. Distilled models such as DistilBERT and TinyBERT reach roughly 97% of the larger model's performance while running far faster.
ScholarGate데이터셋
  1. v1
  2. 1 출처
  3. PUBLISHED
  1. v1
  2. 1 출처
  3. PUBLISHED
  1. v1
  2. 2 출처
  3. PUBLISHED

검색으로 이동 슬라이드 다운로드

ScholarGate방법 비교: Multitask Learning · Curriculum Learning · Knowledge Distillation. 2026-06-17에 다음에서 검색함: https://scholargate.app/ko/compare