ScholarGate
Асистент

Сравнение на методи

Прегледайте избраните методи един до друг; редовете с разлики са откроени.

Полу-наблюдавано обучение с подсилване×Слабо контролирано обучение с подкрепление×
ОбластДълбоко обучениеДълбоко обучение
СемействоMachine learningMachine learning
Година на възникване2020s2010s–present
СъздателMultiple contributors (Laskin, Srinivas, Abbeel et al.)Multiple contributors; reward-learning framing: Christiano et al. (2017)
ТипSemi-supervised training paradigm for RL agentsReinforcement learning with imperfect or partial reward supervision
Основополагащ източникZhan, X., Zhu, X., & Shi, H. (2022). Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(4), 4680–4688. link ↗Sutton, R. S. & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT Press. ISBN: 978-0-262-03924-6
Други названияSSRL, semi-supervised RL, RL with unlabeled data, label-efficient reinforcement learningWSRL, weak-reward RL, imperfect-reward reinforcement learning, reward-impoverished RL
Свързани63
РезюмеSemi-supervised reinforcement learning (SSRL) combines standard reinforcement learning — where an agent learns from sparse reward signals — with semi-supervised techniques that extract structure from unlabeled environment interactions. The goal is to improve sample efficiency and generalization when reward feedback is costly, delayed, or available only for a fraction of the agent's experience.Weakly supervised reinforcement learning (WSRL) trains agents in environments where the reward signal is imperfect, sparse, delayed, or only partially informative — unlike dense fully-supervised RL. The agent must learn effective policies despite incomplete feedback, using auxiliary signals, reward modeling, or preference learning to compensate for the weak supervision.
ScholarGateНабор от данни
  1. v1
  2. 2 Източници
  3. PUBLISHED
  1. v1
  2. 2 Източници
  3. PUBLISHED

Към търсенето Изтегляне на слайдове

ScholarGateСравнение на методи: Semi-supervised Reinforcement Learning · Weakly supervised reinforcement learning. Извлечено на 2026-06-17 от https://scholargate.app/bg/compare